BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014381
(425 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255551004|ref|XP_002516550.1| transcription factor, putative [Ricinus communis]
gi|223544370|gb|EEF45891.1| transcription factor, putative [Ricinus communis]
Length = 629
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 256/411 (62%), Positives = 297/411 (72%), Gaps = 25/411 (6%)
Query: 3 QGETISDFAPKKLARQLDFT-ICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQV 61
Q E ISDFAPKKLARQLDFT +CRAS N L E S + Q QP P P P Q+
Sbjct: 5 QSEAISDFAPKKLARQLDFTAVCRASVNVALSEHSSRLQQQQPPPLPPPPPPPMSQP-QL 63
Query: 62 QSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM---VNRVPHP 118
K+ Q ++L+ PP QP QARPQHV + + R+PHP
Sbjct: 64 HLKLQQQPQPVLQLKLET-PPKQQPQA-------------TQARPQHVMVPPAMRRIPHP 109
Query: 119 VHKLPLPTLPPGKQESPRSRPRANTDGKD--GTPKKQKQCNCRNSRCLKLYCECFAAGIY 176
V KL L TL KQ+SP SRPR N + KD GTPKK K CNC+NSRCLKLYCECFAAG++
Sbjct: 110 VQKLSLHTLQLIKQQSPSSRPRNNIEAKDNSGTPKKAKHCNCKNSRCLKLYCECFAAGVH 169
Query: 177 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 236
C GCNC NCHNNVE+E +RQEAVGA LERNP AF+PKIASSPHG++DA+EDA E QL K
Sbjct: 170 CTGCNCTNCHNNVENEASRQEAVGAVLERNPEAFKPKIASSPHGSRDAKEDAMEVQLVGK 229
Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
HNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH +HNGI +MQ+
Sbjct: 230 HNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNHNGIVYMQQ 289
Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCS 356
AANAAI GA+GSSGYGT +TSNKRKSEE LFG +DQ R ++QQ N+ +N A S
Sbjct: 290 AANAAICGAIGSSGYGTPLTSNKRKSEE-LFGSTNRDQLGRRVAKTQQENHQRNPAASAS 348
Query: 357 -QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
SVPV NA+ LGS+K +SPLA ILQPQD+K++CSLLV++S EA K
Sbjct: 349 PLSVPVPH--NATALGSSKFTYKSPLAGILQPQDVKKMCSLLVVLSQEARK 397
>gi|224141773|ref|XP_002324239.1| predicted protein [Populus trichocarpa]
gi|222865673|gb|EEF02804.1| predicted protein [Populus trichocarpa]
Length = 682
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 233/417 (55%), Positives = 280/417 (67%), Gaps = 63/417 (15%)
Query: 1 MEQGETISDFAPKKLARQ-LDFT--ICR-ASANGLLPEQSVKSQSSQPQSHSPLQPQPPK 56
MEQ ET++++ PKKLA Q LDFT ICR A AN V SQ+ Q Q + LQ Q P
Sbjct: 1 MEQSETVTEYTPKKLAPQKLDFTAPICRTAPAN-------VPSQTQQVQVN--LQTQTPP 51
Query: 57 SQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARP--QH--VRMV 112
P +Q RP QH V +
Sbjct: 52 RP-------------------------------------EMLPPPYQERPVSQHEAVAVA 74
Query: 113 NRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFA 172
R+PHPVHK+ +P GKQ+SP SR AN DGTPKK KQCNC+NS CLKLYCECFA
Sbjct: 75 QRIPHPVHKIS--AMPRGKQDSPGSR-LAN----DGTPKKPKQCNCKNSMCLKLYCECFA 127
Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
AGI+C+GCNC+NC NNVE+E R+EAV TL+RNPNAFRPKIASSPHG++D EDA+E Q
Sbjct: 128 AGIHCNGCNCLNCRNNVENEKERKEAVETTLQRNPNAFRPKIASSPHGSRDTTEDAQEVQ 187
Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIA 292
+ KHNKGC+CK+SGCLKKYCECFQ NI CSENC+CL+CKNFEGS+ERRALFHG + A
Sbjct: 188 MLGKHNKGCNCKRSGCLKKYCECFQGNIFCSENCKCLECKNFEGSDERRALFHGSPHATA 247
Query: 293 FMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSA 352
+ Q+A NAAI GA+GSSGYGT + S KRKSEE LFG+A KDQ + QGN ++NSA
Sbjct: 248 YAQQATNAAISGAIGSSGYGTTLASKKRKSEE-LFGIATKDQPAHSTAKFSQGNTLRNSA 306
Query: 353 VPCS-QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
S SVPVS+TAN +VLGSTK RSPLA +L+PQ +KE+C+LLV +S +AT+ L
Sbjct: 307 AFSSPLSVPVSRTANGAVLGSTKPTYRSPLAGVLKPQFVKEICTLLVTISGKATEAL 363
>gi|356510007|ref|XP_003523732.1| PREDICTED: uncharacterized protein LOC100811459 [Glycine max]
Length = 515
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 221/409 (54%), Positives = 270/409 (66%), Gaps = 30/409 (7%)
Query: 1 MEQGETISDFAPKKLARQLDFTICRA--SANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQ 58
MEQ ET SD AP+KLARQLDF S L P +P SHSPL+ Q
Sbjct: 1 MEQSETASDLAPRKLARQLDFAAVYGDPSHQKLPPPLPPPQSPLRPPSHSPLRLQ----- 55
Query: 59 PQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHP 118
L L PP +P + + + + + RP+ V V R+PHP
Sbjct: 56 ------------------LDLQPPGQRPWLYSEAQEQQPGL--YSPRPKLVSPVRRIPHP 95
Query: 119 VHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD 178
V K P+ L K ESPRS+PR N + KD TPKKQKQCNC+NSRCLKLYCECFAAGIYCD
Sbjct: 96 VVKFPVKVLQVVKPESPRSQPRCNVELKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCD 155
Query: 179 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 238
GCNCVNCHNNV++E ARQEAVG TLERNPNAFRPKIASSP +D++E E ++ KHN
Sbjct: 156 GCNCVNCHNNVDNEAARQEAVGITLERNPNAFRPKIASSPQEQRDSKE--CEIKVIGKHN 213
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAA 298
KGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGS+ER A+FH D+N + M++AA
Sbjct: 214 KGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSDERIAIFHKDYN-LVHMKQAA 272
Query: 299 NAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQS 358
NA I GAVGSSGYGT +T KRK++E+ G +A DQ+V Q Q + S+
Sbjct: 273 NATISGAVGSSGYGTHITPKKRKNQEMFPGKSAMDQTVNMTAQYLQEIDPMASSPSSLSD 332
Query: 359 VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
+S +N + GS++S RS LAD+L+P ++K LCSL V++S A KT
Sbjct: 333 SFISDPSNTRISGSSRSTYRSVLADVLKPNNVKNLCSLFVVLSRVAAKT 381
>gi|255546437|ref|XP_002514278.1| transcription factor, putative [Ricinus communis]
gi|223546734|gb|EEF48232.1| transcription factor, putative [Ricinus communis]
Length = 601
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 224/414 (54%), Positives = 274/414 (66%), Gaps = 62/414 (14%)
Query: 3 QGETISDFAPKKLARQLDFTI----CRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQ 58
Q E++ DF KKLARQLDFT + N +LPE PQS Q QPP +
Sbjct: 18 QSESL-DFPTKKLARQLDFTTQGNNNNNNNNVVLPEH--------PQS----QQQPPATV 64
Query: 59 PQVQSKVASPSQSQPLV--RLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVP 116
Q V +P Q Q + LLPP QP V+A
Sbjct: 65 ALPQEPVVAPQQPQAVAIPEAPLLPP--QPPVRAI------------------------- 97
Query: 117 HPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIY 176
K ESP+S+PR N + KDGTPK+QKQCNC++SRCLKLYCECFA+G Y
Sbjct: 98 --------------KPESPKSKPRPNAELKDGTPKRQKQCNCKHSRCLKLYCECFASGTY 143
Query: 177 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 236
CDGCNCVNC+NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+ E + K
Sbjct: 144 CDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREENGEGLILGK 203
Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQ 295
HNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q
Sbjct: 204 HNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHTNNMAYIQ 263
Query: 296 RAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ-NSAVP 354
+AANAAI GA+GSSGY + S KRK +ELLFG KD S R QQ +++ +++
Sbjct: 264 QAANAAITGAIGSSGYVSPPISKKRKGQELLFGPTTKDPSFHRLGHFQQAVHIRPSTSSS 323
Query: 355 CSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
S P+++ +++ LG +K RS LADI+QPQD+KELCS+LV++S EA KTL
Sbjct: 324 SLSSNPIARAGSSATLGPSKFTYRSLLADIIQPQDLKELCSVLVVLSGEAAKTL 377
>gi|225445198|ref|XP_002280793.1| PREDICTED: uncharacterized protein LOC100249023 [Vitis vinifera]
Length = 592
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 200/308 (64%), Positives = 242/308 (78%), Gaps = 9/308 (2%)
Query: 117 HPVHKLPLP--TLPPG----KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
HPV +P+P T P K ESPRSRPR N D KDGTPKKQKQCNC++SRCLKLYCEC
Sbjct: 65 HPV-TMPVPPQTTHPSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKLYCEC 123
Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
FA+GIYCDGCNCVNCHNNVE+E AR+EAV TLERNPNAFRPKIASSPHGA+D+RE++ E
Sbjct: 124 FASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSREESGE 183
Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-N 289
A + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N
Sbjct: 184 ALVLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHAN 243
Query: 290 GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ 349
+A++Q+AANAAI GA+GSSG+G+ S KRK +EL +G +KD S+ R Q QQ + +
Sbjct: 244 SMAYIQQAANAAITGAIGSSGFGSPPVSKKRKGQELFYGPTSKDPSLHRLAQFQQASLTK 303
Query: 350 -NSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
+ S PVS+ N++ G++K RS LADI+QPQD+KELCS+LV+VS EA +T
Sbjct: 304 ASVTTSSLSSTPVSRVPNSAASGTSKFTYRSLLADIIQPQDLKELCSVLVVVSEEAARTF 363
Query: 409 TSIYHYIQ 416
+ ++
Sbjct: 364 ADKRNLVE 371
>gi|224119800|ref|XP_002331164.1| predicted protein [Populus trichocarpa]
gi|222873247|gb|EEF10378.1| predicted protein [Populus trichocarpa]
Length = 519
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 193/282 (68%), Positives = 229/282 (81%), Gaps = 4/282 (1%)
Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVE 190
K ESP+SRPR N + KDGTPKKQKQCNC++SRCLKLYCECFA+G YCDGCNCVNC+NNVE
Sbjct: 90 KPESPKSRPRLNAELKDGTPKKQKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVE 149
Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
+E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+ E + KHNKGCHCKKSGCLK
Sbjct: 150 NEAARREAVEATLERNPNAFRPKIASSPHGTRDSREETGEGLVFVKHNKGCHCKKSGCLK 209
Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSS 309
KYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANAAI GA+GSS
Sbjct: 210 KYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHGNNMAYIQQAANAAITGAIGSS 269
Query: 310 GYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCS--QSVPVSQTANA 367
GY +L S KRK +EL FG KD S R QQ ++++ A P S S PV++ N
Sbjct: 270 GYASLPVSRKRKGQELFFGQTVKDPSFHRLGHFQQASHIR-PAAPSSPLPSNPVARAGNT 328
Query: 368 SVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
+ LGS+K RS LADI+Q QD+KELCS+LV++S EA KTL+
Sbjct: 329 TTLGSSKITYRSLLADIIQSQDLKELCSVLVVLSGEAAKTLS 370
>gi|356522716|ref|XP_003529992.1| PREDICTED: uncharacterized protein LOC100818116 [Glycine max]
Length = 559
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 198/297 (66%), Positives = 230/297 (77%), Gaps = 8/297 (2%)
Query: 118 PVHKLPLPTLPP----GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAA 173
PV LPLP P GK ESP+SR R N + KD TPKKQKQCNC++S+CLKLYCECFA+
Sbjct: 52 PVAVLPLPLQAPPARVGKPESPKSRSRPNFEMKDATPKKQKQCNCKHSKCLKLYCECFAS 111
Query: 174 GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 233
GIYCDGCNCVNC NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+REDA E +
Sbjct: 112 GIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREDAGEVLI 171
Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAF 293
KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD N +A+
Sbjct: 172 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNMAY 231
Query: 294 MQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV 353
+Q+AANAAI GA+GSSGY + S KRK +EL F KD S+ + QQ N+V+ A
Sbjct: 232 IQQAANAAITGAIGSSGYSSPPVSKKRKGQELFFWPTTKDPSISK--LGQQVNHVRGPA- 288
Query: 354 PCSQSVPVSQT-ANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
P S PVS + LG +K RS LADI+QPQ +KELCS+LV+VS +A KTLT
Sbjct: 289 PSSSLSPVSGARVGTATLGPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLT 345
>gi|297738803|emb|CBI28048.3| unnamed protein product [Vitis vinifera]
Length = 524
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 194/294 (65%), Positives = 236/294 (80%), Gaps = 2/294 (0%)
Query: 125 PTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVN 184
P++ K ESPRSRPR N D KDGTPKKQKQCNC++SRCLKLYCECFA+GIYCDGCNCVN
Sbjct: 10 PSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVN 69
Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
CHNNVE+E AR+EAV TLERNPNAFRPKIASSPHGA+D+RE++ EA + KHNKGCHCK
Sbjct: 70 CHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSREESGEALVLGKHNKGCHCK 129
Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAII 303
KSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANAAI
Sbjct: 130 KSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHANSMAYIQQAANAAIT 189
Query: 304 GAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ-NSAVPCSQSVPVS 362
GA+GSSG+G+ S KRK +EL +G +KD S+ R Q QQ + + + S PVS
Sbjct: 190 GAIGSSGFGSPPVSKKRKGQELFYGPTSKDPSLHRLAQFQQASLTKASVTTSSLSSTPVS 249
Query: 363 QTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIYHYIQ 416
+ N++ G++K RS LADI+QPQD+KELCS+LV+VS EA +T + ++
Sbjct: 250 RVPNSAASGTSKFTYRSLLADIIQPQDLKELCSVLVVVSEEAARTFADKRNLVE 303
>gi|224143568|ref|XP_002325000.1| predicted protein [Populus trichocarpa]
gi|222866434|gb|EEF03565.1| predicted protein [Populus trichocarpa]
Length = 545
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 223/405 (55%), Positives = 270/405 (66%), Gaps = 37/405 (9%)
Query: 10 FAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQVQSKVASPS 69
F KKLARQLDFT G+LP+ PQS QP P + P + S A+ S
Sbjct: 23 FQTKKLARQLDFT------QGVLPDH--------PQS----QPSPLQKPPLLVSPAAAQS 64
Query: 70 QSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPP 129
Q QP + QL +Q + +Q Q R V R+ P+ + P
Sbjct: 65 QPQPHPQSQLQLQPQVAEIQVVPQQQTQQQQQP------ARAVLRLLPPMFRKP------ 112
Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
ESP+S P NT+ KDGTPKKQ+QCNC++SRCLKLYCECFA+G YCDGCNCVNC+NNV
Sbjct: 113 ---ESPKSIPIPNTELKDGTPKKQRQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNV 169
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
E+E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+ + + KHNKGCHCKKSGCL
Sbjct: 170 ENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREETGDGLVFVKHNKGCHCKKSGCL 229
Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGS 308
KKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANAAI GA+GS
Sbjct: 230 KKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHGNNMAYIQQAANAAITGAIGS 289
Query: 309 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN---SAVPCSQSVPVSQTA 365
SGY + S KRK +EL FG KDQS R QQ N + S P+++
Sbjct: 290 SGYASPPVSRKRKGQELFFGHTVKDQSFDRLGHFQQVNGSHTRPPAPSSSLPSNPIARAG 349
Query: 366 NASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
NA LG +K RS LADI+QPQD+KELCS+LV++S EA KT +
Sbjct: 350 NAITLGPSKITYRSLLADIIQPQDLKELCSVLVVLSGEAAKTFSD 394
>gi|356511363|ref|XP_003524396.1| PREDICTED: uncharacterized protein LOC100786353 [Glycine max]
Length = 561
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 198/305 (64%), Positives = 238/305 (78%), Gaps = 8/305 (2%)
Query: 116 PHPVHKLPLP---TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFA 172
P P LPLP ++ GK ESP+SRPR + K+ TPKKQKQCNC++S+CLKLYCECFA
Sbjct: 52 PQPPVILPLPPSASVRVGKPESPKSRPRPGFEIKEATPKKQKQCNCKHSKCLKLYCECFA 111
Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
+GIYCDGCNCVNC+NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+REDA E
Sbjct: 112 SGIYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREDAGEVL 171
Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD-HNGI 291
+ KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD +N +
Sbjct: 172 ILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNNM 231
Query: 292 AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNS 351
++Q+AANAAI GA+GSSGY + S KRK +ELLFG KD SV R Q QQ NNV+
Sbjct: 232 VYIQQAANAAITGAIGSSGYSSPPISKKRKGQELLFGPTVKDPSVGR--QGQQANNVRVP 289
Query: 352 AVPCSQS-VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
A S S +P ++ A+ LG +K RS LADI+QPQ +KELCS+LV+VS +A K T
Sbjct: 290 APSSSLSPIPGARVGPAT-LGPSKLMYRSLLADIIQPQHLKELCSVLVLVSGQAAKMFTD 348
Query: 411 IYHYI 415
+++
Sbjct: 349 HKNFM 353
>gi|356529628|ref|XP_003533391.1| PREDICTED: uncharacterized protein LOC100777698 [Glycine max]
Length = 559
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 198/314 (63%), Positives = 236/314 (75%), Gaps = 9/314 (2%)
Query: 102 WQARPQHVRMVNRVPHPVHKLPLPTLPP----GKQESPRSRPRANTDGKDGTPKKQKQCN 157
+ P+H ++ PV LPL P GK ESP+SR R N + KD TPKKQKQCN
Sbjct: 34 FTGAPEHPQLSQPPQLPVAVLPLQPQAPHARVGKPESPKSRSRPNFEIKDATPKKQKQCN 93
Query: 158 CRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 217
C++S+CLKLYCECFA+GIYCDGCNCVNC NNVE+E AR+EAV ATLERNPNAFRPKIASS
Sbjct: 94 CKHSKCLKLYCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRPKIASS 153
Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 277
PHG +D+RE+A E + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGS
Sbjct: 154 PHGTRDSREEAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGS 213
Query: 278 EERRALFHGD-HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSV 336
EER+ALFHGD +N +A++Q+AANAAI GA+GSSGY + S KRK +EL F KD S+
Sbjct: 214 EERQALFHGDQNNNMAYIQQAANAAITGAIGSSGYSSPPVSKKRKGQELFFWPTTKDPSI 273
Query: 337 IRNPQSQQGNNVQNSAVPCSQSVPVSQT-ANASVLGSTKSALRSPLADILQPQDMKELCS 395
+ QQ N+V+ A P S PVS + LG +K RS LADI+QPQ +KELCS
Sbjct: 274 SK--LGQQVNHVRGPA-PSSSLSPVSGARVGTATLGPSKFMYRSLLADIIQPQHLKELCS 330
Query: 396 LLVMVSSEATKTLT 409
+LV+VS +A KTLT
Sbjct: 331 VLVLVSGQAAKTLT 344
>gi|225429991|ref|XP_002281512.1| PREDICTED: uncharacterized protein LOC100243881 [Vitis vinifera]
Length = 579
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 200/305 (65%), Positives = 243/305 (79%), Gaps = 5/305 (1%)
Query: 109 VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYC 168
V +V R+ +P PTL KQESPRS PR+N++ KDGTPKK+K CNC+NSRCLKLYC
Sbjct: 70 VPVVPRITNPSR----PTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCKNSRCLKLYC 125
Query: 169 ECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 228
ECFA+G+YC+ C+C NC NNV +E AR+EAVG+TL+RNPNAFRPKIA+SPHG+ D E+A
Sbjct: 126 ECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHGSHDGGEEA 185
Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH 288
+ L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH +
Sbjct: 186 GKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNG 245
Query: 289 NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNV 348
N +A++Q+AANAAI GA+GSSGYGT + S KRK +++ F A +QS RN Q QQ N++
Sbjct: 246 NYMAYIQQAANAAISGAIGSSGYGTAVASKKRK-QQVSFCTTANNQSNHRNTQFQQDNHL 304
Query: 349 QNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
+ SAV S S+ V+ T NA+VLGS+K RSPLADILQ QD+K+LCSLLV+VS EA KTL
Sbjct: 305 RASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQLQDVKDLCSLLVVVSQEAGKTL 364
Query: 409 TSIYH 413
T H
Sbjct: 365 TEREH 369
>gi|296081871|emb|CBI20876.3| unnamed protein product [Vitis vinifera]
Length = 676
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 199/301 (66%), Positives = 242/301 (80%), Gaps = 5/301 (1%)
Query: 109 VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYC 168
V +V R+ +P PTL KQESPRS PR+N++ KDGTPKK+K CNC+NSRCLKLYC
Sbjct: 151 VPVVPRITNPSR----PTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCKNSRCLKLYC 206
Query: 169 ECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 228
ECFA+G+YC+ C+C NC NNV +E AR+EAVG+TL+RNPNAFRPKIA+SPHG+ D E+A
Sbjct: 207 ECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHGSHDGGEEA 266
Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH 288
+ L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH +
Sbjct: 267 GKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNG 326
Query: 289 NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNV 348
N +A++Q+AANAAI GA+GSSGYGT + S KRK +++ F A +QS RN Q QQ N++
Sbjct: 327 NYMAYIQQAANAAISGAIGSSGYGTAVASKKRK-QQVSFCTTANNQSNHRNTQFQQDNHL 385
Query: 349 QNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
+ SAV S S+ V+ T NA+VLGS+K RSPLADILQ QD+K+LCSLLV+VS EA KTL
Sbjct: 386 RASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQLQDVKDLCSLLVVVSQEAGKTL 445
Query: 409 T 409
T
Sbjct: 446 T 446
>gi|449449629|ref|XP_004142567.1| PREDICTED: protein tesmin/TSO1-like CXC 5-like [Cucumis sativus]
Length = 602
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 183/290 (63%), Positives = 222/290 (76%), Gaps = 6/290 (2%)
Query: 121 KLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQC---NCRNSRCLKLYCECFAAGIYC 177
K PLP P K +SP+SR R+N + KD TPKKQKQC NC++SRCLKLYCECFA+G+YC
Sbjct: 94 KAPLP--PARKPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKLYCECFASGVYC 151
Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 237
DGCNC NCHNNVEHE +R+EAV TLERNPNAFRPKIA+SPHG +++R++ E + KH
Sbjct: 152 DGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKIANSPHGTRESRDEIGELVMLGKH 211
Query: 238 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQR 296
NKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N IA++Q+
Sbjct: 212 NKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHANNIAYIQQ 271
Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCS 356
AANAAI GA+GSSGY L S KRK EL FG KD + PQ Q NN+ +S + S
Sbjct: 272 AANAAITGAIGSSGYACLPVSKKRKGPELGFGPVGKDSPLNSIPQLHQANNIMSSGMSTS 331
Query: 357 QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
P + + G +K + RS LAD++QP D+KELCS+LV+++SE K
Sbjct: 332 SPFPAAHVGSPVASGPSKFSFRSLLADLIQPNDLKELCSVLVVLTSEVAK 381
>gi|449519509|ref|XP_004166777.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 5-like
[Cucumis sativus]
Length = 556
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 210/415 (50%), Positives = 258/415 (62%), Gaps = 64/415 (15%)
Query: 1 MEQGETISDFAP-KKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQP 59
+E G +S P KKLARQLDFT + +LPE
Sbjct: 22 VELGNVVSSDVPAKKLARQLDFT---GTGGAVLPEH------------------------ 54
Query: 60 QVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR---SPSQAMPQWQARPQHVRMVNRVP 116
PH Q+ R S S A+ Q++ Q V
Sbjct: 55 --------------------------PHFQSRSRCQESESPAVMVVQSQSQPQSPQQLVV 88
Query: 117 HPV-HKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQC---NCRNSRCLKLYCECFA 172
P+ K PLP P K +SP+SR R+N + KD TPKKQKQC NC++SRCLKLYCECFA
Sbjct: 89 LPIGTKAPLP--PARKPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKLYCECFA 146
Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
+G+YCDGCNC NCHNNVEHE +R+EAV TLERNPNAFRPK A+SPHG +++R++ E
Sbjct: 147 SGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKXANSPHGTRESRDEIGELV 206
Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGI 291
+ KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N I
Sbjct: 207 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHANNI 266
Query: 292 AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNS 351
A++Q+AANAAI GA+GSSGY L S KRK EL FG KD + PQ Q NN+ +S
Sbjct: 267 AYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGFGPVGKDSPLNSIPQLHQANNIMSS 326
Query: 352 AVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
+ S P + + G +K + RS LAD++QP D+KELCS+LV+++SE K
Sbjct: 327 GMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQPNDLKELCSVLVVLTSEVAK 381
>gi|15235524|ref|NP_194629.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75213581|sp|Q9SZD1.1|TCX5_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 5; Short=AtTCX5
gi|4972046|emb|CAB43914.1| putative transcription factor [Arabidopsis thaliana]
gi|7269798|emb|CAB79658.1| putative transcription factor [Arabidopsis thaliana]
gi|26452336|dbj|BAC43254.1| putative transcription factor [Arabidopsis thaliana]
gi|29028974|gb|AAO64866.1| At4g29000 [Arabidopsis thaliana]
gi|332660171|gb|AEE85571.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 603
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 180/289 (62%), Positives = 218/289 (75%), Gaps = 9/289 (3%)
Query: 133 ESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEH 191
ESP++R PR N +G+DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC NNV++
Sbjct: 110 ESPKARGPRPNVEGRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDN 169
Query: 192 EVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKK 251
E AR+EAV ATLERNP AFRPKIASSPHG +D RED E L KHNKGCHCKKSGCLKK
Sbjct: 170 EPARREAVEATLERNPFAFRPKIASSPHGGRDKREDIGEVVLLGKHNKGCHCKKSGCLKK 229
Query: 252 YCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSG 310
YCECFQANILCSENC+CLDCKNFEGSEER+ALFHG+H N +A++Q+AANAAI GAVGSSG
Sbjct: 230 YCECFQANILCSENCKCLDCKNFEGSEERQALFHGEHSNHMAYLQQAANAAITGAVGSSG 289
Query: 311 YGTLMTSNKRKSEELLFGVAAKDQSVIRN-PQSQQGNNVQNSAVPCSQS--VPVSQTANA 367
+ +RK +E+LF A KD S + + PQ G + P S + PVS+
Sbjct: 290 FAPSPAPKRRKGQEILFNQAIKDSSRLSHFPQVNNG----RTGGPTSGTSPSPVSRAGGN 345
Query: 368 SVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIYHYIQ 416
+ +K RS LADI+QP D++ LCS+LV V+ EA KT T + I+
Sbjct: 346 ASSVPSKFVYRSLLADIIQPHDVRALCSVLVTVAGEAAKTSTDKRNEIE 394
>gi|356523755|ref|XP_003530500.1| PREDICTED: uncharacterized protein LOC100777327 [Glycine max]
Length = 551
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 193/289 (66%), Positives = 228/289 (78%), Gaps = 5/289 (1%)
Query: 123 PLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNC 182
P ++ GK ESP+SRPR + K+ TPKKQKQCNC++S+CLKLYCECFA+GIYCDGCNC
Sbjct: 64 PSTSVRVGKPESPKSRPRPGFEIKEATPKKQKQCNCKHSKCLKLYCECFASGIYCDGCNC 123
Query: 183 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 242
VNC NNVE+E AR+EAV ATLERNPNAFRPKIASSPH +D REDA E + KHNKGCH
Sbjct: 124 VNCCNNVENEAARREAVEATLERNPNAFRPKIASSPHRTRDIREDAGEILILGKHNKGCH 183
Query: 243 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD-HNGIAFMQRAANAA 301
CKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD +N + ++Q+AANAA
Sbjct: 184 CKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNNLVYIQQAANAA 243
Query: 302 IIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQS-VP 360
I GA+GSSGY + S KRK +ELLFG KD SV R Q QQ NNV+ A S S +P
Sbjct: 244 ITGAIGSSGYSSPPISKKRKGQELLFGPTVKDPSVGR--QGQQANNVRVPAPSSSLSPIP 301
Query: 361 VSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
++ A+ LG +K RS LADI+QPQ +KELCS+LV+VS +A KT T
Sbjct: 302 GARVGPAT-LGPSKLMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTFT 349
>gi|297803150|ref|XP_002869459.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
lyrata]
gi|297315295|gb|EFH45718.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
lyrata]
Length = 601
Score = 351 bits (901), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 209/411 (50%), Positives = 257/411 (62%), Gaps = 45/411 (10%)
Query: 10 FAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQVQSKVASPS 69
F KK ARQLDFT G EQS+ ++ + +P V S + PS
Sbjct: 21 FPTKKPARQLDFT-------GGSDEQSLSKAAAPTVVAT-------AVKPVVTSSI--PS 64
Query: 70 QSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNR-VPHPVHKLPLPTLP 128
+P V + + +R P+ M P +++N + HP
Sbjct: 65 TIRPGVTIAI----------GQVR-PTLPMATTSNPPSQSQILNAPIRHP---------- 103
Query: 129 PGKQESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHN 187
K ESP++R PR +G+DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC N
Sbjct: 104 --KPESPKARGPRPIVEGRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFN 161
Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
NV++E AR+EAV ATLERNP AFRPKIASSPHG +D RED E L KHNKGCHCKKSG
Sbjct: 162 NVDNEPARREAVEATLERNPFAFRPKIASSPHGVRDKREDIGEVVLLGKHNKGCHCKKSG 221
Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAV 306
CLKKYCECFQANILCSENC+CLDCKNFEGSEER+ALFHG+H N +A++Q+AANAAI GAV
Sbjct: 222 CLKKYCECFQANILCSENCKCLDCKNFEGSEERQALFHGEHANHMAYLQQAANAAITGAV 281
Query: 307 GSSGYGTLMTSNKRKSEELLFGVAAKDQSVI-RNPQSQQGNNVQNSAVPCSQSVPVSQTA 365
GSSG+ +RK +E+LF A KD S + + PQ G S S
Sbjct: 282 GSSGFAPSPAPKRRKGQEILFNQATKDSSRLGQFPQVNSG-RASGPTSGSSPSPVSRAGG 340
Query: 366 NASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIYHYIQ 416
NAS S K RS LADI+QP D++ LCS+LV V+ EA KT T + I+
Sbjct: 341 NASSAPS-KFVYRSLLADIIQPHDVRALCSVLVAVAGEAAKTSTDKRNEIE 390
>gi|357500623|ref|XP_003620600.1| Lin-54-like protein [Medicago truncatula]
gi|357500779|ref|XP_003620678.1| Lin-54-like protein [Medicago truncatula]
gi|355495615|gb|AES76818.1| Lin-54-like protein [Medicago truncatula]
gi|355495693|gb|AES76896.1| Lin-54-like protein [Medicago truncatula]
Length = 629
Score = 350 bits (899), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 191/345 (55%), Positives = 234/345 (67%), Gaps = 34/345 (9%)
Query: 97 QAMPQWQARPQHVRMVNRVPHPVHKLPLP-----------TLPPGKQESPRSRPRANTDG 145
Q+ PQ Q Q MV + P V LP+P ++ GKQESP+ R R N +
Sbjct: 61 QSKPQQQVVTQGSVMVQK-PVGVGGLPMPVPAQVQTLQHSSVRVGKQESPKPRSRPNFEV 119
Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 205
K+GTPKKQ+QCNC++S+CLKLYCECFA+GIYCDGCNCVNC NNV++E AR+EAV ATLER
Sbjct: 120 KEGTPKKQRQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLER 179
Query: 206 NPNAFRPKIASSPHGAQDAR---------EDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
NPNAFRPKIASSP GA+D+R E+A E + KH+KGCHCKKSGCLKKYCECF
Sbjct: 180 NPNAFRPKIASSPQGARDSRVSRSSILWQEEAGEGLILIKHHKGCHCKKSGCLKKYCECF 239
Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMT 316
QAN+LCSENCRC+DCKNFEGSEER+ALF GD N ++Q+AANAAI GA+GS G+ +
Sbjct: 240 QANVLCSENCRCMDCKNFEGSEERQALFRGDQNNNVYLQQAANAAITGAIGSYGFSSPPA 299
Query: 317 SNKRKSEELLFGVAAKDQSVIRNPQ--------SQQG---NNVQNSAVPCSQSVPVSQTA 365
S KRK +EL AKD S+ + Q S+ G N V+ A S S S
Sbjct: 300 SKKRKGQELFLWPTAKDPSISKPGQQVKFAYCTSRTGTGVNLVKGPAPSSSASPVSSARG 359
Query: 366 NASVLGSTKSAL--RSPLADILQPQDMKELCSLLVMVSSEATKTL 408
LG + S L RS L+D++QP +KELCS+LV+VS +A KTL
Sbjct: 360 TNPTLGQSPSKLKYRSLLSDVVQPHHLKELCSVLVLVSGQAAKTL 404
>gi|356518324|ref|XP_003527829.1| PREDICTED: uncharacterized protein LOC100812944 [Glycine max]
Length = 656
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 175/286 (61%), Positives = 210/286 (73%), Gaps = 3/286 (1%)
Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCN 181
L + K +SPRS+P+ N + KD TPKKQKQCNC+NSRCLKLYCECFAAGIYCDGCN
Sbjct: 170 LNFSAIKIVKPKSPRSQPQCNVELKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCN 229
Query: 182 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 241
CVNCHNNV++E ARQEAVG TLERNPNAFRPKIASSP +D++E E Q+ KHNKGC
Sbjct: 230 CVNCHNNVDNEAARQEAVGITLERNPNAFRPKIASSPLERRDSKE--CEIQVIGKHNKGC 287
Query: 242 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAA 301
HCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGS+ERRA+FH D+N + +++AANA
Sbjct: 288 HCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSDERRAIFHKDYN-LVHIKQAANAT 346
Query: 302 IIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPV 361
I GSSGYGT +T KRK E+ G +A DQ+ Q QQ + S+ V
Sbjct: 347 ISRVAGSSGYGTHLTPKKRKIHEMFPGKSAMDQTDNITAQYQQEIDPMASSPSFLSDSFV 406
Query: 362 SQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
S +N + S++S RS LAD QPQ++K LCSL V++S A KT
Sbjct: 407 SDPSNTRISASSRSTYRSVLADFFQPQNVKNLCSLFVVLSQVAAKT 452
>gi|30680884|ref|NP_849995.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|330251872|gb|AEC06966.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 578
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 199/419 (47%), Positives = 253/419 (60%), Gaps = 63/419 (15%)
Query: 1 MEQGETISDFAPK---------KLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQ 51
M +GE F PK K ARQLDFT + E S +Q+S + S
Sbjct: 1 MGEGEEGDKFPPKTDEVTQESMKSARQLDFTGGSSDV-----EHSHSNQASSMAAASIPS 55
Query: 52 PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM 111
P ++P + S Q+ P V + PP PQ Q HV +
Sbjct: 56 PIVTVTRPIITS------QAPPTVATPIPPP-----------------PQSQGIILHVPI 92
Query: 112 VNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECF 171
R P P ESP S PR + +DGTP+K+KQCNC++SRCLKLYCECF
Sbjct: 93 --RHPRP--------------ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECF 136
Query: 172 AAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREA 231
A+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG +D RE+ +
Sbjct: 137 ASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDV 196
Query: 232 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NG 290
+ A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R++LFHG+H +
Sbjct: 197 VMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQSLFHGEHSHN 256
Query: 291 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 350
+A++Q ANAAI GA+GSSG+ + +RK +E+ F KD S R Q+ G
Sbjct: 257 LAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--- 312
Query: 351 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
+ S S+ + LG +K +S LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 313 -----TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT 366
>gi|15225302|ref|NP_179601.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75266022|sp|Q9SL70.1|TCX6_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 6; Short=AtTCX6
gi|4580462|gb|AAD24386.1| hypothetical protein [Arabidopsis thaliana]
gi|225898124|dbj|BAH30394.1| hypothetical protein [Arabidopsis thaliana]
gi|330251873|gb|AEC06967.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 571
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 199/419 (47%), Positives = 253/419 (60%), Gaps = 63/419 (15%)
Query: 1 MEQGETISDFAPK---------KLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQ 51
M +GE F PK K ARQLDFT + E S +Q+S + S
Sbjct: 1 MGEGEEGDKFPPKTDEVTQESMKSARQLDFTGGSSDV-----EHSHSNQASSMAAASIPS 55
Query: 52 PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM 111
P ++P + S Q+ P V + PP PQ Q HV +
Sbjct: 56 PIVTVTRPIITS------QAPPTVATPIPPP-----------------PQSQGIILHVPI 92
Query: 112 VNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECF 171
R P P ESP S PR + +DGTP+K+KQCNC++SRCLKLYCECF
Sbjct: 93 --RHPRP--------------ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECF 136
Query: 172 AAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREA 231
A+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG +D RE+ +
Sbjct: 137 ASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDV 196
Query: 232 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NG 290
+ A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R++LFHG+H +
Sbjct: 197 VMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQSLFHGEHSHN 256
Query: 291 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 350
+A++Q ANAAI GA+GSSG+ + +RK +E+ F KD S R Q+ G
Sbjct: 257 LAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--- 312
Query: 351 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
+ S S+ + LG +K +S LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 313 -----TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT 366
>gi|26984067|gb|AAN85197.1| hypothetical protein [Arabidopsis thaliana]
Length = 536
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 164/278 (58%), Positives = 206/278 (74%), Gaps = 10/278 (3%)
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
ESP S PR + +DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC NNVE+E
Sbjct: 63 ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENE 122
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
AR++AV +TLERNPNAFRPKIA+SPHG +D RE+ + + A+HNKGCHCKKSGCLKKY
Sbjct: 123 PARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKY 182
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGY 311
CECFQANILCSENC CLDCKNFEGSE R++LFHG+H + +A++Q ANAAI GA+GSSG+
Sbjct: 183 CECFQANILCSENCXCLDCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGF 241
Query: 312 GTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLG 371
+ +RK +E+ F KD S R Q+ G + S S+ + LG
Sbjct: 242 ASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--------TSSQTGSRAGGNASLG 293
Query: 372 STKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
+K +S LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 294 PSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT 331
>gi|26984069|gb|AAN85198.1| hypothetical protein [Arabidopsis thaliana]
Length = 543
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 164/278 (58%), Positives = 206/278 (74%), Gaps = 10/278 (3%)
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
ESP S PR + +DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC NNVE+E
Sbjct: 63 ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENE 122
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
AR++AV +TLERNPNAFRPKIA+SPHG +D RE+ + + A+HNKGCHCKKSGCLKKY
Sbjct: 123 PARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKY 182
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGY 311
CECFQANILCSENC CLDCKNFEGSE R++LFHG+H + +A++Q ANAAI GA+GSSG+
Sbjct: 183 CECFQANILCSENCXCLDCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGF 241
Query: 312 GTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLG 371
+ +RK +E+ F KD S R Q+ G + S S+ + LG
Sbjct: 242 ASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--------TSSQTGSRAGGNASLG 293
Query: 372 STKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
+K +S LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 294 PSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT 331
>gi|326532502|dbj|BAK05180.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 615
Score = 337 bits (864), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 194/397 (48%), Positives = 248/397 (62%), Gaps = 16/397 (4%)
Query: 13 KKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQVQSKVASPSQSQ 72
KKL RQLDF + + S+S P++ PQP Q + + P Q Q
Sbjct: 34 KKLVRQLDFNYGNPAMAA---AAAAVSRSVPPRALQVGYPQP--HQLRAALPMGVPHQLQ 88
Query: 73 PLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQ 132
P P +PH H + P A+P P + + + PV + P+ P K
Sbjct: 89 PRGL-----PVMRPHQMVHAQLPRPALPMAMPVPV-PQARSALSQPVQRPPVAV--PLKP 140
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
ESP+ R R DGKDGTP K+K CNCR+S+CLKLYCECFA+GIYCDGCNC NC NNVE+E
Sbjct: 141 ESPKPRGRL-YDGKDGTPTKKKCCNCRHSKCLKLYCECFASGIYCDGCNCTNCFNNVENE 199
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
VAR+EA+ ATLERNP+AFRPKI SSPH ++ E A + L KHNKGCHCKKSGCLKKY
Sbjct: 200 VARREAIEATLERNPDAFRPKIGSSPHTNRNNMEVAGDLPLIGKHNKGCHCKKSGCLKKY 259
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYG 312
CECFQANILCSENC+C DCKNF+GSE+R+ L DH + MQ+AANAA+ GA+G++ +
Sbjct: 260 CECFQANILCSENCKCRDCKNFDGSEDRKHLSL-DHKNVVHMQQAANAAVNGAIGATAFS 318
Query: 313 TLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGS 372
+ TS KRK + ++ K+ +RN Q Q Q +A+ S+PV Q+ + ++G
Sbjct: 319 SPSTSKKRKHIDSSLDLSTKEHLAMRNGQLSQ-VIYQKNAMASDGSLPVGQSVHPHMMGP 377
Query: 373 TKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
K R LADI+QP D+KELC LLV VS +A K T
Sbjct: 378 FKVTYRPLLADIIQPGDVKELCKLLVTVSGQAAKAYT 414
>gi|297836236|ref|XP_002886000.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297331840|gb|EFH62259.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 568
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 163/267 (61%), Positives = 200/267 (74%), Gaps = 10/267 (3%)
Query: 144 DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATL 203
+ +DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNCHNNVE+E R+EA+G+TL
Sbjct: 107 EARDGTPQKKKQCNCKHSRCLKLYCECFASGSYCDGCNCVNCHNNVENEPERREAIGSTL 166
Query: 204 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
ERNPNAFRPKIA+SPHG +D RE+ E L +HNKGCHCKKSGCLKKYCECFQANILCS
Sbjct: 167 ERNPNAFRPKIAASPHGGRDNREEVGEVVLLGRHNKGCHCKKSGCLKKYCECFQANILCS 226
Query: 264 ENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKS 322
ENC+CLDCKNFEGSE R++LFHG+H + +A++Q ANAAI GA+GSSG+ + +RK
Sbjct: 227 ENCKCLDCKNFEGSEVRQSLFHGEHSHNVAYLQH-ANAAITGAIGSSGFASAPAPKRRKG 285
Query: 323 EELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLA 382
+E+ F KD S R Q+ G S NAS L +K RS LA
Sbjct: 286 QEIFFNQGPKDSSTHRLGQANSGRTT-------SPKTGSHAGGNAS-LRPSKVVYRSLLA 337
Query: 383 DILQPQDMKELCSLLVMVSSEATKTLT 409
DI++P+D+K LCS+LV V+ EA KTLT
Sbjct: 338 DIIKPKDVKALCSVLVSVAGEAAKTLT 364
>gi|26984071|gb|AAN85199.1| hypothetical protein [Arabidopsis thaliana]
Length = 532
Score = 333 bits (855), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 164/278 (58%), Positives = 205/278 (73%), Gaps = 14/278 (5%)
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
ESP S PR + +DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC NNVE+E
Sbjct: 63 ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENE 122
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
AR++AV +TLERNPNAFRPKIA+SPHG +D RE+ + + A+HNKGCHCKKSGCLKKY
Sbjct: 123 PARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKY 182
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGY 311
CECFQANILCSENC CLDCKNFEGSE R++LFHG+H + +A++Q ANAAI GA+GSSG+
Sbjct: 183 CECFQANILCSENCXCLDCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGF 241
Query: 312 GTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLG 371
+ +RK +E+ F KD S R Q+ G + S S+ + LG
Sbjct: 242 ASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--------TSSQTGSRAGGNASLG 293
Query: 372 STKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
+KS LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 294 PSKSL----LANIIKPMDVKALCSVLVAVAGEAAKTLT 327
>gi|242058765|ref|XP_002458528.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
gi|241930503|gb|EES03648.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
Length = 550
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 180/332 (54%), Positives = 223/332 (67%), Gaps = 10/332 (3%)
Query: 78 QLLPPSSQPH-VQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPR 136
Q + P +PH V H+ P A+P PQ +R + P + +P P ESP+
Sbjct: 41 QRILPVMRPHQVVGHVPLPRHAVPMAVPVPQ-LRPMPPQPVQRPPVAVPLKP----ESPK 95
Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
R R +GKD TP K+K CNCRNSRCLKLYCECFA+G +CDGCNC NC NN E+EVAR+
Sbjct: 96 PRARL-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARR 154
Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
EA+ ATLERNP+AFRPKI SSPH ++ E + + L KHNKGCHCKKSGCLKKYCECF
Sbjct: 155 EAIEATLERNPDAFRPKIGSSPHTNRN-NEVSSDLPLVGKHNKGCHCKKSGCLKKYCECF 213
Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMT 316
QANILCSENC+C+DCKNFEGSEERR LF GDH MQ+A NAA+ GA+G++G+ + T
Sbjct: 214 QANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINMQQATNAAVNGAIGATGFPSPST 273
Query: 317 SNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSA 376
S KRK + + K+ RN Q + Q +AVP +P+SQ+A+ LG K
Sbjct: 274 SRKRKHIDPSLDNSNKEHVAHRNYHISQ-SVYQKNAVP-DGPIPISQSAHPPNLGPFKVT 331
Query: 377 LRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
R LADI+Q +D+KELC LLV+VS EA KT
Sbjct: 332 YRPLLADIVQEEDIKELCKLLVVVSGEAAKTY 363
>gi|218189097|gb|EEC71524.1| hypothetical protein OsI_03825 [Oryza sativa Indica Group]
Length = 576
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/277 (58%), Positives = 199/277 (71%), Gaps = 3/277 (1%)
Query: 131 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
K ESP+ PRA +GKDGTP K+K CNC++S+CLKLYCECFA+G+YCDGCNC NC NNV
Sbjct: 89 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNV 147
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
++E AR+EA+ ATLERNP+AFRPKI SSPH ++ E A + L KHNKGCHCKKSGCL
Sbjct: 148 KNETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCL 207
Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSS 309
KKYCECFQANILCSENC+C+DCKNFEGSEER+ALF GDH MQ+AANAA+ GA+G++
Sbjct: 208 KKYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSIHMQQAANAAVNGAIGAA 267
Query: 310 GYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASV 369
+ + TS KRK + +AK+ R Q Q + +P +P++Q +
Sbjct: 268 AFSSPSTSRKRKHIDPSLDHSAKENGAHRTNHVPQV-FYQKNGMPSDGPLPINQPVHPPT 326
Query: 370 LGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
LG K R LADI+QP D+KELC LLV+VS A K
Sbjct: 327 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAK 363
>gi|413952361|gb|AFW85010.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
Length = 412
Score = 320 bits (821), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 163/279 (58%), Positives = 200/279 (71%), Gaps = 8/279 (2%)
Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
P K +SP+ R +GKD TP K+K CNCRNSRCLKLYCECFA+G +CDGCNC NC NN
Sbjct: 131 PLKPDSPKPWARL-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNN 189
Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
E+EVAR+EA+ ATLERNP+AFRPKI SSPH ++ E + + L KHNKGCHCKKSGC
Sbjct: 190 PENEVARREAIEATLERNPDAFRPKIGSSPHTNRN-NEVSSDLPLVGKHNKGCHCKKSGC 248
Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGS 308
LKKYCECFQANILCSENC+C+DCKNFEGSEERR LF GDH +Q+A NAA+ GA+G+
Sbjct: 249 LKKYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINIQQATNAAVNGAIGT 308
Query: 309 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANAS 368
+G+ + TS KRK + + K+ + RN Q N V + S+P+SQ+A+
Sbjct: 309 TGFPSPSTSRKRKHIDPSLDHSNKEHAAHRNCHIPQKNAVPDG------SIPISQSAHPP 362
Query: 369 VLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
LG K R LADI+Q +D+KELC LLV+VS EA KT
Sbjct: 363 TLGPFKVTYRPLLADIVQAEDVKELCKLLVVVSGEAAKT 401
>gi|115440083|ref|NP_001044321.1| Os01g0761100 [Oryza sativa Japonica Group]
gi|57900356|dbj|BAD87346.1| tesmin-like [Oryza sativa Japonica Group]
gi|113533852|dbj|BAF06235.1| Os01g0761100 [Oryza sativa Japonica Group]
gi|215768567|dbj|BAH00796.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 619
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 161/278 (57%), Positives = 200/278 (71%), Gaps = 9/278 (3%)
Query: 131 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
K ESP+ PRA +GKDGTP K+K CNC++S+CLKLYCECFA+G+YCDGCNC NC NNV
Sbjct: 137 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNV 195
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
++E AR+EA+ ATLERNP+AFRPKI SSPH ++ E A + L KHNKGCHCKKSGCL
Sbjct: 196 KNETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCL 255
Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSS 309
KKYCECFQANILCSENC+C+DCKNFEGSEER+ALF GDH MQ+AANAA+ GA+G++
Sbjct: 256 KKYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSIHMQQAANAAVNGAIGAA 315
Query: 310 GYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNV-QNSAVPCSQSVPVSQTANAS 368
+ + S KRK + +AK+ R N+V Q + +P +P++Q +
Sbjct: 316 AFSSPSASRKRKHIDPSLDHSAKENGAHRT------NHVPQKNGMPSDGPLPINQPVHPP 369
Query: 369 VLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
LG K R LADI+QP D+KELC LLV+VS A K
Sbjct: 370 TLGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAK 407
>gi|222619282|gb|EEE55414.1| hypothetical protein OsJ_03534 [Oryza sativa Japonica Group]
Length = 529
Score = 314 bits (804), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 160/279 (57%), Positives = 198/279 (70%), Gaps = 3/279 (1%)
Query: 131 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
K ESP+ PRA +GKDGTP K+K CNC++S+CLKLYCECFA+G+YCDGCNC NC NNV
Sbjct: 43 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNV 101
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
++E AR+EA+ ATLERNP+AFRPKI SSPH ++ E A + L KHNKGCHCKKSGCL
Sbjct: 102 KNETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCL 161
Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSS 309
KKYCECFQANILCSENC+C+DCKNFEGSEER+ALF GDH MQ+AANAA+ GA+G++
Sbjct: 162 KKYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSIHMQQAANAAVNGAIGAA 221
Query: 310 GYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASV 369
+ + S KRK + +AK+ R Q Q + +P +P++Q +
Sbjct: 222 AFSSPSASRKRKHIDPSLDHSAKENGAHRTNHVPQV-FYQKNGMPSDGPLPINQPVHPPT 280
Query: 370 LGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
LG K R LADI+QP D+KELC LLV+VS A K
Sbjct: 281 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAKAY 319
>gi|293335083|ref|NP_001169566.1| uncharacterized protein LOC100383445 [Zea mays]
gi|224030123|gb|ACN34137.1| unknown [Zea mays]
gi|323433699|gb|ADX60083.1| CPP transcription factor [Zea mays]
Length = 607
Score = 305 bits (781), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 160/280 (57%), Positives = 194/280 (69%), Gaps = 10/280 (3%)
Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
P K ESP+ R R +GKD TP K+K NCRNS+CLKLYCECFA+G YCDGCNC NC NN
Sbjct: 126 PLKPESPKLRARL-YEGKDSTPTKKKCYNCRNSKCLKLYCECFASGTYCDGCNCTNCFNN 184
Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
++EVAR+EAV A L+RNP+AFRPKI SSPH ++ E + L KHNKGCHCKKSGC
Sbjct: 185 PDNEVARREAVEAILDRNPDAFRPKIGSSPHMHRN-NEVPSDLPLVGKHNKGCHCKKSGC 243
Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGS 308
LKKYCECFQANILCSENC+C+DCKNFEGSEERR LF GDH MQ+ NAA+ GA+G+
Sbjct: 244 LKKYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINMQQVTNAAVNGAIGA 303
Query: 309 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANAS 368
+G+ + TS KRK + K+ +N Q N AVP S+P+SQ+A+
Sbjct: 304 TGFPSPSTSRKRKHNDPSLDHFNKEHIAHKNYHIPQKN-----AVP-DGSIPISQSAHPP 357
Query: 369 VLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
LG + LAD +Q +D+KELC LLV+VS EA KT
Sbjct: 358 TLGPF--TYKPLLADTVQAEDIKELCKLLVVVSGEAAKTY 395
>gi|358344677|ref|XP_003636414.1| Lin-54-like protein [Medicago truncatula]
gi|355502349|gb|AES83552.1| Lin-54-like protein [Medicago truncatula]
Length = 646
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 198/470 (42%), Positives = 253/470 (53%), Gaps = 84/470 (17%)
Query: 1 MEQGETISDFAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQP------ 54
ME+ ET D A +K ARQLDF A+ G P S + PQS S P+P
Sbjct: 1 MEKSETTLDLANRKFARQLDFA---AACGG--PSYLTLSPPT-PQSLSKTIPRPPTPSYL 54
Query: 55 ---PKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM 111
+ QS V P + P + Q S P V H +SP RPQ V
Sbjct: 55 PPPAQMHLDFQSPVRRPWKQSP-PQWQQQRTSQSPPVMPHFQSP---------RPQLVSP 104
Query: 112 VNRVPHPVHKLPLPTLPPGKQE-SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
V+R+P+ KLP+ + SPRS+ + KD T KKQK+CNC+NS+CLKLYCEC
Sbjct: 105 VHRLPYSTPKLPVKRFQALNESPSPRSQSQNKAGLKDNTLKKQKRCNCKNSKCLKLYCEC 164
Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP------------ 218
+AAGIYCDGCNC NCHNN+ +E AR+EA+G TLE+NPNAFRPKIASSP
Sbjct: 165 YAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSI 224
Query: 219 -------------------------HGA--------------QDAREDAREAQLAAKHNK 239
+GA + +E+ E QL +HNK
Sbjct: 225 QLPHVFIVGVLHQSFELKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNK 284
Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 299
GCHCKK GCLKKYCECF AN+LCSENC+C+DCKNFEGS+ R + + + +++A N
Sbjct: 285 GCHCKK-GCLKKYCECFHANVLCSENCKCIDCKNFEGSDVWRIVLQ-EECSLVQIRQATN 342
Query: 300 AAIIGAVGS--SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQ 357
AAI GAVG S GT +T KRK +E G + DQ V Q Q+ + S+
Sbjct: 343 AAINGAVGFGPSISGTHITPKKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLS 402
Query: 358 SVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
+ VS TA + ++ RS LAD+LQ Q++K LCSLLV++S EA KT
Sbjct: 403 ASFVSDTAYKRI---SRPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKT 449
>gi|357520979|ref|XP_003630778.1| Lin-54-like protein, partial [Medicago truncatula]
gi|355524800|gb|AET05254.1| Lin-54-like protein, partial [Medicago truncatula]
Length = 220
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 135/181 (74%), Positives = 152/181 (83%), Gaps = 6/181 (3%)
Query: 115 VPHPVHKLPLPTLPP------GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYC 168
P P H P PTLPP GK ESP+S+ R N + KD TPKKQKQCNC++SRCLKLYC
Sbjct: 38 APLPEHPQPSPTLPPLPATKLGKPESPKSKSRPNFETKDATPKKQKQCNCKHSRCLKLYC 97
Query: 169 ECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 228
ECFA+GIYCDGCNCVNC NNV++E AR+EAV ATLERNPNAFRPKIASSPHG +D +E+
Sbjct: 98 ECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEET 157
Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH 288
E ++ KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD
Sbjct: 158 GEVKVLVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQ 217
Query: 289 N 289
N
Sbjct: 218 N 218
>gi|79521003|ref|NP_197951.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|395455100|sp|F4JY84.1|TCX7_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 7; Short=AtTCX7
gi|332006105|gb|AED93488.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 459
Score = 284 bits (727), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 157/352 (44%), Positives = 215/352 (61%), Gaps = 27/352 (7%)
Query: 62 QSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHK 121
QS ++ P++ + L PP P V++ +S ++ P PQ P+H+
Sbjct: 13 QSDISIPAEKMDSFQKSLEPPPCSP-VKSSQQSETEVTP-----PQKP--------PLHQ 58
Query: 122 LPLPTLPPGK---QESPRSRPRANTDGKDGTP-KKQKQCNCRNSRCLKLYCECFAAGIYC 177
+ + Q+ SR + K+ TP K+QK CNC+NS+CLKLYCECFA+G YC
Sbjct: 59 FGHNQVQKNRVSNQDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKLYCECFASGSYC 118
Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 237
+GCNCVNCHN +E+E +RQ A+ LERNP+AF+PKIA SPHG +D +E+ ++ L KH
Sbjct: 119 NGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGMKDLQENVQQVLLIGKH 178
Query: 238 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRA 297
+KGCHC+KSGCLKKYCEC+QANILCSENCRC DCKNFEGSEER+AL HG ++Q+
Sbjct: 179 SKGCHCRKSGCLKKYCECYQANILCSENCRCQDCKNFEGSEERKALLHGSQVSDTYIQQM 238
Query: 298 ANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQ 357
NAA+ A+ S Y S KRKS+++ V + Q Q+ N+V+ +
Sbjct: 239 TNAAVNRAIDMSAYLYPPESRKRKSKDISDSVVSS-----YGVQYQRANHVRRNGENSLF 293
Query: 358 SVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
S+P N +V GST SA RS ++ QP ++ELCSLLV S + L+
Sbjct: 294 SLP----NNKAVSGSTTSAYRSSWSNTFQPHHVRELCSLLVSNSVDVANKLS 341
>gi|358344762|ref|XP_003636456.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
gi|355502391|gb|AES83594.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
Length = 649
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 194/473 (41%), Positives = 249/473 (52%), Gaps = 87/473 (18%)
Query: 1 MEQGETISDFAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQP------ 54
ME+ ET D A +K ARQLDF A+ G P S + PQS S P+P
Sbjct: 1 MEKSETTLDLANRKFARQLDFA---AACGG--PSYLTLSPPT-PQSLSKTIPRPPTPSYL 54
Query: 55 ---PKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM 111
+ QS V P + P + Q S P V H +SP RPQ V
Sbjct: 55 PPPAQMHLDFQSPVRRPWKQSP-PQWQQQRTSQSPPVMPHFQSP---------RPQLVSP 104
Query: 112 VNRVPHPVHKLPLPTLPPGKQE-SPRSRPRANTDGKDGTPKKQKQCNCRNSR---CLKLY 167
V+R+P+ KLP+ + SPRS+ + KD T KKQK+CNC+NS+ Y
Sbjct: 105 VHRLPYSTPKLPVKRFQALNESPSPRSQSQNKAGLKDNTLKKQKRCNCKNSKSDFATARY 164
Query: 168 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP--------- 218
CEC+AAGIYCDGCNC NCHNN+ +E AR+EA+G TLE+NPNAFRPKIASSP
Sbjct: 165 CECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVS 224
Query: 219 ----------------------------HGA--------------QDAREDAREAQLAAK 236
+GA + +E+ E QL +
Sbjct: 225 SSIQLPHVFIVGVLHQSFELKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGR 284
Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
HNKGCHCKK GCLKKYCECF AN+LCSENC+C+DCKNFEGS+ R + + + +++
Sbjct: 285 HNKGCHCKK-GCLKKYCECFHANVLCSENCKCIDCKNFEGSDVWRIVLQ-EECSLVQIRQ 342
Query: 297 AANAAIIGAVGS--SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVP 354
A NAAI GAVG S GT +T KRK +E G + DQ V Q Q+ + S+
Sbjct: 343 ATNAAINGAVGFGPSISGTHITPKKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPL 402
Query: 355 CSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
+ VS TA + ++ RS LAD+LQ Q++K LCSLLV++S EA KT
Sbjct: 403 SLSASFVSDTAYKRI---SRPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKT 452
>gi|414880393|tpg|DAA57524.1| TPA: hypothetical protein ZEAMMB73_933477 [Zea mays]
Length = 356
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 137/234 (58%), Positives = 165/234 (70%), Gaps = 6/234 (2%)
Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
P K ESP+ R R +GKD TP K+K CNCRNS+CLKLYCECFA+G YCDGCNC NC NN
Sbjct: 126 PLKPESPKLRARL-YEGKDSTPTKKKCCNCRNSKCLKLYCECFASGTYCDGCNCTNCFNN 184
Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
++EVAR+EAV A L+RNP+AFRPKI SSPH ++ E + L KHNKGCHCKKSGC
Sbjct: 185 PDNEVARREAVEAILDRNPDAFRPKIGSSPHMHRN-NEVPSDLPLVGKHNKGCHCKKSGC 243
Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGS 308
LKKYCECFQANILCSENC+C+DCKNFEGSEERR LF GDH MQ+ NAA+ GA+G+
Sbjct: 244 LKKYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINMQQVTNAAVNGAIGA 303
Query: 309 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVS 362
+G+ + TS KRK + K+ +N Q + + VPC + VS
Sbjct: 304 TGFPSPSTSRKRKHNDPSLDHFNKEHIAHKNYHIPQ----EQAEVPCYITRSVS 353
>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 912
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 155/291 (53%), Positives = 195/291 (67%), Gaps = 23/291 (7%)
Query: 128 PPGKQESPR---SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVN 184
P K SPR SR R+ DGKDGTPKK KQCNC+NSRCLKLYCECFA+G YCDGCNCVN
Sbjct: 440 PSIKHSSPRRPVSRSRSALDGKDGTPKKCKQCNCKNSRCLKLYCECFASGTYCDGCNCVN 499
Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ-LAAKHNKGCHC 243
C NN EHEV RQEAV ATLERNPNAFRPKIA+SP D +ED + AAKHNKGCHC
Sbjct: 500 CCNNSEHEVVRQEAVEATLERNPNAFRPKIANSPRILHDGKEDVVDRNGSAAKHNKGCHC 559
Query: 244 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHN-GIAFMQRAANAAI 302
KKSGCLKKYCEC+QANILCSENC+C+DC+NF S+ERRALF GD G++F+QR + A
Sbjct: 560 KKSGCLKKYCECYQANILCSENCKCVDCRNFITSDERRALFPGDPALGLSFVQRPSGCA- 618
Query: 303 IGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGN--NVQNSAVPCSQSVP 360
SG+ + + S KRK++++ + ++ Q+ + N V+N A +
Sbjct: 619 ------SGHMSPL-SRKRKTQDVEY--------IMPGLQTNKSNLVKVENHAATTPKMSI 663
Query: 361 VSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSI 411
+ +A K RS LAD+L +++LC LLV+VS++ ++ I
Sbjct: 664 YTASAEHIPEPMVKVVNRSLLADVLHTDVIQDLCKLLVIVSADVQHEVSGI 714
>gi|168001515|ref|XP_001753460.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695339|gb|EDQ81683.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 405
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 162/287 (56%), Positives = 196/287 (68%), Gaps = 15/287 (5%)
Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVE 190
K ESPR R R+ D KDGTPKK KQCNC+NSRCLKLYCECFA+G YC+GCNCVNC NNVE
Sbjct: 4 KHESPRQRSRS-FDAKDGTPKKCKQCNCKNSRCLKLYCECFASGTYCEGCNCVNCCNNVE 62
Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
+E+ RQEAV ATLERNPNAFRPKI SSP G +D+ ED E LA KHNKGCHCKKSGCLK
Sbjct: 63 NEIVRQEAVEATLERNPNAFRPKIFSSP-GIRDSGEDVGEHPLAGKHNKGCHCKKSGCLK 121
Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHN-GIAFMQRAANAAIIGAVGSS 309
KYCECFQANILCSENC+C+DCKNF+GSEERRALF DHN + FMQ + ++ S+
Sbjct: 122 KYCECFQANILCSENCKCVDCKNFDGSEERRALFQADHNMSVNFMQPLGGSGVL----SN 177
Query: 310 GYGTLMTS---NKRKSEELLF-GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTA 365
L S KR++ EL+F G K+Q + + +S + S P SQ
Sbjct: 178 SPAYLSPSPLLKKRRTHELVFSGPGLKEQLTSGGGTAPSLSLALSSTI---ASTPSSQGV 234
Query: 366 NASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIY 412
+ +V K +S LA ++Q + + ELC LLV+VSSE K + +
Sbjct: 235 SPAV-PPAKPMHKSLLAGVVQIEAIHELCKLLVVVSSETQKEILGML 280
>gi|147817113|emb|CAN75367.1| hypothetical protein VITISV_008552 [Vitis vinifera]
Length = 628
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 125/184 (67%), Positives = 149/184 (80%)
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
E+A + L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH
Sbjct: 215 EEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFH 274
Query: 286 GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQG 345
+ N +A++Q+AANAAI GA+GSSGYGT + S KRK +E+ F A +QS RN Q QQ
Sbjct: 275 DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRKQQEVSFCTTANNQSNHRNTQFQQD 334
Query: 346 NNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT 405
N+++ SAV S S+ V+ T NA+VLGS+K RSPLADILQPQD+K+LCSLLV+VS EA
Sbjct: 335 NHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQPQDVKDLCSLLVVVSQEAG 394
Query: 406 KTLT 409
KTLT
Sbjct: 395 KTLT 398
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
K K C+C+ S CLK YCECF A I C + C C++C N
Sbjct: 225 KHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKN 262
>gi|224059104|ref|XP_002299717.1| predicted protein [Populus trichocarpa]
gi|222846975|gb|EEE84522.1| predicted protein [Populus trichocarpa]
Length = 497
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 187/302 (61%), Gaps = 6/302 (1%)
Query: 119 VHKLPLPTLPPGKQESPR-SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC 177
+H LP L K ESPR S P+A T +GTPK +QC+C+ S+CLKLYCECFA+G YC
Sbjct: 57 LHVLPRSPLSFKKSESPRISLPQARTKIINGTPKSHRQCHCKQSKCLKLYCECFASGSYC 116
Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 237
D C+C NCHNNVE+E R+EA LERNPNAF+PKI SP D + A++ + AKH
Sbjct: 117 DECSCANCHNNVENEDVRREAAECILERNPNAFKPKITGSPCTPPDDGDAAKDVLVMAKH 176
Query: 238 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQR 296
KGCHCK++GCLKKYCECFQANILCSENC+C+ CKN EG E + +H A++Q+
Sbjct: 177 IKGCHCKRTGCLKKYCECFQANILCSENCKCVSCKNLEGRELGMVVTSENHCKTKAYIQQ 236
Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ-NSAVPC 355
+ V SS + S KR E LF +D + + N V + ++P
Sbjct: 237 ENANSSSAVVSSSHIFS-QESRKRTFRE-LFDSNMRDTEIHEFTKHHAVNPVGVHVSIPT 294
Query: 356 SQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIYHYI 415
++ V Q A+++ LGS+K RS LAD + P D +ELCSLLV+V SEA K L +
Sbjct: 295 LRADCVCQIASSAALGSSKLTYRSLLADAIHPLDTRELCSLLVVV-SEAAKFLADEHCKA 353
Query: 416 QI 417
QI
Sbjct: 354 QI 355
>gi|357139249|ref|XP_003571196.1| PREDICTED: uncharacterized protein LOC100845699 [Brachypodium
distachyon]
Length = 487
Score = 240 bits (613), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 142/308 (46%), Positives = 181/308 (58%), Gaps = 22/308 (7%)
Query: 115 VPHPVHKLPLPTLPPGKQE----SPRSRPRANT------DGKDGTPKKQKQCNCRNSRCL 164
VP + P + P +Q+ PR P A T + K+ TPKK+K CNC+NSRCL
Sbjct: 19 VPAAAGGMKAPVVAPQQQKPAAPVPRPWPVAFTPMKPAAEVKNVTPKKRKHCNCKNSRCL 78
Query: 165 KLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 224
KLYCECFAA + CDGCNC NC NN E+E R+ A+ ATL RNP AF+PKI + P
Sbjct: 79 KLYCECFAARVSCDGCNCSNCGNNSENEKIRKGAIEATLLRNPLAFQPKIENGPSTVNVR 138
Query: 225 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
++++ L KHNKGCHCKKSGCLKKYCECFQANILCS+NCRC+DCKNFEGSEERRAL
Sbjct: 139 KDNSGAVPLRPKHNKGCHCKKSGCLKKYCECFQANILCSKNCRCMDCKNFEGSEERRALV 198
Query: 285 HGDHNGI-AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQ 343
GD+ +Q+AAN A+ GA+GSSGY K E+ L + Q Q
Sbjct: 199 QGDNTSDRNNIQQAANTALNGAIGSSGYKCSPARRKISQEDSLSEM-----------QFQ 247
Query: 344 QGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSE 403
Q N S + S + S S+K RSPLA+ + D+ +L +V+ +
Sbjct: 248 QVNLSDVSQLTPSCTGFGGHNGCYSQSNSSKMIYRSPLANTIHLSDVNDLVKHVVIACRK 307
Query: 404 ATKTLTSI 411
A + +I
Sbjct: 308 AAEAFPTI 315
>gi|302801582|ref|XP_002982547.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
gi|300149646|gb|EFJ16300.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
Length = 460
Score = 234 bits (596), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 200/332 (60%), Gaps = 40/332 (12%)
Query: 87 HVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGK 146
+ + + SQA+P Q P R PV +L ++ P RP + K
Sbjct: 42 YSSSRLSRTSQALPSSQVPPAQPGEPGR--KPVRQLNFSSILP--------RPPV-YELK 90
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
+ KK KQCNC+N+RCLKLYCECFA+G YCDGCNC+NC NNVE+E RQEAV TLERN
Sbjct: 91 EVAAKKCKQCNCKNTRCLKLYCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERN 150
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
PNAFRPKIA++ H +D+ +D R KHNKGC+CKKSGCLKKYCECFQANILCS+NC
Sbjct: 151 PNAFRPKIANTQHAIRDSADDMR----LGKHNKGCNCKKSGCLKKYCECFQANILCSDNC 206
Query: 267 RCLDCKNFEGSEERRALFHGD------HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKR 320
+C+DCKN++ S+ERRAL+ D H G +A + + V SS M + KR
Sbjct: 207 KCVDCKNYDSSDERRALYQPDYSQSTYHGGPPPYTTSATGS-LACVSSSP----MPAKKR 261
Query: 321 KSEELLF-GVAAK-DQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALR 378
++ +L+F G + K Q + R Q Q + S +P + S+ S K+
Sbjct: 262 RTSDLVFVGESLKVQQPIYRTLQGIQDAKLSGSMLPPNSSL------------SAKTIQT 309
Query: 379 SPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
S LA ++QP+ + +LC+LLVM S+EA K + +
Sbjct: 310 SLLAGVVQPETVSDLCNLLVMASAEAAKAMIA 341
>gi|302798629|ref|XP_002981074.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
gi|300151128|gb|EFJ17775.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
Length = 461
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 200/332 (60%), Gaps = 40/332 (12%)
Query: 87 HVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGK 146
+ + + SQA+P Q P R PV +L ++ P RP + K
Sbjct: 42 YSSSRLSRTSQALPSSQVPPAQPGEPGR--KPVRQLNFSSILP--------RPPV-YELK 90
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
+ KK KQCNC+N+RCLKLYCECFA+G YCDGCNC+NC NNVE+E RQEAV TLERN
Sbjct: 91 EVAAKKCKQCNCKNTRCLKLYCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERN 150
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
PNAFRPKIA++ H +D+ +D R KHNKGC+CKKSGCLKKYCECFQANILCS+NC
Sbjct: 151 PNAFRPKIANTQHAIRDSADDMR----LGKHNKGCNCKKSGCLKKYCECFQANILCSDNC 206
Query: 267 RCLDCKNFEGSEERRALFHGD------HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKR 320
+C+DCKN++ S+ERRAL+ D H G +A + + V SS M + KR
Sbjct: 207 KCVDCKNYDSSDERRALYQPDYSQSTYHGGPPPYTTSATGS-LACVSSSP----MPAKKR 261
Query: 321 KSEELLF-GVAAK-DQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALR 378
++ +L+F G + K Q + R Q Q + S +P + S+ S K+
Sbjct: 262 RTSDLVFVGESLKVQQPIYRTLQGIQDAKLSGSMLPPNSSL------------SAKTIQT 309
Query: 379 SPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
S LA ++QP+ + +LC+LLVM S+EA K + +
Sbjct: 310 SLLAGVVQPETVSDLCNLLVMASAEAAKAMIA 341
>gi|297605751|ref|NP_001057547.2| Os06g0332700 [Oryza sativa Japonica Group]
gi|255677011|dbj|BAF19461.2| Os06g0332700 [Oryza sativa Japonica Group]
Length = 448
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 128/291 (43%), Positives = 174/291 (59%), Gaps = 14/291 (4%)
Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCN 181
P+ +PP +S P + K++K CNC+NS+CLKLYCECFAAG+YCDGC+
Sbjct: 7 WPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKLYCECFAAGLYCDGCH 57
Query: 182 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 241
C C N V +E ARQEA+ +T +RNP AF+PKI + + ++DA KHNKGC
Sbjct: 58 CKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGC 117
Query: 242 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI-AFMQRAANA 300
HCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R + GD++ +Q AAN
Sbjct: 118 HCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAANV 177
Query: 301 AIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVP 360
A+ GA+GSSGY KR E+ L+ + S ++ Q Q+ N+V +S + S +
Sbjct: 178 ALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTGL- 235
Query: 361 VSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSI 411
+ ++ + RS LA+ + P D L LV+V +A +
Sbjct: 236 --EGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCRKAADAFLTT 284
>gi|242095900|ref|XP_002438440.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
gi|241916663|gb|EER89807.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
Length = 509
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 196/321 (61%), Gaps = 10/321 (3%)
Query: 95 PSQAMPQWQARPQHVRMVNRVPHPVHKL--PLPTLPPGKQESPRSRPRANTDGKDGTP-K 151
P +P+ A P+ + + P P ++ +P L P E S + + K GTP K
Sbjct: 10 PPALVPEAVAEPRAAPQLQQQPKPAVQVQPSMPVLRPWPMEITLSM-KPVVEMKSGTPAK 68
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
K+K CNC+NS+CLKLYCECFAAG YCDGCNC C N VE+E RQ+A+ T RNPNAF+
Sbjct: 69 KKKHCNCKNSQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQ 128
Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
PKI + P R+DA KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C+DC
Sbjct: 129 PKIENGPIPPS-VRKDAGALPSLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDC 187
Query: 272 KNFEGSEERRALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVA 330
KN+EGSEE R+ GD++ +Q+AAN A+ GA+GSSGY KR +E+ +
Sbjct: 188 KNYEGSEELRSTIQGDNSCDRNNLQQAANVALNGAIGSSGYRFSPVRRKRPAEDPHYQRL 247
Query: 331 AKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDM 390
+ S+++ Q Q+ NNV S + S + + G +K RSPLA+ + D+
Sbjct: 248 NVEGSMMQT-QFQEANNVDVSQITSSTG---RDGSTGNFQGRSKLVYRSPLANTIHVTDV 303
Query: 391 KELCSLLVMVSSEATKTLTSI 411
+L + LV+V +A + T+I
Sbjct: 304 NDLANHLVIVCRKAAERFTTI 324
>gi|242064794|ref|XP_002453686.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
gi|241933517|gb|EES06662.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
Length = 497
Score = 230 bits (587), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 127/276 (46%), Positives = 173/276 (62%), Gaps = 15/276 (5%)
Query: 135 PRSRPRA------NTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
PR P A +T+ K TPKK+K CNCRNS+CLK+YCECF YCDGCNC NC N
Sbjct: 43 PRQWPMAFNPLKPSTEVKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNI 102
Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
V +E AR EA+ A +RNP+AF+PKI + P+ ++++ + AKH+KGCHCKKSGC
Sbjct: 103 VGNENARNEAIEAIRQRNPSAFQPKIENGPNTLNVRKDNSGAVPVVAKHHKGCHCKKSGC 162
Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVG 307
LKKYCEC+QAN+ CS+NCRC+ CKN EG+E+ DH + + +Q+AAN A G VG
Sbjct: 163 LKKYCECYQANVFCSKNCRCMHCKNSEGNEDTETSSQRDHASDRSHIQQAANVAFNGTVG 222
Query: 308 SSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS--QQGNNVQNSAV-PCSQSVPVSQT 364
SSGY + + ++++S E GV K + + Q+ QQGN+ S + PCS
Sbjct: 223 SSGY--ICSLSRKRSHEDALGVRIKSEGSMPETQTQYQQGNHADVSPLAPCSTGFDRHNA 280
Query: 365 ANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 400
AN+ S RSPLA+ + +++ +L LVMV
Sbjct: 281 ANSK---SNNPIYRSPLANTIHLREVNDLVKHLVMV 313
>gi|259489914|ref|NP_001159253.1| uncharacterized protein LOC100304343 [Zea mays]
gi|223943031|gb|ACN25599.1| unknown [Zea mays]
gi|413936375|gb|AFW70926.1| hypothetical protein ZEAMMB73_955245 [Zea mays]
Length = 530
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 120/271 (44%), Positives = 167/271 (61%), Gaps = 7/271 (2%)
Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
+++T+ K TPKK+K CNCRNS+CLK+YCECF YCDGCNC NC N V +E AR EA+
Sbjct: 56 KSSTEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAI 115
Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
A +RNP+AF+PKI + + ++++ L AKH+KGCHCKKSGCLKKYCEC+QAN
Sbjct: 116 EAIRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECYQAN 175
Query: 260 ILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSN 318
+ CS+NCRC+ CKN EG+++ DH + +++AAN A G+VGSSGY + + +
Sbjct: 176 VFCSKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQAANIAFNGSVGSSGY--ICSLS 233
Query: 319 KRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV-PCSQSVPVSQTANASVLGSTKSAL 377
+++S E V + + Q QQGN+ S + PCS AN
Sbjct: 234 RKRSHEDTQCVRINSEGSMPETQYQQGNHADVSMLPPCSTGFDGHNAANTKPYNPN---Y 290
Query: 378 RSPLADILQPQDMKELCSLLVMVSSEATKTL 408
RSPLA+ + ++ +L LVMV A T+
Sbjct: 291 RSPLANTIHLSEVNDLVKQLVMVCRMAAATI 321
>gi|413936376|gb|AFW70927.1| hypothetical protein ZEAMMB73_955245, partial [Zea mays]
Length = 485
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 120/271 (44%), Positives = 167/271 (61%), Gaps = 7/271 (2%)
Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
+++T+ K TPKK+K CNCRNS+CLK+YCECF YCDGCNC NC N V +E AR EA+
Sbjct: 56 KSSTEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAI 115
Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
A +RNP+AF+PKI + + ++++ L AKH+KGCHCKKSGCLKKYCEC+QAN
Sbjct: 116 EAIRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECYQAN 175
Query: 260 ILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSN 318
+ CS+NCRC+ CKN EG+++ DH + +++AAN A G+VGSSGY + + +
Sbjct: 176 VFCSKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQAANIAFNGSVGSSGY--ICSLS 233
Query: 319 KRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV-PCSQSVPVSQTANASVLGSTKSAL 377
+++S E V + + Q QQGN+ S + PCS AN
Sbjct: 234 RKRSHEDTQCVRINSEGSMPETQYQQGNHADVSMLPPCSTGFDGHNAANTKPYNPN---Y 290
Query: 378 RSPLADILQPQDMKELCSLLVMVSSEATKTL 408
RSPLA+ + ++ +L LVMV A T+
Sbjct: 291 RSPLANTIHLSEVNDLVKQLVMVCRMAAATI 321
>gi|50725423|dbj|BAD32896.1| tesmin-like [Oryza sativa Japonica Group]
gi|50725495|dbj|BAD32965.1| tesmin-like [Oryza sativa Japonica Group]
Length = 473
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 128/308 (41%), Positives = 174/308 (56%), Gaps = 32/308 (10%)
Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKL--------------- 166
P+ +PP +S P + K++K CNC+NS+CLKL
Sbjct: 58 WPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKLACSSILDLFTGQIDQ 108
Query: 167 ---YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
YCECFAAG+YCDGC+C C N V +E ARQEA+ +T +RNP AF+PKI + +
Sbjct: 109 LYGYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNL 168
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
++DA KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R +
Sbjct: 169 RKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLI 228
Query: 284 FHGDHNGI-AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS 342
GD++ +Q AAN A+ GA+GSSGY KR E+ L+ + S ++ Q
Sbjct: 229 TQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QF 287
Query: 343 QQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSS 402
Q+ N+V +S + S + + ++ + RS LA+ + P D L LV+V
Sbjct: 288 QEANHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCR 344
Query: 403 EATKTLTS 410
+A +
Sbjct: 345 KAADAFLT 352
>gi|293336528|ref|NP_001167886.1| uncharacterized protein LOC100381595 [Zea mays]
gi|223944643|gb|ACN26405.1| unknown [Zea mays]
gi|413944443|gb|AFW77092.1| hypothetical protein ZEAMMB73_278537 [Zea mays]
Length = 499
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 121/252 (48%), Positives = 160/252 (63%), Gaps = 6/252 (2%)
Query: 161 SRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 220
S+CLKLYCECFAAG YCDGCNC C N VE+E RQEA+ T RNPNAF+PKI + P
Sbjct: 80 SQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIP 139
Query: 221 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 280
+ +DA KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C+DCKN+EGSEE
Sbjct: 140 SS-VWKDAGALPSHPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDCKNYEGSEEL 198
Query: 281 RALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 339
R+ GD++ +Q+AAN A+ GA+GSSGY KR E+ + S+++
Sbjct: 199 RSTTQGDNSCDRNNIQQAANVALNGAIGSSGYRFSPVRRKRPPEDPHCQTLNVEGSMMQT 258
Query: 340 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 399
Q ++ NNV S + S + + G++K RSPLA+ + D+ +L + LV+
Sbjct: 259 -QFRETNNVDVSQMTSSTG---RDGSTVNFQGTSKLVYRSPLANTIHITDVNDLANHLVI 314
Query: 400 VSSEATKTLTSI 411
+A + T+I
Sbjct: 315 ACRKAAERFTTI 326
>gi|357141227|ref|XP_003572141.1| PREDICTED: uncharacterized protein LOC100829506 [Brachypodium
distachyon]
Length = 540
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 116/266 (43%), Positives = 157/266 (59%), Gaps = 13/266 (4%)
Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAV 199
+++ +GTP++ K C C+NS+CLKLYC+CFA+G YC D CNC NC N+V HE ARQ+A+
Sbjct: 137 TSSESINGTPRR-KNCKCKNSKCLKLYCDCFASGRYCNDDCNCKNCCNDVSHETARQDAI 195
Query: 200 GATLERNPNAFRPKIASSP-HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
A +ERNP AF PKI + P H AQ+ A E KH KGC CK+S CLKKYCECFQ+
Sbjct: 196 NAVMERNPVAFMPKIGNIPRHAAQNREYRAAEGPRVGKHMKGCQCKRSECLKKYCECFQS 255
Query: 259 NILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSN 318
N+LCSENC+C DCKN+E SE+ + + ++ N A+ G +G S ++
Sbjct: 256 NVLCSENCKCTDCKNYESSEDMKEMRRMTQQHGVYVHHVQNLALKGMIGPSA---VLPRA 312
Query: 319 KRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALR 378
K L +DQ + N SQ P SVP T ++ LG R
Sbjct: 313 AEKFSGLSVASLGRDQPINNNDSSQ-------VLSPLLTSVPTEDTESSVRLGRHGVTYR 365
Query: 379 SPLADILQPQDMKELCSLLVMVSSEA 404
+ LADI+Q +D+ LC +LV+VS +A
Sbjct: 366 TLLADIIQIEDVNVLCKVLVLVSRQA 391
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 39/79 (49%), Gaps = 7/79 (8%)
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR---- 281
E + E+ K C CK S CLK YC+CF + C+++C C +C N E R
Sbjct: 136 ETSSESINGTPRRKNCKCKNSKCLKLYCDCFASGRYCNDDCNCKNCCNDVSHETARQDAI 195
Query: 282 -ALFHGDHNGIAFMQRAAN 299
A+ + N +AFM + N
Sbjct: 196 NAVM--ERNPVAFMPKIGN 212
>gi|222635505|gb|EEE65637.1| hypothetical protein OsJ_21206 [Oryza sativa Japonica Group]
Length = 437
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 112/246 (45%), Positives = 150/246 (60%), Gaps = 5/246 (2%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YCECFAAG+YCDGC+C C N V +E ARQEA+ +T +RNP AF+PKI + + ++
Sbjct: 76 YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 135
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG 286
DA KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R + G
Sbjct: 136 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQG 195
Query: 287 DHNGI-AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQG 345
D++ +Q AAN A+ GA+GSSGY KR E+ L+ + S ++ Q Q+
Sbjct: 196 DNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQ-TQFQEA 254
Query: 346 NNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT 405
N+V +S + S + + ++ + RS LA+ + P D L LV+V +A
Sbjct: 255 NHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCRKAA 311
Query: 406 KTLTSI 411
+
Sbjct: 312 DAFLTT 317
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
PK K C+C+ S CLK YCECF A I C C C +C N
Sbjct: 144 PKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKN 182
>gi|307108583|gb|EFN56823.1| hypothetical protein CHLNCDRAFT_144374 [Chlorella variabilis]
Length = 702
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 95/140 (67%), Positives = 107/140 (76%), Gaps = 11/140 (7%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC+NSRCLKLYCECFA+G YCDGCNC+NC+NN EHE RQ AV A LERNPNAFRP
Sbjct: 153 KKHCNCKNSRCLKLYCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRP 212
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KIA GA + R A+HNKGC+CKKSGCLKKYCECFQA I+CSE C+CLDCK
Sbjct: 213 KIAP---GADNRRAG------GARHNKGCNCKKSGCLKKYCECFQAGIVCSEICKCLDCK 263
Query: 273 NFEGSEERRALF--HGDHNG 290
N+EGSE R AL G+ G
Sbjct: 264 NYEGSEARDALVSPQGERQG 283
>gi|47848051|dbj|BAD21836.1| putative tesmin/TSO1-like CXC domain-containing protein [Oryza
sativa Japonica Group]
Length = 508
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 113/252 (44%), Positives = 149/252 (59%), Gaps = 18/252 (7%)
Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 205
K+ +PKK+K CNC+NS YCECFAA +YCDGC+C C N +E+E R+EA+ L R
Sbjct: 115 KNASPKKRKHCNCKNS-----YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLR 169
Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
NP AF+PKI +SP+ ++++ KHNKGCHC+KS CLKKYCECFQANILCS+N
Sbjct: 170 NPLAFQPKIENSPNTVTVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKN 229
Query: 266 CRCLDCKNFEGSEERRALFH----GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 321
CRC DCKNFEGSEER+AL D N I Q AAN A+ GA GS GY K+
Sbjct: 230 CRCQDCKNFEGSEERKALVQMKNASDRNHI---QEAANFALNGATGSLGYKNSPVRRKKY 286
Query: 322 SEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPL 381
E L + ++ Q Q ++ S + S + A S++ RSPL
Sbjct: 287 QENSL------GEQILSEAQFQPTDHADVSQLASSCTGFGGDIAINYQSKSSEMIYRSPL 340
Query: 382 ADILQPQDMKEL 393
A+ + ++ +L
Sbjct: 341 ANTIPLIEVNDL 352
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
+ N++ PK K C+CR S CLK YCECF A I C C C +C N
Sbjct: 189 KDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKN 237
>gi|412988366|emb|CCO17702.1| unnamed protein product [Bathycoccus prasinos]
Length = 321
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 99/182 (54%), Positives = 119/182 (65%), Gaps = 23/182 (12%)
Query: 127 LPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNC 185
+PP + P + TD +K CNCRNSRCLKLYCECFA+G YC CNCV C
Sbjct: 2 IPPVEGGLPLNNANIQTD--------RKTCNCRNSRCLKLYCECFASGQYCLQNCNCVEC 53
Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
NNVE+ V RQEAV ATLERNPNAFRPK+ + +G + KHNKGCHCKK
Sbjct: 54 KNNVENSVQRQEAVEATLERNPNAFRPKVLPTLNGGEGDE----------KHNKGCHCKK 103
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGA 305
SGCLKKYCECFQA+I CS+ CRCLDCKNFEGS+ +A+ FM ++A+++ A
Sbjct: 104 SGCLKKYCECFQASIFCSDMCRCLDCKNFEGSDAHKAIASQQE----FMPAGSDASLLVA 159
Query: 306 VG 307
G
Sbjct: 160 PG 161
>gi|224166209|ref|XP_002338900.1| predicted protein [Populus trichocarpa]
gi|222873866|gb|EEF10997.1| predicted protein [Populus trichocarpa]
Length = 154
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 94/152 (61%), Positives = 114/152 (75%), Gaps = 4/152 (2%)
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
E+ E + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFH
Sbjct: 1 EETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFH 60
Query: 286 GDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQ 344
GDH N +A++Q+AANAAI GA+GSSGY +L S KRK +EL FG KD S R QQ
Sbjct: 61 GDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPSFHRLGHFQQ 120
Query: 345 GNNVQNSAVPCS--QSVPVSQTANASVLGSTK 374
++++ A P S S PV++ N + LGS+K
Sbjct: 121 ASHIR-PAAPSSPLPSNPVARAGNTTTLGSSK 151
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
K K C+C+ S CLK YCECF A I C + C C++C N
Sbjct: 11 KHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKN 48
>gi|303274438|ref|XP_003056539.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
gi|226462623|gb|EEH59915.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
Length = 255
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 84/130 (64%), Positives = 99/130 (76%), Gaps = 6/130 (4%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K++K CNCRNSRCLKLYCECFA+G++CD CNCVNC NN+ R AV +TLERNPNAF
Sbjct: 4 KRKKYCNCRNSRCLKLYCECFASGLHCDSCNCVNCSNNLASAAIRCTAVESTLERNPNAF 63
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
RPKIA + RE A++HN+GCHCKKS CLKKYCECFQANI CS+ CRC++
Sbjct: 64 RPKIAPGVGPLEKRREG------ASRHNRGCHCKKSSCLKKYCECFQANIFCSDICRCME 117
Query: 271 CKNFEGSEER 280
CKNF GS +R
Sbjct: 118 CKNFLGSSQR 127
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
++G + + C+C+ S CLK YCECF A I+C D C C+ C N
Sbjct: 78 REGASRHNRGCHCKKSSCLKKYCECFQANIFCSDICRCMECKN 120
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+K K C+C+ S CLK YCECF + + C ++C C++C N
Sbjct: 3 SKRKKYCNCRNSRCLKLYCECFASGLHC-DSCNCVNCSN 40
>gi|218190473|gb|EEC72900.1| hypothetical protein OsI_06724 [Oryza sativa Indica Group]
Length = 274
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 94/173 (54%), Positives = 115/173 (66%), Gaps = 8/173 (4%)
Query: 159 RNSRCLK-LYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 217
R +R +K YCECFAA +YCDGC+C C N +E+E R+EA+ L RNP AF+PKI +S
Sbjct: 79 RATRLVKSWYCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENS 138
Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 277
P+ ++++ KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGS
Sbjct: 139 PNTVTVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGS 198
Query: 278 EERRALFH----GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL 326
EER+AL D N I Q AAN A+ GA GS GY + K+ E L
Sbjct: 199 EERKALVQMKNASDRNHI---QEAANFALSGATGSLGYKSSPVRRKKYQENSL 248
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
+ N++ PK K C+CR S CLK YCECF A I C C C +C N
Sbjct: 146 KDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKN 194
>gi|91088685|ref|XP_974962.1| PREDICTED: similar to Lin54 protein [Tribolium castaneum]
Length = 695
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 78/163 (47%), Positives = 103/163 (63%), Gaps = 10/163 (6%)
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
ES P +N + P+K CNC S+CLKLYC+CFA G +C CNC+NC NN+E+E
Sbjct: 464 ESITMTPESNLESNGIRPRKP--CNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENE 521
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
RQ A+ LERNPNAFRPKI + A D+ KH KGC+CK+SGCLK Y
Sbjct: 522 DHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSS--------IRKHTKGCNCKRSGCLKNY 573
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQ 295
CEC++A I CS NC+C+ C+N E S E+++L +G+ +Q
Sbjct: 574 CECYEAKIACSNNCKCMGCRNIEDSMEKKSLRASIEDGLHNLQ 616
>gi|308814306|ref|XP_003084458.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
tauri]
gi|116056343|emb|CAL56726.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
tauri]
Length = 766
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 79/143 (55%), Positives = 95/143 (66%), Gaps = 16/143 (11%)
Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG--CNCVNCHNNVEHEVAR 195
+ RA DG K CNC+NSRCLKLYCECFAAG C C C NC N VEH R
Sbjct: 5 KDRAQIQRGDG-----KGCNCKNSRCLKLYCECFAAGALCASARCRCANCMNTVEHAGER 59
Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
A+ L+RNPNAFRPKIA+ ++D +A +HN+GCHCK+SGCLKKYCEC
Sbjct: 60 TAAIETVLDRNPNAFRPKIAAVASPSRD---------VALRHNRGCHCKRSGCLKKYCEC 110
Query: 256 FQANILCSENCRCLDCKNFEGSE 278
FQA I C+E CRC+ C+N+EGS+
Sbjct: 111 FQAAIYCAETCRCVSCENYEGSD 133
>gi|270011672|gb|EFA08120.1| hypothetical protein TcasGA2_TC005724 [Tribolium castaneum]
Length = 752
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 78/163 (47%), Positives = 103/163 (63%), Gaps = 10/163 (6%)
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
ES P +N + P+K CNC S+CLKLYC+CFA G +C CNC+NC NN+E+E
Sbjct: 521 ESITMTPESNLESNGIRPRKP--CNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENE 578
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
RQ A+ LERNPNAFRPKI + A D+ KH KGC+CK+SGCLK Y
Sbjct: 579 DHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSS--------IRKHTKGCNCKRSGCLKNY 630
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQ 295
CEC++A I CS NC+C+ C+N E S E+++L +G+ +Q
Sbjct: 631 CECYEAKIACSNNCKCMGCRNIEDSMEKKSLRASIEDGLHNLQ 673
>gi|145356362|ref|XP_001422401.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582643|gb|ABP00718.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 151
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 82/141 (58%), Positives = 99/141 (70%), Gaps = 2/141 (1%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEV 193
SPRS + +D + ++K+CNC+NS+CLKLYCECFA+G YCD CNC C NN EH
Sbjct: 12 SPRSIAARASTPRD-SASQRKRCNCKNSKCLKLYCECFASGKYCDACNCAGCKNNDEHAH 70
Query: 194 ARQEAVGATLERNPNAFRPKIAS-SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
RQ A+ TLERNPNAFRPKI + + +G D A A+HN+GCHCK+SGCLKKY
Sbjct: 71 ERQSAIEQTLERNPNAFRPKIINQTTNGTLDGAGAGDGATTEARHNRGCHCKRSGCLKKY 130
Query: 253 CECFQANILCSENCRCLDCKN 273
CECFQA I C E CRC +CKN
Sbjct: 131 CECFQAAIYCVERCRCAECKN 151
>gi|428168961|gb|EKX37899.1| hypothetical protein GUITHDRAFT_44838, partial [Guillardia theta
CCMP2712]
Length = 138
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 80/130 (61%), Positives = 100/130 (76%), Gaps = 4/130 (3%)
Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERN 206
GTP K+K CNC+NS+CLKLYCECFA+G+YC+ CNC +CHNNV E RQ AV +TLERN
Sbjct: 9 GTPAKRKPCNCKNSKCLKLYCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLERN 68
Query: 207 PNAFRPKIA-SSPHGAQDA-REDAREAQLA-AKHNKGCHCKKSGCLKKYCECFQANILCS 263
P+AFRPKI + G D+ + AR A+H KGCHCKKSGCLKKYCECFQA ++CS
Sbjct: 69 PHAFRPKITQTGQQGIVDSPAKGARWLMFVKARHMKGCHCKKSGCLKKYCECFQAAVMCS 128
Query: 264 ENCRCLDCKN 273
+ C+C++C+N
Sbjct: 129 DTCKCIECRN 138
>gi|222640454|gb|EEE68586.1| hypothetical protein OsJ_27092 [Oryza sativa Japonica Group]
Length = 612
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 130/368 (35%), Positives = 185/368 (50%), Gaps = 39/368 (10%)
Query: 54 PPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQW-QARPQHVRMV 112
PP + QVQ + Q P +L++LP +Q RSP P Q P+ R +
Sbjct: 82 PPAKKLQVQQSSSMLPQDFP--KLEVLPVEKPTKLQVR-RSPQHVSPALKQDSPKMERPL 138
Query: 113 NRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFA 172
P + L + P + ++ KD T ++ C+C+ S+CLKLYCECF
Sbjct: 139 LPRPATFIDVMLSSQKPSSE--------MWSETKDVTLTRKTNCSCKYSKCLKLYCECFE 190
Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
G YC GCNC NC NNV HE RQ+A+ LERNP AF PK+ +S A + A E
Sbjct: 191 KGRYCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVLNS--TAHNCESKAAEGD 248
Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHN 289
+ KH KGC CK++ LKKYCECF+A++ CSENCRC CKNF+ +E+R + H H
Sbjct: 249 IVGKHTKGCKCKRTEYLKKYCECFKASVFCSENCRCTGCKNFKSNEDRISQKNTVHAHH- 307
Query: 290 GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ 349
N A G VG S ++ L + D S+ N + +Q
Sbjct: 308 -------VQNPASSGMVGQS---VIIFHAAENDSSLSLAASVSDHSI--NNNTSHVILLQ 355
Query: 350 NS-------AVPCSQSVPVSQTANASVLGST--KSALRSPLADILQPQDMKELCSLLVMV 400
++ A+ CS + V++ S++ + R LADI+Q +D+ ELC+LL++V
Sbjct: 356 HTFQISPFQALSCSLTSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILV 415
Query: 401 SSEATKTL 408
S +A TL
Sbjct: 416 SRQAGATL 423
>gi|302780747|ref|XP_002972148.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
gi|302791403|ref|XP_002977468.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
gi|300154838|gb|EFJ21472.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
gi|300160447|gb|EFJ27065.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
Length = 116
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 78/116 (67%), Positives = 86/116 (74%), Gaps = 9/116 (7%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD--- 223
YCECFA+G+YCDGCNC+NC NN E+E RQEAV LERNPNAFRPKI S D
Sbjct: 1 YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60
Query: 224 -----AREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+E RE Q A+HNKGC CKKSGCLKKYCECFQANILCSENCRC+DC+N
Sbjct: 61 CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
+ K C C+ S CLK YCECF A I C + C CV+C N
Sbjct: 77 VARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116
>gi|218201063|gb|EEC83490.1| hypothetical protein OsI_29022 [Oryza sativa Indica Group]
Length = 586
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 126/359 (35%), Positives = 176/359 (49%), Gaps = 47/359 (13%)
Query: 54 PPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQW-QARPQHVRMV 112
PP + QVQ + Q P +L++LP +Q RSP P Q P+ R +
Sbjct: 82 PPAKKLQVQQSSSMLPQDFP--KLEVLPVEKPTKLQVR-RSPQHVSPALKQDSPKMERPL 138
Query: 113 NRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFA 172
P + L + P + ++ KD T ++ C+C+ S+CLKLYCECF
Sbjct: 139 LPRPATFIDVMLSSQKPSSE--------MWSETKDVTLTRKTNCSCKYSKCLKLYCECFE 190
Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
G YC GCNC NC NNV HE ARQ+A+ LERNP AF PK+++S A + A E
Sbjct: 191 KGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVSNS--TAHNCESKAAEGD 248
Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHN 289
+ KH KGC CK++ CLKKYCECF+A++ CSENCRC CKN++ + +R + H H
Sbjct: 249 IVGKHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAHH- 307
Query: 290 GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ 349
N A G VG S ++F A D S+ + +
Sbjct: 308 -------VQNPASSGMVGQSV--------------IIFHAAENDSSLSLAASALSCSL-- 344
Query: 350 NSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
+ SV + + R LADI+Q +D+ ELC+LL++VS +A TL
Sbjct: 345 ------TSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQAGATL 397
>gi|157111432|ref|XP_001651563.1| hypothetical protein AaeL_AAEL005893 [Aedes aegypti]
gi|108878391|gb|EAT42616.1| AAEL005893-PA [Aedes aegypti]
Length = 757
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 75/146 (51%), Positives = 97/146 (66%), Gaps = 11/146 (7%)
Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
G ++ R P T D KK + CNC S+CLKLYC+CFA G +C CNC +C+NN+
Sbjct: 467 GSEDETRKGPAVGTIFPDEAYKK-RPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNL 525
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
++E RQ+A+ ATLERNP+AF+PKI GA A EDA H KGC+CK+SGCL
Sbjct: 526 DNEEERQKAIRATLERNPSAFKPKI-----GAVSADEDALRL-----HTKGCNCKRSGCL 575
Query: 250 KKYCECFQANILCSENCRCLDCKNFE 275
K YCEC++A I CS NC+C+ C+N E
Sbjct: 576 KNYCECYEAKIACSANCKCIGCRNTE 601
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C+C KS CLK YC+CF C NC C DC N +EE R
Sbjct: 492 CNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEER 531
>gi|328722237|ref|XP_001950656.2| PREDICTED: protein lin-54 homolog [Acyrthosiphon pisum]
Length = 750
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 78/176 (44%), Positives = 107/176 (60%), Gaps = 23/176 (13%)
Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
++P++ +D K +K CNC S+CLKLYC+CFA G +C CNC +C+NN+EHE R
Sbjct: 525 TKPKSTSD----VNKTKKPCNCTRSQCLKLYCDCFANGEFCYQCNCNSCYNNMEHEEDRA 580
Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
+A+ + L+RNPNAFRPKI S Q +H KGC+CK+SGCLK YCECF
Sbjct: 581 QAIRSVLDRNPNAFRPKIKSFV------------GQAERQHTKGCNCKRSGCLKNYCECF 628
Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHGDHNGIA----FMQRAANAAIIGAVGS 308
+A I CS+NC+C+ C+N E ++ F GDH G R +A + + GS
Sbjct: 629 EARIACSQNCKCIGCRNME--DDSHTSF-GDHPGSLSVDNLKSRTKSANFVTSFGS 681
>gi|427785019|gb|JAA57961.1| Putative dna-dependent [Rhipicephalus pulchellus]
Length = 916
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/158 (52%), Positives = 102/158 (64%), Gaps = 17/158 (10%)
Query: 147 DGTP-----KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 201
+GTP + +K CNC S+CLKLYC+CFA G +C CNC NC NN+EHE RQ+A+GA
Sbjct: 652 NGTPCEDNSRPRKPCNCTKSQCLKLYCDCFANGEFCHSCNCNNCFNNLEHEEERQKAIGA 711
Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
LERNPNAFRPKI G + R H KGC+CK+SGCLK YCEC++A IL
Sbjct: 712 CLERNPNAFRPKIGKGKEGDHERR-----------HTKGCNCKRSGCLKNYCECYEAKIL 760
Query: 262 CSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 298
CS C+C+ CKNFE S ER+ L D + Q+AA
Sbjct: 761 CSSMCKCVGCKNFEDSSERKTLMQLADAAEVRVQQQAA 798
>gi|323457208|gb|EGB13074.1| hypothetical protein AURANDRAFT_60657 [Aureococcus anophagefferens]
Length = 1772
Score = 156 bits (395), Expect = 2e-35, Method: Composition-based stats.
Identities = 93/268 (34%), Positives = 124/268 (46%), Gaps = 59/268 (22%)
Query: 54 PPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVN 113
PP++ + +A + P + LLPP S A + A P W + + R
Sbjct: 204 PPRAGTYKERAIARVHELVPATPVALLPPRS-----AGEKETKPAAPGWLS-GLYARAAE 257
Query: 114 RVPHPVHKLPLPTLPPGKQESPRSR-----------------------PRANTDGKDG-- 148
+ + T P + S S+ P A D DG
Sbjct: 258 KEERVLQASATTTPEPPAKRSRTSKTSQGLRGAAAPPARGGRGGGAATPTAEVDDDDGGD 317
Query: 149 --------TPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAV 199
K+ K CNCRNSRCLKLYC+CFAAG +C+ C CV+CHN+ H R +A+
Sbjct: 318 RPTAAKASATKRFKPCNCRNSRCLKLYCDCFAAGRFCEESCKCVDCHNDQAHARDRDDAI 377
Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
ATLE+NP AFR K+ + A H GCHCKK+ CLKKYCECF+A
Sbjct: 378 KATLEKNPKAFRAKVDAE----------------ATTHQNGCHCKKTKCLKKYCECFEAG 421
Query: 260 ILCSENCRCLDCKNFEGSEE---RRALF 284
I C C+C DC+N+ GS + RR L+
Sbjct: 422 ITCGAKCKCADCENYPGSLQLAARRGLY 449
>gi|403340562|gb|EJY69569.1| hypothetical protein OXYTRI_09693 [Oxytricha trifallax]
Length = 596
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 87/164 (53%), Positives = 100/164 (60%), Gaps = 28/164 (17%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC+NS+CLKLYCECFA+G YC CNC C NN+E+E R+E + LERNPNAFRP
Sbjct: 33 KKNCNCKNSKCLKLYCECFASGEYCKNCNCNGCCNNIENESIRKETIAIILERNPNAFRP 92
Query: 213 KIAS----SPHGAQDA----------------------REDAREAQLAAKHNKGCHCKKS 246
KIAS SP Q + A KH KGC CKKS
Sbjct: 93 KIASLPPQSPQITQKVVGVTGGPINNPSQINIGLSTPIMTGVNDGVAAGKHAKGCACKKS 152
Query: 247 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNG 290
GCLKKYCECFQA I CS+NC+C DCKNF+GS +RR L D NG
Sbjct: 153 GCLKKYCECFQAGIFCSDNCKCCDCKNFDGSLDRRILL--DSNG 194
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
K K C C+ S CLK YCECF AGI+C D C C +C N
Sbjct: 142 KHAKGCACKKSGCLKKYCECFQAGIFCSDNCKCCDCKN 179
>gi|302851277|ref|XP_002957163.1| TSO1-like protein [Volvox carteri f. nagariensis]
gi|300257570|gb|EFJ41817.1| TSO1-like protein [Volvox carteri f. nagariensis]
Length = 1133
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 72/128 (56%), Positives = 84/128 (65%), Gaps = 3/128 (2%)
Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNP 207
G P Q Q ++ YCECFA+G YC+ CNCV C NN EHE RQ AV A LERNP
Sbjct: 128 GLPLFQAQAGNMTAQS---YCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNP 184
Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
NAFRPKI S+ A A +H KGC+CKKS CLKKYCECFQA I CS+NC+
Sbjct: 185 NAFRPKIQSNEQAPAAAAAVVNNAAAPGRHLKGCNCKKSSCLKKYCECFQAGIYCSDNCK 244
Query: 268 CLDCKNFE 275
C++CKN+E
Sbjct: 245 CVECKNYE 252
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/38 (65%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
K CNC+ S CLK YCECF AGIYC D C CV C N E
Sbjct: 216 KGCNCKKSSCLKKYCECFQAGIYCSDNCKCVECKNYEE 253
>gi|281363313|ref|NP_001163143.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
gi|272432466|gb|ACZ94416.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
Length = 952
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 110/200 (55%), Gaps = 36/200 (18%)
Query: 89 QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP--------- 139
QA R Q +P Q+ P V P LPTLPPG + + ++P
Sbjct: 674 QAKQRIRQQQLPTEQSTPIKVE-----PK------LPTLPPGVKANVPAKPLFEVLKPPA 722
Query: 140 ----RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVAR 195
D G ++K CNC S+CLKLYC+CFA G +C C C +C NN+++EV R
Sbjct: 723 TAAAAGAVDPLGGMTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVER 782
Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
+ A+ + L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC
Sbjct: 783 ERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCEC 830
Query: 256 FQANILCSENCRCLDCKNFE 275
++A I CS C+C+ C+N E
Sbjct: 831 YEAKIPCSSICKCVGCRNME 850
>gi|449676261|ref|XP_002165629.2| PREDICTED: uncharacterized protein LOC100197918, partial [Hydra
magnipapillata]
Length = 530
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/171 (45%), Positives = 102/171 (59%), Gaps = 13/171 (7%)
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
+S +S +AN + +K CNC S+CLKLYCECFA G +C+ CNC C NN+ HE
Sbjct: 298 DSNKSASKANVGPITESKILKKPCNCTKSQCLKLYCECFANGEFCNNCNCRICFNNIAHE 357
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
V R +A+ + L+RNP AF PKI R +H KGC+C++SGCLK Y
Sbjct: 358 VERSKAIKSCLDRNPYAFHPKIGKGKIKGDTER----------RHTKGCNCRRSGCLKNY 407
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAANAAI 302
CEC++A ILCS C+C CKNFE S +R+ L D + +Q+ NAAI
Sbjct: 408 CECYEAKILCSTLCKCSGCKNFEESADRKTLMQLADAAEVRVLQQ--NAAI 456
>gi|24653408|ref|NP_610879.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
gi|190359804|sp|A1Z9E2.1|LIN54_DROME RecName: Full=Protein lin-54 homolog; AltName: Full=Myb complex
protein of 120 kDa
gi|7303305|gb|AAF58365.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
gi|372466647|gb|AEX93136.1| FI18123p1 [Drosophila melanogaster]
Length = 950
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 110/200 (55%), Gaps = 36/200 (18%)
Query: 89 QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP--------- 139
QA R Q +P Q+ P V P LPTLPPG + + ++P
Sbjct: 672 QAKQRIRQQQLPTEQSTPIKVE-----PK------LPTLPPGVKANVPAKPLFEVLKPPA 720
Query: 140 ----RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVAR 195
D G ++K CNC S+CLKLYC+CFA G +C C C +C NN+++EV R
Sbjct: 721 TAAAAGAVDPLGGMTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVER 780
Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
+ A+ + L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC
Sbjct: 781 ERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCEC 828
Query: 256 FQANILCSENCRCLDCKNFE 275
++A I CS C+C+ C+N E
Sbjct: 829 YEAKIPCSSICKCVGCRNME 848
>gi|159884199|gb|ABX00778.1| RE32583p [Drosophila melanogaster]
Length = 950
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 110/200 (55%), Gaps = 36/200 (18%)
Query: 89 QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP--------- 139
QA R Q +P Q+ P V P LPTLPPG + + ++P
Sbjct: 672 QAKQRIRQQQLPTEQSTPIKVE-----PK------LPTLPPGVKANVPAKPLFEVLKPPA 720
Query: 140 ----RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVAR 195
D G ++K CNC S+CLKLYC+CFA G +C C C +C NN+++EV R
Sbjct: 721 TAAAAGAVDPLGGMTSRRKHCNCSRSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVER 780
Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
+ A+ + L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC
Sbjct: 781 ERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCEC 828
Query: 256 FQANILCSENCRCLDCKNFE 275
++A I CS C+C+ C+N E
Sbjct: 829 YEAKIPCSSICKCVGCRNME 848
>gi|320163778|gb|EFW40677.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 491
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/122 (59%), Positives = 91/122 (74%), Gaps = 13/122 (10%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
+K CNC+NSRCLKLYCECFA+G YCD C CV+C NN ++ ARQ+AV ATLERNP AFR
Sbjct: 137 KKTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFR 196
Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
PKI S+ A E+ A+H+KGCHC+KS C K+YCEC+QA I C+ +C+C+DC
Sbjct: 197 PKINSA----------ATESW--ARHSKGCHCRKSNCQKRYCECYQAGIQCTAHCKCVDC 244
Query: 272 KN 273
KN
Sbjct: 245 KN 246
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 1/52 (1%)
Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNN 188
RP+ N+ + + K C+CR S C K YCEC+ AGI C C CV+C N+
Sbjct: 196 RPKINSAATESWARHSKGCHCRKSNCQKRYCECYQAGIQCTAHCKCVDCKNH 247
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 28/43 (65%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
K C+CK S CLK YCECF + C +C+C+ C N E ++E R
Sbjct: 138 KTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEAR 180
>gi|194883323|ref|XP_001975752.1| GG20386 [Drosophila erecta]
gi|190658939|gb|EDV56152.1| GG20386 [Drosophila erecta]
Length = 944
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 80/198 (40%), Positives = 108/198 (54%), Gaps = 34/198 (17%)
Query: 89 QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQ-----------ESPRS 137
QA R Q +P Q+ P V P LPT PPG + P +
Sbjct: 668 QAKQRIRQQQLPTEQSAPTKVE-----PK------LPTQPPGFNPVPTKPLFDILKQPAT 716
Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQE 197
A D G ++K CNC S+CLKLYC+CFA G +C C C +C NN+++EV R+
Sbjct: 717 GAAATVDALGGMTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNMDYEVERER 776
Query: 198 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
A+ + L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC++
Sbjct: 777 AIRSCLDRNPSAFQPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYE 824
Query: 258 ANILCSENCRCLDCKNFE 275
A I C+ C+C+ C+N E
Sbjct: 825 AKIPCTSICKCVGCRNME 842
>gi|391347269|ref|XP_003747887.1| PREDICTED: uncharacterized protein LOC100905647 [Metaseiulus
occidentalis]
Length = 609
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 83/192 (43%), Positives = 106/192 (55%), Gaps = 24/192 (12%)
Query: 97 QAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP-RANTDGKD---GTPKK 152
QA PQ Q PQ++ + P H + + S P +A+ D D
Sbjct: 341 QASPQSQ-HPQYLVVKQTTPQGQHSFAV--------QPQNSEPAQADYDAYDKQLSHTSS 391
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S+CLKLYC+CFA G +C CNCV C+NN+EHE R AV LERNPNAF P
Sbjct: 392 KKPCNCTKSQCLKLYCDCFANGEFCQNCNCVQCYNNLEHEEERSLAVKLCLERNPNAFHP 451
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G ++ R H KGC+CK+SGCLK YCEC++A ILCS CRC+ C
Sbjct: 452 KIGKYKPGDKERR-----------HTKGCNCKRSGCLKNYCECYEARILCSGVCRCVGCH 500
Query: 273 NFEGSEERRALF 284
N E + + +L
Sbjct: 501 NIEENMDAESLL 512
>gi|268552505|ref|XP_002634235.1| C. briggsae CBR-LIN-54 protein [Caenorhabditis briggsae]
Length = 437
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 83/217 (38%), Positives = 114/217 (52%), Gaps = 22/217 (10%)
Query: 114 RVPHPVHKLPLPTLPPGKQESPRSRP------------RANTDGKDGTPKKQKQCNCRNS 161
R P++ P + P G + + R R T K P ++K CNC S
Sbjct: 123 RASRPMYSRPGASRPAGGELTGEQRKAFIEMKEQVNQVRMKTKKKVYAPGQRKPCNCTKS 182
Query: 162 RCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 221
+CLKLYC+CFA G +C CNC +CHNN++++ R +A+ +LERNPNAF+PKI + G
Sbjct: 183 QCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGT 242
Query: 222 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
D + H KGCHCKKSGCLK YCEC++A + C++ C+C C+N E R
Sbjct: 243 AD---------IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTR 293
Query: 282 ALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTS 317
G G + NA A SG G++MT
Sbjct: 294 FKNSGGPGAGANALMSLTNANSSTATPDSGPGSVMTD 330
>gi|195334093|ref|XP_002033719.1| GM21473 [Drosophila sechellia]
gi|194125689|gb|EDW47732.1| GM21473 [Drosophila sechellia]
Length = 949
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 71/165 (43%), Positives = 99/165 (60%), Gaps = 25/165 (15%)
Query: 124 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
LPTLPPG + + ++P D G ++K CNC S+CLKLYC+C
Sbjct: 695 LPTLPPGAKANVPAKPLFEVLKPAATPAAAGAVDQLGGMTSRRKHCNCSKSQCLKLYCDC 754
Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
FA G +C C C +C NN+++EV R+ A+ + L+RNP+AF+PKI + G D R
Sbjct: 755 FANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--DMRL---- 808
Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
HNKGC+CK+SGCLK YCEC++A I C+ C+C+ C+N E
Sbjct: 809 ------HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 847
>gi|198433102|ref|XP_002124866.1| PREDICTED: similar to lin-54 homolog [Ciona intestinalis]
Length = 605
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 70/133 (52%), Positives = 87/133 (65%), Gaps = 15/133 (11%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K +K CNC S CLKLYCECFA G +CD CNC+NCHNN+E + R +A+ + LERNP AF
Sbjct: 375 KIRKPCNCTKSMCLKLYCECFANGHFCDSCNCINCHNNLEFDTDRSKAIKSCLERNPMAF 434
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
RPKI +DA H KGC+CK+SGCLK YCEC++A I C+ C+C+
Sbjct: 435 RPKIGR----GRDANR---------THQKGCNCKRSGCLKNYCECYEARIPCTSKCKCIG 481
Query: 271 CKNFEGSEERRAL 283
CKN E E+ AL
Sbjct: 482 CKNLE--EDSVAL 492
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 145 GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQE 197
G+D QK CNC+ S CLK YCEC+ A I C C C+ C N E VA +E
Sbjct: 441 GRDANRTHQKGCNCKRSGCLKNYCECYEARIPCTSKCKCIGCKNLEEDSVALKE 494
>gi|341881213|gb|EGT37148.1| CBN-LIN-54 protein [Caenorhabditis brenneri]
Length = 407
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 78/179 (43%), Positives = 104/179 (58%), Gaps = 11/179 (6%)
Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
R T K P ++K CNC S+CLKLYC+CFA G +C CNC +CHNN++++ R +A+
Sbjct: 134 RMKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 193
Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
+LERNPNAF+PKI + G D + H KGCHCKKSGCLK YCEC++A
Sbjct: 194 RQSLERNPNAFKPKIGIA-RGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 243
Query: 260 ILCSENCRCLDCKNFEGSEERRALFHG-DHNGIAFMQRAANAAIIGAVGSSGYGTLMTS 317
+ C++ C+C C+N E R G G + NAA A SG G++MT
Sbjct: 244 VPCTDRCKCKGCQNTETYRMTRFKNSGVSVPGTNALISLTNAASSSATPDSGPGSVMTD 302
>gi|308492241|ref|XP_003108311.1| CRE-LIN-54 protein [Caenorhabditis remanei]
gi|308249159|gb|EFO93111.1| CRE-LIN-54 protein [Caenorhabditis remanei]
Length = 433
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 87/252 (34%), Positives = 126/252 (50%), Gaps = 21/252 (8%)
Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
R T K P ++K CNC S+CLKLYC+CFA G +C CNC +CHNN++++ R +A+
Sbjct: 159 RLKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 218
Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
+LERNPNAF+PKI + G D + H KGCHCKKSGCLK YCEC++A
Sbjct: 219 RQSLERNPNAFKPKIGIARGGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 269
Query: 260 ILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNK 319
+ C++ C+C C+N E R G + NA+ A SG G++MT
Sbjct: 270 VPCTDRCKCKCCQNTEAYRMTRFKNSGSLPSTNALMSLTNASST-ATPDSGPGSVMTDE- 327
Query: 320 RKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRS 379
A D + PQ + + V + A ++ + AL +
Sbjct: 328 ----------AGDDYDDMMLPQKPKAEMDPRRFPWYYMTDEVVEAATMCMVAQAEEALNA 377
Query: 380 PLADILQPQDMK 391
++ +Q +D K
Sbjct: 378 SFSEKMQTEDDK 389
>gi|17541142|ref|NP_502544.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
gi|75019534|sp|Q95QD7.1|LIN54_CAEEL RecName: Full=Protein lin-54; AltName: Full=Abnormal cell lineage
protein 54
gi|6425180|emb|CAB05229.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
Length = 435
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 109/195 (55%), Gaps = 17/195 (8%)
Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
S R T K P ++K CNC S+CLKLYC+CFA G +C CNC +CHNN+E++ R
Sbjct: 158 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRS 217
Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
+A+ +LERNPNAF+PKI + G D + H KGCHCKKSGCLK YCEC+
Sbjct: 218 KAIRQSLERNPNAFKPKIGIARGGITD---------IERLHQKGCHCKKSGCLKNYCECY 268
Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHG----DHNGIAFMQRAANAAIIGAVGSSGYG 312
+A + C++ C+C C+N E R G + N + + A++ A SG G
Sbjct: 269 EAKVPCTDRCKCKGCQNTETYRMTRYKNSGGAVSNTNALMSLTNASST----ATPDSGPG 324
Query: 313 TLMTSNKRKSEELLF 327
+++T E +
Sbjct: 325 SVVTDEHGDDYEDML 339
>gi|116831214|gb|ABK28561.1| unknown [Arabidopsis thaliana]
Length = 369
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 109/300 (36%), Positives = 146/300 (48%), Gaps = 66/300 (22%)
Query: 118 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC 177
PV +LPL PP +E S + TD + T +K K C C+ S+CLKLYC+CFA+G+ C
Sbjct: 34 PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVC 89
Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAK 236
C+CV+CHNN E AR+ A+ L RNPNAF K S Q A D + L+
Sbjct: 90 TDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLS-- 147
Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
+GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN SE + HN +R
Sbjct: 148 --RGCKCKRTRCLKKYCECFQANLLCSDNCKCINCKNV--SEAFQPPAFSAHNSPQVYRR 203
Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEEL---------LFGVAAKDQSVIRNPQSQQGNN 347
R+ EL LF + D S+ N
Sbjct: 204 -----------------------RRDRELTEWNSCPAPLFSIP--DNSI---------QN 229
Query: 348 VQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
S + CS +P + S++G T + L D+ +LCSLLV S AT T
Sbjct: 230 ALGSPMSCSPKLPYRK--KRSLMGYTSTLL----------PDLGDLCSLLVAASESATTT 277
>gi|79403647|ref|NP_188237.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75136698|sp|Q700D0.1|TCX8_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 8; Short=AtTCX8
gi|45935045|gb|AAS79557.1| tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|46367484|emb|CAG25868.1| hypothetical protein [Arabidopsis thaliana]
gi|332642257|gb|AEE75778.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 368
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 109/300 (36%), Positives = 146/300 (48%), Gaps = 66/300 (22%)
Query: 118 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC 177
PV +LPL PP +E S + TD + T +K K C C+ S+CLKLYC+CFA+G+ C
Sbjct: 34 PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVC 89
Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAK 236
C+CV+CHNN E AR+ A+ L RNPNAF K S Q A D + L+
Sbjct: 90 TDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLS-- 147
Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
+GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN SE + HN +R
Sbjct: 148 --RGCKCKRTRCLKKYCECFQANLLCSDNCKCINCKNV--SEAFQPPAFSAHNSPQVYRR 203
Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEEL---------LFGVAAKDQSVIRNPQSQQGNN 347
R+ EL LF + D S+ N
Sbjct: 204 -----------------------RRDRELTEWNSCPAPLFSIP--DNSI---------QN 229
Query: 348 VQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
S + CS +P + S++G T + L D+ +LCSLLV S AT T
Sbjct: 230 ALGSPMSCSPKLPYRK--KRSLMGYTSTLL----------PDLGDLCSLLVAASESATTT 277
>gi|17541144|ref|NP_502545.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
gi|3924783|emb|CAB05228.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
Length = 429
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 109/195 (55%), Gaps = 17/195 (8%)
Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
S R T K P ++K CNC S+CLKLYC+CFA G +C CNC +CHNN+E++ R
Sbjct: 152 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRS 211
Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
+A+ +LERNPNAF+PKI + G D + H KGCHCKKSGCLK YCEC+
Sbjct: 212 KAIRQSLERNPNAFKPKIGIARGGITD---------IERLHQKGCHCKKSGCLKNYCECY 262
Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHG----DHNGIAFMQRAANAAIIGAVGSSGYG 312
+A + C++ C+C C+N E R G + N + + A++ A SG G
Sbjct: 263 EAKVPCTDRCKCKGCQNTETYRMTRYKNSGGAVSNTNALMSLTNASST----ATPDSGPG 318
Query: 313 TLMTSNKRKSEELLF 327
+++T E +
Sbjct: 319 SVVTDEHGDDYEDML 333
>gi|195583082|ref|XP_002081353.1| GD10970 [Drosophila simulans]
gi|194193362|gb|EDX06938.1| GD10970 [Drosophila simulans]
Length = 950
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 71/165 (43%), Positives = 98/165 (59%), Gaps = 25/165 (15%)
Query: 124 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
LPTLPPG + ++P D G ++K CNC S+CLKLYC+C
Sbjct: 696 LPTLPPGVKADVPAKPLFEVLKPAATPAAAGAVDPLGGMTSRRKHCNCSKSQCLKLYCDC 755
Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
FA G +C C C +C NN+++EV R+ A+ + L+RNP+AF+PKI + G D R
Sbjct: 756 FANGEFCQDCTCKDCCNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--DMR----- 808
Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
HNKGC+CK+SGCLK YCEC++A I C+ C+C+ C+N E
Sbjct: 809 -----LHNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 848
>gi|146182846|ref|XP_001025406.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|146143702|gb|EAS05161.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 546
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 66/120 (55%), Positives = 86/120 (71%), Gaps = 4/120 (3%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
K+CNC+ S+C+KLYCEC+AAG+YC+GC+CV+C NN + E R +A+ TL +NP AF PK
Sbjct: 69 KKCNCKQSQCIKLYCECYAAGVYCNGCHCVSCKNNDQSEGWRNQAIQNTLVKNPTAFLPK 128
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
I+ A + KHNKGC CK+SGC+KKYCECFQA + CSENC+C DC+N
Sbjct: 129 ISQKQETASKVVDPLN----LGKHNKGCACKRSGCMKKYCECFQAGVPCSENCKCTDCQN 184
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 29/43 (67%), Gaps = 1/43 (2%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
K C+CK+S C+K YCEC+ A + C+ C C+ CKN + SE R
Sbjct: 69 KKCNCKQSQCIKLYCECYAAGVYCN-GCHCVSCKNNDQSEGWR 110
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
K K C C+ S C+K YCECF AG+ C + C C +C N
Sbjct: 147 KHNKGCACKRSGCMKKYCECFQAGVPCSENCKCTDCQN 184
>gi|198458221|ref|XP_001360959.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
gi|198136263|gb|EAL25535.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
Length = 978
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 67/143 (46%), Positives = 92/143 (64%), Gaps = 12/143 (8%)
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
+SP+ TD G ++K CNC S+CLKLYC+CFA G +C C C +C NN+++E
Sbjct: 745 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYE 804
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
V R+ A+ + L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK Y
Sbjct: 805 VERERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNY 852
Query: 253 CECFQANILCSENCRCLDCKNFE 275
CEC++A I CS C+C+ C+N E
Sbjct: 853 CECYEAKIPCSSICKCVGCRNME 875
>gi|9294463|dbj|BAB02682.1| unnamed protein product [Arabidopsis thaliana]
Length = 356
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 109/300 (36%), Positives = 146/300 (48%), Gaps = 66/300 (22%)
Query: 118 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC 177
PV +LPL PP +E S + TD + T +K K C C+ S+CLKLYC+CFA+G+ C
Sbjct: 34 PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVC 89
Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAK 236
C+CV+CHNN E AR+ A+ L RNPNAF K S Q A D + L+
Sbjct: 90 TDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLS-- 147
Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
+GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN SE + HN +R
Sbjct: 148 --RGCKCKRTRCLKKYCECFQANLLCSDNCKCINCKNV--SEAFQPPAFSAHNSPQVYRR 203
Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEEL---------LFGVAAKDQSVIRNPQSQQGNN 347
R+ EL LF + D S+ N
Sbjct: 204 -----------------------RRDRELTEWNSCPAPLFSIP--DNSI---------QN 229
Query: 348 VQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
S + CS +P + S++G T + L D+ +LCSLLV S AT T
Sbjct: 230 ALGSPMSCSPKLPYRK--KRSLMGYTSTLL----------PDLGDLCSLLVAASESATTT 277
>gi|359476108|ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera]
Length = 1001
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/196 (42%), Positives = 109/196 (55%), Gaps = 20/196 (10%)
Query: 116 PHPVHKLP----------LPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLK 165
P +H++P + +L Q SP+ R + + DG K+CNC+ S+CLK
Sbjct: 521 PIELHEIPCDKRKFNSEHMDSLEEFNQPSPKKRRKKASSTNDG--DGCKRCNCKKSKCLK 578
Query: 166 LYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-D 223
LYC+CFAAGIYC +GC CV C N E++ E RNP AF PKI +G+ +
Sbjct: 579 LYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGSPIN 638
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ ED R +A+H +GC+CKKS CLKKYCEC+QAN+ CS CRC CKN G +E
Sbjct: 639 SGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGCKNVYGRKEEYGA 698
Query: 284 FHGDHNGIAFMQRAAN 299
F M R AN
Sbjct: 699 FK------EMMSRRAN 708
>gi|326680100|ref|XP_002666845.2| PREDICTED: protein lin-54 homolog [Danio rerio]
Length = 460
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 71/165 (43%), Positives = 95/165 (57%), Gaps = 11/165 (6%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K ++ CNC S+CLKLYCECFA G +C C C NC NN EH R +AV A L+RNP AF
Sbjct: 262 KSKRPCNCTKSQCLKLYCECFANGEFCSSCKCTNCFNNTEHVFERSQAVKACLDRNPGAF 321
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
RPKI S G + H KGC+CK+SGCLK YCEC++A I+C+ C+C+
Sbjct: 322 RPKIGSRKQG-----------NVKGCHTKGCNCKRSGCLKNYCECYEAKIMCTSTCKCVG 370
Query: 271 CKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLM 315
C+N++ + + D + I F R + I V + G L+
Sbjct: 371 CRNYDEGCDMKKSSEQDRHDIYFPTRCPVSVITRDVVEATCGCLL 415
>gi|158289963|ref|XP_559078.3| AGAP010380-PA [Anopheles gambiae str. PEST]
gi|157018411|gb|EAL41036.3| AGAP010380-PA [Anopheles gambiae str. PEST]
Length = 725
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 96/156 (61%), Gaps = 15/156 (9%)
Query: 133 ESPRSRPRANTDGKDGTPK---KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
++ + PRA T G P K++ CNC S+CLKLYC+CFA G YC CNC +C N
Sbjct: 432 DTSKDSPRALTTTVYGFPDEAYKKRPCNCTKSQCLKLYCDCFANGEYCYNCNCKDCFNTF 491
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
+H+ RQ+A+ +TLERNPNAF+PKI G+ + +D H KGC+CK+SGCL
Sbjct: 492 DHDNERQKAIRSTLERNPNAFKPKI-----GSIGSTDDGTRL-----HTKGCNCKRSGCL 541
Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
K YCEC++ I CS NC+C+ +F + LFH
Sbjct: 542 KNYCECYEGKIACSSNCKCVGTPSFHLLQSH--LFH 575
>gi|195124896|ref|XP_002006919.1| GI21332 [Drosophila mojavensis]
gi|193911987|gb|EDW10854.1| GI21332 [Drosophila mojavensis]
Length = 879
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 64/124 (51%), Positives = 86/124 (69%), Gaps = 12/124 (9%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
++K CNC S+CLKLYC+CFA G +C C C +C NN+++EV R+EA+ + LERNP+AF+
Sbjct: 663 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKREEAIRSCLERNPSAFK 722
Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
PKI + G D R HNKGC+CK+SGCLK YCEC++A I CS C+C+ C
Sbjct: 723 PKITAPNSG--DIRL----------HNKGCNCKRSGCLKNYCECYEAKIPCSTMCKCVGC 770
Query: 272 KNFE 275
+N E
Sbjct: 771 RNME 774
>gi|218198100|gb|EEC80527.1| hypothetical protein OsI_22807 [Oryza sativa Indica Group]
Length = 426
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 101/290 (34%), Positives = 142/290 (48%), Gaps = 43/290 (14%)
Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCN 181
P+ +PP +S P + K++K CNC+NS+CLKL
Sbjct: 58 WPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKL--------------- 93
Query: 182 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 241
QEA+ + KI + + ++DA KHNKGC
Sbjct: 94 --------------QEAITVRSNGIQKPSKLKIENGSNALNLRKDDAGAPASLPKHNKGC 139
Query: 242 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI-AFMQRAANA 300
HCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R + GD++ +Q AAN
Sbjct: 140 HCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAANV 199
Query: 301 AIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVP 360
A+ GA+GSSGY KR E+ L+ + S ++ Q Q+ N+V +S + S +
Sbjct: 200 ALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTGL- 257
Query: 361 VSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
+ ++ + RS LA+ + P D L LV+V +A +
Sbjct: 258 --EGCYSNYQCRSNVVYRSALANTISPTDATGLAKHLVIVCRKAADAFLT 305
>gi|165972329|ref|NP_766302.2| protein lin-54 homolog [Mus musculus]
gi|169234729|ref|NP_001108482.1| protein lin-54 homolog [Mus musculus]
gi|190359887|sp|Q571G4.2|LIN54_MOUSE RecName: Full=Protein lin-54 homolog
gi|74141051|dbj|BAE22096.1| unnamed protein product [Mus musculus]
gi|148688344|gb|EDL20291.1| expressed sequence AI461788, isoform CRA_a [Mus musculus]
Length = 749
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 74/139 (53%), Positives = 95/139 (68%), Gaps = 11/139 (7%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
+ T + +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RN
Sbjct: 516 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
P AF+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 267 RCLDCKNFEGSEERRALFH 285
+C+ CKNFE S ER+ L H
Sbjct: 625 KCIGCKNFEESPERKTLMH 643
>gi|195484960|ref|XP_002090893.1| GE12548 [Drosophila yakuba]
gi|194176994|gb|EDW90605.1| GE12548 [Drosophila yakuba]
Length = 950
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 80/203 (39%), Positives = 110/203 (54%), Gaps = 35/203 (17%)
Query: 85 QPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANT- 143
Q QA R Q +P Q P V P LPTLP G + + ++P +
Sbjct: 669 QLQAQAKQRIRQQQLPTEQNAPTKVE-----PK------LPTLPTGLKANAPAKPLFDIL 717
Query: 144 -----------DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
D G ++K CNC S+CLKLYC+CFA G +C C C +C NN+++E
Sbjct: 718 KPPATGAAAAVDALGGMTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYE 777
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
V R+ A+ + L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK Y
Sbjct: 778 VERERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNY 825
Query: 253 CECFQANILCSENCRCLDCKNFE 275
CEC++A I C+ C+C+ C+N E
Sbjct: 826 CECYEAKIPCTSICKCVGCRNME 848
>gi|297834488|ref|XP_002885126.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297330966|gb|EFH61385.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 421
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 76/176 (43%), Positives = 105/176 (59%), Gaps = 18/176 (10%)
Query: 124 LPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCV 183
+P PP +E ++ + +G +K+C C+ S+CLKLYC+CFA+G+ C C+CV
Sbjct: 38 IPPFPPISREHSETKDTTDQEGITW----RKRCRCKQSKCLKLYCDCFASGVLCTDCDCV 93
Query: 184 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCH 242
+CHNN ++ AR AV L RNPNAF K+ SS + Q A D R L+ +GC
Sbjct: 94 DCHNNSDNYDARDAAVVNVLGRNPNAFNEKLFSSINDKQCKAAPDTRPGLLS----RGCK 149
Query: 243 CKKSGCLKKYCECFQANILCSENCRCLDCKN---------FEGSEERRALFHGDHN 289
CK++ CLKKYCECFQAN+LCS+NC+C++CKN F R L+ G N
Sbjct: 150 CKRTKCLKKYCECFQANVLCSDNCKCINCKNVSEAFQPSVFAWGLNSRKLYEGFDN 205
>gi|301094740|ref|XP_002896474.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109449|gb|EEY67501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 972
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 72/153 (47%), Positives = 89/153 (58%), Gaps = 31/153 (20%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K+ CNC+ S+CLKLYCECFA+G YCD CNC++C N E RQ+A+ + LE+NPNAF
Sbjct: 560 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVRQQAIASRLEKNPNAF 619
Query: 211 RPKIASSPH-------GAQDAREDAR---------------------EAQLAAK--HNKG 240
+PKI ++ GA+ D R A +A H G
Sbjct: 620 KPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMATTKMHKHG 679
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
CHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 680 CHCKKSACQKKYCECFQAGVSCGENCRCIDCKN 712
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 28/41 (68%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C+CKKS CLK YCECF + C E+C CLDC N +E+ R
Sbjct: 564 CNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVR 604
>gi|60359862|dbj|BAD90150.1| mKIAA2037 protein [Mus musculus]
Length = 793
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 74/139 (53%), Positives = 95/139 (68%), Gaps = 11/139 (7%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
+ T + +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RN
Sbjct: 560 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 619
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
P AF+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 620 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 668
Query: 267 RCLDCKNFEGSEERRALFH 285
+C+ CKNFE S ER+ L H
Sbjct: 669 KCIGCKNFEESPERKTLMH 687
>gi|428168842|gb|EKX37782.1| hypothetical protein GUITHDRAFT_52378, partial [Guillardia theta
CCMP2712]
Length = 118
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 77/124 (62%), Positives = 92/124 (74%), Gaps = 9/124 (7%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K+KQCNC+NS+CLKLYCECFA+G YC+ CNC C NN +E R+ A+ ATLERNP AF
Sbjct: 2 KRKQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAF 61
Query: 211 RPKIASS-PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
+PKIASS P G +D+ R H KGCHCKKSGCLKKYCECFQA +LCS C+C
Sbjct: 62 QPKIASSAPPGIEDSPVQGR-------HTKGCHCKKSGCLKKYCECFQAGVLCSAQCKCH 114
Query: 270 DCKN 273
+C+N
Sbjct: 115 ECRN 118
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 3/66 (4%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE-ERRALFHG--DHNGIAFMQ 295
K C+CK S CLK YCECF + C+ +C C C+N E E +R++ + N +AF
Sbjct: 4 KQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAFQP 63
Query: 296 RAANAA 301
+ A++A
Sbjct: 64 KIASSA 69
>gi|390332563|ref|XP_792482.3| PREDICTED: protein lin-54 homolog [Strongylocentrotus purpuratus]
Length = 705
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 78/142 (54%), Positives = 96/142 (67%), Gaps = 13/142 (9%)
Query: 144 DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATL 203
DG P+K CNC S+CLKLYC+CFA G +C CNC NC NN++HE R +AV A L
Sbjct: 475 DGSGSRPRKP--CNCTKSQCLKLYCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVKACL 532
Query: 204 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
+RNP+AF PKI HG+Q R +HNKGC+CK+SGCLK YCEC++A ILCS
Sbjct: 533 DRNPHAFHPKIGKG-HGSQTNR----------RHNKGCNCKRSGCLKNYCECYEAKILCS 581
Query: 264 ENCRCLDCKNFEGSEERRALFH 285
C+C+ CKNFE S ER+ L H
Sbjct: 582 NFCKCVGCKNFEESPERKTLMH 603
>gi|194376810|dbj|BAG57551.1| unnamed protein product [Homo sapiens]
Length = 709
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 98/149 (65%), Gaps = 12/149 (8%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+ +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+
Sbjct: 580 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 628
Query: 271 CKNFEGSEERRALFH-GDHNGIAFMQRAA 298
CKNFE S ER+ L H D + Q+ A
Sbjct: 629 CKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|410903926|ref|XP_003965444.1| PREDICTED: protein lin-54 homolog [Takifugu rubripes]
Length = 807
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/161 (47%), Positives = 100/161 (62%), Gaps = 12/161 (7%)
Query: 139 PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEA 198
P T + P+ +K CNC S+CLKLYC+CFA G +C+ CNC NC NN+EHE R +A
Sbjct: 566 PLNGTSTTETAPRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKA 625
Query: 199 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
+ L+RNP AF+PKI G D R H+KGC+CK+SGCLK YCEC++A
Sbjct: 626 IKTCLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEA 674
Query: 259 NILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 298
I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 675 KIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 715
>gi|325183843|emb|CCA18301.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183968|emb|CCA18426.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 983
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 71/152 (46%), Positives = 89/152 (58%), Gaps = 24/152 (15%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K+ CNC+ S+CLKLYCECFA G YCD CNC++C N +E RQ AV + LE+NPNAF
Sbjct: 583 KKAPCNCKKSKCLKLYCECFANGGYCDENCNCLDCSNTLERIDERQAAVASRLEKNPNAF 642
Query: 211 RPKIASSPHGAQDAREDAREAQLAAK-----------------------HNKGCHCKKSG 247
+PKI S A + +A + + H GCHCKKS
Sbjct: 643 KPKILPSQSSAVKSLYEALSSTKSTSSGVFPMRRSTADGHRTPTGKKRMHKDGCHCKKSA 702
Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
C KKYCECFQA +LC +NCRC+DC+N E +E
Sbjct: 703 CQKKYCECFQAGVLCGDNCRCIDCRNVEKKKE 734
>gi|348671359|gb|EGZ11180.1| hypothetical protein PHYSODRAFT_520155 [Phytophthora sojae]
Length = 999
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 78/186 (41%), Positives = 99/186 (53%), Gaps = 44/186 (23%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K+ CNC+ S+CLKLYCECFA+G YCD CNC++C N E RQ+A+ A LE+NPNAF
Sbjct: 563 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCSNTPATEEVRQQAIAARLEKNPNAF 622
Query: 211 RPKIASSPH-----------GAQDAREDAREAQLAAK---------------------HN 238
+PKI ++P GA + L+ H
Sbjct: 623 KPKIGATPGMTPGSARRLSVGASGGSHASPGTFLSPSGRLNLQQQHQLLSAGLLSTKMHK 682
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAA 298
GCHCKKS C KKYCECFQA + C ENCRC+DCKN +A NG+A A
Sbjct: 683 HGCHCKKSACQKKYCECFQAGVPCGENCRCIDCKN-------QAPCVAHANGVA----TA 731
Query: 299 NAAIIG 304
+A++G
Sbjct: 732 GSAVVG 737
>gi|324510685|gb|ADY44467.1| Protein lin-54 [Ascaris suum]
Length = 448
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 67/134 (50%), Positives = 87/134 (64%), Gaps = 11/134 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C CNC +CHNN+++E R A+ A+LERNPNAF+P
Sbjct: 221 KKPCNCTKSMCLKLYCDCFANGEFCKDCNCKDCHNNLDNEAERSRAIKASLERNPNAFKP 280
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI + G D+ H KGCHCKKS CLK YCEC++A + C+E C+C C+
Sbjct: 281 KIGVASRGRVDSER---------LHQKGCHCKKSNCLKNYCECYEAKVPCTERCKCSSCR 331
Query: 273 NFEGSEERRALFHG 286
N E +R + F G
Sbjct: 332 NTES--DRASKFRG 343
>gi|195400539|ref|XP_002058874.1| GJ19678 [Drosophila virilis]
gi|194156225|gb|EDW71409.1| GJ19678 [Drosophila virilis]
Length = 980
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 63/124 (50%), Positives = 85/124 (68%), Gaps = 12/124 (9%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
++K CNC S+CLKLYC+CFA G +C C C +C NN+++EV R+ A+ + LERNP+AF+
Sbjct: 765 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIRSCLERNPSAFK 824
Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
PKI + G D R HNKGC+CK+SGCLK YCEC++A I CS C+C+ C
Sbjct: 825 PKITAPNSG--DMRL----------HNKGCNCKRSGCLKNYCECYEAKIPCSAMCKCVGC 872
Query: 272 KNFE 275
+N E
Sbjct: 873 RNME 876
>gi|301089591|ref|XP_002895080.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102431|gb|EEY60483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 972
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/153 (47%), Positives = 89/153 (58%), Gaps = 31/153 (20%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K+ CNC+ S+CLKLYCECFA+G YCD CNC++C N E RQ+A+ + LE+NPNAF
Sbjct: 560 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVRQQAIASRLEKNPNAF 619
Query: 211 RPKIASSPH-------GAQDAREDAR---------------------EAQLA--AKHNKG 240
+PKI ++ GA+ D R A +A H G
Sbjct: 620 KPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMANTKMHKHG 679
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
CHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 680 CHCKKSACQKKYCECFQAGVPCGENCRCIDCKN 712
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 28/41 (68%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C+CKKS CLK YCECF + C E+C CLDC N +E+ R
Sbjct: 564 CNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVR 604
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 145 GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
G T + C+C+ S C K YCECF AG+ C + C C++C N
Sbjct: 669 GMANTKMHKHGCHCKKSACQKKYCECFQAGVPCGENCRCIDCKN 712
>gi|34365308|emb|CAE45981.1| hypothetical protein [Homo sapiens]
Length = 749
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|190689471|gb|ACE86510.1| lin-54 homolog (C. elegans) protein [synthetic construct]
Length = 749
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|195431948|ref|XP_002063989.1| GK15963 [Drosophila willistoni]
gi|194160074|gb|EDW74975.1| GK15963 [Drosophila willistoni]
Length = 587
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 110/188 (58%), Gaps = 17/188 (9%)
Query: 89 QAHMRSPSQAMPQWQ-ARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKD 147
Q H++ +A PQ + ++ V VP P +K L P S S P D +
Sbjct: 304 QQHLQQLQKAKPQLKGSQLNQSANVKPVPAPANKPNFEILKP--PSSAASTPAV--DAQL 359
Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNP 207
++K CNC S+CLKLYC+CFA G +C C C +C NN+++EV R++A+ + L+RNP
Sbjct: 360 NLASRRKHCNCSKSQCLKLYCDCFANGEFCKDCTCKDCFNNLDNEVEREKAIRSCLDRNP 419
Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
+AF+PKI + G D R HNKGC+CK+SGCLK YCEC++A I CS C+
Sbjct: 420 SAFKPKITAPASG--DMRL----------HNKGCNCKRSGCLKNYCECYEAKIPCSSMCK 467
Query: 268 CLDCKNFE 275
C+ C+N E
Sbjct: 468 CVGCRNME 475
>gi|149701464|ref|XP_001494323.1| PREDICTED: protein lin-54 homolog isoform 2 [Equus caballus]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|402869363|ref|XP_003898732.1| PREDICTED: protein lin-54 homolog isoform 2 [Papio anubis]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643
>gi|291401518|ref|XP_002717112.1| PREDICTED: lin-54 homolog [Oryctolagus cuniculus]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643
>gi|426344811|ref|XP_004038946.1| PREDICTED: protein lin-54 homolog isoform 1 [Gorilla gorilla
gorilla]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643
>gi|344284817|ref|XP_003414161.1| PREDICTED: protein lin-54 homolog isoform 1 [Loxodonta africana]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/135 (54%), Positives = 93/135 (68%), Gaps = 11/135 (8%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+ +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+
Sbjct: 580 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 628
Query: 271 CKNFEGSEERRALFH 285
CKNFE S ER+ L H
Sbjct: 629 CKNFEESPERKTLMH 643
>gi|329664578|ref|NP_001192410.1| protein lin-54 homolog [Bos taurus]
gi|296486396|tpg|DAA28509.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
gi|440898648|gb|ELR50096.1| Protein lin-54-like protein [Bos grunniens mutus]
Length = 748
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 521 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 580
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 581 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 629
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 630 NFEESPERKTLMH 642
>gi|169234719|ref|NP_919258.2| protein lin-54 homolog isoform a [Homo sapiens]
gi|114593921|ref|XP_517185.2| PREDICTED: protein lin-54 homolog isoform 3 [Pan troglodytes]
gi|397524648|ref|XP_003832301.1| PREDICTED: protein lin-54 homolog isoform 1 [Pan paniscus]
gi|313104222|sp|Q6MZP7.3|LIN54_HUMAN RecName: Full=Protein lin-54 homolog; AltName: Full=CXC
domain-containing protein 1
gi|119626322|gb|EAX05917.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
gi|119626324|gb|EAX05919.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
gi|355687283|gb|EHH25867.1| CXC domain-containing protein 1 [Macaca mulatta]
gi|355762609|gb|EHH62025.1| CXC domain-containing protein 1 [Macaca fascicularis]
gi|383415851|gb|AFH31139.1| protein lin-54 homolog isoform a [Macaca mulatta]
gi|410222154|gb|JAA08296.1| lin-54 homolog [Pan troglodytes]
gi|410249630|gb|JAA12782.1| lin-54 homolog [Pan troglodytes]
gi|410296420|gb|JAA26810.1| lin-54 homolog [Pan troglodytes]
gi|410329877|gb|JAA33885.1| lin-54 homolog [Pan troglodytes]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643
>gi|165972337|ref|NP_001094034.1| protein lin-54 homolog [Rattus norvegicus]
gi|190359881|sp|Q641Z1.2|LIN54_RAT RecName: Full=Protein lin-54 homolog
gi|149046795|gb|EDL99569.1| similar to hypothetical protein, isoform CRA_b [Rattus norvegicus]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643
>gi|332233371|ref|XP_003265875.1| PREDICTED: protein lin-54 homolog isoform 1 [Nomascus leucogenys]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643
>gi|426231958|ref|XP_004010003.1| PREDICTED: protein lin-54 homolog isoform 1 [Ovis aries]
Length = 748
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 521 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 580
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 581 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 629
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 630 NFEESPERKTLMH 642
>gi|118404776|ref|NP_001072590.1| protein lin-54 homolog [Xenopus (Silurana) tropicalis]
gi|123905701|sp|Q0IHV2.1|LIN54_XENTR RecName: Full=Protein lin-54 homolog
gi|114107615|gb|AAI22958.1| lin-54 homolog [Xenopus (Silurana) tropicalis]
Length = 741
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 101/153 (66%), Gaps = 14/153 (9%)
Query: 136 RSRPRANTDG---KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
+++PR +G + + +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE
Sbjct: 494 QNQPRGPLNGIISSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHE 553
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
RQ+A+ A L+RNP AF+PKI G D R H+KGC+CK+SGCLK Y
Sbjct: 554 NERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNY 602
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFH 285
CEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 603 CECYEAKIMCSSICKCIGCKNFEESPERKTLMH 635
>gi|452819860|gb|EME26911.1| transcription factor [Galdieria sulphuraria]
Length = 284
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 67/127 (52%), Positives = 91/127 (71%), Gaps = 11/127 (8%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K C C S+CLK+YC+CF +G+YC C CV C+NN +H+V R+ A+ + LE+NP+AF+P
Sbjct: 13 KGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQP 72
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+A+ +D+ L KHNKGC+C+K+ CLKKYCECFQ+ +LCSE C+C CK
Sbjct: 73 KVAT----FRDS------GGLIVKHNKGCNCRKTACLKKYCECFQSGVLCSELCKCSGCK 122
Query: 273 NFEGSEE 279
NFEGS E
Sbjct: 123 NFEGSLE 129
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
KGC C KS CLK YC+CF + + C +C C+ C N E + R
Sbjct: 13 KGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVER 55
Score = 45.4 bits (106), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
K K CNCR + CLK YCECF +G+ C + C C C N
Sbjct: 86 KHNKGCNCRKTACLKKYCECFQSGVLCSELCKCSGCKN 123
>gi|354499573|ref|XP_003511883.1| PREDICTED: protein lin-54 homolog [Cricetulus griseus]
gi|344242817|gb|EGV98920.1| Protein lin-54-like [Cricetulus griseus]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|190690833|gb|ACE87191.1| lin-54 homolog (C. elegans) protein [synthetic construct]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 96/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|403263377|ref|XP_003924012.1| PREDICTED: protein lin-54 homolog isoform 1 [Saimiri boliviensis
boliviensis]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 630
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|296082007|emb|CBI21012.3| unnamed protein product [Vitis vinifera]
Length = 1094
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/161 (47%), Positives = 97/161 (60%), Gaps = 12/161 (7%)
Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAV 199
++T+ DG K+CNC+ S+CLKLYC+CFAAGIYC +GC CV C N E++ E
Sbjct: 565 SSTNDGDGC----KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETR 620
Query: 200 GATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
RNP AF PKI +G+ ++ ED R +A+H +GC+CKKS CLKKYCEC+QA
Sbjct: 621 KQIESRNPLAFAPKIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQA 680
Query: 259 NILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 299
N+ CS CRC CKN G +E F M R AN
Sbjct: 681 NVGCSAGCRCEGCKNVYGRKEEYGAFK------EMMSRRAN 715
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 44/106 (41%), Gaps = 21/106 (19%)
Query: 196 QEAVGATLERNPN----AFRPKIASSPHGAQDAREDAREAQL------------AAKHNK 239
+ + G RN N A + S HG Q + +A L A+ N
Sbjct: 510 ENSAGTEDRRNENKASIAMSSATSQSSHGIQPTKPKKEKASLSFHVISHNFDFSASSTND 569
Query: 240 GCHCKKSGC-----LKKYCECFQANILCSENCRCLDCKNFEGSEER 280
G CK+ C LK YC+CF A I C+E C C+ C N ++R
Sbjct: 570 GDGCKRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDR 615
>gi|301753331|ref|XP_002912513.1| PREDICTED: protein lin-54 homolog [Ailuropoda melanoleuca]
gi|281346804|gb|EFB22388.1| hypothetical protein PANDA_000261 [Ailuropoda melanoleuca]
Length = 749
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 630
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|359323632|ref|XP_003640147.1| PREDICTED: protein lin-54 homolog isoform 2 [Canis lupus
familiaris]
Length = 749
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 630
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643
>gi|410957317|ref|XP_003985276.1| PREDICTED: protein lin-54 homolog isoform 1 [Felis catus]
Length = 749
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 630
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643
>gi|431916156|gb|ELK16408.1| Protein lin-54 like protein [Pteropus alecto]
Length = 827
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 600 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 659
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 660 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 708
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 709 NFEESPERKTLMH 721
>gi|297292592|ref|XP_001083154.2| PREDICTED: protein lin-54 homolog, partial [Macaca mulatta]
Length = 813
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/135 (54%), Positives = 93/135 (68%), Gaps = 11/135 (8%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+ +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF
Sbjct: 584 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 643
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+
Sbjct: 644 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 692
Query: 271 CKNFEGSEERRALFH 285
CKNFE S ER+ L H
Sbjct: 693 CKNFEESPERKTLMH 707
>gi|145539662|ref|XP_001455521.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423329|emb|CAK88124.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 147/298 (49%), Gaps = 50/298 (16%)
Query: 7 ISDFAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKS------QPQ 60
I D +P+++ ++L+FT C S N + S+ S +S+ Q +SPL Q K P
Sbjct: 41 IKDQSPEQVCKKLEFTPCLKSPN----QYSIFSINSRFQPYSPLNHQTTKKANIFVISPA 96
Query: 61 VQSKVASPSQSQPLVRLQ-----LLPP-----------SSQPHVQAHMRSPSQAMPQWQA 104
Q Q LV + L P S + + RSP ++ +
Sbjct: 97 RQINFDHTKTEQRLVNISSPLSSLKVPDDLLCEEEEILSDDQDIHSQFRSPPKSPHRKLE 156
Query: 105 RPQHVRMVNRVPHPVHKLPLPTLP-PGKQESPRSRPRANTDGKDGTPKKQKQ-----CNC 158
R ++ +N V TL P K S + R T K P +++ CNC
Sbjct: 157 RATKLKFINFV----------TLKKPLKFTSDQVNNRKRTSSKRRKPLTEQKPKIIVCNC 206
Query: 159 RNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 217
+ S+CLKLYC+CFAAG+ C CNC +CHNN +H R+ + +ERNP AFRPK+ S
Sbjct: 207 KKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKVDSR 266
Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
+ ED E +H KGC+CKKS CLKKYCEC+Q + CSE C+C DCKN E
Sbjct: 267 SNS-----ED--EIDHKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDDCKNCE 317
Score = 45.1 bits (105), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C+CKKS CLK YC+CF A + C ++C C C N + + R
Sbjct: 204 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKER 244
>gi|410038272|ref|XP_003950370.1| PREDICTED: protein lin-54 homolog [Pan troglodytes]
gi|426344815|ref|XP_004038948.1| PREDICTED: protein lin-54 homolog isoform 3 [Gorilla gorilla
gorilla]
gi|80475938|gb|AAI09279.1| LIN54 protein [Homo sapiens]
Length = 660
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 541
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554
>gi|441625230|ref|XP_004089055.1| PREDICTED: protein lin-54 homolog [Nomascus leucogenys]
Length = 660
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 541
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554
>gi|311262890|ref|XP_003129406.1| PREDICTED: protein lin-54 homolog [Sus scrofa]
Length = 763
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 536 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 595
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 596 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 644
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 645 NFEESPERKTLMH 657
>gi|402869365|ref|XP_003898733.1| PREDICTED: protein lin-54 homolog isoform 3 [Papio anubis]
Length = 660
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 541
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554
>gi|149701467|ref|XP_001494305.1| PREDICTED: protein lin-54 homolog isoform 1 [Equus caballus]
Length = 660
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 541
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554
>gi|426231960|ref|XP_004010004.1| PREDICTED: protein lin-54 homolog isoform 2 [Ovis aries]
Length = 659
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 432 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 491
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 492 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 540
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 541 NFEESPERKTLMH 553
>gi|403263381|ref|XP_003924014.1| PREDICTED: protein lin-54 homolog isoform 3 [Saimiri boliviensis
boliviensis]
Length = 660
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 541
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554
>gi|345795675|ref|XP_003434061.1| PREDICTED: protein lin-54 homolog isoform 1 [Canis lupus
familiaris]
Length = 660
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 541
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 542 NFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|145479557|ref|XP_001425801.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392873|emb|CAK58403.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 150/298 (50%), Gaps = 36/298 (12%)
Query: 7 ISDFAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQVQSKVA 66
I D +P+++ ++L+FT C S N P + Q+++ + + P + +++
Sbjct: 41 IKDQSPEQVCKKLEFTPCLKSPNQFQPYSPLNRQTTKKANIFVISPARQINFDHSKTEQR 100
Query: 67 SPSQSQPLVRLQLLPP---------SSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPH 117
+ S PL L++ S + + RSP ++ + R ++ +N V
Sbjct: 101 LVNISSPLSSLKVPDDLLCDEEEIFSDDQEIHSQFRSPPKSPHRKLERATKLKFINFV-- 158
Query: 118 PVHKLPLPTLP-PGKQESPRSRPRANTDGKDGTPKKQKQ-----CNCRNSRCLKLYCECF 171
TL P K S + R T K P+ +++ CNC+ S+CLKLYC+CF
Sbjct: 159 --------TLKKPLKFTSDQVNNRKRTSSKRRKPQTEQKPKIIVCNCKKSKCLKLYCDCF 210
Query: 172 AAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
AAG+ C CNC +CHNN +H R+ + +ERNP AFRPK+ S + + + R
Sbjct: 211 AAGVPCGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKVESRSNSEDEIDQKPR- 269
Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFH 285
H KGC+CKKS CLKKYCEC+Q + CSE C+C +CKN E E R+ + H
Sbjct: 270 ------HFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKNCEMPVKKESRKRVKH 321
>gi|117306452|gb|AAI25581.1| Lin54 protein [Mus musculus]
Length = 515
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 74/139 (53%), Positives = 95/139 (68%), Gaps = 11/139 (7%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
+ T + +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RN
Sbjct: 282 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 341
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
P AF+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 342 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 390
Query: 267 RCLDCKNFEGSEERRALFH 285
+C+ CKNFE S ER+ L H
Sbjct: 391 KCIGCKNFEESPERKTLMH 409
>gi|194757683|ref|XP_001961092.1| GF13699 [Drosophila ananassae]
gi|190622390|gb|EDV37914.1| GF13699 [Drosophila ananassae]
Length = 921
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 85/124 (68%), Gaps = 12/124 (9%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
++K CNC S+CLKLYC+CFA G +C C C +C NN+++EV R+ A+ + L+RNP+AF+
Sbjct: 706 RRKHCNCSKSQCLKLYCDCFANGEFCQNCTCKDCFNNLDYEVERERAIRSCLDRNPSAFK 765
Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
PKI + G D R HNKGC+CK+SGCLK YCEC++A I C+ C+C+ C
Sbjct: 766 PKITAPNSG--DMRL----------HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGC 813
Query: 272 KNFE 275
+N E
Sbjct: 814 RNME 817
>gi|31873949|emb|CAD97902.1| hypothetical protein [Homo sapiens]
Length = 621
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 73/135 (54%), Positives = 93/135 (68%), Gaps = 11/135 (8%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+ +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF
Sbjct: 392 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 451
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+
Sbjct: 452 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 500
Query: 271 CKNFEGSEERRALFH 285
CKNFE S ER+ L H
Sbjct: 501 CKNFEESPERKTLMH 515
>gi|410957319|ref|XP_003985277.1| PREDICTED: protein lin-54 homolog isoform 2 [Felis catus]
Length = 660
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 541
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554
>gi|363733235|ref|XP_003641221.1| PREDICTED: protein lin-54 homolog isoform 1 [Gallus gallus]
Length = 804
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN++HE RQ+A+ A L+RNP AF+P
Sbjct: 577 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 636
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 637 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 685
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 686 NFEESPERKTLMH 698
>gi|380799221|gb|AFE71486.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
Length = 560
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 333 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 392
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 393 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 441
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 442 NFEESPERKTLMHLADAAEVRVQQQTA 468
>gi|327272952|ref|XP_003221248.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Anolis
carolinensis]
Length = 749
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN++HE RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRNPEAFKP 581
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643
>gi|432885357|ref|XP_004074681.1| PREDICTED: protein lin-54 homolog [Oryzias latipes]
Length = 821
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 93/139 (66%), Gaps = 11/139 (7%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
+ T + +K CNC S+CLKLYC+CFA G +C+ CNC NC NN+EHE R +A+ L+RN
Sbjct: 588 ETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTCLDRN 647
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
P AF+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 648 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 696
Query: 267 RCLDCKNFEGSEERRALFH 285
+C+ CKNFE S ER+ L H
Sbjct: 697 KCIGCKNFEESPERKTLMH 715
>gi|224049388|ref|XP_002189980.1| PREDICTED: protein lin-54 homolog [Taeniopygia guttata]
Length = 751
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN++HE RQ+A+ A L+RNP AF+P
Sbjct: 524 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 583
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 584 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 632
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 633 NFEESPERKTLMH 645
>gi|326918692|ref|XP_003205622.1| PREDICTED: protein lin-54 homolog isoform 1 [Meleagris gallopavo]
Length = 756
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN++HE RQ+A+ A L+RNP AF+P
Sbjct: 529 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 588
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 589 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 637
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 638 NFEESPERKTLMH 650
>gi|126330910|ref|XP_001376980.1| PREDICTED: protein lin-54 homolog [Monodelphis domestica]
Length = 756
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 91/133 (68%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 529 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAFKP 588
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 589 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 637
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 638 NFEESPERKTLMH 650
>gi|118090184|ref|XP_420554.2| PREDICTED: protein lin-54 homolog isoform 2 [Gallus gallus]
Length = 752
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN++HE RQ+A+ A L+RNP AF+P
Sbjct: 525 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 584
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 585 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 633
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 634 NFEESPERKTLMH 646
>gi|169234721|ref|NP_001108479.1| protein lin-54 homolog isoform b [Homo sapiens]
gi|169234723|ref|NP_001108480.1| protein lin-54 homolog isoform b [Homo sapiens]
gi|197098666|ref|NP_001125469.1| protein lin-54 homolog [Pongo abelii]
gi|332233373|ref|XP_003265876.1| PREDICTED: protein lin-54 homolog isoform 2 [Nomascus leucogenys]
gi|332233375|ref|XP_003265877.1| PREDICTED: protein lin-54 homolog isoform 3 [Nomascus leucogenys]
gi|332233377|ref|XP_003265878.1| PREDICTED: protein lin-54 homolog isoform 4 [Nomascus leucogenys]
gi|397524650|ref|XP_003832302.1| PREDICTED: protein lin-54 homolog isoform 2 [Pan paniscus]
gi|402869361|ref|XP_003898731.1| PREDICTED: protein lin-54 homolog isoform 1 [Papio anubis]
gi|75042058|sp|Q5RBN8.1|LIN54_PONAB RecName: Full=Protein lin-54 homolog
gi|34364698|emb|CAE45799.1| hypothetical protein [Homo sapiens]
gi|55728144|emb|CAH90822.1| hypothetical protein [Pongo abelii]
gi|158258975|dbj|BAF85458.1| unnamed protein product [Homo sapiens]
gi|190689473|gb|ACE86511.1| lin-54 homolog (C. elegans) protein [synthetic construct]
gi|190690835|gb|ACE87192.1| lin-54 homolog (C. elegans) protein [synthetic construct]
gi|410249632|gb|JAA12783.1| lin-54 homolog [Pan troglodytes]
gi|410329875|gb|JAA33884.1| lin-54 homolog [Pan troglodytes]
Length = 528
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|449276576|gb|EMC85038.1| Protein lin-54 like protein, partial [Columba livia]
Length = 757
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN++HE RQ+A+ A L+RNP AF+P
Sbjct: 530 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 589
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 590 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 638
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 639 NFEESPERKTLMH 651
>gi|426344813|ref|XP_004038947.1| PREDICTED: protein lin-54 homolog isoform 2 [Gorilla gorilla
gorilla]
Length = 528
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|348535421|ref|XP_003455199.1| PREDICTED: protein lin-54 homolog [Oreochromis niloticus]
Length = 775
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 94/147 (63%), Gaps = 11/147 (7%)
Query: 139 PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEA 198
P + T + +K CNC S+CLKLYC+CFA G +C+ CNC NC NN+EHE R +A
Sbjct: 534 PLNGLSAAETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKA 593
Query: 199 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
+ L+RNP AF+PKI G D R H+KGC+CK+SGCLK YCEC++A
Sbjct: 594 IKTCLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEA 642
Query: 259 NILCSENCRCLDCKNFEGSEERRALFH 285
I+CS C+C+ CKNFE S ER+ L H
Sbjct: 643 KIMCSSICKCIGCKNFEESPERKTLMH 669
>gi|426231962|ref|XP_004010005.1| PREDICTED: protein lin-54 homolog isoform 3 [Ovis aries]
Length = 528
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|417402343|gb|JAA48022.1| Putative protein lin-54 [Desmodus rotundus]
Length = 527
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 300 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 359
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 360 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 408
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 409 NFEESPERKTLMH 421
>gi|338723424|ref|XP_003364724.1| PREDICTED: protein lin-54 homolog [Equus caballus]
Length = 528
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|403263379|ref|XP_003924013.1| PREDICTED: protein lin-54 homolog isoform 2 [Saimiri boliviensis
boliviensis]
Length = 528
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 409
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|380799223|gb|AFE71487.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
Length = 521
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 294 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 353
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 354 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 402
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 403 NFEESPERKTLMH 415
>gi|344284819|ref|XP_003414162.1| PREDICTED: protein lin-54 homolog isoform 2 [Loxodonta africana]
Length = 528
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 98/149 (65%), Gaps = 12/149 (8%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+ +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+
Sbjct: 359 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 407
Query: 271 CKNFEGSEERRALFH-GDHNGIAFMQRAA 298
CKNFE S ER+ L H D + Q+ A
Sbjct: 408 CKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|359323634|ref|XP_003640148.1| PREDICTED: protein lin-54 homolog isoform 3 [Canis lupus
familiaris]
gi|410957321|ref|XP_003985278.1| PREDICTED: protein lin-54 homolog isoform 3 [Felis catus]
Length = 528
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 409
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|340376737|ref|XP_003386888.1| PREDICTED: protein lin-54 homolog [Amphimedon queenslandica]
Length = 408
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/161 (47%), Positives = 99/161 (61%), Gaps = 19/161 (11%)
Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN 184
T+PP + +S+P++ + P+K CNC+NS+CLKLYC+CFA G +C D CNC N
Sbjct: 155 TIPPPSHQ--QSKPKSGANLASNKPRKP--CNCKNSQCLKLYCDCFANGEFCRDSCNCQN 210
Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG-AQDAREDAREAQLAAKHNKGCHC 243
C N+ +HE R AV A LERNPNAF+PK+ HG D R +H KGC+C
Sbjct: 211 CKNSFQHEGDRSRAVKACLERNPNAFKPKVG---HGRVGDER----------RHIKGCNC 257
Query: 244 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
KKS CLK YCEC++A I CS CRC+ C+N E + L
Sbjct: 258 KKSSCLKNYCECYEAKIPCSHLCRCVGCQNLADRPEGKGLM 298
>gi|115497062|ref|NP_001070035.1| protein lin-54 homolog [Danio rerio]
gi|123905351|sp|Q08CM4.1|LIN54_DANRE RecName: Full=Protein lin-54 homolog
gi|115313049|gb|AAI24176.1| Zgc:152921 [Danio rerio]
Length = 771
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 98/149 (65%), Gaps = 12/149 (8%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+ +K CNC S+CLKLYC+CFA G +C+ CNCVNC NN++HE R +A+ A L+RNP AF
Sbjct: 543 RPRKPCNCTRSQCLKLYCDCFANGEFCNNCNCVNCFNNLDHESERLKAIKACLDRNPVAF 602
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+PKI G D R H+KGC+CKKSGCLK YCEC++A I+CS C+C+
Sbjct: 603 KPKIGKGKEGESDRR-----------HSKGCNCKKSGCLKNYCECYEAKIMCSSICKCMG 651
Query: 271 CKNFEGSEERRALFH-GDHNGIAFMQRAA 298
CKNFE S ER+ L H D + Q+ A
Sbjct: 652 CKNFEESPERKTLMHLADAAEVRVQQQTA 680
>gi|326918694|ref|XP_003205623.1| PREDICTED: protein lin-54 homolog isoform 2 [Meleagris gallopavo]
Length = 667
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN++HE RQ+A+ A L+RNP AF+P
Sbjct: 440 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 499
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 500 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 548
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 549 NFEESPERKTLMH 561
>gi|37360936|dbj|BAC98377.1| KIAA2037 protein [Homo sapiens]
Length = 494
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 267 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 326
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 327 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 375
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 376 NFEESPERKTLMH 388
>gi|291243053|ref|XP_002741422.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
[Saccoglossus kowalevskii]
Length = 946
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/153 (50%), Positives = 95/153 (62%), Gaps = 24/153 (15%)
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
E+P +RPR K CNC S+CLKLYC+CFA G +C CNC NC NN+EHE
Sbjct: 495 EAPGTRPR-------------KPCNCTKSQCLKLYCDCFANGEFCSNCNCTNCFNNLEHE 541
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
R +A+ + LERNP AF PKI G D R HNKGC+CK+SGCLK Y
Sbjct: 542 SERAKAIKSCLERNPLAFHPKIGKGKEGQADRR-----------HNKGCNCKRSGCLKNY 590
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFH 285
CEC++A I+CS C+C+ CKNFE S +R+ L H
Sbjct: 591 CECYEAKIMCSSICKCVGCKNFEESPDRKTLMH 623
>gi|26325238|dbj|BAC26373.1| unnamed protein product [Mus musculus]
gi|148688345|gb|EDL20292.1| expressed sequence AI461788, isoform CRA_b [Mus musculus]
gi|219520492|gb|AAI45026.1| Lin54 protein [Mus musculus]
Length = 419
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/139 (53%), Positives = 95/139 (68%), Gaps = 11/139 (7%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
+ T + +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RN
Sbjct: 186 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 245
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
P AF+PKI G D R H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 246 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 294
Query: 267 RCLDCKNFEGSEERRALFH 285
+C+ CKNFE S ER+ L H
Sbjct: 295 KCIGCKNFEESPERKTLMH 313
>gi|51980517|gb|AAH82043.1| Lin54 protein [Rattus norvegicus]
Length = 495
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 268 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 327
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 328 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 376
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 377 NFEESPERKTLMH 389
>gi|449455360|ref|XP_004145421.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
sativus]
gi|449494974|ref|XP_004159699.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
sativus]
Length = 602
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 66/132 (50%), Positives = 86/132 (65%), Gaps = 1/132 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ SRCLKLYCECFAAG+YC + C+C +C N HE E RNP AF P
Sbjct: 308 KRCNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAP 367
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ + + R+D+ + +A+H +GC+CKKSGCLKKYCEC+Q + CS NCRC CK
Sbjct: 368 KVIMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCK 427
Query: 273 NFEGSEERRALF 284
N G ++ AL
Sbjct: 428 NAFGRKDESALL 439
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
R +++ + + ++ CNC+ S CLK YCEC+ G+ C C C C N
Sbjct: 380 RDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKN 428
>gi|449471001|ref|XP_004153179.1| PREDICTED: CRC domain-containing protein TSO1-like, partial
[Cucumis sativus]
Length = 581
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 66/132 (50%), Positives = 86/132 (65%), Gaps = 1/132 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ SRCLKLYCECFAAG+YC + C+C +C N HE E RNP AF P
Sbjct: 287 KRCNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAP 346
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ + + R+D+ + +A+H +GC+CKKSGCLKKYCEC+Q + CS NCRC CK
Sbjct: 347 KVIMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCK 406
Query: 273 NFEGSEERRALF 284
N G ++ AL
Sbjct: 407 NAFGRKDESALL 418
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
R +++ + + ++ CNC+ S CLK YCEC+ G+ C C C C N
Sbjct: 359 RDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKN 407
>gi|355699397|gb|AES01114.1| lin-54-like protein [Mustela putorius furo]
Length = 748
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 101/260 (38%), Positives = 138/260 (53%), Gaps = 32/260 (12%)
Query: 52 PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR----------SPSQAMPQ 101
P+P PQ+ VAS SQP RL ++P + P +Q ++ +P
Sbjct: 417 PKPINPTPQI---VAS---SQPQQRL-IMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVG 469
Query: 102 WQARP-QHVRMVNRVPHPVHKLPLP-TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCR 159
+ P Q+V + + + T PG Q R P + + +K CNC
Sbjct: 470 YAVLPAQYVTQLQQSSYVSIASSSSFTGTPGIQTQAR-LPFNGIIPSESASRPRKPCNCT 528
Query: 160 NSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI
Sbjct: 529 KSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKE 588
Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S E
Sbjct: 589 GESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPE 637
Query: 280 RRALFH-GDHNGIAFMQRAA 298
R+ L H D + Q+ A
Sbjct: 638 RKTLMHLADAAEVRVQQQTA 657
>gi|115476122|ref|NP_001061657.1| Os08g0369600 [Oryza sativa Japonica Group]
gi|38637224|dbj|BAD03490.1| transcription factor-like [Oryza sativa Japonica Group]
gi|38637277|dbj|BAD03541.1| transcription factor-like [Oryza sativa Japonica Group]
gi|113623626|dbj|BAF23571.1| Os08g0369600 [Oryza sativa Japonica Group]
Length = 369
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 100/257 (38%), Positives = 142/257 (55%), Gaps = 21/257 (8%)
Query: 52 PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQW-QARPQHVR 110
P PP ++ ++Q + +S Q +L++LP +Q RSP P Q P+ R
Sbjct: 79 PSPPPAK-KLQVQQSSSMLPQDFPKLEVLPVEKPTKLQVR-RSPQHVSPALKQDSPKMER 136
Query: 111 MVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
+ +P P T S + ++ KD T ++ C+C+ S+CLKLYCEC
Sbjct: 137 PL--LPRPA------TFIDVMLSSQKPSSEMWSETKDVTLTRKTNCSCKYSKCLKLYCEC 188
Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
F G YC GCNC NC NNV HE ARQ+A+ LERNP AF PK+ +S A + A E
Sbjct: 189 FEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVLNS--TAHNCESKAAE 246
Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGD 287
+ KH KGC CK++ CLKKYCECF+A++ CSENCRC CKN++ + +R + H
Sbjct: 247 GDIVGKHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAH 306
Query: 288 HNGIAFMQRAANAAIIG 304
H +Q A++ ++G
Sbjct: 307 H-----VQNPASSGMVG 318
>gi|297830990|ref|XP_002883377.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
lyrata]
gi|297329217|gb|EFH59636.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
lyrata]
Length = 684
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/135 (51%), Positives = 89/135 (65%), Gaps = 9/135 (6%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N HE E V AT + RNP
Sbjct: 390 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 445
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF PK+ S +A +DA + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC
Sbjct: 446 AFAPKVIRSADSIMEAGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSVNCRC 505
Query: 269 LDCKNFEGSEERRAL 283
CKN G +E L
Sbjct: 506 EGCKNVFGRKEGSLL 520
>gi|71051367|gb|AAH99076.1| Lin54 protein [Rattus norvegicus]
Length = 438
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 211 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 270
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 271 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 319
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 320 NFEESPERKTLMH 332
>gi|156544375|ref|XP_001607403.1| PREDICTED: protein lin-54 homolog isoform 1 [Nasonia vitripennis]
gi|345480219|ref|XP_003424108.1| PREDICTED: protein lin-54 homolog isoform 2 [Nasonia vitripennis]
Length = 818
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 91/139 (65%), Gaps = 12/139 (8%)
Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
SR N+D P+K CNC S+CLKLYC+CFA G +C CNC NC NN+ +E RQ
Sbjct: 583 SRSFNNSDANGIRPRKP--CNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQ 640
Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
A+ + LERNPNAFRPKI +D R +HNKGC+CK+SGCLK YCEC+
Sbjct: 641 RAIKSCLERNPNAFRPKIGKGRETGEDIR----------RHNKGCNCKRSGCLKNYCECY 690
Query: 257 QANILCSENCRCLDCKNFE 275
+A I CSENC+C+ C+N E
Sbjct: 691 EAKIPCSENCKCIGCRNVE 709
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
K CNC+ S CLK YCEC+ A I C + C C+ C N E + +Q
Sbjct: 673 KGCNCKRSGCLKNYCECYEAKIPCSENCKCIGCRNVEEPALHKQ 716
>gi|260803175|ref|XP_002596466.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
gi|229281723|gb|EEN52478.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
Length = 458
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 78/158 (49%), Positives = 100/158 (63%), Gaps = 15/158 (9%)
Query: 132 QESPRSRPRANTDGKDGT----PKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHN 187
Q + ++PR +G T + +K CNC S+CLKLYC+CFA G +C+ CNC NC N
Sbjct: 210 QATTATQPRQQVNGTLNTEQVGTRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFN 269
Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
N+EHE R +++ A LERNP AF PKI G D R HNKGC+CK+SG
Sbjct: 270 NLEHEAERAKSIKACLERNPYAFHPKIGKGKEGEGDRR-----------HNKGCNCKRSG 318
Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
CLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 319 CLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMH 356
>gi|296196136|ref|XP_002745695.1| PREDICTED: protein lin-54 homolog isoform 1 [Callithrix jacchus]
Length = 749
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 95/246 (38%), Positives = 132/246 (53%), Gaps = 31/246 (12%)
Query: 52 PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR----------SPSQAMPQ 101
P+P S Q+ + SQP RL ++P + P +Q ++ +P
Sbjct: 417 PKPINSTSQIVTT------SQPQQRL-IMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVG 469
Query: 102 WQARP-QHVRMVNRVPH-PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCR 159
+ P Q+V + + + + T PG Q R P + + +K CNC
Sbjct: 470 YAVLPAQYVTQLQQSSYVSIASNSTFTGTPGIQTQARP-PFNGIIPSESASRPRKPCNCT 528
Query: 160 NSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI
Sbjct: 529 KSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKE 588
Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S E
Sbjct: 589 GESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPE 637
Query: 280 RRALFH 285
R+ L H
Sbjct: 638 RKTLMH 643
>gi|443711997|gb|ELU05498.1| hypothetical protein CAPTEDRAFT_229042 [Capitella teleta]
Length = 614
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/140 (52%), Positives = 93/140 (66%), Gaps = 10/140 (7%)
Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 205
++ + +K CNC S+CLKLYC+CFA G +C CNCVNC NN+EHE R ++ LER
Sbjct: 386 QNAAARPRKACNCTKSQCLKLYCDCFANGEFCSNCNCVNCFNNLEHEEERSRSIKTCLER 445
Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
NP AF PKI S G D R +HNKGC+CK+SGCLK YCEC++A ILC+
Sbjct: 446 NPQAFHPKIGKSKAGQGDIR----------RHNKGCNCKRSGCLKNYCECYEAKILCTSL 495
Query: 266 CRCLDCKNFEGSEERRALFH 285
C+C+ CKNFE S +R+ L H
Sbjct: 496 CKCVGCKNFEESPDRKTLMH 515
>gi|149046796|gb|EDL99570.1| similar to hypothetical protein, isoform CRA_c [Rattus norvegicus]
Length = 419
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 192 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 251
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 252 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 300
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 301 NFEESPERKTLMH 313
>gi|296196138|ref|XP_002745696.1| PREDICTED: protein lin-54 homolog isoform 2 [Callithrix jacchus]
Length = 660
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 95/246 (38%), Positives = 132/246 (53%), Gaps = 31/246 (12%)
Query: 52 PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR----------SPSQAMPQ 101
P+P S Q+ + SQP RL ++P + P +Q ++ +P
Sbjct: 328 PKPINSTSQIVTT------SQPQQRL-IMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVG 380
Query: 102 WQARP-QHVRMVNRVPH-PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCR 159
+ P Q+V + + + + T PG Q R P + + +K CNC
Sbjct: 381 YAVLPAQYVTQLQQSSYVSIASNSTFTGTPGIQTQARP-PFNGIIPSESASRPRKPCNCT 439
Query: 160 NSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI
Sbjct: 440 KSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKE 499
Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S E
Sbjct: 500 GESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPE 548
Query: 280 RRALFH 285
R+ L H
Sbjct: 549 RKTLMH 554
>gi|395850121|ref|XP_003797647.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Otolemur
garnettii]
Length = 544
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 82/132 (62%), Gaps = 11/132 (8%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
K C C S CLKL C+CFA G + + CNC + +N++EHE RQ+A+ A L+RNP AF+PK
Sbjct: 345 KPCECVISLCLKLDCDCFANGEFYNNCNCTSYYNSLEHENERQKAIKACLDRNPEAFKPK 404
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+E H+KGC+CK SGCLK YCEC++A I+CS C+C CKN
Sbjct: 405 TG-----------KGKEEDSDXHHSKGCNCKXSGCLKNYCECYEAKIMCSSICKCKGCKN 453
Query: 274 FEGSEERRALFH 285
FE ER+ L H
Sbjct: 454 FEEXPERKTLMH 465
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 23/43 (53%), Gaps = 1/43 (2%)
Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
+D K CNC+ S CLK YCEC+ A I C C C C N
Sbjct: 411 EDSDXHHSKGCNCKXSGCLKNYCECYEAKIMCSSICKCKGCKN 453
>gi|298708780|emb|CBJ30741.1| tesmin-like, animal TCX proteins are associated with development of
both male and female reproductiv [Ectocarpus
siliculosus]
Length = 735
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/126 (51%), Positives = 78/126 (61%), Gaps = 14/126 (11%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 215
CNC+ S+CLKLYCECF YC+GCNCV C N E RQ A+ T+ERNP AF K
Sbjct: 440 CNCKKSKCLKLYCECFQRQQYCNGCNCVECLNTERTEDLRQLAIQGTIERNPQAFVSKF- 498
Query: 216 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
+ R R HN GC+CKKS CLKKYCECFQA + C NC+C++CKN+E
Sbjct: 499 -------ERRAGKRS------HNAGCNCKKSACLKKYCECFQAGVACGTNCKCVNCKNYE 545
Query: 276 GSEERR 281
G+ R
Sbjct: 546 GAAGGR 551
>gi|66812974|ref|XP_640666.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
discoideum AX4]
gi|60468739|gb|EAL66741.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
discoideum AX4]
Length = 902
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 69/137 (50%), Positives = 87/137 (63%), Gaps = 11/137 (8%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
K+C+C+NS+CLK+YCECFAA + C+GC C C NN + + A TLERNP+AF PK
Sbjct: 360 KKCHCKNSKCLKMYCECFAAKVLCNGCLCFGCQNNEANIEKVERARFQTLERNPDAFNPK 419
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
I S + KH+KGCHC+KS CLKKYCECFQA+I C+ENC+C DCKN
Sbjct: 420 IKPS-------NSKINNQIIIDKHSKGCHCRKSSCLKKYCECFQASIPCNENCKCYDCKN 472
Query: 274 FEGSEERRALFHGDHNG 290
+ L H D+ G
Sbjct: 473 ----QAHHHLHHPDNGG 485
>gi|14587305|dbj|BAB61216.1| P0460E08.26 [Oryza sativa Japonica Group]
Length = 553
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 66/102 (64%), Positives = 82/102 (80%), Gaps = 2/102 (1%)
Query: 131 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
K ESP+ PRA +GKDGTP K+K CNC++S+CLKLYCECFA+G+YCDGCNC NC NNV
Sbjct: 137 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNV 195
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREA 231
++E AR+EA+ ATLERNP+AFRPKI SSPH ++ + A A
Sbjct: 196 KNETARREAIDATLERNPDAFRPKIGSSPHANRNNMQAANAA 237
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 65/184 (35%), Positives = 91/184 (49%), Gaps = 18/184 (9%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDC-KNFEGSEERRALFHG--DHNGIAFMQ 295
K C+CK S CLK YCECF + + C + C C +C N + RR + N AF
Sbjct: 160 KCCNCKHSKCLKLYCECFASGVYC-DGCNCSNCFNNVKNETARREAIDATLERNPDAFRP 218
Query: 296 R-------------AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS 342
+ AANAA+ GA+G++ + + S KRK + +AK+ R
Sbjct: 219 KIGSSPHANRNNMQAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHRTNHV 278
Query: 343 QQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSS 402
Q Q + +P +P++Q + LG K R LADI+QP D+KELC LLV+VS
Sbjct: 279 PQVF-YQKNGMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVVSG 337
Query: 403 EATK 406
A K
Sbjct: 338 NAAK 341
>gi|55729585|emb|CAH91522.1| hypothetical protein [Pongo abelii]
Length = 528
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 96/147 (65%), Gaps = 12/147 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+C A G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCCANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409
Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
NFE S ER+ L H D + Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|452822459|gb|EME29478.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
sulphuraria]
Length = 542
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 79/177 (44%), Positives = 100/177 (56%), Gaps = 14/177 (7%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K++K C C+NS CLKLYCECFAAG C C+C NC N+ +HE EA A L RNP AF
Sbjct: 228 KERKSCKCKNSMCLKLYCECFAAGQLCSNCSCQNCLNDEDHEKEVSEARNAILLRNPAAF 287
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
PK+ + P +ED LA KH KGC+CK+SGC K YCECF A + CS C+C D
Sbjct: 288 EPKMTAVP------KED---GALAIKHQKGCNCKRSGCQKNYCECFHAGVFCSNICKCND 338
Query: 271 CKNFE--GSEERRALFH-GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEE 324
C N E SE+ L H +GI R A + + G + M S +++ E
Sbjct: 339 CGNREDDNSEKSTNLSHILSDSGIGI--RGALSPLRGETNGYQHLLFMESPRKRWRE 393
>gi|312078614|ref|XP_003141815.1| hypothetical protein LOAG_06231 [Loa loa]
Length = 495
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 84/244 (34%), Positives = 126/244 (51%), Gaps = 37/244 (15%)
Query: 82 PSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRA 141
PS +Q SP A QA Q NRV PV L +P+ RP
Sbjct: 216 PSKSRLLQTGPLSPPLAYRAAQASIQR----NRVFTPVLAAKL---------APKKRPLG 262
Query: 142 NTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 201
+ +K C+C S CLKLYC+CFA G +C+ C+C +C N +E+E+ R A+
Sbjct: 263 S----------KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRL 312
Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
+LERNPNAF+PKI + + R+ + H KGCHCKKS CLK YCEC++A +
Sbjct: 313 SLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHCKKSNCLKNYCECYEAKVP 363
Query: 262 CSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 321
C++ C+C+ C+N +E RA + + +AA +G+ + + T + S++
Sbjct: 364 CTDRCKCVCCRN---TESDRAARLANTKSATALTDIRSAATMGS--ADFHKTTLYSDEDS 418
Query: 322 SEEL 325
+E+
Sbjct: 419 DDEV 422
>gi|219126634|ref|XP_002183557.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404794|gb|EEC44739.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1852
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/191 (40%), Positives = 105/191 (54%), Gaps = 28/191 (14%)
Query: 115 VPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAG 174
P + K PL PGK+ N +G + +CNC+ S+CLKLYCECF+A
Sbjct: 1481 FPSSIVKTPLAV--PGKE---------NPNGITPMSTPRNRCNCKKSQCLKLYCECFSAE 1529
Query: 175 IYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 234
+ C+GCNC +C N R +A+ L +N +AF+P+IA++P GA D +
Sbjct: 1530 VMCEGCNCTDCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIAAAP-GAADLQ--------- 1579
Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS----EERRALFHGDHNG 290
H+ GC CKKS CLKKYCECFQA +LC C+C CKN GS ++RR + D +G
Sbjct: 1580 -GHSMGCKCKKSECLKKYCECFQAGVLCGLKCKCESCKNIAGSQKLIDKRRQM--KDTSG 1636
Query: 291 IAFMQRAANAA 301
F R + A
Sbjct: 1637 AEFAMRVSQEA 1647
>gi|390460682|ref|XP_003732523.1| PREDICTED: protein lin-54 homolog [Callithrix jacchus]
Length = 528
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 98/260 (37%), Positives = 137/260 (52%), Gaps = 32/260 (12%)
Query: 52 PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR----------SPSQAMPQ 101
P+P S Q+ + SQP RL ++P + P +Q ++ +P
Sbjct: 196 PKPINSTSQIVTT------SQPQQRL-IMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVG 248
Query: 102 WQARP-QHVRMVNRVPH-PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCR 159
+ P Q+V + + + + T PG Q R P + + +K CNC
Sbjct: 249 YAVLPAQYVTQLQQSSYVSIASNSTFTGTPGIQTQARP-PFNGIIPSESASRPRKPCNCT 307
Query: 160 NSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI
Sbjct: 308 KSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKE 367
Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S E
Sbjct: 368 GESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPE 416
Query: 280 RRALFH-GDHNGIAFMQRAA 298
R+ L H D + Q+ A
Sbjct: 417 RKTLMHLADAAEVRVQQQTA 436
>gi|194389404|dbj|BAG61668.1| unnamed protein product [Homo sapiens]
Length = 348
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 121 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 180
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 181 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 229
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 230 NFEESPERKTLMH 242
>gi|393905053|gb|EFO22256.2| hypothetical protein LOAG_06231 [Loa loa]
Length = 452
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 84/244 (34%), Positives = 126/244 (51%), Gaps = 37/244 (15%)
Query: 82 PSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRA 141
PS +Q SP A QA Q NRV PV L +P+ RP
Sbjct: 173 PSKSRLLQTGPLSPPLAYRAAQASIQR----NRVFTPVLAAKL---------APKKRPLG 219
Query: 142 NTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 201
+ +K C+C S CLKLYC+CFA G +C+ C+C +C N +E+E+ R A+
Sbjct: 220 S----------KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRL 269
Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
+LERNPNAF+PKI + + R+ + H KGCHCKKS CLK YCEC++A +
Sbjct: 270 SLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHCKKSNCLKNYCECYEAKVP 320
Query: 262 CSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 321
C++ C+C+ C+N +E RA + + +AA +G+ + + T + S++
Sbjct: 321 CTDRCKCVCCRN---TESDRAARLANTKSATALTDIRSAATMGS--ADFHKTTLYSDEDS 375
Query: 322 SEEL 325
+E+
Sbjct: 376 DDEV 379
>gi|334186533|ref|NP_193213.5| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
gi|395455099|sp|F4JIF5.1|TCX2_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 2; Short=AtTCX2;
AltName: Full=Protein TSO1-like 2; Short=Protein SOL2
gi|332658096|gb|AEE83496.1| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
Length = 674
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 70/159 (44%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 129 PGKQESPRSRP---RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN 184
P QE S P R D +G + K+CNC+ S+CLKLYCECFAAG+YC + C+C++
Sbjct: 349 PALQELNLSSPKKKRVKLDSGEG--ESCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCID 406
Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
C N HE RNP AF PK+ + Q+ +DA + +A+H +GC+CK
Sbjct: 407 CFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETGDDASKTPASARHKRGCNCK 466
Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
KS CLKKYCEC+Q + CS NCRC CKN G ++ ++
Sbjct: 467 KSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSSI 505
>gi|297830988|ref|XP_002883376.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
lyrata]
gi|297329216|gb|EFH59635.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
lyrata]
Length = 623
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 104/179 (58%), Gaps = 9/179 (5%)
Query: 110 RMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCE 169
+++ VP + L + P K+ R ++ G+ G+ K+CNC+ S+CLKLYCE
Sbjct: 302 KVIEVVPKSSGLVELTPISPKKK-----RRKSEQSGEGGS--SCKRCNCKKSKCLKLYCE 354
Query: 170 CFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 228
CFAAG+YC + C+C++C N HE RNP AF PK+ + + EDA
Sbjct: 355 CFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDA 414
Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 287
+ +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G ++ +LF D
Sbjct: 415 SKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 472
>gi|328865133|gb|EGG13519.1| tesmin/TSO1-like protein [Dictyostelium fasciculatum]
Length = 597
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 65/129 (50%), Positives = 84/129 (65%), Gaps = 13/129 (10%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K C+C+NS+CLKLYCECFA + C+GC+C CHNN + Q + LERNP AF P
Sbjct: 341 RKGCHCKNSKCLKLYCECFANKLLCNGCHCFGCHNNDANIEVVQRSRSNILERNPEAFNP 400
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K +E+ ++ KH KGCHC+KS CLKKYCECFQA I C E+C+C+DC+
Sbjct: 401 KFKQ--------KEETKQ----HKHTKGCHCRKSECLKKYCECFQAGIPCGEHCKCIDCR 448
Query: 273 NF-EGSEER 280
N E +ER
Sbjct: 449 NTGEDIKER 457
>gi|449015362|dbj|BAM78764.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 707
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/296 (33%), Positives = 145/296 (48%), Gaps = 71/296 (23%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+ +K CNC+NS+CLKLYC+CFAAG YC+GC+C NC N ++ RQ A+ +LERNP+AF
Sbjct: 99 RPRKPCNCKNSKCLKLYCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSLERNPHAF 158
Query: 211 RPKIASSPHGA-------------------------QDAREDARE--------------- 230
R K+ + + +D+ E +E
Sbjct: 159 RSKVVTLEYDTSSKDTASRQNCAATAAAAKSADVAPEDSSEPVQEQVPNEEASSTSTQPR 218
Query: 231 -----AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE----RR 281
Q A+H KGC C +S CLKKYCECFQ + CS +CRC +C+NFEGS E R
Sbjct: 219 TTGTTTQRHAQHIKGCSCLRSLCLKKYCECFQNGVYCSASCRCSNCRNFEGSAELSQARE 278
Query: 282 ALFHGDHNGIAFM-QRAAN-----AAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQS 335
L + GI+F Q+ A+ ++++G V S G L + R L G+ K QS
Sbjct: 279 RLVLAEL-GISFSHQKGASGTPTKSSLVGVVES---GLLSEESVRNIG--LEGILEKVQS 332
Query: 336 V-IRNPQSQQGNNVQNSA------VPCSQSVPVSQTANASVLGSTKSALRSPLADI 384
+ R P + +S S+ +QT+N S + +S+ R+P A +
Sbjct: 333 LRKRTPGESNAQSTGSSKSHRAKRSETSRRTSYAQTSNGSTV---ESSPRTPTAKL 385
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
+A E + K C+CK S CLK YC+CF A C+ C C +C N
Sbjct: 90 NALEGGSVTRPRKPCNCKNSKCLKLYCDCFAAGTYCN-GCHCTNCLNL 136
>gi|300121347|emb|CBK21727.2| unnamed protein product [Blastocystis hominis]
Length = 305
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 89/145 (61%), Gaps = 17/145 (11%)
Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVG 200
+++DG P CNCR SRCLKLYCECFA+ ++C+GCNCVNC NN EH R+E
Sbjct: 174 SSSDGSPHDPNHSHTCNCRKSRCLKLYCECFASDVFCNGCNCVNCKNNPEHLKEREEVKQ 233
Query: 201 ATLERNPNAFRPKIA-SSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQA 258
A ++R+P+ F P + +P A + N +GC C+K+ C+KKYC+CFQ
Sbjct: 234 ALIQRDPHVFEPHVVWDTP---------------ATRTNLRGCRCQKTRCVKKYCDCFQN 278
Query: 259 NILCSENCRCLDCKNFEGSEERRAL 283
I C NCRC+DCKN E + RAL
Sbjct: 279 GICCGPNCRCVDCKNTEEAALNRAL 303
>gi|170582258|ref|XP_001896048.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
gi|158596825|gb|EDP35103.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
Length = 469
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 59/129 (45%), Positives = 84/129 (65%), Gaps = 9/129 (6%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K C+C S CLKLYC+CFA G +C+ C+C +C N +E+E+ R A+ +LERNPNAF+P
Sbjct: 238 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 297
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI + + R+ + H KGCHCKKS CLK YCEC++A + C++ C+C+ C+
Sbjct: 298 KIGVATN---------RQVEPERLHQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCR 348
Query: 273 NFEGSEERR 281
N E R
Sbjct: 349 NTESDRAAR 357
>gi|321459097|gb|EFX70154.1| hypothetical protein DAPPUDRAFT_217492 [Daphnia pulex]
Length = 209
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/123 (55%), Positives = 83/123 (67%), Gaps = 12/123 (9%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K +K CNC S+CLKLYC+CFA G +C GCNCV C NN+EHE R A+ + L+RNP+AF
Sbjct: 20 KPRKPCNCTKSQCLKLYCDCFANGEFCHGCNCVCCANNLEHEELRLRAIRSCLDRNPHAF 79
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+PKI G E R HNKGCHCK+SGCLK YCEC++A I CS C+C+
Sbjct: 80 KPKI-----GVGWGPEPRR-------HNKGCHCKRSGCLKNYCECYEAKIACSAICKCIG 127
Query: 271 CKN 273
CKN
Sbjct: 128 CKN 130
>gi|395542051|ref|XP_003772948.1| PREDICTED: protein lin-54 homolog, partial [Sarcophilus harrisii]
Length = 360
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 91/133 (68%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYC+CFA G +C+ CNC NC NN+EHE RQ+A+ A L+RNP AF+P
Sbjct: 133 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAFKP 192
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 193 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 241
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 242 NFEESPERKTLMH 254
>gi|380022721|ref|XP_003695187.1| PREDICTED: protein lin-54 homolog [Apis florea]
Length = 827
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)
Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
TL G +S R+ T + + +K CNC S+CLKLYC+CFA G +C CNC NC
Sbjct: 580 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 639
Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK+
Sbjct: 640 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 689
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 690 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 719
>gi|340722920|ref|XP_003399847.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus terrestris]
Length = 829
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)
Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
TL G +S R+ T + + +K CNC S+CLKLYC+CFA G +C CNC NC
Sbjct: 582 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 641
Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK+
Sbjct: 642 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 691
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 692 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 721
>gi|340722922|ref|XP_003399848.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus terrestris]
Length = 835
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)
Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
TL G +S R+ T + + +K CNC S+CLKLYC+CFA G +C CNC NC
Sbjct: 588 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 647
Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK+
Sbjct: 648 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 697
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 698 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 727
>gi|328785527|ref|XP_392991.3| PREDICTED: protein lin-54 homolog isoform 1 [Apis mellifera]
Length = 804
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)
Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
TL G +S R+ T + + +K CNC S+CLKLYC+CFA G +C CNC NC
Sbjct: 580 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 639
Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK+
Sbjct: 640 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 689
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 690 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 719
>gi|350403514|ref|XP_003486824.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus impatiens]
Length = 829
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)
Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
TL G +S R+ T + + +K CNC S+CLKLYC+CFA G +C CNC NC
Sbjct: 582 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 641
Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK+
Sbjct: 642 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 691
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 692 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 721
>gi|350403512|ref|XP_003486823.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus impatiens]
Length = 835
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)
Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
TL G +S R+ T + + +K CNC S+CLKLYC+CFA G +C CNC NC
Sbjct: 588 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 647
Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK+
Sbjct: 648 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 697
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 698 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 727
>gi|30686784|ref|NP_566718.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75274232|sp|Q9LUI3.1|TSO1_ARATH RecName: Full=CRC domain-containing protein TSO1
gi|9279696|dbj|BAB01253.1| DNA binding protein-like [Arabidopsis thaliana]
gi|332643155|gb|AEE76676.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 695
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/135 (48%), Positives = 88/135 (65%), Gaps = 9/135 (6%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N HE E V AT + RNP
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 456
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF PK+ + +A +DA + +A+H +GC+CKKS C+KKYCEC+Q + CS NCRC
Sbjct: 457 AFAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRC 516
Query: 269 LDCKNFEGSEERRAL 283
C N G ++ L
Sbjct: 517 EGCTNVFGRKDGSLL 531
>gi|6739821|gb|AAF27433.1|AF206324_1 putative DNA binding protein [Arabidopsis thaliana]
gi|7767425|gb|AAF69124.1|AF204059_1 CXC domain protein TSO1 [Arabidopsis thaliana]
Length = 695
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/135 (48%), Positives = 88/135 (65%), Gaps = 9/135 (6%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N HE E V AT + RNP
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 456
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF PK+ + +A +DA + +A+H +GC+CKKS C+KKYCEC+Q + CS NCRC
Sbjct: 457 AFAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRC 516
Query: 269 LDCKNFEGSEERRAL 283
C N G ++ L
Sbjct: 517 EGCTNVFGRKDGSLL 531
>gi|223995127|ref|XP_002287247.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976363|gb|EED94690.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1174
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/174 (40%), Positives = 94/174 (54%), Gaps = 25/174 (14%)
Query: 145 GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLE 204
GK+ + + + C C+NS+CLKLYC+CF+A YC GC C++C N E R +A+ T
Sbjct: 852 GKENS-ENESTCKCKNSKCLKLYCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRA 910
Query: 205 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
RNPNAF+ K+ +S +H GC CKKS CLKKYCECF ++C E
Sbjct: 911 RNPNAFKTKLNAS----------------VTEHATGCKCKKSECLKKYCECFANLVVCGE 954
Query: 265 NCRCLDCKNFEGS----EERRALFHGDHNGIAFMQRAANAAIIGAVGSS--GYG 312
+C+C +CKN+ GS E RR L D G R G++ S YG
Sbjct: 955 SCKCSNCKNYVGSQALIERRRKL--KDAKGTELAMRPVARGWKGSMSDSKVAYG 1006
>gi|145502865|ref|XP_001437410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404560|emb|CAK70013.1| unnamed protein product [Paramecium tetraurelia]
Length = 220
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 64/122 (52%), Positives = 89/122 (72%), Gaps = 9/122 (7%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+NS+C+KLYCEC+ ++CD C+C NC NN + RQ+A+ TLE+NP+AF+P
Sbjct: 16 KKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQRQKAMQYTLEKNPSAFQP 75
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI +S D++ D KHNKGC CKKSGC+KKYCECFQA + CS+ C+C++C+
Sbjct: 76 KITTS-----DSKPDPLNF---GKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECR 127
Query: 273 NF 274
N+
Sbjct: 128 NY 129
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 27/43 (62%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
K C+CK S C+K YCEC+Q + C NC C +C N + ++R
Sbjct: 16 KKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQR 58
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
K K C C+ S C+K YCECF A + C D C C+ C N
Sbjct: 91 KHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 128
>gi|413952362|gb|AFW85011.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
Length = 235
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 64/91 (70%), Positives = 74/91 (81%), Gaps = 1/91 (1%)
Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
P K +SP+ R +GKD TP K+K CNCRNSRCLKLYCECFA+G +CDGCNC NC NN
Sbjct: 131 PLKPDSPKPWARL-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNN 189
Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPH 219
E+EVAR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 190 PENEVARREAIEATLERNPDAFRPKIGSSPH 220
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
K C+C+ S CLK YCECF + C + C C +C N +E R
Sbjct: 155 KCCNCRNSRCLKLYCECFASGTHC-DGCNCTNCFNNPENEVAR 196
>gi|402586637|gb|EJW80574.1| hypothetical protein WUBG_08516, partial [Wuchereria bancrofti]
Length = 391
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 59/129 (45%), Positives = 84/129 (65%), Gaps = 9/129 (6%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K C+C S CLKLYC+CFA G +C+ C+C +C N +E+E+ R A+ +LERNPNAF+P
Sbjct: 234 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 293
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI + + R+ + H KGCHCKKS CLK YCEC++A + C++ C+C+ C+
Sbjct: 294 KIGVATN---------RQVEPERLHQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCR 344
Query: 273 NFEGSEERR 281
N E R
Sbjct: 345 NTESDRAAR 353
>gi|414877954|tpg|DAA55085.1| TPA: hypothetical protein ZEAMMB73_985885 [Zea mays]
Length = 772
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/154 (44%), Positives = 92/154 (59%), Gaps = 4/154 (2%)
Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
Q SP+ + R D DGT K+C+C+ S+CLKLYCECFAAG+YC + C+C+ C NN
Sbjct: 443 QGSPKKK-RHKFDSGDGT--SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCMNNQS 499
Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
H RNP AF PK+ + + R+ + + +A+H +GC+CKKS CLK
Sbjct: 500 HTETVLSTRQQIESRNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLK 559
Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
KYCECFQ + CS +CRC CKN G E A+
Sbjct: 560 KYCECFQGGVGCSISCRCEGCKNAFGKREGAAVL 593
>gi|18403510|ref|NP_566717.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75153799|sp|Q8L548.1|TCX3_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 3; Short=AtTCX3;
AltName: Full=Protein TSO1-like 1; Short=Protein SOL1
gi|20466374|gb|AAM20504.1| unknown protein [Arabidopsis thaliana]
gi|22136318|gb|AAM91237.1| unknown protein [Arabidopsis thaliana]
gi|332643153|gb|AEE76674.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 609
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 3/155 (1%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
SP+ + R + +G K+CNC+ S+CLKLYCECFAAG YC + C+C+NC N H+
Sbjct: 309 SPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHK 367
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
RNP AF PK+ + + EDA + +A+H +GC+CKKS CLKKY
Sbjct: 368 DVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKY 427
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGD 287
CEC+Q + CS NCRC CKN G ++ +LF D
Sbjct: 428 CECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461
>gi|242019702|ref|XP_002430298.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515413|gb|EEB17560.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 995
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 70/125 (56%), Positives = 90/125 (72%), Gaps = 11/125 (8%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+ +K CNC S+CLKLYCECFA G +C CNC NC+NN+EHE ARQ+++ + LERNP AF
Sbjct: 779 RMKKPCNCTRSQCLKLYCECFANGEFCFQCNCNNCYNNIEHEEARQKSIKSCLERNPCAF 838
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
RPKI + ED R +HNKGCHCK+SGCLK YCEC++A I CS++C+C+
Sbjct: 839 RPKIQ-----IGETEEDER------RHNKGCHCKRSGCLKNYCECYEAKITCSKSCKCIG 887
Query: 271 CKNFE 275
C+N E
Sbjct: 888 CRNTE 892
>gi|7767427|gb|AAF69125.1|AF205142_1 CXC domain containing TSO1-like protein 1 [Arabidopsis thaliana]
Length = 609
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 3/155 (1%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
SP+ + R + +G K+CNC+ S+CLKLYCECFAAG YC + C+C+NC N H+
Sbjct: 309 SPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHK 367
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
RNP AF PK+ + + EDA + +A+H +GC+CKKS CLKKY
Sbjct: 368 DVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKY 427
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGD 287
CEC+Q + CS NCRC CKN G ++ +LF D
Sbjct: 428 CECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461
>gi|168046141|ref|XP_001775533.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673088|gb|EDQ59616.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1223
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 63/127 (49%), Positives = 81/127 (63%), Gaps = 1/127 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAA +C G C C NC N E+E RNP AF P
Sbjct: 737 KRCNCKKSKCLKLYCECFAAREFCVGSCACRNCFNKPEYEATVLNTRQQIESRNPLAFAP 796
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI +P + ++A + +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC C+
Sbjct: 797 KIVQAPESSPIPGDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCR 856
Query: 273 NFEGSEE 279
N G +E
Sbjct: 857 NMYGRKE 863
>gi|255542340|ref|XP_002512233.1| tso1, putative [Ricinus communis]
gi|223548194|gb|EEF49685.1| tso1, putative [Ricinus communis]
Length = 729
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 63/126 (50%), Positives = 81/126 (64%), Gaps = 1/126 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYC+CFAAG+YC + C+C +C N HE E RNP AF P
Sbjct: 450 KRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFNKPAHEDTVLETRKQIESRNPLAFAP 509
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ S D ++ + +A+H +GC+CKKS CLKKYCECFQ + CS NCRC CK
Sbjct: 510 KVIRSTDYVSDFGDETNKTPASARHKRGCNCKKSNCLKKYCECFQGGVGCSSNCRCEGCK 569
Query: 273 NFEGSE 278
N GS+
Sbjct: 570 NSFGSK 575
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 4/48 (8%)
Query: 144 DGKDGTP---KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
D + TP + ++ CNC+ S CLK YCECF G+ C C C C N
Sbjct: 523 DETNKTPASARHKRGCNCKKSNCLKKYCECFQGGVGCSSNCRCEGCKN 570
>gi|383858991|ref|XP_003704982.1| PREDICTED: protein lin-54 homolog isoform 2 [Megachile rotundata]
Length = 802
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 92/149 (61%), Gaps = 10/149 (6%)
Query: 127 LPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCH 186
L G +S R+ T + + +K CNC S+CLKLYC+CFA G +C CNC NC
Sbjct: 580 LSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCS 639
Query: 187 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 246
NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK+S
Sbjct: 640 NNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKRS 689
Query: 247 GCLKKYCECFQANILCSENCRCLDCKNFE 275
GCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 690 GCLKNYCECYEAKIPCSANCKCIGCRNVE 718
>gi|15028139|gb|AAK76693.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 526
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 65/136 (47%), Positives = 88/136 (64%), Gaps = 9/136 (6%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N HE E V AT + RNP
Sbjct: 232 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 287
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF PK+ + +A +DA + +A+H +GC+CKKS C+KKYCEC+Q + CS NCRC
Sbjct: 288 AFAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRC 347
Query: 269 LDCKNFEGSEERRALF 284
C N G ++ L
Sbjct: 348 EGCTNVFGRKDGSLLV 363
>gi|23297130|gb|AAN13100.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 526
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 65/136 (47%), Positives = 88/136 (64%), Gaps = 9/136 (6%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N HE E V AT + RNP
Sbjct: 232 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 287
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF PK+ + +A +DA + +A+H +GC+CKKS C+KKYCEC+Q + CS NCRC
Sbjct: 288 AFAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRC 347
Query: 269 LDCKNFEGSEERRALF 284
C N G ++ L
Sbjct: 348 EGCTNVFGRKDGSLLV 363
>gi|383858989|ref|XP_003704981.1| PREDICTED: protein lin-54 homolog isoform 1 [Megachile rotundata]
Length = 826
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 92/149 (61%), Gaps = 10/149 (6%)
Query: 127 LPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCH 186
L G +S R+ T + + +K CNC S+CLKLYC+CFA G +C CNC NC
Sbjct: 580 LSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCS 639
Query: 187 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 246
NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK+S
Sbjct: 640 NNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKRS 689
Query: 247 GCLKKYCECFQANILCSENCRCLDCKNFE 275
GCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 690 GCLKNYCECYEAKIPCSANCKCIGCRNVE 718
>gi|307206123|gb|EFN84203.1| Protein lin-54-like protein [Harpegnathos saltator]
Length = 825
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 67/123 (54%), Positives = 83/123 (67%), Gaps = 10/123 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S+CLKLYC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRP
Sbjct: 605 RKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRP 664
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+
Sbjct: 665 KIGKGRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCR 714
Query: 273 NFE 275
N E
Sbjct: 715 NIE 717
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
D + K CNC+ S CLK YCEC+ A I C C C+ C N
Sbjct: 674 DDIRRHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRN 715
>gi|307182142|gb|EFN69485.1| Protein lin-54-like protein [Camponotus floridanus]
Length = 822
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 67/123 (54%), Positives = 83/123 (67%), Gaps = 10/123 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S+CLKLYC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRP
Sbjct: 602 RKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRP 661
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+
Sbjct: 662 KIGKGRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCR 711
Query: 273 NFE 275
N E
Sbjct: 712 NIE 714
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
D + K CNC+ S CLK YCEC+ A I C C C+ C N
Sbjct: 671 DDIRRHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRN 712
>gi|414883728|tpg|DAA59742.1| TPA: hypothetical protein ZEAMMB73_893320 [Zea mays]
Length = 800
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 67/148 (45%), Positives = 87/148 (58%), Gaps = 3/148 (2%)
Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNN 188
G+ SP+ + R +G DG + K+C+C+ S+CLKLYCECFAAG+YC + C+C C N
Sbjct: 460 GQSVSPQKKRRKTDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCMNK 517
Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
HE RNP AF PK+ + ED +A+H +GC+CKKS C
Sbjct: 518 PIHEETVLSTRKQIEFRNPLAFAPKVIRMSDAGLETGEDPNNTPASARHKRGCNCKKSSC 577
Query: 249 LKKYCECFQANILCSENCRCLDCKNFEG 276
LKKYCEC+Q + CS NCRC CKN G
Sbjct: 578 LKKYCECYQGGVGCSSNCRCESCKNAFG 605
>gi|326500406|dbj|BAK06292.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 782
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/142 (45%), Positives = 86/142 (60%), Gaps = 1/142 (0%)
Query: 139 PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQE 197
P+ +G + K+C+C+ S+CLKLYCECFAAG+YC + C+C C N HE
Sbjct: 458 PKKRRKSDNGDDESCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLS 517
Query: 198 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
RNP AF PK+ Q+A+ED + +A+H +GC+CKKS CLKKYCEC+Q
Sbjct: 518 TRKHIEFRNPLAFAPKVIRMSEAGQEAQEDPKNTPASARHKRGCNCKKSSCLKKYCECYQ 577
Query: 258 ANILCSENCRCLDCKNFEGSEE 279
+ CS NCRC CKN G+ +
Sbjct: 578 GGVGCSNNCRCETCKNTFGTRD 599
>gi|242086180|ref|XP_002443515.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
gi|241944208|gb|EES17353.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
Length = 767
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 91/154 (59%), Gaps = 4/154 (2%)
Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
Q SP+ + R D DGT K+C+C+ S+CLKLYCECFAAG+YC + C+C+ C NN
Sbjct: 438 QGSPKKK-RQKFDNGDGT--SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCLNNQS 494
Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
H RNP AF PK+ + + + + + +A+H +GC+CKKS CLK
Sbjct: 495 HTETVLSTRQQIESRNPLAFAPKVIHTSESGLELGDFSNKTPASARHKRGCNCKKSSCLK 554
Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
KYCECFQ + CS +CRC CKN G E A+
Sbjct: 555 KYCECFQGGVGCSISCRCEGCKNAFGKREGAAVL 588
>gi|156378675|ref|XP_001631267.1| predicted protein [Nematostella vectensis]
gi|156218304|gb|EDO39204.1| predicted protein [Nematostella vectensis]
Length = 226
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/133 (56%), Positives = 90/133 (67%), Gaps = 11/133 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S+CLKLYC+CFA G +C CNCVNC NN+EHE R +A+ A LERNP+AF P
Sbjct: 4 RKPCNCTKSQCLKLYCDCFAQGEFCSNCNCVNCSNNIEHERERSKAIKACLERNPHAFHP 63
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI G + R HNKGCHCK+SGCLK YCEC++A ILC+ C+C CK
Sbjct: 64 KIGKGKVGESERR-----------HNKGCHCKRSGCLKNYCECYEAKILCTSLCKCTGCK 112
Query: 273 NFEGSEERRALFH 285
NFE S ER+ L H
Sbjct: 113 NFEESPERKTLMH 125
>gi|343959662|dbj|BAK63688.1| hypothetical protein [Pan troglodytes]
Length = 348
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/129 (54%), Positives = 89/129 (68%), Gaps = 11/129 (8%)
Query: 157 NCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 216
NC S CL+LYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI
Sbjct: 125 NCTKSLCLRLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGK 184
Query: 217 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE
Sbjct: 185 GKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEE 233
Query: 277 SEERRALFH 285
S ER+ L H
Sbjct: 234 SPERKTLMH 242
>gi|322788388|gb|EFZ14059.1| hypothetical protein SINV_04654 [Solenopsis invicta]
Length = 834
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/122 (54%), Positives = 83/122 (68%), Gaps = 10/122 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S+CLKLYC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRP
Sbjct: 614 RKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRP 673
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI +D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+
Sbjct: 674 KIGKGRETGEDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSGNCKCIGCR 723
Query: 273 NF 274
N
Sbjct: 724 NI 725
>gi|414868877|tpg|DAA47434.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
gi|414868878|tpg|DAA47435.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
Length = 587
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 68/149 (45%), Positives = 89/149 (59%), Gaps = 4/149 (2%)
Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
Q SP+ + R D DGT K+C+C+ S+CLKLYCECFAAG+YC + C+C+ C NN
Sbjct: 441 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCQNNQS 497
Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
H RNP AF PK+ + D + + + +A+H +GC+CKKS CLK
Sbjct: 498 HMETVLSTRQQIESRNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLK 557
Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEE 279
KYCECFQ + CS +CRC CKN G E
Sbjct: 558 KYCECFQGGVGCSVSCRCEGCKNAFGKRE 586
>gi|242043084|ref|XP_002459413.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
gi|241922790|gb|EER95934.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
Length = 776
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 67/148 (45%), Positives = 87/148 (58%), Gaps = 3/148 (2%)
Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNN 188
G+ SP+ + R +G DG + K+C+C+ S+CLKLYCECFAAG+YC + C+C C N
Sbjct: 442 GQSVSPQKKRRKTDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNK 499
Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
HE RNP AF PK+ + ED +A+H +GC+CKKS C
Sbjct: 500 PIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGLETGEDPNNTPASARHKRGCNCKKSSC 559
Query: 249 LKKYCECFQANILCSENCRCLDCKNFEG 276
LKKYCEC+Q + CS NCRC CKN G
Sbjct: 560 LKKYCECYQGGVGCSTNCRCESCKNTFG 587
>gi|167536200|ref|XP_001749772.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771699|gb|EDQ85361.1| predicted protein [Monosiga brevicollis MX1]
Length = 429
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 62/121 (51%), Positives = 79/121 (65%), Gaps = 10/121 (8%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+ CNC+NS+CLKLYCECFA G YC CNCV+C N E R++A+ TL +NP+AFRP
Sbjct: 216 RTCNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLRKQAIEVTLAKNPDAFRP 275
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K GA +R+ A + H KGC C +S C+K+YCECFQA I C+ C+C DC
Sbjct: 276 KF-----GANKSRQSA----APSAHVKGCRCARSKCIKRYCECFQAGIPCTSACQCRDCH 326
Query: 273 N 273
N
Sbjct: 327 N 327
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 26/47 (55%)
Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
A + C+CK S CLK YCECF C NC C+ C N E +E R
Sbjct: 212 ASGGRTCNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLR 258
>gi|255548982|ref|XP_002515547.1| tso1, putative [Ricinus communis]
gi|223545491|gb|EEF46996.1| tso1, putative [Ricinus communis]
Length = 873
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/133 (48%), Positives = 83/133 (62%), Gaps = 2/133 (1%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ ++CLKLYC+CFAAGIYC D C C +C N E+E E RNP AF P
Sbjct: 453 KRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFNRPEYEDTVLETRQQIESRNPLAFAP 512
Query: 213 KIASSPHGAQDAREDAREAQLA-AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
KI +RED + + ++H +GC+CKKS CLKKYCEC+QAN+ CS CRC C
Sbjct: 513 KIVQHAKEFAASREDRSSSMPSLSRHKRGCNCKKSMCLKKYCECYQANVGCSSECRCEGC 572
Query: 272 KNFEGSEERRALF 284
KN G +E +
Sbjct: 573 KNGYGRKEEYGII 585
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 30/51 (58%)
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
D E+ + + K C+CKK+ CLK YC+CF A I C++ C C DC N
Sbjct: 437 DNFEEKKSLSIDGDGCKRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFN 487
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLE 204
+ ++ CNC+ S CLK YCEC+ A + C C C C N R+E G E
Sbjct: 537 RHKRGCNCKKSMCLKKYCECYQANVGCSSECRCEGCKNG----YGRKEEYGIIEE 587
>gi|332021648|gb|EGI62007.1| Protein lin-54-like protein [Acromyrmex echinatior]
Length = 859
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/122 (54%), Positives = 83/122 (68%), Gaps = 10/122 (8%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S+CLKLYC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRP
Sbjct: 639 RKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRP 698
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI +D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+
Sbjct: 699 KIGKGRETGEDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSGNCKCIGCR 748
Query: 273 NF 274
N
Sbjct: 749 NI 750
>gi|357111377|ref|XP_003557490.1| PREDICTED: uncharacterized protein LOC100836677 [Brachypodium
distachyon]
Length = 787
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 87/143 (60%), Gaps = 4/143 (2%)
Query: 135 PRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEV 193
P+ + R + +G D + K+C+C+ S+CLKLYCECFAAG+YC + C+C+ C N HE
Sbjct: 459 PQKKRRKSDNGDD---ESCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCLGCLNKPIHEE 515
Query: 194 ARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYC 253
RNP AF PK+ Q+ +ED +A+H +GC+CKKS CLKKYC
Sbjct: 516 IVLSTRKQIEFRNPLAFAPKVIRLSDAGQETQEDPNNTPASARHKRGCNCKKSSCLKKYC 575
Query: 254 ECFQANILCSENCRCLDCKNFEG 276
EC+Q + CS NCRC CKN G
Sbjct: 576 ECYQGGVGCSSNCRCESCKNTFG 598
>gi|115470839|ref|NP_001059018.1| Os07g0176200 [Oryza sativa Japonica Group]
gi|34393829|dbj|BAC83433.1| putative CXC domain protein TSO1 [Oryza sativa Japonica Group]
gi|113610554|dbj|BAF20932.1| Os07g0176200 [Oryza sativa Japonica Group]
Length = 782
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/148 (45%), Positives = 88/148 (59%), Gaps = 4/148 (2%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
SP+ + R + +G DG + K+C+C+ S+CLKLYCECFAAG+YC + C+C C N HE
Sbjct: 450 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHE 507
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQD-AREDAREAQLAAKHNKGCHCKKSGCLKK 251
RNP AF PK+ QD ED +A+H +GC+CKKS CLKK
Sbjct: 508 EIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKK 567
Query: 252 YCECFQANILCSENCRCLDCKNFEGSEE 279
YCEC+Q + CS NCRC CKN G +
Sbjct: 568 YCECYQGGVGCSSNCRCEGCKNTFGKRD 595
>gi|222636534|gb|EEE66666.1| hypothetical protein OsJ_23296 [Oryza sativa Japonica Group]
Length = 782
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 68/148 (45%), Positives = 88/148 (59%), Gaps = 4/148 (2%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
SP+ + R + +G DG + K+C+C+ S+CLKLYCECFAAG+YC + C+C C N HE
Sbjct: 438 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHE 495
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQD-AREDAREAQLAAKHNKGCHCKKSGCLKK 251
RNP AF PK+ QD ED +A+H +GC+CKKS CLKK
Sbjct: 496 EIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKK 555
Query: 252 YCECFQANILCSENCRCLDCKNFEGSEE 279
YCEC+Q + CS NCRC CKN G +
Sbjct: 556 YCECYQGGVGCSSNCRCEGCKNTFGKRD 583
>gi|218199174|gb|EEC81601.1| hypothetical protein OsI_25091 [Oryza sativa Indica Group]
Length = 794
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 68/148 (45%), Positives = 88/148 (59%), Gaps = 4/148 (2%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
SP+ + R + +G DG + K+C+C+ S+CLKLYCECFAAG+YC + C+C C N HE
Sbjct: 450 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHE 507
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQD-AREDAREAQLAAKHNKGCHCKKSGCLKK 251
RNP AF PK+ QD ED +A+H +GC+CKKS CLKK
Sbjct: 508 EIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKK 567
Query: 252 YCECFQANILCSENCRCLDCKNFEGSEE 279
YCEC+Q + CS NCRC CKN G +
Sbjct: 568 YCECYQGGVGCSSNCRCEGCKNTFGKRD 595
>gi|297737091|emb|CBI26292.3| unnamed protein product [Vitis vinifera]
Length = 717
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 90/154 (58%), Gaps = 3/154 (1%)
Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
Q SP+ + R G G + K+CNC+ S+CLKLYCECFAAG+YC + C+C C N
Sbjct: 403 QSSPKKKRRRLEHG--GETEGCKRCNCKKSKCLKLYCECFAAGVYCVEPCSCQECFNKPI 460
Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
HE RNP AF PK+ S + ++ + +A+H +GC+CKKS CLK
Sbjct: 461 HEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKSSCLK 520
Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
KYCEC+Q + CS NCRC CKN G ++ AL
Sbjct: 521 KYCECYQGGVGCSINCRCEGCKNAFGRKDGSALI 554
>gi|40241245|emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus]
gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like protein [Lotus japonicus]
Length = 897
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 63/128 (49%), Positives = 81/128 (63%), Gaps = 2/128 (1%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYC+CFAAG++C D C+C +C N E+ E RNP AF P
Sbjct: 509 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 568
Query: 213 KIASSPHGAQDAREDAR-EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
KI S A ED +A+H +GC+CK+S CLKKYCEC+Q+N+ CS CRC C
Sbjct: 569 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGC 628
Query: 272 KNFEGSEE 279
KN G +E
Sbjct: 629 KNVYGKKE 636
>gi|51949820|gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japonicus]
Length = 821
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 63/128 (49%), Positives = 81/128 (63%), Gaps = 2/128 (1%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYC+CFAAG++C D C+C +C N E+ E RNP AF P
Sbjct: 433 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 492
Query: 213 KIASSPHGAQDAREDAR-EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
KI S A ED +A+H +GC+CK+S CLKKYCEC+Q+N+ CS CRC C
Sbjct: 493 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGC 552
Query: 272 KNFEGSEE 279
KN G +E
Sbjct: 553 KNVYGKKE 560
>gi|348567366|ref|XP_003469470.1| PREDICTED: protein lin-54 homolog [Cavia porcellus]
Length = 749
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 81/150 (54%), Gaps = 41/150 (27%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV---------------EHEVAR 195
+ +K CNC S CLKLYC+CFA G EHE R
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANG---------------EFCNNCNCTNCYNNLEHENER 564
Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
Q+A+ A L+RNP AF+PKI G D R H+KGC+CK+SGCLK YCEC
Sbjct: 565 QKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCEC 613
Query: 256 FQANILCSENCRCLDCKNFEGSEERRALFH 285
++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 614 YEAKIMCSSICKCIGCKNFEESPERKTLMH 643
>gi|359477548|ref|XP_002279154.2| PREDICTED: uncharacterized protein LOC100261336 [Vitis vinifera]
Length = 738
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 62/132 (46%), Positives = 81/132 (61%), Gaps = 1/132 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG+YC + C+C C N HE RNP AF P
Sbjct: 444 KRCNCKKSKCLKLYCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAP 503
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ S + ++ + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CK
Sbjct: 504 KVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCK 563
Query: 273 NFEGSEERRALF 284
N G ++ AL
Sbjct: 564 NAFGRKDGSALI 575
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
+ ++ CNC+ S CLK YCEC+ G+ C C C C N
Sbjct: 527 RHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKN 564
>gi|356538172|ref|XP_003537578.1| PREDICTED: uncharacterized protein LOC100815907 [Glycine max]
Length = 774
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 91/152 (59%), Gaps = 3/152 (1%)
Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
Q SPR + R + G + K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N
Sbjct: 466 QNSPRMKRRKSE--TPGDTEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPI 523
Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
HE + RNP AF PK+ + + +D + +A+H +GC+CKKS CLK
Sbjct: 524 HEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCLK 583
Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRA 282
KYCEC+Q + CS +CRC CKN G ++ A
Sbjct: 584 KYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 615
>gi|298204891|emb|CBI34198.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 82/127 (64%), Gaps = 1/127 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N HE + RNP AF P
Sbjct: 503 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 562
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ S + +++ + +A+H +GC+CKKS CLKKYCECFQ + CS +CRC CK
Sbjct: 563 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCK 622
Query: 273 NFEGSEE 279
N G ++
Sbjct: 623 NTFGRKD 629
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 229 REAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
R ++ AA++ K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 491 RRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537
>gi|147840870|emb|CAN68781.1| hypothetical protein VITISV_018991 [Vitis vinifera]
Length = 806
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 82/127 (64%), Gaps = 1/127 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N HE + RNP AF P
Sbjct: 503 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 562
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ S + +++ + +A+H +GC+CKKS CLKKYCECFQ + CS +CRC CK
Sbjct: 563 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCK 622
Query: 273 NFEGSEE 279
N G ++
Sbjct: 623 NTFGRKD 629
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 229 REAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
R ++ AA++ K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 491 RRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537
>gi|356497442|ref|XP_003517569.1| PREDICTED: uncharacterized protein LOC100804447 [Glycine max]
Length = 774
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 92/155 (59%), Gaps = 9/155 (5%)
Query: 132 QESPRSRPRANTDGKDGTPKKQ---KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
Q SP+ + R K TP + K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N
Sbjct: 466 QNSPKKKRR-----KSETPGETEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFN 520
Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
HE + RNP AF PK+ + + +D + +A+H +GC+CKKS
Sbjct: 521 KPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSS 580
Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
CLKKYCEC+Q + CS +CRC CKN G ++ A
Sbjct: 581 CLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 615
>gi|359487901|ref|XP_002272709.2| PREDICTED: uncharacterized protein LOC100265943 [Vitis vinifera]
Length = 777
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 82/127 (64%), Gaps = 1/127 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N HE + RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 533
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ S + +++ + +A+H +GC+CKKS CLKKYCECFQ + CS +CRC CK
Sbjct: 534 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCK 593
Query: 273 NFEGSEE 279
N G ++
Sbjct: 594 NTFGRKD 600
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 229 REAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
R ++ AA++ K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 462 RRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 508
>gi|255552089|ref|XP_002517089.1| tso1, putative [Ricinus communis]
gi|223543724|gb|EEF45252.1| tso1, putative [Ricinus communis]
Length = 727
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 80/130 (61%), Gaps = 1/130 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG+YC + C+C C N HE RNP AF P
Sbjct: 435 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAP 494
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ S + ++ + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CK
Sbjct: 495 KVIRSSEPVTEIGDEFSKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCK 554
Query: 273 NFEGSEERRA 282
N G ++ A
Sbjct: 555 NAFGRKDGSA 564
>gi|118384765|ref|XP_001025522.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89307289|gb|EAS05277.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1084
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 60/118 (50%), Positives = 78/118 (66%), Gaps = 1/118 (0%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 215
CNC+ ++CLKLYCECFAA C+GC C C N E E R+EA A LERN +AF PKI
Sbjct: 533 CNCKRTKCLKLYCECFAASRMCEGCTCQGCFNKPEFEAMRKEARQAILERNNSAFDPKID 592
Query: 216 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
S Q E+ + + H+KGC+CKKS CLKKYCEC+Q + C++ C+C +C+N
Sbjct: 593 SQSKLKQLCNEEGVDNEQKV-HSKGCNCKKSNCLKKYCECYQLGVKCTKLCKCDNCRN 649
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 142 NTDGKDGTPK-KQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQE 197
N +G D K K CNC+ S CLK YCEC+ G+ C C C NC NN+ H + +E
Sbjct: 602 NEEGVDNEQKVHSKGCNCKKSNCLKKYCECYQLGVKCTKLCKCDNCRNNINHPHSHEE 659
>gi|357436899|ref|XP_003588725.1| Lin-54-like protein [Medicago truncatula]
gi|355477773|gb|AES58976.1| Lin-54-like protein [Medicago truncatula]
Length = 921
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 84/149 (56%), Gaps = 11/149 (7%)
Query: 136 RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVA 194
R + + +TD G K CNC+ S+CLKLYC+CF AGI+C DGC C C N VE +
Sbjct: 506 RKKKKTSTDDASGP----KTCNCKKSKCLKLYCDCFGAGIFCGDGCACEGCGNRVEFQDK 561
Query: 195 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----AAKHNKGCHCKKSGCLK 250
E RNP AF PKI P A + + + +A+H +GC+CK+S C K
Sbjct: 562 VVETKQQIESRNPQAFAPKIV--PCAADVPPNNMEDVNMTTPASARHKRGCNCKRSKCTK 619
Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEE 279
KYCECFQAN+ CS CRC C N G E
Sbjct: 620 KYCECFQANVGCSTGCRCDGCMNAFGKRE 648
>gi|193237591|dbj|BAG50072.1| transcription factor CPP [Lotus japonicus]
Length = 764
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/167 (40%), Positives = 94/167 (56%), Gaps = 14/167 (8%)
Query: 130 GKQESPRSRPRANTDGKDGTPKKQ-------------KQCNCRNSRCLKLYCECFAAGIY 176
G Q + S A+ D +PKK+ K+CNC+ S+CLKLYCECFAAG+Y
Sbjct: 439 GVQPAEDSSQAASEDFNQNSPKKKRRKVEPAGDTEGCKRCNCKKSKCLKLYCECFAAGVY 498
Query: 177 C-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 235
C + C+C +C N HE + RNP AF PK+ + + +D + +A
Sbjct: 499 CIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASA 558
Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
+H +GC+CKKS CLKKYCEC+Q + CS +CRC CKN G ++ A
Sbjct: 559 RHKRGCNCKKSNCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 605
>gi|357157337|ref|XP_003577764.1| PREDICTED: uncharacterized protein LOC100829642 [Brachypodium
distachyon]
Length = 771
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 121/226 (53%), Gaps = 26/226 (11%)
Query: 81 PPSSQPHVQAHMRS----PSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLP-PGKQESP 135
PP + P + H+ + P + + A+ ++ N P V P + P ++S
Sbjct: 375 PPCALPGIGLHLNALATIPKDKIVPYAAQSSTIQASN-FPSAVSSSPATSEPNTANEDSS 433
Query: 136 RSRPRANTD-GKDGTPKKQ------------KQCNCRNSRCLKLYCECFAAGIYC-DGCN 181
++ AN D G+PKK+ K+C+C+ S+CLKLYCECF AG++C + C+
Sbjct: 434 QAIVVANADESGQGSPKKKRHKFDNGDGTSCKRCSCKKSKCLKLYCECFHAGVFCSEPCS 493
Query: 182 CVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 238
C C N N+E ++ +E + + RNP AF PK+ + Q+ E + + +A+H
Sbjct: 494 CQGCLNKPSNMETVLSTREQIES---RNPLAFAPKVIRTSEPGQELGEQSNKTPASARHK 550
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
+GC+CKKS CLKKYCECFQ + CS +CRC CKN G E L
Sbjct: 551 RGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGRREGVVLL 596
>gi|356575873|ref|XP_003556061.1| PREDICTED: uncharacterized protein LOC100811662 [Glycine max]
Length = 774
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 64/150 (42%), Positives = 89/150 (59%), Gaps = 3/150 (2%)
Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
+Q SP+ + R +G + K+CNC+ S+CLKLYCECFAAG+YC + C C +C N
Sbjct: 467 QQNSPKKKRRKLEPAGEG--ESCKRCNCKKSKCLKLYCECFAAGVYCIEPCACRDCFNKP 524
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
H + RNP AF PK+ S + +D + +A+H +GC+CKKS CL
Sbjct: 525 IHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIGDDLNKTPASARHKRGCNCKKSSCL 584
Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEE 279
KKYCEC+Q + CS +CRC CKN G ++
Sbjct: 585 KKYCECYQGGVGCSVSCRCEGCKNTYGRKD 614
>gi|358332933|dbj|GAA51516.1| protein lin-54 homolog [Clonorchis sinensis]
Length = 638
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 56/121 (46%), Positives = 77/121 (63%), Gaps = 13/121 (10%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
++C+C S CLKLYCECFAAG++C C+CV C+N +HE RQ+A+ + R P+AF+ K
Sbjct: 163 RRCSCSRSFCLKLYCECFAAGVFCSDCSCVGCYNLAQHENYRQKAIMRIVNRKPDAFQSK 222
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
IA S + A H +GC+CK+SGCLK YCEC++A + C+ CRC C N
Sbjct: 223 IAHS-------------LENATVHARGCNCKRSGCLKNYCECYEARVRCTSRCRCQYCYN 269
Query: 274 F 274
Sbjct: 270 M 270
>gi|297833182|ref|XP_002884473.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
lyrata]
gi|297330313|gb|EFH60732.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
lyrata]
Length = 611
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 69/179 (38%), Positives = 100/179 (55%), Gaps = 9/179 (5%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+C C+ S+CLKLYCECFAAG+YC + C+C NC N HE ++ RNP AF P
Sbjct: 431 KRCKCKKSQCLKLYCECFAAGLYCVEPCSCQNCFNKPIHEDLVMKSREVIEARNPLAFAP 490
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ S+ A D + + +A+H +GC+C+KSGCLKKYCEC + CS +CRC+ CK
Sbjct: 491 KVVSTSDTAIDLWVENSKTPASARHTRGCNCRKSGCLKKYCECNLMGVRCSSSCRCIGCK 550
Query: 273 NFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAA 331
N G + G+ + + A + G S ++++R++ LL G A
Sbjct: 551 NVFGHTNEKCA--GESDAVTINDEAKHC------GDSRQNEETSNSERRNRLLLPGSVA 601
>gi|218187206|gb|EEC69633.1| hypothetical protein OsI_39028 [Oryza sativa Indica Group]
Length = 780
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/157 (43%), Positives = 99/157 (63%), Gaps = 11/157 (7%)
Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN--- 187
Q SP+ + R D DGT K+C+C+ S+CLKLYCECF AG++C + C+C C N
Sbjct: 458 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPS 514
Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
N+E ++ +E + + RNP AF PK+ + G Q+ +D+ + +++H +GC+CKKS
Sbjct: 515 NMETVLSTREQIES---RNPLAFAPKVIRTEPG-QELADDSNKTPASSRHKRGCNCKKSS 570
Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
CLKKYCEC+Q + CS +CRC CKN G E AL
Sbjct: 571 CLKKYCECYQGGVGCSVSCRCEGCKNAFGRREGVALL 607
>gi|89953384|gb|ABD83288.1| Fgenesh protein 45 [Beta vulgaris]
Length = 665
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 88/145 (60%), Gaps = 10/145 (6%)
Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEA 198
+A+T D T K+ CNC+ ++CLKLYC+CFAAG YC + C C C N E+E E
Sbjct: 476 KASTSTDDDTTKR---CNCKKTKCLKLYCDCFAAGFYCGESCACQGCFNRPEYEDTVLET 532
Query: 199 VGATLERNPNAFRPKI----ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCE 254
RNP AF PK+ +SP + + + + +A+H +GC+CKKS CLKKYCE
Sbjct: 533 RQQIESRNPLAFAPKVMQTVTASPTTGKVVEKLSTPS--SARHKRGCNCKKSMCLKKYCE 590
Query: 255 CFQANILCSENCRCLDCKNFEGSEE 279
C+QAN+ CSE CRC CKN G E
Sbjct: 591 CYQANVGCSEGCRCEGCKNIYGVRE 615
>gi|115489476|ref|NP_001067225.1| Os12g0605500 [Oryza sativa Japonica Group]
gi|77557032|gb|ABA99828.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|77557033|gb|ABA99829.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649732|dbj|BAF30244.1| Os12g0605500 [Oryza sativa Japonica Group]
gi|215707264|dbj|BAG93724.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 760
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 69/157 (43%), Positives = 99/157 (63%), Gaps = 11/157 (7%)
Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN--- 187
Q SP+ + R D DGT K+C+C+ S+CLKLYCECF AG++C + C+C C N
Sbjct: 438 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPS 494
Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
N+E ++ +E + + RNP AF PK+ + G Q+ +D+ + +++H +GC+CKKS
Sbjct: 495 NMETVLSTREQIES---RNPLAFAPKVIRTEPG-QELADDSNKTPASSRHKRGCNCKKSS 550
Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
CLKKYCEC+Q + CS +CRC CKN G E AL
Sbjct: 551 CLKKYCECYQGGVGCSVSCRCEGCKNAFGRREGVALL 587
>gi|351726712|ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max]
gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like protein [Glycine max]
Length = 896
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 77/133 (57%), Gaps = 2/133 (1%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYC+CFAAG YC D C C C N E+ E RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESRNPIAFAP 533
Query: 213 KIASSPHGAQDAREDAR-EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
KI +D +A+H +GC+CK+S CLKKYCEC+QAN+ CS CRC C
Sbjct: 534 KIVQPTTDISSHMDDENLTTPSSARHKRGCNCKRSMCLKKYCECYQANVGCSSGCRCEGC 593
Query: 272 KNFEGSEERRALF 284
KN G +E F
Sbjct: 594 KNVHGKKEDYVAF 606
Score = 45.1 bits (105), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
+ ++ CNC+ S CLK YCEC+ A + C GC C C N
Sbjct: 558 RHKRGCNCKRSMCLKKYCECYQANVGCSSGCRCEGCKN 595
>gi|42563508|ref|NP_187136.3| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75146756|sp|Q84JZ8.1|TCX4_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 4; Short=AtTCX4
gi|28393078|gb|AAO41973.1| unknown protein [Arabidopsis thaliana]
gi|28827398|gb|AAO50543.1| unknown protein [Arabidopsis thaliana]
gi|332640623|gb|AEE74144.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 639
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 59/124 (47%), Positives = 79/124 (63%), Gaps = 1/124 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+C CR S+CLKLYCECF+AG++C + C+C NC N HE ++ RNP AF P
Sbjct: 453 KRCKCRKSQCLKLYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAP 512
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ S+ D + + +A+H +GC+C+KSGC KKYCECF + CS NCRC+ CK
Sbjct: 513 KVVSTSDTVIDLWVENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCK 572
Query: 273 NFEG 276
N G
Sbjct: 573 NTFG 576
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEH 191
+ + ++ CNCR S C K YCECF G+ C C C+ C N H
Sbjct: 533 ASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFGH 577
>gi|255553069|ref|XP_002517577.1| tso1, putative [Ricinus communis]
gi|223543209|gb|EEF44741.1| tso1, putative [Ricinus communis]
Length = 348
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 76/124 (61%), Gaps = 1/124 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG+YC D C C +C N E+E + RNP AF P
Sbjct: 20 KRCNCKRSKCLKLYCECFAAGVYCLDSCACEDCFNRPEYEDTVLDTRQQIEARNPLAFAP 79
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ + E +A+H +GC+CKKS CLKKYCEC+QA + CS CRC CK
Sbjct: 80 KVVKQATNSPANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGVGCSSGCRCEGCK 139
Query: 273 NFEG 276
N G
Sbjct: 140 NSYG 143
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 33/53 (62%), Gaps = 4/53 (7%)
Query: 141 ANTDGKDGTP---KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
AN +G TP + ++ CNC+ S+CLK YCEC+ AG+ C GC C C N+
Sbjct: 90 ANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGVGCSSGCRCEGCKNSY 142
>gi|452819762|gb|EME26815.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
sulphuraria]
Length = 444
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 61/119 (51%), Positives = 78/119 (65%), Gaps = 5/119 (4%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ S+CLKLYCECFA+G YC C C C NN ++ + A TLERNP AF PKI
Sbjct: 318 CNCKRSQCLKLYCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPKI 377
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+ S + + R A H++GC+CK+S C KKYCECFQA + C +NC+C+DCKN
Sbjct: 378 SESTAAVTEEGQVLR----VAAHHRGCNCKRSNCRKKYCECFQAGVPCGDNCKCVDCKN 432
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 24/36 (66%), Gaps = 1/36 (2%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
+ CNC+ S C K YCECF AG+ C D C CV+C N
Sbjct: 397 HRGCNCKRSNCRKKYCECFQAGVPCGDNCKCVDCKN 432
>gi|403352890|gb|EJY75972.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
trifallax]
Length = 1042
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 79/132 (59%), Gaps = 20/132 (15%)
Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGA 201
+D K P K CNC+ S+CLKLYC+CFA G+ C CNC +C N E R++A+
Sbjct: 642 SDNKSTQPLKTT-CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCAN-TEGNEERKQAMDQ 699
Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
ERNPNAF+PKI Q H+KGCHCKKSGCLKKYCEC+Q+ ++
Sbjct: 700 ITERNPNAFKPKI-----------------QEKGFHSKGCHCKKSGCLKKYCECYQSGVV 742
Query: 262 CSENCRCLDCKN 273
C+ C C CKN
Sbjct: 743 CTNLCACEGCKN 754
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C+CKKS CLK YC+CF + CS +C C DC N EG+EER+
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEERK 694
>gi|449522630|ref|XP_004168329.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 3-like
[Cucumis sativus]
Length = 788
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 58/132 (43%), Positives = 83/132 (62%), Gaps = 1/132 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N HE RNP AF P
Sbjct: 493 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATRKQIESRNPLAFAP 552
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ + + +++ + +A+H +GC+CKKS CLKKYCEC+Q + CS +CRC CK
Sbjct: 553 KVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCK 612
Query: 273 NFEGSEERRALF 284
N G ++ ++
Sbjct: 613 NAFGRKDGSSIL 624
>gi|449432678|ref|XP_004134126.1| PREDICTED: protein tesmin/TSO1-like CXC 3-like [Cucumis sativus]
Length = 788
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/132 (43%), Positives = 83/132 (62%), Gaps = 1/132 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N HE RNP AF P
Sbjct: 493 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATRKQIESRNPLAFAP 552
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ + + +++ + +A+H +GC+CKKS CLKKYCEC+Q + CS +CRC CK
Sbjct: 553 KVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCK 612
Query: 273 NFEGSEERRALF 284
N G ++ ++
Sbjct: 613 NAFGRKDGSSIL 624
>gi|344295768|ref|XP_003419583.1| PREDICTED: tesmin [Loxodonta africana]
Length = 503
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 108/194 (55%), Gaps = 31/194 (15%)
Query: 99 MPQWQARPQHVRMVNRVPHPVHKL-----PLPT---LPPGKQESPRSRPRANTDGKDGTP 150
+PQ+Q + + + + VP P+ L P+P L + E+ P N+ G P
Sbjct: 238 LPQYQDQSSYPQ--SDVPKPMTTLVGRFLPVPAKLNLITQQLENGALSPVVNSSALPGPP 295
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K L YC+CFA+G +C+ C+ NC NN+ HE+ R +A+ A L+RNP AF
Sbjct: 296 KLT----------LAGYCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAF 345
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+PKI + G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+
Sbjct: 346 QPKIGTGNLG-----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIG 394
Query: 271 CKNFEGSEERRALF 284
CKN++ S ER+ L
Sbjct: 395 CKNYDESPERKTLM 408
>gi|357606575|gb|EHJ65118.1| hypothetical protein KGM_22197 [Danaus plexippus]
Length = 844
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/151 (49%), Positives = 100/151 (66%), Gaps = 12/151 (7%)
Query: 133 ESPRSRPRANTD---GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
+SP + + TD + G + +K CNC S+CLKLYC+CFA G +C+ CNC NCHNN+
Sbjct: 605 QSPTTPALSETDQVTTELGGLRPRKACNCTKSQCLKLYCDCFANGEFCNRCNCNNCHNNL 664
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
E+E RQ+A+ L+RNPNAFRPKI S G ++ +HNKGC+CK+SGCL
Sbjct: 665 ENEELRQKAIRGCLDRNPNAFRPKIGKSKAGG---------PEIIRRHNKGCNCKRSGCL 715
Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEER 280
K YCEC++A I CS C+C+ C+N E + ER
Sbjct: 716 KNYCECYEAKIACSSICKCVGCRNVEETLER 746
>gi|356534189|ref|XP_003535640.1| PREDICTED: uncharacterized protein LOC100820491 [Glycine max]
Length = 760
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 59/127 (46%), Positives = 79/127 (62%), Gaps = 1/127 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG+YC + C C +C N H + RNP AF P
Sbjct: 475 KRCNCKKSKCLKLYCECFAAGVYCIEPCACHDCFNKPIHVETVLQTRQQIESRNPLAFAP 534
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K+ S + +D + +A+H +GC+CKKS CLKKYCEC+Q + CS +CRC CK
Sbjct: 535 KVIRSSDSVSEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCK 594
Query: 273 NFEGSEE 279
N G ++
Sbjct: 595 NTYGRKD 601
>gi|302843447|ref|XP_002953265.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
nagariensis]
gi|300261362|gb|EFJ45575.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
nagariensis]
Length = 1987
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 62/148 (41%), Positives = 85/148 (57%), Gaps = 25/148 (16%)
Query: 145 GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE---VARQEAVGA 201
G+ + K C C+ S+CLKLYC+CFAAG YC C+C++CHN EH + R+E + A
Sbjct: 779 GRRTSENSSKSCRCKKSQCLKLYCDCFAAGQYCGSCSCISCHNRPEHADRVLQRREDIAA 838
Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
R+P AF KI +P+G KH +GC+C+KS CLKKYCEC+Q +
Sbjct: 839 ---RDPQAFTRKIQLAPNG-------------NGKHKRGCNCRKSHCLKKYCECYQGGVK 882
Query: 262 CSENCRCLDCKNF------EGSEERRAL 283
C C C++C+N EG+ R AL
Sbjct: 883 CGIQCTCMECENMDVGSSQEGAGARGAL 910
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 1/74 (1%)
Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
G NPNA R A R + + +K C CKKS CLK YC+CF A
Sbjct: 749 GINANANPNAHRSSGAGGGLTQIQGGGPNRGRRTSENSSKSCRCKKSQCLKLYCDCFAAG 808
Query: 260 ILCSENCRCLDCKN 273
C +C C+ C N
Sbjct: 809 QYCG-SCSCISCHN 821
>gi|359482667|ref|XP_003632803.1| PREDICTED: uncharacterized protein LOC100855293 [Vitis vinifera]
Length = 613
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 85/143 (59%), Gaps = 5/143 (3%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNP 207
T +K CNC+ ++CLKLYC+C AAG+YC D C C NC N E+E Q R+P
Sbjct: 282 TESDRKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDP 341
Query: 208 NAFRPKIASSPHGAQD--AREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
AF P+I + D +ED +A+H +GC+CKKS C KKYCEC+QA I CS+
Sbjct: 342 LAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQAGIGCSD 401
Query: 265 NCRCLDCKN-FEGSEERRALFHG 286
CRC DC+N F +RRA G
Sbjct: 402 GCRCEDCRNSFGIKADRRAERWG 424
>gi|405973431|gb|EKC38148.1| lin-54-like protein [Crassostrea gigas]
Length = 764
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/119 (52%), Positives = 79/119 (66%), Gaps = 11/119 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA G +C CNC NC NN++HE R A+ + L+RNP AF PKI G +D R
Sbjct: 529 YCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIGKGRDGERDRR- 587
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
HNKGC+CK+SGCLK YCEC++A I+CS +C+C+ CKNFE S ER+ L H
Sbjct: 588 ----------HNKGCNCKRSGCLKNYCECYEAKIMCSSSCKCIGCKNFEESPERKTLMH 636
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 3/46 (6%)
Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
G+DG ++ K CNC+ S CLK YCEC+ A I C C C+ C N
Sbjct: 579 GRDGERDRRHNKGCNCKRSGCLKNYCECYEAKIMCSSSCKCIGCKN 624
>gi|326427032|gb|EGD72602.1| hypothetical protein PTSG_04338 [Salpingoeca sp. ATCC 50818]
Length = 584
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 58/111 (52%), Positives = 74/111 (66%), Gaps = 12/111 (10%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+ CNC SRCLKLYCEC+AAG YC C C NC NN+E+E RQ AV +TL++NP+AFRP
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQLAVLSTLDKNPDAFRP 466
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
KI Q+ + + A H +GC C KS CL++YCECFQA + C+
Sbjct: 467 KIK------QETTTNRK-----ANHLRGCKCTKSKCLQRYCECFQAGVPCT 506
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ C+C KS CLK YCEC+ A C C+C +C+N +E RR L
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQL 451
>gi|118352258|ref|XP_001009402.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89291169|gb|EAR89157.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 805
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/148 (41%), Positives = 94/148 (63%), Gaps = 7/148 (4%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLER 205
+G+ KK + CNC+ ++CLKLYC+CFAAG +C CNC C N V ++ AR + V LE+
Sbjct: 471 NGSNKKTRICNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLEK 530
Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
NP AF K +Q + + + +LA+K KGC+C++SGCLKKYC+C+Q + C E+
Sbjct: 531 NPFAFNVKDIEIEEPSQLSLKAQK--KLASK--KGCNCRRSGCLKKYCQCYQDGLQCGEH 586
Query: 266 CRCLDCKNFEGSEERRALFHGDHNGIAF 293
C+C C+N E + + LF + N ++
Sbjct: 587 CKCNGCENCE--KPAKKLFLAEQNSLSL 612
>gi|224143100|ref|XP_002324849.1| predicted protein [Populus trichocarpa]
gi|222866283|gb|EEF03414.1| predicted protein [Populus trichocarpa]
Length = 561
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 94/161 (58%), Gaps = 16/161 (9%)
Query: 132 QESPRSRPR----ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCH 186
Q SP+ + + ++TDG DG K+CNC+ +RCLK YC+CFAAGIYC + C C C
Sbjct: 405 QSSPKKKRQDHKVSSTDG-DGC----KRCNCKKTRCLKRYCDCFAAGIYCAETCACQGCF 459
Query: 187 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----AAKHNKGCH 242
N +E+E EA NP AF PKI H + + +A L + +H GC+
Sbjct: 460 NRLEYEDTVLEARQQKESHNPLAFAPKIVQ--HVTEFHAINVEDASLFTPSSGRHKTGCN 517
Query: 243 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
CK+S C+KKYCEC+QAN+ CS CRC CKN G +E L
Sbjct: 518 CKRSMCVKKYCECYQANVGCSNACRCEGCKNIHGRKEGEKL 558
>gi|414871812|tpg|DAA50369.1| TPA: hypothetical protein ZEAMMB73_080577 [Zea mays]
Length = 662
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 100/190 (52%), Gaps = 34/190 (17%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQ-----------CNCRNSRCLKLYCECFAAGIYCDG-CN 181
S ++ P A+ G+ KK+++ CNCR S+CLKLYC CFAA +YC C+
Sbjct: 334 SQKTMPNADKSGQQIHKKKRRKLQKDDGESCRHCNCRRSKCLKLYCPCFAAKVYCSSLCS 393
Query: 182 CVNCHNNVEHEVARQEAVGATLER----NPNAFRPKIASSPHGAQDAREDAREAQLAAKH 237
C C NN HE E V T +R NP AF P + + + +D+ + +A+H
Sbjct: 394 CQGCSNNHAHE----ELVSCTRKRTESHNPLAFAPTVTHTYGTVLEFGDDSNKTPASARH 449
Query: 238 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF------------- 284
+GC+C+KS CLKKYCECFQ+ + C +CRC+ CKN G E L
Sbjct: 450 KRGCNCRKSSCLKKYCECFQSGVGCCISCRCMSCKNSFGKREGVLLLTTEKLEKGGKAKG 509
Query: 285 -HGDHNGIAF 293
HG+ + +AF
Sbjct: 510 THGEEDKLAF 519
>gi|422292803|gb|EKU20105.1| tesmin tso1-like cxc domain-containing protein [Nannochloropsis
gaditana CCMP526]
Length = 365
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 108/227 (47%), Gaps = 56/227 (24%)
Query: 89 QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDG 148
Q +S + + P +RP V R +P QE+ RS P + G G
Sbjct: 98 QRRNQSSNNSHPGGSSRPVRVEAAIRRQNP------------SQEARRSPPLNGSGG--G 143
Query: 149 TP-----------KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
TP + K+CNCR S+CLKLYCECFA G YC D C C++C+N + E R
Sbjct: 144 TPQGRGTSGLAATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVRT 203
Query: 197 EAVGATLERNPNAFRPKI-----------ASSPH------GAQDAREDAREAQL------ 233
+AV + L RNP+AF KI A H G Q + D E +
Sbjct: 204 QAVESILSRNPHAFTEKIQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQG 263
Query: 234 -------AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
A H +GC+CKKS C+KKYCECF A + C ENC+C C+N
Sbjct: 264 TAGGALPTAAHVRGCNCKKSLCIKKYCECFFAGVYCGENCQCEGCQN 310
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 30/48 (62%)
Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
A + +K C+C+KS CLK YCECF C ++C CLDC N E+ R
Sbjct: 155 ATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVR 202
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
+ CNC+ S C+K YCECF AG+YC + C C C NN E
Sbjct: 276 RGCNCKKSLCIKKYCECFFAGVYCGENCQCEGCQNNPE 313
>gi|89111949|ref|NP_001034746.1| tesmin isoform a [Mus musculus]
gi|205371774|sp|Q9WTJ6.2|MTL5_MOUSE RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific; AltName: Full=Testis-specific
metallothionein-like protein
gi|15617422|dbj|BAB64935.1| tesmin [Mus musculus]
gi|19353778|gb|AAH24377.1| Metallothionein-like 5, testis-specific (tesmin) [Mus musculus]
gi|148700990|gb|EDL32937.1| mCG3887, isoform CRA_d [Mus musculus]
Length = 475
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 80/125 (64%), Gaps = 15/125 (12%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+PK+ GA
Sbjct: 275 LSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 330
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 331 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 379
Query: 284 FHGDH 288
H
Sbjct: 380 MSTPH 384
Score = 45.1 bits (105), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 372
>gi|47228462|emb|CAG05282.1| unnamed protein product [Tetraodon nigroviridis]
Length = 798
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/122 (51%), Positives = 80/122 (65%), Gaps = 11/122 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA G +C+ CNC NC NN+EHE R +A+ L+RNP AF+PKI G D
Sbjct: 587 LNGYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESD 646
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L
Sbjct: 647 RR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTL 695
Query: 284 FH 285
H
Sbjct: 696 MH 697
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 3/46 (6%)
Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
GK+G ++ K CNC+ S CLK YCEC+ A I C C C+ C N
Sbjct: 640 GKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 685
>gi|15617420|dbj|BAB64934.1| tesmin [Mus musculus]
Length = 475
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 80/125 (64%), Gaps = 15/125 (12%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+PK+ GA
Sbjct: 275 LSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 330
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 331 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 379
Query: 284 FHGDH 288
H
Sbjct: 380 MSTPH 384
Score = 45.1 bits (105), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 372
>gi|118349584|ref|XP_001008073.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89289840|gb|EAR87828.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 471
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/144 (43%), Positives = 80/144 (55%), Gaps = 21/144 (14%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNA 209
KKQK CNC+ ++CLKLYC+CFAAG C G CNC C N E R A+ + ++R P+A
Sbjct: 127 KKQKTCNCKKTKCLKLYCDCFAAGELCGGECNCFGCFNTSNSE-KRAGAIVSIMDRQPDA 185
Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
F PK+ + H KGC+C +SGCLKKYCEC+ + C E C+C
Sbjct: 186 FGPKVQQNTHK------------------KGCNCTRSGCLKKYCECYTLGVNCGEYCKCT 227
Query: 270 DCKNFE-GSEERRALFHGDHNGIA 292
CKN + ER AL NGI
Sbjct: 228 QCKNVDLNHSERLALSMNGSNGIG 251
>gi|37718857|gb|AAR01728.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 766
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 89/156 (57%), Gaps = 5/156 (3%)
Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
+P G++ R R + D D K+C+C+ S+CLKLYCECFA+ +YC + C
Sbjct: 443 MPTSAGNSGQENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKLYCECFASKVYCSESC 498
Query: 181 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 240
+C C N+ HE RNP AF PK+ + + ED+ +++H +G
Sbjct: 499 SCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFGEDSNATPASSRHKRG 558
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
C+CK+S C+KKYCECFQ+ + CS +CRC +CKN G
Sbjct: 559 CNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 594
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 511
>gi|108710010|gb|ABF97805.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
Length = 757
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 89/156 (57%), Gaps = 5/156 (3%)
Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
+P G++ R R + D D K+C+C+ S+CLKLYCECFA+ +YC + C
Sbjct: 434 MPTSAGNSGQENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKLYCECFASKVYCSESC 489
Query: 181 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 240
+C C N+ HE RNP AF PK+ + + ED+ +++H +G
Sbjct: 490 SCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFGEDSNATPASSRHKRG 549
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
C+CK+S C+KKYCECFQ+ + CS +CRC +CKN G
Sbjct: 550 CNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 585
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 502
>gi|358340153|dbj|GAA48108.1| protein lin-54 homolog [Clonorchis sinensis]
Length = 450
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 71/132 (53%), Positives = 82/132 (62%), Gaps = 14/132 (10%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYCECFA G C CNC NC NN+ +E R A+ TLERNP AF P
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCHNCMNNLAYEEERGRAIKMTLERNPTAFHP 203
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI E R KH KGC+CK+SGCLK YCEC++A I CS+ CRC C+
Sbjct: 204 KIGRG--------EGER------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 249
Query: 273 NFEGSEERRALF 284
N E S ERR+L
Sbjct: 250 NTEDSVERRSLM 261
>gi|387219095|gb|AFJ69256.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 365
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 82/227 (36%), Positives = 108/227 (47%), Gaps = 56/227 (24%)
Query: 89 QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDG 148
Q +S + + P +RP V R +P QE+ RS P + G G
Sbjct: 98 QRRNQSSNNSHPGGSSRPVRVEAAIRRQNP------------SQEARRSPPLNGSGG--G 143
Query: 149 TP-----------KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
TP + K+CNCR S+CLKLYCECFA G YC D C C++C+N + E R
Sbjct: 144 TPQGRGTSGLAATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVRT 203
Query: 197 EAVGATLERNPNAFRPKI-----------ASSPH------GAQDAREDAREAQL------ 233
+AV + L RNP+AF KI A H G Q + D E +
Sbjct: 204 QAVESILSRNPHAFTEKIQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQG 263
Query: 234 -------AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
A H +GC+CKKS C++KYCECF A + C ENC+C C+N
Sbjct: 264 TAGGALPTAAHVRGCNCKKSLCIQKYCECFFAGVYCGENCQCEGCQN 310
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 30/48 (62%)
Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
A + +K C+C+KS CLK YCECF C ++C CLDC N E+ R
Sbjct: 155 ATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVR 202
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
+ CNC+ S C++ YCECF AG+YC + C C C NN E
Sbjct: 276 RGCNCKKSLCIQKYCECFFAGVYCGENCQCEGCQNNPE 313
>gi|222625427|gb|EEE59559.1| hypothetical protein OsJ_11844 [Oryza sativa Japonica Group]
Length = 644
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 89/156 (57%), Gaps = 5/156 (3%)
Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
+P G++ R R + D D K+C+C+ S+CLKLYCECFA+ +YC + C
Sbjct: 321 MPTSAGNSGQENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKLYCECFASKVYCSESC 376
Query: 181 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 240
+C C N+ HE RNP AF PK+ + + ED+ +++H +G
Sbjct: 377 SCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFGEDSNATPASSRHKRG 436
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
C+CK+S C+KKYCECFQ+ + CS +CRC +CKN G
Sbjct: 437 CNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 472
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 389
>gi|148700988|gb|EDL32935.1| mCG3887, isoform CRA_b [Mus musculus]
Length = 360
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 80/125 (64%), Gaps = 15/125 (12%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+PK+ GA
Sbjct: 160 LSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 215
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 216 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 264
Query: 284 FHGDH 288
H
Sbjct: 265 MSTPH 269
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 220 KGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 257
>gi|349501047|ref|NP_001012069.2| tesmin [Rattus norvegicus]
gi|205829210|sp|Q5XHX9.2|MTL5_RAT RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific
Length = 475
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 79/125 (63%), Gaps = 15/125 (12%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+PK+ GA
Sbjct: 275 LAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 330
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R H+KGC+CK+SGCLK YCEC++A I CS C+C+ CKN+E S ER+ L
Sbjct: 331 LR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEESPERKML 379
Query: 284 FHGDH 288
H
Sbjct: 380 MSTPH 384
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEE 372
>gi|387192746|gb|AFJ68672.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 510
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 61/122 (50%), Positives = 72/122 (59%), Gaps = 15/122 (12%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ SRCLKLYC+CFAA +C CNC NC N R EA+ +ERNPNAF+ K
Sbjct: 225 CNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKF 284
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
P+G EDA H GC C+KS CLKKYCECF A CS+ C C+ C+N
Sbjct: 285 --RPNG-----EDA-------VHKMGCKCRKSACLKKYCECFNAGARCSDKCSCVGCQNV 330
Query: 275 EG 276
G
Sbjct: 331 AG 332
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%)
Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
GC+CK+S CLK YC+CF A C +C C +C N G RA
Sbjct: 224 GCNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRA 266
>gi|53733621|gb|AAH83920.1| Metallothionein-like 5, testis-specific (tesmin) [Rattus
norvegicus]
Length = 467
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 79/125 (63%), Gaps = 15/125 (12%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+PK+ GA
Sbjct: 267 LAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 322
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R H+KGC+CK+SGCLK YCEC++A I CS C+C+ CKN+E S ER+ L
Sbjct: 323 LR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEESPERKML 371
Query: 284 FHGDH 288
H
Sbjct: 372 MSTPH 376
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 324 RHSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEE 364
>gi|354496627|ref|XP_003510427.1| PREDICTED: tesmin-like [Cricetulus griseus]
Length = 380
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 56/125 (44%), Positives = 79/125 (63%), Gaps = 15/125 (12%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C C+C NN+ HE+ R +A+ A ++RNP AF+PK+ G
Sbjct: 180 LAGYCDCFASGDFCTSCSC----NNLRHELERFKAIKACIDRNPEAFQPKMGKGRLGVSK 235
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 236 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 284
Query: 284 FHGDH 288
+ H
Sbjct: 285 MNTPH 289
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
G+ G K + K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 229 GRLGVSKLRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 277
>gi|299472731|emb|CBN80299.2| TSO1; transcription factor (Partial) [Ectocarpus siliculosus]
Length = 792
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/143 (43%), Positives = 77/143 (53%), Gaps = 19/143 (13%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ S+CLKLYC+CFA + C CNC +C+NN E R +A+ LERNP AF
Sbjct: 26 CNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMRLQAINGVLERNPAAF---- 81
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
DA+ E A H GC C+KS CLKKYCECF + CSE C C+ C+N
Sbjct: 82 --------DAKFRETEKHDAVVHKTGCKCRKSACLKKYCECFNKGVACSEKCNCVGCRNT 133
Query: 275 EGSEERRALFHGDHNGIAFMQRA 297
AL+ G A +RA
Sbjct: 134 V------ALYSEYRGGGAAAERA 150
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 26/42 (61%)
Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
GC+CK+S CLK YC+CF ++C+ +C C C N E R
Sbjct: 25 GCNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMR 66
>gi|422294067|gb|EKU21367.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 511
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/122 (50%), Positives = 72/122 (59%), Gaps = 15/122 (12%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ SRCLKLYC+CFAA +C CNC NC N R EA+ +ERNPNAF+ K
Sbjct: 225 CNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKF 284
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
P+G EDA H GC C+KS CLKKYCECF A CS+ C C+ C+N
Sbjct: 285 --RPNG-----EDA-------VHKMGCKCRKSACLKKYCECFNAGARCSDKCSCVGCQNV 330
Query: 275 EG 276
G
Sbjct: 331 AG 332
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%)
Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
GC+CK+S CLK YC+CF A C +C C +C N G RA
Sbjct: 224 GCNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRA 266
>gi|4581559|gb|AAD24666.1|U67176_1 tesmin-1 [Mus musculus]
gi|4581561|gb|AAD24667.1|U77383_1 tesmin-2 [Mus musculus]
gi|21464732|gb|AAM54491.1|AF329359_1 tesmin [Mus musculus]
gi|21464734|gb|AAM54492.1|AF329360_1 tesmin [Mus musculus]
gi|148700989|gb|EDL32936.1| mCG3887, isoform CRA_c [Mus musculus]
gi|148700991|gb|EDL32938.1| mCG3887, isoform CRA_c [Mus musculus]
Length = 295
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 80/125 (64%), Gaps = 15/125 (12%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+PK+ GA
Sbjct: 95 LSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 150
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 151 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 199
Query: 284 FHGDH 288
H
Sbjct: 200 MSTPH 204
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
G+ G K + K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 144 GRLGAAKLRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 192
>gi|256089961|ref|XP_002581002.1| tesmin-related [Schistosoma mansoni]
Length = 364
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 71/133 (53%), Positives = 82/133 (61%), Gaps = 14/133 (10%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYCECFA G C CNC NC NN+ +E R A+ TLERNP AF P
Sbjct: 145 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 204
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI E R KH KGC+CK+SGCLK YCEC++A I CS+ CRC C+
Sbjct: 205 KIGRG--------EGER------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 250
Query: 273 NFEGSEERRALFH 285
N E S ERR+L
Sbjct: 251 NTEDSVERRSLMR 263
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 5/77 (6%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE----HEVARQEAVGA 201
+G K K CNC+ S CLK YCEC+ A I C D C C C N + + R A+GA
Sbjct: 210 EGERKHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLMRLAAMGA 269
Query: 202 TLERNPNAFRPKIASSP 218
R P++F+ + P
Sbjct: 270 LRSRQPSSFKKHLVKPP 286
>gi|360044492|emb|CCD82040.1| tesmin-related [Schistosoma mansoni]
Length = 363
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 71/133 (53%), Positives = 82/133 (61%), Gaps = 14/133 (10%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+K CNC S CLKLYCECFA G C CNC NC NN+ +E R A+ TLERNP AF P
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 203
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI E R KH KGC+CK+SGCLK YCEC++A I CS+ CRC C+
Sbjct: 204 KIGRG--------EGER------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 249
Query: 273 NFEGSEERRALFH 285
N E S ERR+L
Sbjct: 250 NTEDSVERRSLMR 262
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 5/77 (6%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE----HEVARQEAVGA 201
+G K K CNC+ S CLK YCEC+ A I C D C C C N + + R A+GA
Sbjct: 209 EGERKHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLMRLAAMGA 268
Query: 202 TLERNPNAFRPKIASSP 218
R P++F+ + P
Sbjct: 269 LRSRQPSSFKKHLVKPP 285
>gi|326495114|dbj|BAJ85653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 766
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 94/152 (61%), Gaps = 10/152 (6%)
Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN--- 187
Q SP+ + R D DGT K+C+C+ S+CLKLYCECF AG++C + C+C +C N
Sbjct: 440 QGSPKKK-RHKFDDGDGT--SCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQDCLNKPS 496
Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
N+E ++ ++ + + RNP AF PK+ + Q+ E + +A+H +GC+CKKS
Sbjct: 497 NMEIVLSTRKQIES---RNPLAFAPKVIRTSEPGQELGEYSNRTPASARHKRGCNCKKSS 553
Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
CLKKYCEC+Q + CS +CR CKN G E
Sbjct: 554 CLKKYCECYQGGVGCSISCRYEGCKNAFGRRE 585
>gi|428180550|gb|EKX49417.1| hypothetical protein GUITHDRAFT_43196, partial [Guillardia theta
CCMP2712]
Length = 113
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 60/125 (48%), Positives = 78/125 (62%), Gaps = 12/125 (9%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 208
+ ++K C+C+ SRCLKLYCECFAAG C GC CV+C N+ +HE R +AV +RN N
Sbjct: 1 SDSRKKVCSCKKSRCLKLYCECFAAGEICSGCKCVDCANDGDHEDMRLQAVDTIKQRNNN 60
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF PKI Q H +GC CKKS CLKKYCEC+QA + C++ C+C
Sbjct: 61 AFAPKIVDEIQ------------QDKGMHARGCRCKKSHCLKKYCECYQAGVQCTDKCKC 108
Query: 269 LDCKN 273
+C+N
Sbjct: 109 EECQN 113
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
K C CKKS CLK YCECF A +CS C+C+DC N E+ R
Sbjct: 6 KVCSCKKSRCLKLYCECFAAGEICS-GCKCVDCANDGDHEDMR 47
>gi|224099825|ref|XP_002311635.1| predicted protein [Populus trichocarpa]
gi|222851455|gb|EEE89002.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 83/143 (58%), Gaps = 3/143 (2%)
Query: 144 DGKDGTPKKQKQCN--CRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG 200
D +PKK++ R + L+LYCECFAAG+YC + C C +C N HE
Sbjct: 445 DFSQTSPKKKRHVGRRSRGLQALQLYCECFAAGVYCIEPCACQDCFNKPIHEDTVLATRK 504
Query: 201 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANI 260
RNP AF PK+ S A + +++ + +A+H +GC+CKKS CLKKYCEC+Q +
Sbjct: 505 QIESRNPLAFAPKVIRSSEPAPEIGDESSKTPASARHKRGCNCKKSSCLKKYCECYQGGV 564
Query: 261 LCSENCRCLDCKNFEGSEERRAL 283
CS NCRC CKN G ++ AL
Sbjct: 565 GCSLNCRCEGCKNAFGRKDGSAL 587
>gi|118383712|ref|XP_001025010.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89306777|gb|EAS04765.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1040
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 37/180 (20%)
Query: 136 RSRPRANTDGKDG------------TPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNC 182
R R N DG T K++ +CNC+ S+CLKLYC+CF G +C+ CNC
Sbjct: 682 REREEGNLKDLDGIVEYEQTLRPQTTSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNC 741
Query: 183 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED--------------- 227
C N ++ R+ A+ ERNP+AF+PKI + Q
Sbjct: 742 TECSNTENNKAERESAIKQLQERNPDAFKPKIQTKEQIIQQYLSKLQIFMLIISSLLSKN 801
Query: 228 ---------AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 278
++ + HNKGC+CKKSGC KKYCEC+ + CS+ C+C C+N + SE
Sbjct: 802 QIQYLIPKIIKDDEEILIHNKGCNCKKSGCEKKYCECYNTGVKCSDQCKCEGCRNRDPSE 861
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 28/50 (56%)
Query: 232 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
Q +K C+CKKS CLK YC+CF C+ C C +C N E ++ R
Sbjct: 705 QTTSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAER 754
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVAR 195
K CNC+ S C K YCEC+ G+ C D C C C N E+ +
Sbjct: 822 KGCNCKKSGCEKKYCECYNTGVKCSDQCKCEGCRNRDPSEIVK 864
>gi|302789393|ref|XP_002976465.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
gi|300156095|gb|EFJ22725.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
Length = 710
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 69/176 (39%), Positives = 90/176 (51%), Gaps = 35/176 (19%)
Query: 136 RSRPRANTDGKDG----TPKKQ------------KQCNCRNSRCLKLYCECFAAGIYC-D 178
R R +D DG TPKK+ K+C+C+ S+CLKLYCECFAAG+YC D
Sbjct: 356 RKRLSMQSDSMDGEMGRTPKKRRKSLGEKAGDGCKRCHCKKSKCLKLYCECFAAGVYCLD 415
Query: 179 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS--------------SPHGAQDA 224
C C +C N E RNP AF PKI S H ++
Sbjct: 416 SCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEGVGIWFSGHAMTNS 475
Query: 225 ----REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
+ + + +A+H +GC+CKKS CLKKYCEC+QA + CS+ C+C CKN G
Sbjct: 476 LFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKNIYG 531
>gi|301095347|ref|XP_002896774.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108657|gb|EEY66709.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 394
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 89/160 (55%), Gaps = 22/160 (13%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPN 208
P++ K C+C+ S CLKLYCEC AA CD CNC C N E R+ AV A L+RNP
Sbjct: 45 PERTKGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAILDRNPL 104
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF+PK+AS +++H +GC+C+KSGC+K YCEC QA + C+ C C
Sbjct: 105 AFQPKVASG----------------SSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCAC 148
Query: 269 LDCKNFEG--SEERRALFHG---DHNGIAFMQRAANAAII 303
C+N E S ++ +F G D N + R + ++
Sbjct: 149 HQCRNTETFVSAKKMLVFAGVSADDNDLRVTIRPSQRKLV 188
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 3/75 (4%)
Query: 230 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFHG 286
+ Q + KGC CKKS CLK YCEC A +C C C CKN ER
Sbjct: 40 QVQTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAIL 99
Query: 287 DHNGIAFMQRAANAA 301
D N +AF + A+ +
Sbjct: 100 DRNPLAFQPKVASGS 114
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERN 206
G+ + + CNCR S C+K YCEC AG+ C C C C N A++ V A + +
Sbjct: 113 GSSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAGVSAD 172
Query: 207 PNAFRPKI 214
N R I
Sbjct: 173 DNDLRVTI 180
>gi|2244834|emb|CAB10256.1| hypothetical protein [Arabidopsis thaliana]
gi|7268183|emb|CAB78519.1| hypothetical protein [Arabidopsis thaliana]
Length = 658
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 74/120 (61%), Gaps = 1/120 (0%)
Query: 165 KLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
K YCECFAAG+YC + C+C++C N HE RNP AF PK+ + Q+
Sbjct: 370 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQE 429
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+DA + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G ++ ++
Sbjct: 430 TGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSSI 489
Score = 44.7 bits (104), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
+ ++ CNC+ S CLK YCEC+ G+ C C C C N
Sbjct: 442 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 479
>gi|348686835|gb|EGZ26649.1| hypothetical protein PHYSODRAFT_473582 [Phytophthora sojae]
Length = 407
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/140 (44%), Positives = 81/140 (57%), Gaps = 19/140 (13%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPN 208
P++ K C+C+ S CLKLYCEC AA CD CNC C N E R+ AV A LERNP
Sbjct: 45 PERTKGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAILERNPL 104
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF PK+AS +++H +GC+C+KSGC+K YCEC QA + C+ C C
Sbjct: 105 AFLPKVASG----------------SSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCAC 148
Query: 269 LDCKNFEG--SEERRALFHG 286
C+N E S ++ +F G
Sbjct: 149 HQCRNTETFVSAKKMLVFAG 168
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 3/75 (4%)
Query: 230 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFHG 286
+ Q + KGC CKKS CLK YCEC A +C C C CKN + ER
Sbjct: 40 QVQTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAIL 99
Query: 287 DHNGIAFMQRAANAA 301
+ N +AF+ + A+ +
Sbjct: 100 ERNPLAFLPKVASGS 114
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERN 206
G+ + + CNCR S C+K YCEC AG+ C C C C N A++ V A + +
Sbjct: 113 GSSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAGVSAD 172
Query: 207 PNAFR 211
++ R
Sbjct: 173 DSSLR 177
>gi|219109951|ref|XP_002176728.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411263|gb|EEC51191.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 490
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 71/122 (58%), Gaps = 11/122 (9%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C+ +RCLKLYC+CF IYC C C++CHN E ARQ A+ L RNP+AF K
Sbjct: 217 CTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNVPAQEDARQNAMKVILSRNPHAFDTKF 276
Query: 215 ASSPHGAQDAREDAREAQLAAK---HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
+P D + +K H GC C+KS C+KKYCEC+ ++ C+ +CRC C
Sbjct: 277 QKTP-------VDGATVETPSKLLTHKLGCKCRKSACMKKYCECYAGHVYCNTHCRCTGC 329
Query: 272 KN 273
KN
Sbjct: 330 KN 331
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 37/67 (55%), Gaps = 6/67 (8%)
Query: 221 AQDAREDARE-AQLAAKHNK-----GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
A D RE A +Q+ H + GC CKK+ CLK YC+CF I C NCRCLDC N
Sbjct: 191 ANDGREGATSPSQIETDHLEFVQAVGCTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNV 250
Query: 275 EGSEERR 281
E+ R
Sbjct: 251 PAQEDAR 257
>gi|313238970|emb|CBY13958.1| unnamed protein product [Oikopleura dioica]
Length = 394
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/111 (46%), Positives = 73/111 (65%), Gaps = 11/111 (9%)
Query: 164 LKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
LKLYCECF G YC C+C CHN ++ RQ+A+ + L+RN +AF+PK+A S
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS----- 259
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
++ KH +GC+CK+SGCLKKYCEC+QA + C++ C C++CKN
Sbjct: 260 -----GKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKN 305
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 35/58 (60%), Gaps = 3/58 (5%)
Query: 138 RPRANTDGK--DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
+P+ GK + K + CNC+ S CLK YCEC+ AG+ C D CNCV C N +H+
Sbjct: 253 KPKVAKSGKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKNRDDHD 310
>gi|9279694|dbj|BAB01251.1| DNA binding protein-like [Arabidopsis thaliana]
Length = 593
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 69/106 (65%), Gaps = 1/106 (0%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+CNC+ S+CLKLYCECFAAG YC + C+C+NC N H+ RNP AF P
Sbjct: 328 KRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAP 387
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
K+ + + EDA + +A+H +GC+CKKS CLKKYCEC+Q+
Sbjct: 388 KVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECYQS 433
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 190 EHEVARQEAVGA----TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
+H++ R+ G +E P + + +P + R + ++ K C+CKK
Sbjct: 276 QHDIVRENESGKDSGQIIEVVPKSL-ASVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKK 334
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKN 273
S CLK YCECF A C E C C++C N
Sbjct: 335 SKCLKLYCECFAAGFYCIEPCSCINCFN 362
>gi|297800740|ref|XP_002868254.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
lyrata]
gi|297314090|gb|EFH44513.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
lyrata]
Length = 651
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 71/116 (61%), Gaps = 1/116 (0%)
Query: 165 KLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
K YCECFAAG+YC + C+C++C N HE RNP AF PK+ + +
Sbjct: 363 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSESVLE 422
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
+DA + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G ++
Sbjct: 423 TGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD 478
Score = 45.1 bits (105), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
+ ++ CNC+ S CLK YCEC+ G+ C C C C N
Sbjct: 435 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 472
>gi|313245432|emb|CBY40166.1| unnamed protein product [Oikopleura dioica]
Length = 394
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 52/111 (46%), Positives = 72/111 (64%), Gaps = 11/111 (9%)
Query: 164 LKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
LKLYCECF G YC C+C CHN ++ RQ+A+ + L+RN +AF+PK+A S
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS----- 259
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
++ KH +GC+CK+SGCLKKYCEC+QA + C++ C C+ CKN
Sbjct: 260 -----GKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKN 305
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 35/58 (60%), Gaps = 3/58 (5%)
Query: 138 RPRANTDGK--DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
+P+ GK + K + CNC+ S CLK YCEC+ AG+ C D CNCV C N +H+
Sbjct: 253 KPKVAKSGKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKNRDDHD 310
>gi|118399021|ref|XP_001031837.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89286171|gb|EAR84174.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 742
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 57/128 (44%), Positives = 75/128 (58%), Gaps = 19/128 (14%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLER 205
+ +P + K CNCRNS+CLK YC+CFAAG+YC C C CHN E+E RQ+A+ +R
Sbjct: 493 EKSPTEAKPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDERQKAIQKKKKR 552
Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
+AF P I ++ H KGCHCK S C KKYC C Q +LCS
Sbjct: 553 IKDAFLPVIQNNSHI------------------KGCHCKNSHCQKKYCVCHQNGVLCSSL 594
Query: 266 CRCLDCKN 273
C+C++C+N
Sbjct: 595 CQCVECEN 602
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 27/43 (62%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
K C+C+ S CLK+YC+CF A + C C+C +C N E+ R
Sbjct: 500 KPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDER 542
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAV 199
K C+C+NS C K YC C G+ C C CV C N +E Q ++
Sbjct: 568 KGCHCKNSHCQKKYCVCHQNGVLCSSLCQCVECENKIESMKKEQSSI 614
>gi|323456808|gb|EGB12674.1| hypothetical protein AURANDRAFT_60658 [Aureococcus anophagefferens]
Length = 1430
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 58/138 (42%), Positives = 78/138 (56%), Gaps = 14/138 (10%)
Query: 145 GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVAR-----QEAV 199
G+DG+ K CNC+ S+CLKLYCEC+AAG++C CNC NC N E A A+
Sbjct: 1003 GRDGSGGGFKPCNCKKSKCLKLYCECYAAGVFCRDCNCSNCLNKPEDGEAHPPYPATHAL 1062
Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
G L A + +S R+ Q + GC CKKS CLKKYCECF+A
Sbjct: 1063 G-RLRHGDVAVPTAVLTSAQ--------KRKPQSKQRQESGCFCKKSKCLKKYCECFEAG 1113
Query: 260 ILCSENCRCLDCKNFEGS 277
+ C +C+C +C+N++GS
Sbjct: 1114 VHCEASCKCENCENYDGS 1131
>gi|224131306|ref|XP_002328506.1| predicted protein [Populus trichocarpa]
gi|222838221|gb|EEE76586.1| predicted protein [Populus trichocarpa]
Length = 414
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 69/114 (60%), Gaps = 1/114 (0%)
Query: 164 LKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
+K YCECFAAG+YC + C+C+ C NN HE E RNP AF PK+ +
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVS 197
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
+ E+ + +A+H +GC+CKKS CLKKYCECFQ + CS CRC CKN G
Sbjct: 198 EFGEETNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFG 251
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 7/55 (12%)
Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGI------YCDGC-NCVNCHNNVE 190
T+ + + ++ CNC+ S CLK YCECF G+ C+GC N C N VE
Sbjct: 203 TNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFGCKNGVE 257
Score = 38.9 bits (89), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEE 279
+K YCECF A + C E C CL+C N E+
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHED 168
>gi|118373056|ref|XP_001019722.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89301489|gb|EAR99477.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 495
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 75/130 (57%), Gaps = 3/130 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLER 205
D K ++ C C+NS+CLKLYCECFA G YC D C C+ C N E E +EA TL R
Sbjct: 247 DSEQKAREGCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECENEIKEARKVTLTR 306
Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
NP+AF K+ QD +A L K KGC CK++ C KKYCEC+ A + CS
Sbjct: 307 NPDAFTSKLEVVGTIVQDEDIEANRGMLGYK--KGCKCKRTYCKKKYCECYNAGVKCSYL 364
Query: 266 CRCLDCKNFE 275
C C +C N E
Sbjct: 365 CICENCHNQE 374
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 29/56 (51%), Gaps = 9/56 (16%)
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 278
D+ + ARE GC CK S CLK YCECF C + C+CL C N E E
Sbjct: 247 DSEQKARE---------GCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECE 293
>gi|255084730|ref|XP_002504796.1| tso1-like transcription factor [Micromonas sp. RCC299]
gi|226520065|gb|ACO66054.1| tso1-like transcription factor [Micromonas sp. RCC299]
Length = 620
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 64/110 (58%), Gaps = 13/110 (11%)
Query: 166 LYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
LYCECFAAG++C C+C NC N ++ Q RNPNAF KI +S G
Sbjct: 277 LYCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKIVASGDGG---- 332
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
+H KGCHCKKS CLKKYCECFQA +LC + C+C CKN E
Sbjct: 333 ---------GQHKKGCHCKKSACLKKYCECFQAGVLCQDYCKCDGCKNKE 373
>gi|145350962|ref|XP_001419860.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580092|gb|ABO98153.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 510
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 64/108 (59%), Gaps = 13/108 (12%)
Query: 166 LYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
LYCECFAAG +C C+C C N E+E + +RNP AF KI + DA
Sbjct: 207 LYCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKIMA------DAG 260
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+DAR H KGCHCKKS CLKKYCECFQA + C + C+C CKN
Sbjct: 261 DDAR-------HTKGCHCKKSACLKKYCECFQAGVKCQDYCKCEGCKN 301
>gi|297721065|ref|NP_001172895.1| Os02g0274400 [Oryza sativa Japonica Group]
gi|255670792|dbj|BAH91624.1| Os02g0274400, partial [Oryza sativa Japonica Group]
Length = 140
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/95 (63%), Positives = 65/95 (68%), Gaps = 7/95 (7%)
Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH----GDHNGI 291
KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL D N I
Sbjct: 23 KHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQMKNASDRNHI 82
Query: 292 AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL 326
Q AAN A+ GA GS GY K+ E L
Sbjct: 83 ---QEAANFALNGATGSLGYKNSPVRRKKYQENSL 114
Score = 45.4 bits (106), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
PK K C+CR S CLK YCECF A I C C C +C N
Sbjct: 21 IPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKN 60
>gi|426252144|ref|XP_004019777.1| PREDICTED: tesmin [Ovis aries]
Length = 512
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/122 (48%), Positives = 82/122 (67%), Gaps = 11/122 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC N++ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 308 LAGYCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 364
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 365 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 416
Query: 284 FH 285
+
Sbjct: 417 MN 418
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 365 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 409
>gi|118355040|ref|XP_001010781.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89292548|gb|EAR90536.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 978
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 55/119 (46%), Positives = 73/119 (61%), Gaps = 9/119 (7%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ ++CLKLYCECFA YC +GC+C C N E+E RQ A+ RN +AF+PK
Sbjct: 405 CNCKKTKCLKLYCECFAKLGYCGEGCHCHECKNRPENEDERQNAIQEAKSRNNDAFQPKT 464
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
S Q+ + A Q+ KGC+C+K+ CLKKYCECF A C+ C+C C+N
Sbjct: 465 ESE----QERKNKANNEQMK----KGCNCRKTKCLKKYCECFNAGTYCNNMCKCDSCEN 515
>gi|307110312|gb|EFN58548.1| hypothetical protein CHLNCDRAFT_140658 [Chlorella variabilis]
Length = 664
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 55/128 (42%), Positives = 72/128 (56%), Gaps = 19/128 (14%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNA 209
++Q+ CNC+ S CLK+YCECFAAG +C C+C++C N A L +NPNA
Sbjct: 260 REQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVLAKNPNA 319
Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRC 268
F K+ + A H +GC CK+S CLKKYCECF A C+ E C+C
Sbjct: 320 FEVKVTA-----------------ATGHRRGCRCKRSKCLKKYCECFHAGARCNPEVCQC 362
Query: 269 LDCKNFEG 276
DC+N EG
Sbjct: 363 EDCRNTEG 370
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 26/39 (66%)
Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
A+ + C+CK+S CLK YCECF A C+ +C CL C N
Sbjct: 259 AREQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSN 297
>gi|224000241|ref|XP_002289793.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975001|gb|EED93330.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 712
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 60/147 (40%), Positives = 75/147 (51%), Gaps = 29/147 (19%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C+ S+CLKLYC CF+A C C C C N E R++AV ATL+RNPNAFR K
Sbjct: 338 CTCKKSKCLKLYCHCFSANSMCSSFCRCTECMNTTEAGTVREDAVKATLQRNPNAFRSKF 397
Query: 215 ASSPHGAQDARE-----DAREAQLA-----------------------AKHNKGCHCKKS 246
G + A+E A++ QL H GC+C+KS
Sbjct: 398 VKVTGGEEFAKEWEKNERAKQMQLLNVGDSPAKTKTTGVEMAAVAIANPTHKLGCNCRKS 457
Query: 247 GCLKKYCECFQANILCSENCRCLDCKN 273
CLK+YCECF A C NC+CL C+N
Sbjct: 458 FCLKRYCECFGAQTPCGLNCKCLGCQN 484
>gi|118355648|ref|XP_001011083.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89292850|gb|EAR90838.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 974
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 76/124 (61%), Gaps = 7/124 (5%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ ++CLKLYC+CFA G C C CV C NN R +A+ L+RNP AF K
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLRNQAIEGILQRNPFAFNVK- 694
Query: 215 ASSPHGAQDAREDAREAQLAAKHNK--GCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
P A+ ++++ + L ++K GC+C+KS CLKKYC C+ + + CSE C+C +CK
Sbjct: 695 ---PEEAKKSKKELNNSILQNSNSKVIGCNCRKSNCLKKYCLCYHSGMKCSEACKCTECK 751
Query: 273 NFEG 276
N E
Sbjct: 752 NLEN 755
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 28/41 (68%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C+CKK+ CLK YC+CF LC ++C+C++C N E + R
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLR 676
>gi|311247083|ref|XP_003122474.1| PREDICTED: tesmin [Sus scrofa]
Length = 511
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/122 (49%), Positives = 82/122 (67%), Gaps = 11/122 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 307 LAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 363
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 364 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 415
Query: 284 FH 285
+
Sbjct: 416 MN 417
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 364 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 408
>gi|395851609|ref|XP_003798345.1| PREDICTED: tesmin [Otolemur garnettii]
Length = 509
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 60/122 (49%), Positives = 83/122 (68%), Gaps = 11/122 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 305 LAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 361
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER++L
Sbjct: 362 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKSL 413
Query: 284 FH 285
+
Sbjct: 414 IN 415
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 362 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 406
>gi|340505892|gb|EGR32168.1| hypothetical protein IMG5_093910 [Ichthyophthirius multifiliis]
Length = 260
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 88/153 (57%), Gaps = 15/153 (9%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+Q+QCNC+ S+CLKLYC+CFA G YC+ C+C C NN ++ R+ A+ ERNPNAF
Sbjct: 60 QQRQCNCKKSKCLKLYCDCFAIGEYCNPICHCFECKNNESNKEKREMALKQNKERNPNAF 119
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+I +++QL H +GC+CKKSGC KKYC+C+ + CS CRC+
Sbjct: 120 TSRII-------------QQSQLNN-HQRGCNCKKSGCQKKYCQCYLDGMECSRYCRCIG 165
Query: 271 CKNFEGSEERRALFHGDHNGIAFMQRAANAAII 303
C+N E ++ + + Q+ N+ I
Sbjct: 166 CENCNIFENKKICLNNKNIEKQIQQQKGNSDYI 198
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 32/51 (62%)
Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
L A + C+CKKS CLK YC+CF C+ C C +CKN E ++E+R +
Sbjct: 56 LKALQQRQCNCKKSKCLKLYCDCFAIGEYCNPICHCFECKNNESNKEKREM 106
>gi|334332647|ref|XP_001371812.2| PREDICTED: tesmin [Monodelphis domestica]
Length = 728
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 59/119 (49%), Positives = 81/119 (68%), Gaps = 11/119 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA G +C+ CNC NC+NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 527 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG------ 580
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 581 -----DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLLN 634
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 581 DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 625
>gi|403353185|gb|EJY76132.1| Tso1, putative [Oxytricha trifallax]
Length = 712
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 60/150 (40%), Positives = 78/150 (52%), Gaps = 28/150 (18%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K CNC+ S+CLKLYCECFA +C C C C N+ EH+ R A L RNP AFRP
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRP 508
Query: 213 KIASSPHGAQDAREDAR----------EAQLAA-----------------KHNKGCHCKK 245
K+ + +D + +A L+ +H +GC+CKK
Sbjct: 509 KVETQQDETSLFSKDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDKRHFRGCNCKK 568
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
S C KKYCECFQ + CS+ C+C +CKN E
Sbjct: 569 SNCQKKYCECFQQGVNCSDLCKCEECKNNE 598
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 25/43 (58%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
K C+CKKS CLK YCECF N C NC C C N ++ R
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDER 491
>gi|345309503|ref|XP_001516587.2| PREDICTED: tesmin-like [Ornithorhynchus anatinus]
Length = 243
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 62/125 (49%), Positives = 84/125 (67%), Gaps = 11/125 (8%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA+G +C+ CNC NC+NN HE R +A+ A L RNP AF+PKI G
Sbjct: 52 YCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIGKGKLG------ 105
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG 286
++ +HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S +RR L++G
Sbjct: 106 -----EVKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCKNYEESPQRRILWNG 160
Query: 287 DHNGI 291
H +
Sbjct: 161 PHPAV 165
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
P+ K CNC+ S CLK YCECF A I C C C++C N E
Sbjct: 109 PRHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCKNYEE 150
>gi|146174407|ref|XP_001019363.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|146144792|gb|EAR99118.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1114
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/126 (46%), Positives = 77/126 (61%), Gaps = 9/126 (7%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC ++CLKLYC CF G+ C D C C +C+N + + R+ A+ A +R P+AF K+
Sbjct: 714 CNCGKTKCLKLYCVCFQQGLMCSDQCRCTDCYNKIAYLEERERAIKAIKQRYPDAFEKKV 773
Query: 215 ASSPHGA---QDA-REDAR---EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
P A +DA R DA + QL A H KGC+CKKS C KKYCECF A + C+ CR
Sbjct: 774 NFLPELARKIEDAERLDADLELQNQLLA-HKKGCNCKKSACKKKYCECFLAGVKCTYLCR 832
Query: 268 CLDCKN 273
C C+N
Sbjct: 833 CEQCQN 838
Score = 42.7 bits (99), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
+K CNC+ S C K YCECF AG+ C C C C N
Sbjct: 803 KKGCNCKKSACKKKYCECFLAGVKCTYLCRCEQCQN 838
>gi|145545444|ref|XP_001458406.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426226|emb|CAK91009.1| unnamed protein product [Paramecium tetraurelia]
Length = 218
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/119 (46%), Positives = 72/119 (60%), Gaps = 15/119 (12%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ S+CLKLYC+CFAAG C CNC C NN + + R + + +ERNP AF K+
Sbjct: 85 CNCKKSKCLKLYCDCFAAGKLCSSKCNCCGCFNNSSNLLERNQFIEKMVERNPEAFNQKV 144
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+E H+KGC+C+KSGC KKYCEC+Q I CS+NC+C C+N
Sbjct: 145 --------------KEVDYKLAHSKGCNCRKSGCKKKYCECYQLGIECSDNCKCDGCQN 189
>gi|332249871|ref|XP_003274078.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Nomascus leucogenys]
Length = 517
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/122 (49%), Positives = 81/122 (66%), Gaps = 11/122 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI G
Sbjct: 317 LARYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG--- 373
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC C++SGCLK YCEC++A I+CS C+C+DCKN+E S ER+ L
Sbjct: 374 --------DVKPRHNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEESPERKTL 425
Query: 284 FH 285
+
Sbjct: 426 MN 427
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K C+CR S CLK YCEC+ A I C C C++C N E
Sbjct: 374 DVKPRHNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEE 418
>gi|118384767|ref|XP_001025523.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89307290|gb|EAS05278.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1022
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 70/118 (59%), Gaps = 6/118 (5%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 215
C C+ ++CLKLYCECF A YC G NC C N E+E R+ A+ +RN +AF PK
Sbjct: 766 CGCKKTKCLKLYCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKAD 825
Query: 216 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
S + ++ A H+KGC+CKKS C KKYCEC+Q C+ C+C +CKN
Sbjct: 826 KSDQYYK------SDSGTKAIHSKGCNCKKSDCRKKYCECYQMGAKCTHLCKCYNCKN 877
>gi|145489787|ref|XP_001430895.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397996|emb|CAK63497.1| unnamed protein product [Paramecium tetraurelia]
Length = 276
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/245 (31%), Positives = 111/245 (45%), Gaps = 37/245 (15%)
Query: 32 LPEQSVKSQSSQPQSHSPLQPQPP-KSQPQVQSKVASPS-QSQPLVRLQLLPPSSQPHVQ 89
+P+ +KS Q L+ P KS +Q++ S S R+ L P + ++
Sbjct: 43 IPQIVIKSNFQQELICFMLELHKPNKSHSIIQNRNHSRSFNHNQQFRISLNYPQANLNLS 102
Query: 90 AHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGT 149
+++ +P H + N + + ++ P L QE
Sbjct: 103 PSIKNKGMYYYSIHLQP-HTQYYNLNLNIIFEILQPNLFYLNQE---------------- 145
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPN 208
K CNCR S+CLKLYC+CFA G C CNC C NN + + R + +ERNP
Sbjct: 146 ---HKPCNCRKSKCLKLYCDCFAVGKLCSSKCNCCGCFNNSSNLLERNSFIQKMIERNPQ 202
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF K+ +E + H KGC+C+KSGC KKYCEC+Q I CS+NC+C
Sbjct: 203 AFNQKV--------------QEVESKMTHAKGCNCRKSGCQKKYCECYQMGIECSDNCKC 248
Query: 269 LDCKN 273
C N
Sbjct: 249 DGCLN 253
>gi|301787847|ref|XP_002929338.1| PREDICTED: tesmin-like [Ailuropoda melanoleuca]
gi|281340625|gb|EFB16209.1| hypothetical protein PANDA_019494 [Ailuropoda melanoleuca]
Length = 510
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/137 (45%), Positives = 86/137 (62%), Gaps = 22/137 (16%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 306 LAGYCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIGKGKLG--- 362
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 363 --------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 414
Query: 284 -----------FHGDHN 289
F G+H+
Sbjct: 415 MNMPNYMEIGGFEGNHH 431
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNCR S CLK YCEC+ A I C C C+ C N E
Sbjct: 363 DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 407
>gi|410974670|ref|XP_003993766.1| PREDICTED: tesmin [Felis catus]
Length = 502
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/132 (45%), Positives = 83/132 (62%), Gaps = 15/132 (11%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C CNC NC NN+ HEV R +A+ A L+RNP AF+PKI
Sbjct: 299 LAGYCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNPEAFQPKIGK------- 351
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R + +HN+GC+C++SGCLK YCEC++A I+CS C+C+ CKN++ ER+ L
Sbjct: 352 ----GRPGDVTPRHNRGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDEGPERKTL 407
Query: 284 ----FHGDHNGI 291
+H D G
Sbjct: 408 MSMPYHPDAGGF 419
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D TP+ + CNCR S CLK YCEC+ A I C C C+ C N E
Sbjct: 356 DVTPRHNRGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDE 400
>gi|395544681|ref|XP_003774236.1| PREDICTED: tesmin [Sarcophilus harrisii]
Length = 548
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/119 (49%), Positives = 81/119 (68%), Gaps = 11/119 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA G +C+ CNC NC+NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 347 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG------ 400
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 401 -----DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLLN 454
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 401 DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 445
>gi|291415497|ref|XP_002723989.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
[Oryctolagus cuniculus]
Length = 450
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/122 (50%), Positives = 81/122 (66%), Gaps = 11/122 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN HE+ R +A+ A L+RNP AF+PKI G+
Sbjct: 247 LSGYCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNPEAFQPKIGKGRPGSAK 306
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
R HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 307 PR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNYEESPERKTL 355
Query: 284 FH 285
+
Sbjct: 356 MN 357
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 3/49 (6%)
Query: 145 GKDGT--PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
G+ G+ P+ K CNC+ S CLK YCEC+ A I C C CV C N E
Sbjct: 300 GRPGSAKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNYEE 348
>gi|440894221|gb|ELR46727.1| Tesmin [Bos grunniens mutus]
Length = 511
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/122 (49%), Positives = 81/122 (66%), Gaps = 11/122 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 307 LAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 363
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 364 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 415
Query: 284 FH 285
+
Sbjct: 416 MN 417
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 364 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 408
>gi|329663862|ref|NP_001192832.1| tesmin [Bos taurus]
gi|296471372|tpg|DAA13487.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
Length = 511
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/122 (49%), Positives = 81/122 (66%), Gaps = 11/122 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 307 LAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 363
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 364 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 415
Query: 284 FH 285
+
Sbjct: 416 MN 417
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 364 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 408
>gi|145535353|ref|XP_001453415.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421126|emb|CAK86018.1| unnamed protein product [Paramecium tetraurelia]
Length = 348
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 71/120 (59%), Gaps = 1/120 (0%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
QCNC+ S+CLKLYCECF C CNC C N +++ R +A+ L RNP AF P
Sbjct: 167 QCNCKKSKCLKLYCECFTNNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFAPI 226
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+ ++ Q +E + + + KGC+CKKS C KKYCEC+ N C+E C+C +C N
Sbjct: 227 LTNNGQQPQVIQEQKSQKDIQKETKKGCNCKKSECKKKYCECYSINQKCTELCKCENCLN 286
>gi|330843039|ref|XP_003293472.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
gi|325076199|gb|EGC30006.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
Length = 478
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 55/120 (45%), Positives = 69/120 (57%), Gaps = 31/120 (25%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
K+C+C+NS+CLK+YCECFA I C+ C C C NN A E +G
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCNNCQCFGCQNNE----ANIEKIGND----------- 147
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
++ L KH++GCHC+KSGCLKKYCECFQA I C+ENC+C DCKN
Sbjct: 148 ----------------QSVLIDKHSRGCHCRKSGCLKKYCECFQAGIPCNENCKCYDCKN 191
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/42 (57%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 280
K CHCK S CLK YCECF ILC+ NC+C C+N E + E+
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCN-NCQCFGCQNNEANIEK 143
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNN 188
G Q + + + D K + C+CR S CLK YCECF AGI C + C C +C N
Sbjct: 133 GCQNNEANIEKIGNDQSVLIDKHSRGCHCRKSGCLKKYCECFQAGIPCNENCKCYDCKNQ 192
Query: 189 VEH 191
H
Sbjct: 193 SHH 195
>gi|194218566|ref|XP_001492479.2| PREDICTED: tesmin-like [Equus caballus]
Length = 383
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/122 (49%), Positives = 82/122 (67%), Gaps = 11/122 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 179 LAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 235
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 236 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 287
Query: 284 FH 285
+
Sbjct: 288 MN 289
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 236 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 280
>gi|357480917|ref|XP_003610744.1| Lin-54-like protein [Medicago truncatula]
gi|355512079|gb|AES93702.1| Lin-54-like protein [Medicago truncatula]
Length = 808
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 81/160 (50%), Gaps = 31/160 (19%)
Query: 154 KQCNCRNSRCLKL-------------------YCECFAAGIYC-DGCNCVNCHNNVEHEV 193
K+CNC+ S+CLKL YCECFAAG+YC + C+C C N HE
Sbjct: 488 KRCNCKKSKCLKLIDLKMTILVLELTILFFFSYCECFAAGVYCIEPCSCQECFNKPIHED 547
Query: 194 ARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYC 253
+ RNP AF PK+ S + D + +A+H +GC+CKKS CLKKYC
Sbjct: 548 TVLQTRKQIESRNPLAFAPKVIRSADSVPETGIDPNKTPASARHKRGCNCKKSNCLKKYC 607
Query: 254 ECFQANIL-----------CSENCRCLDCKNFEGSEERRA 282
EC+Q IL CS +CRC CKN G ++ A
Sbjct: 608 ECYQVLILTISCLPQGGVGCSISCRCEGCKNAFGRKDGSA 647
>gi|242033599|ref|XP_002464194.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
gi|241918048|gb|EER91192.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
Length = 641
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 83/161 (51%), Gaps = 9/161 (5%)
Query: 136 RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVA 194
R R N DGK + C+C+ S+CLKLYC CFAA +YC + C+C C NN HE A
Sbjct: 344 RRRKLQNDDGKSC-----RHCSCKKSKCLKLYCACFAAKVYCSEFCSCQGCSNNHMHEEA 398
Query: 195 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCE 254
T RNP AF P + + +D+ +A+H +GC+C+KS CLKKYCE
Sbjct: 399 VSHIRKQTESRNPLAFAPTVTRKCGSVSELGDDSNNTPASARHKRGCNCRKSSCLKKYCE 458
Query: 255 CFQANILCS--ENCRCLDCKNFEGSEERRALFHGDHNGIAF 293
CFQ +L E L + + E + HG +AF
Sbjct: 459 CFQVLMLLFSLEGVLLLTTEKLKKGGEAKGT-HGKEEKLAF 498
>gi|402892588|ref|XP_003909492.1| PREDICTED: tesmin [Papio anubis]
Length = 515
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/121 (49%), Positives = 80/121 (66%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI G
Sbjct: 311 LAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG--- 367
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 368 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 419
Query: 284 F 284
Sbjct: 420 M 420
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 57/137 (41%), Gaps = 19/137 (13%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAV------ 199
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E R+ +
Sbjct: 368 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNYEESPERKTLMSMPNYV 426
Query: 200 -GATLERNPNAFRPKIASSPHGAQDAR------EDAREAQLAAKHNKGCHCKKSGCLKKY 252
LE + + K + P +QD R + EA A +G +K C K
Sbjct: 427 QAGGLEGSRHLPPTKFSGLPKFSQDRRPSSCISWEVVEATCACLLAQGEEAEKEHCSK-- 484
Query: 253 CECFQANILCSENCRCL 269
C ++ E RCL
Sbjct: 485 --CLAEQMILEEFGRCL 499
>gi|145478369|ref|XP_001425207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392276|emb|CAK57809.1| unnamed protein product [Paramecium tetraurelia]
Length = 373
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/146 (43%), Positives = 89/146 (60%), Gaps = 10/146 (6%)
Query: 155 QCNCRNSRCLKLYCECFAA-GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
QCNC+ S+CLKLYCECFA G+ CNC +C N +++ R +A+ L RN +AF +
Sbjct: 187 QCNCKKSKCLKLYCECFANNGVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNTDAFV-Q 245
Query: 214 IASSPHGAQDAREDAREAQLAAKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 268
S+ GAQ ++D + + +K N KGC+CKKSGC KKYCEC+ NI C+E C+C
Sbjct: 246 CFSTKGGAQFVQQD-KPLKEPSKDNSSVIRKGCNCKKSGCKKKYCECYSQNIRCNELCKC 304
Query: 269 LDCKNFEGSE--ERRALFHGDHNGIA 292
C N +E ++ L DHN ++
Sbjct: 305 EHCLNKTDAEIQAQQDLGQEDHNNLS 330
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 6/77 (7%)
Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNV 189
+Q+ P P KD + +K CNC+ S C K YCEC++ I C+ C C +C N
Sbjct: 257 QQDKPLKEP-----SKDNSSVIRKGCNCKKSGCKKKYCECYSQNIRCNELCKCEHCLNKT 311
Query: 190 EHEVARQEAVGATLERN 206
+ E+ Q+ +G N
Sbjct: 312 DAEIQAQQDLGQEDHNN 328
>gi|412990962|emb|CCO18334.1| predicted protein [Bathycoccus prasinos]
Length = 671
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 54/109 (49%), Positives = 65/109 (59%), Gaps = 15/109 (13%)
Query: 166 LYCECFAAGIYCDGCNCVNCHNNV-EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 224
LYC+CFAAG++C C+C +C N + EV RQ RNPNAF KI
Sbjct: 358 LYCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRY-QIESRNPNAFANKIVD-------- 408
Query: 225 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+ + AKH KGCHCKKS CLKKYCECFQAN+ C + C+C CKN
Sbjct: 409 -----DDSVDAKHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKN 452
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
K K C+C+ S CLK YCECF A + C D C C C N +
Sbjct: 415 KHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKNTTD 455
>gi|302753274|ref|XP_002960061.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
gi|300171000|gb|EFJ37600.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
Length = 155
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 80/144 (55%), Gaps = 14/144 (9%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
+C+C+ S+CLKLYCECFA+G YC D C C NC N EHE E RNP AF P+
Sbjct: 1 RCSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPR 60
Query: 214 IASSPHGAQDAREDAREAQL-------------AAKHNKGCHCKKSGCLKKYCECFQANI 260
+ S + + L +KH +GC+CKKS CLKKYCECFQ+ +
Sbjct: 61 VISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEV 120
Query: 261 LCSENCRCLDCKNFEGSEERRALF 284
CS+ C+C CKN G+++ F
Sbjct: 121 GCSDACKCRGCKNTFGAKQGDDFF 144
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNN 188
T K ++ CNC+ S CLK YCECF + + C D C C C N
Sbjct: 94 TSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKNT 134
>gi|300175264|emb|CBK20575.2| unnamed protein product [Blastocystis hominis]
Length = 396
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 71/121 (58%), Gaps = 17/121 (14%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ SRCLKLYCECF + YC CNC+NC+NN +HE R+EAV RNP+ F +
Sbjct: 151 CNCKKSRCLKLYCECFKSQEYCSSECNCLNCYNNRKHESEREEAVQRMKARNPHCFENHV 210
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
D ++ ++ GC C+++ C KKYCECF + C + C+C DCKN
Sbjct: 211 --------DEKKGVNKS--------GCRCRRTHCDKKYCECFSHGVPCGDQCQCKDCKNE 254
Query: 275 E 275
E
Sbjct: 255 E 255
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 38/75 (50%), Gaps = 10/75 (13%)
Query: 217 SPHGAQDAREDAREAQLAAKHNKG----------CHCKKSGCLKKYCECFQANILCSENC 266
S ++DA E A + +AK + C+CKKS CLK YCECF++ CS C
Sbjct: 117 SAASSRDALESAFPREKSAKSTRATQPSIPGLVTCNCKKSRCLKLYCECFKSQEYCSSEC 176
Query: 267 RCLDCKNFEGSEERR 281
CL+C N E R
Sbjct: 177 NCLNCYNNRKHESER 191
Score = 44.7 bits (104), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 3/66 (4%)
Query: 142 NTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG 200
+ D K G K C CR + C K YCECF+ G+ C D C C +C N + + VG
Sbjct: 209 HVDEKKGVNKSG--CRCRRTHCDKKYCECFSHGVPCGDQCQCKDCKNEEQWNNGMESMVG 266
Query: 201 ATLERN 206
E N
Sbjct: 267 DGEETN 272
>gi|145526438|ref|XP_001449030.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416596|emb|CAK81633.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 53/127 (41%), Positives = 75/127 (59%), Gaps = 1/127 (0%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
QCNC+ S+CLKLYCECFA C CNC C N +++ R +A+ L RNP AF
Sbjct: 163 QCNCKKSKCLKLYCECFANNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFSTI 222
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+ ++ Q E + + + + KGC+CKKS C KKYCEC+ N C++ C+C +C N
Sbjct: 223 LTNNGQQPQIIPEPKSQKEQSKETKKGCNCKKSECKKKYCECYSINQKCTDLCKCENCLN 282
Query: 274 FEGSEER 280
E +E+
Sbjct: 283 KEEPQEQ 289
>gi|12322849|gb|AAG51411.1|AC009465_11 hypothetical protein; 69435-72676 [Arabidopsis thaliana]
Length = 601
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 68/111 (61%), Gaps = 1/111 (0%)
Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
YCECF+AG++C + C+C NC N HE ++ RNP AF PK+ S+ D
Sbjct: 428 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 487
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
+ + +A+H +GC+C+KSGC KKYCECF + CS NCRC+ CKN G
Sbjct: 488 VENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFG 538
Score = 45.8 bits (107), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEH 191
+ ++ CNCR S C K YCECF G+ C C C+ C N H
Sbjct: 498 RHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFGH 539
>gi|224054140|ref|XP_002298111.1| predicted protein [Populus trichocarpa]
gi|222845369|gb|EEE82916.1| predicted protein [Populus trichocarpa]
Length = 781
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 63/128 (49%), Positives = 79/128 (61%), Gaps = 3/128 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
++CNC+ SRCLKLYCECFAAGIYC D C+CVNC N E+E + T RNP AF P
Sbjct: 441 RRCNCKKSRCLKLYCECFAAGIYCLDTCSCVNCINKPEYEDTVLDMRQQTEARNPLAFAP 500
Query: 213 KIASSPHG--AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
K+ ++ A E +++H KGC+CKKS C KKYCECFQ + C CRC
Sbjct: 501 KVVNNATNSPANMMEEGKWMKTSSSRHKKGCNCKKSKCSKKYCECFQGGVGCCNGCRCEG 560
Query: 271 CKNFEGSE 278
C N G++
Sbjct: 561 CYNPYGNK 568
>gi|355566221|gb|EHH22600.1| Metallothionein-like 5, testis-specific [Macaca mulatta]
Length = 508
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 59/121 (48%), Positives = 80/121 (66%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI G
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 361 --------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 412
Query: 284 F 284
Sbjct: 413 M 413
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 57/137 (41%), Gaps = 19/137 (13%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAV------ 199
D P+ K CNCR S CLK YCEC+ A I C C C+ C N E R+ +
Sbjct: 361 DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGC-KNYEESPERKTLMSMPNYV 419
Query: 200 -GATLERNPNAFRPKIASSPHGAQDAR------EDAREAQLAAKHNKGCHCKKSGCLKKY 252
LE + + K + P +QD R + EA A +G +K C K
Sbjct: 420 QAGGLEGSRHLPPTKFSGLPKFSQDRRPSSCISWEVVEATCACLLAQGEEAEKEHCSK-- 477
Query: 253 CECFQANILCSENCRCL 269
C ++ E RCL
Sbjct: 478 --CLAEQMILEEFGRCL 492
>gi|348564752|ref|XP_003468168.1| PREDICTED: tesmin-like [Cavia porcellus]
Length = 391
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 58/119 (48%), Positives = 79/119 (66%), Gaps = 11/119 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA+G +C+ CNC NC NN+ HE+ R A+ L+RNP AF+PKI
Sbjct: 186 YCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIGK---------- 235
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
R + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 236 -GRSGDVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 293
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 240 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 284
>gi|355751890|gb|EHH56010.1| Metallothionein-like 5, testis-specific, partial [Macaca
fascicularis]
Length = 490
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/121 (48%), Positives = 80/121 (66%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI G
Sbjct: 286 LAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG--- 342
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 343 --------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 394
Query: 284 F 284
Sbjct: 395 M 395
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 57/137 (41%), Gaps = 19/137 (13%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAV------ 199
D P+ K CNCR S CLK YCEC+ A I C C C+ C N E R+ +
Sbjct: 343 DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGC-KNYEESPERKTLMSMPNYV 401
Query: 200 -GATLERNPNAFRPKIASSPHGAQDAR------EDAREAQLAAKHNKGCHCKKSGCLKKY 252
LE + + K + P +QD R + EA A +G +K C K
Sbjct: 402 QAGGLEGSRHLPPTKFSGLPKFSQDRRPSSCISWEVVEATCACLLAQGEEAEKEHCSK-- 459
Query: 253 CECFQANILCSENCRCL 269
C ++ E RCL
Sbjct: 460 --CLAEQMILEEFGRCL 474
>gi|345784018|ref|XP_854573.2| PREDICTED: tesmin [Canis lupus familiaris]
Length = 516
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 85/134 (63%), Gaps = 22/134 (16%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 315 YCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQPKIGKGKLG------ 368
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL--- 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S E++ L
Sbjct: 369 -----DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPEQKTLMNM 423
Query: 284 --------FHGDHN 289
F G+H+
Sbjct: 424 PNYMEIGGFEGNHH 437
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNCR S CLK YCEC+ A I C C C+ C N E
Sbjct: 369 DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 413
>gi|301631659|ref|XP_002944915.1| PREDICTED: protein lin-54 homolog, partial [Xenopus (Silurana)
tropicalis]
Length = 183
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/125 (52%), Positives = 84/125 (67%), Gaps = 11/125 (8%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K +K CNC S+CLKLYC+CFA G C+ CNC NC+NN EHE+ R +AV A L RNP+AF
Sbjct: 9 KTKKPCNCTRSQCLKLYCDCFANGDLCNNCNCSNCYNNAEHELERFKAVKACLYRNPDAF 68
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+PKI E + + +HNKGC+CK+SGC K YCEC++A I CS C+C+
Sbjct: 69 QPKI-----------EKGKIGDVKPRHNKGCNCKRSGCFKNYCECYEAKIACSSVCKCVG 117
Query: 271 CKNFE 275
CKN +
Sbjct: 118 CKNVK 122
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
D P+ K CNC+ S C K YCEC+ A I C C CV C N
Sbjct: 79 DVKPRHNKGCNCKRSGCFKNYCECYEAKIACSSVCKCVGCKN 120
>gi|145552635|ref|XP_001461993.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429830|emb|CAK94620.1| unnamed protein product [Paramecium tetraurelia]
Length = 384
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 73/130 (56%), Gaps = 6/130 (4%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
QCNC+ S+CLKLYCECFA C CNC +C N +++ R +A+ L RNP+AF
Sbjct: 191 QCNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQC 250
Query: 214 IASSPHGAQDAREDAREAQLAAKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 268
++ + + K N KGC+CKKSGC KKYCEC+ N+ CS+ C+C
Sbjct: 251 FQQKGQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKC 310
Query: 269 LDCKNFEGSE 278
C N +E
Sbjct: 311 EQCLNRTDAE 320
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 30/41 (73%), Gaps = 1/41 (2%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 280
C+CKKS CLK YCECF N +CS++C C DCKN + +ER
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQER 232
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATL 203
KD + +K CNC+ S C K YCEC++ + C D C C C N + E+ Q+ + L
Sbjct: 273 KDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKCEQCLNRTDAEIQAQQDIAQVL 331
>gi|403301068|ref|XP_003941221.1| PREDICTED: tesmin [Saimiri boliviensis boliviensis]
Length = 263
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/118 (50%), Positives = 80/118 (67%), Gaps = 11/118 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA+G +C+ CNC NC NN++HE+ R +A+ A L RNP AF+PKI +S G R
Sbjct: 53 YCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQPKIGTSQLGDVKPR- 111
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
H+KGC C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 112 ----------HSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 159
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K C+CR S CLK YCEC+ A I C C C+ C N E
Sbjct: 107 DVKPRHSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 151
>gi|145498644|ref|XP_001435309.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402440|emb|CAK67912.1| unnamed protein product [Paramecium tetraurelia]
Length = 242
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 55/119 (46%), Positives = 69/119 (57%), Gaps = 15/119 (12%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ S+CLKLYC+CFAA C CNC C NN + + R +ERNP AF K+
Sbjct: 109 CNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLYREKMVERNPEAFNQKV 168
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+E H+KGC+C+KSGC KKYCEC+Q I CS+NC+C CKN
Sbjct: 169 --------------KEVDQKMAHSKGCNCRKSGCKKKYCECYQMGIECSDNCKCDGCKN 213
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 28/46 (60%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
K C+CKKS CLK YC+CF AN LCS C C C N + R L+
Sbjct: 107 KPCNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLY 152
>gi|302804648|ref|XP_002984076.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
gi|300148428|gb|EFJ15088.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
Length = 141
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/139 (43%), Positives = 79/139 (56%), Gaps = 14/139 (10%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
+C+C+ S+CLKLYCECFA+G YC D C C NC N EHE E RNP AF P+
Sbjct: 1 RCSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPR 60
Query: 214 IASSPHGAQDAREDAREAQL-------------AAKHNKGCHCKKSGCLKKYCECFQANI 260
+ S + + L +KH +GC+CKKS CLKKYCECFQ+ +
Sbjct: 61 VISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEV 120
Query: 261 LCSENCRCLDCKNFEGSEE 279
CS+ C+C CKN G+++
Sbjct: 121 GCSDACKCRGCKNTFGAKQ 139
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
T K ++ CNC+ S CLK YCECF + + C D C C C N
Sbjct: 94 TSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKNTF 135
>gi|307103614|gb|EFN51873.1| hypothetical protein CHLNCDRAFT_139785 [Chlorella variabilis]
Length = 929
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 66/117 (56%), Gaps = 14/117 (11%)
Query: 166 LYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 224
LYC+CFA G YC C+C +C N VE+ A +RNPNAF KI +
Sbjct: 540 LYCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQRNPNAFVQKIEA-------- 591
Query: 225 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
+A L +H +GC+CKKS C+KKYCECFQA + C E+C+C C N G RR
Sbjct: 592 -----DAALGGQHRRGCNCKKSHCMKKYCECFQAGVPCGEHCKCESCHNTAGHSGRR 643
>gi|145478035|ref|XP_001425040.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392108|emb|CAK57642.1| unnamed protein product [Paramecium tetraurelia]
Length = 386
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 73/130 (56%), Gaps = 6/130 (4%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
QCNC+ S+CLKLYCECFA C CNC +C N +++ R +A+ L RNP+AF
Sbjct: 191 QCNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQC 250
Query: 214 IASSPHGAQDAREDAREAQLAAKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 268
++ + + K N KGC+CKKSGC KKYCEC+ N+ CS+ C+C
Sbjct: 251 FQQKSQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKC 310
Query: 269 LDCKNFEGSE 278
C N +E
Sbjct: 311 EQCLNRTDAE 320
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 30/41 (73%), Gaps = 1/41 (2%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 280
C+CKKS CLK YCECF N +CS++C C DCKN + +ER
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQER 232
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATL 203
KD + +K CNC+ S C K YCEC++ + C D C C C N + E+ Q+ + L
Sbjct: 273 KDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKCEQCLNRTDAEIQAQQDIAQVL 331
>gi|351709791|gb|EHB12710.1| Tesmin [Heterocephalus glaber]
Length = 283
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/119 (49%), Positives = 81/119 (68%), Gaps = 11/119 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA+G +C+ C+C NC NN+ HE+ R A+ A L+RNP AF+PKI S
Sbjct: 77 YCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKS--------- 127
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
R + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+DC N+E S ER+ L +
Sbjct: 128 --RLGDVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEESPERKTLMN 184
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCEC+ A I C C C++C+N E
Sbjct: 131 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEE 175
>gi|327260145|ref|XP_003214896.1| PREDICTED: tesmin-like [Anolis carolinensis]
Length = 377
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/137 (45%), Positives = 84/137 (61%), Gaps = 21/137 (15%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA G +C CNC NC+NN +HE+ R A+ A L+RNP AF PKI G
Sbjct: 176 YCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLG------ 229
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF-- 284
+ +HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S ER+ L
Sbjct: 230 -----DIKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKNYEESPERKTLINM 284
Query: 285 --------HGDHNGIAF 293
H +++ ++F
Sbjct: 285 PNDMEIKSHEENDAVSF 301
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P+ K CNC+ S CLK YCECF A I C C CV C N E
Sbjct: 230 DIKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKNYEE 274
>gi|119595124|gb|EAW74718.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_a
[Homo sapiens]
Length = 529
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/121 (47%), Positives = 80/121 (66%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI G
Sbjct: 325 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 381
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 382 --------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTL 433
Query: 284 F 284
Sbjct: 434 M 434
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
P+ K CNCR S CLK YCEC+ A I C C C+ C N E
Sbjct: 385 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 426
>gi|196011968|ref|XP_002115847.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
gi|190581623|gb|EDV21699.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
Length = 316
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/133 (48%), Positives = 82/133 (61%), Gaps = 13/133 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YCECFA G +C CNC NCHNN+EHE R A+ A LERNP+AFRPKI + G +
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEGNE---- 193
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH- 285
+H+KGC+CK+SGCLK YCEC++A I C+ C+C +CKN E ER+ L
Sbjct: 194 --------RRHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKNREDCLERKNLMQL 245
Query: 286 GDHNGIAFMQRAA 298
D I Q+ A
Sbjct: 246 ADAADIRVKQQTA 258
Score = 45.1 bits (105), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
RP+ + + K CNC+ S CLK YCEC+ A I C C C NC N
Sbjct: 182 RPKIGKAKEGNERRHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKN 232
>gi|40806200|ref|NP_004914.2| tesmin isoform a [Homo sapiens]
gi|205371773|sp|Q9Y4I5.2|MTL5_HUMAN RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific; AltName: Full=Testis-specific
metallothionein-like protein
gi|40555823|gb|AAH64579.1| Metallothionein-like 5, testis-specific (tesmin) [Homo sapiens]
gi|119595125|gb|EAW74719.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_b
[Homo sapiens]
Length = 508
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/121 (47%), Positives = 80/121 (66%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI G
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 361 --------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTL 412
Query: 284 F 284
Sbjct: 413 M 413
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
P+ K CNCR S CLK YCEC+ A I C C C+ C N E
Sbjct: 364 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 405
>gi|426369505|ref|XP_004051728.1| PREDICTED: tesmin [Gorilla gorilla gorilla]
Length = 508
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/121 (47%), Positives = 80/121 (66%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI G
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 361 --------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTL 412
Query: 284 F 284
Sbjct: 413 M 413
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
P+ K CNCR S CLK YCEC+ A I C C C+ C N E
Sbjct: 364 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 405
>gi|119595126|gb|EAW74720.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_c
[Homo sapiens]
Length = 463
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/121 (47%), Positives = 80/121 (66%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI G
Sbjct: 259 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 315
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 316 --------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTL 367
Query: 284 F 284
Sbjct: 368 M 368
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
P+ K CNCR S CLK YCEC+ A I C C C+ C N E
Sbjct: 319 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 360
>gi|403340785|gb|EJY69688.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
trifallax]
Length = 710
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 54/123 (43%), Positives = 72/123 (58%), Gaps = 13/123 (10%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K+C C S+CLKLYCECFAAG+ C C C +C N + A Q+A ++R+P AF
Sbjct: 525 KKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFEI 584
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K++ Q++ D + H KGC CKKSGC K YCECFQ + C++ C+C C
Sbjct: 585 KVS------QNSGTDKLQ------HRKGCTCKKSGCQKGYCECFQLGVPCTDFCKCSGCA 632
Query: 273 NFE 275
N E
Sbjct: 633 NCE 635
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 26/41 (63%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
K C C KS CLK YCECF A ++C +C C DC N E S +
Sbjct: 525 KKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCCNTEDSSD 565
>gi|444723239|gb|ELW63898.1| Protein lin-54 like protein [Tupaia chinensis]
Length = 783
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 60/99 (60%), Gaps = 12/99 (12%)
Query: 201 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANI 260
A L+RNP AF+PKI G D R H+KGC+CK+SGCLK YCEC++A I
Sbjct: 604 ACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKI 652
Query: 261 LCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 298
+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 653 MCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 691
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 3/46 (6%)
Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
GK+G ++ K CNC+ S CLK YCEC+ A I C C C+ C N
Sbjct: 620 GKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 665
>gi|4581563|gb|AAD24668.1|U86074_1 tesmin [Homo sapiens]
Length = 299
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/118 (48%), Positives = 79/118 (66%), Gaps = 11/118 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI G
Sbjct: 98 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG------ 151
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 152 -----NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 204
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
P+ K CNCR S CLK YCEC+ A I C C C+ C N E
Sbjct: 155 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 196
>gi|145511942|ref|XP_001441893.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409154|emb|CAK74496.1| unnamed protein product [Paramecium tetraurelia]
Length = 374
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 73/121 (60%), Gaps = 3/121 (2%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ S+CLKLYCECFA C CNC C N +++ R +A+ L RNP+AF +
Sbjct: 169 CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKAIEEALLRNPDAFAAIL 228
Query: 215 ASSPHGAQDAREDA--REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
++ Q +E+ +E+Q KGC+CKKS C KKYCEC+ N C++ C+C +C
Sbjct: 229 TNNGQQPQIIQEEKSQKESQKDITTRKGCNCKKSECKKKYCECYSINQRCTDLCKCENCL 288
Query: 273 N 273
N
Sbjct: 289 N 289
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
K CNC+ S C K YCEC++ C D C C NC N V+
Sbjct: 255 KGCNCKKSECKKKYCECYSINQRCTDLCKCENCLNKVQ 292
>gi|281211685|gb|EFA85847.1| hypothetical protein PPL_01079 [Polysphondylium pallidum PN500]
Length = 735
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 61/94 (64%), Gaps = 9/94 (9%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
K C+C+NS+CLK+YCECFA+ C+GC C CHNN +H + + A +TLERNP+AF PK
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCNGCQCFGCHNNEDHIESVENARASTLERNPDAFGPK 385
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
S +G E A KH KGCHC++SG
Sbjct: 386 FKQSFNG---------EKPKAEKHLKGCHCRRSG 410
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
K CHCK S CLK YCECF + LC+ C+C C N E E
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCN-GCQCFGCHNNEDHIE 365
>gi|326920244|ref|XP_003206384.1| PREDICTED: tesmin-like [Meleagris gallopavo]
Length = 622
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 76/116 (65%), Gaps = 11/116 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YCECFA+G +C+ CNC NC+NN HE R +A+ L+RNP AF PKI S G
Sbjct: 427 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSKLG------ 480
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
+ HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S +++
Sbjct: 481 -----DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEESPDKKT 531
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P K CNC+ S CLK YCECF A I C C C+ C N E
Sbjct: 481 DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 525
>gi|159478621|ref|XP_001697401.1| hypothetical protein CHLREDRAFT_105644 [Chlamydomonas reinhardtii]
gi|158274559|gb|EDP00341.1| predicted protein [Chlamydomonas reinhardtii]
Length = 91
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/92 (56%), Positives = 61/92 (66%), Gaps = 1/92 (1%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YCECFA+ YC+ CNC+ C NN E+E RQ AV A +ERNPNAF+PKI H
Sbjct: 1 YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHETHTPVV 60
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
A A + +H KGC+CKKS CLKKYCECFQ
Sbjct: 61 VA-AAGASGRHLKGCNCKKSFCLKKYCECFQV 91
>gi|145534959|ref|XP_001453218.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420929|emb|CAK85821.1| unnamed protein product [Paramecium tetraurelia]
Length = 265
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 76/135 (56%), Gaps = 11/135 (8%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC+ S+CLKLYCECFA C CNC C N +++ R +A L RNP+AF +
Sbjct: 52 CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKATEEALLRNPDAFAAIL 111
Query: 215 ASS---PHGAQDA-------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
++ P QD + +E+Q+ KGC+CKKS C KKYCEC+ N C++
Sbjct: 112 TNNGQQPQIIQDKTLILPLEEKSQKESQIDITTRKGCNCKKSECKKKYCECYSINQKCTD 171
Query: 265 NCRCLDCKNFEGSEE 279
C+C +C N S+E
Sbjct: 172 LCKCENCLNKAESQE 186
>gi|118091374|ref|XP_001231493.1| PREDICTED: tesmin [Gallus gallus]
Length = 503
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 76/116 (65%), Gaps = 11/116 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YCECFA+G +C+ CNC NC+NN HE R +A+ L+RNP AF PKI S G
Sbjct: 307 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSELG------ 360
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
+ HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S +++
Sbjct: 361 -----DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEESPDKKT 411
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
D P K CNC+ S CLK YCECF A I C C C+ C N E
Sbjct: 361 DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 405
>gi|357133064|ref|XP_003568148.1| PREDICTED: uncharacterized protein LOC100845078 [Brachypodium
distachyon]
Length = 377
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 64/165 (38%), Positives = 78/165 (47%), Gaps = 13/165 (7%)
Query: 116 PHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGI 175
P ++L L T P R R DG K C C+ S+CLKLYC CFA G
Sbjct: 62 PSAANQLALATTPK------RQRVEEAADGNGC-----KHCACKKSKCLKLYCPCFAGGG 110
Query: 176 YC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 234
YC + C CV C N ++ Q L R KI +A EDA + +
Sbjct: 111 YCSEKCGCVPCFNKSDYAETVQTTRKVLLSRQKR-MSLKINRRSEANAEAMEDAHNSSSS 169
Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
+GC+CKKS CLKKYC+C+Q CS CRC DCKN G E
Sbjct: 170 TPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCEDCKNPYGKNE 214
>gi|332837107|ref|XP_001154282.2| PREDICTED: tesmin [Pan troglodytes]
Length = 508
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 57/121 (47%), Positives = 79/121 (65%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI G
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ C+N E S ER+ L
Sbjct: 361 --------NVKPRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEESPERKTL 412
Query: 284 F 284
Sbjct: 413 M 413
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
P+ K CNCR S CLK YCEC+ A I C C C+ C N
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCEN 402
>gi|397517193|ref|XP_003828803.1| PREDICTED: tesmin [Pan paniscus]
Length = 508
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 57/121 (47%), Positives = 79/121 (65%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI G
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ C+N E S ER+ L
Sbjct: 361 --------NVKPRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEESPERKTL 412
Query: 284 F 284
Sbjct: 413 M 413
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
P+ K CNCR S CLK YCEC+ A I C C C+ C N
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCEN 402
>gi|168010127|ref|XP_001757756.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691032|gb|EDQ77396.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 113
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 67/111 (60%), Gaps = 1/111 (0%)
Query: 167 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
YCECFAA ++C G C C NC N E+E R+P AF PKI + R
Sbjct: 1 YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEARDPLAFAPKIVQAAEPTPIPR 60
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
+DA + +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC CKN G
Sbjct: 61 DDALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKNMYG 111
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 2/67 (2%)
Query: 123 PLPTLPPGKQES-PRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
PL P Q + P PR + + + ++ CNC+ S CLK YCEC+ AG+ C +GC
Sbjct: 42 PLAFAPKIVQAAEPTPIPRDDALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGC 101
Query: 181 NCVNCHN 187
C C N
Sbjct: 102 RCEGCKN 108
>gi|326491307|dbj|BAK05753.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326502882|dbj|BAJ99069.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 375
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/161 (39%), Positives = 78/161 (48%), Gaps = 21/161 (13%)
Query: 149 TPKKQK---------QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEA 198
TPK+QK C C+ SRCLKLYC CFA G YC + C CV C N + Q
Sbjct: 66 TPKRQKVEDAADGCKHCACKKSRCLKLYCPCFAGGGYCSEKCGCVPCFNKADFAETVQTT 125
Query: 199 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
L R KI + EDA + + +GC+CKKS CLKKYC+C+Q
Sbjct: 126 RKVLLSRQKR-MSLKINRRLEANAETMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQD 184
Query: 259 NILCSENCRCLDCKNFEG-------SEERRALFHG---DHN 289
CS CRC DCKN G E +R L+ G DH+
Sbjct: 185 GTGCSLFCRCEDCKNPFGKNEGIMADESKRFLYTGADLDHS 225
>gi|297743393|emb|CBI36260.3| unnamed protein product [Vitis vinifera]
Length = 326
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 68/116 (58%), Gaps = 4/116 (3%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNP 207
T +K CNC+ ++CLKLYC+C AAG+YC D C C NC N E+E Q R+P
Sbjct: 10 TESDRKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDP 69
Query: 208 NAFRPKIASSPHGAQD--AREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANI 260
AF P+I + D +ED +A+H +GC+CKKS C KKYCEC+Q I
Sbjct: 70 LAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQVYI 125
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 27/40 (67%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 278
K C+CK++ CLK YC+C A + C+++C C +C N +E
Sbjct: 15 KHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENE 54
>gi|403344595|gb|EJY71645.1| Tesmin/TSO1-like CXC domain-containing protein [Oxytricha
trifallax]
Length = 600
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 71/115 (61%), Gaps = 10/115 (8%)
Query: 162 RCLKLYCECFAAGIYCD-GCNCVNCHNNVEH-EVARQEAVGATLERNPNAFRPKIAS-SP 218
+CLKLYC+CFA G C C C+NC N ++ E EA + RNP AF+PK+ P
Sbjct: 454 QCLKLYCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGP 513
Query: 219 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
AQ ++ DA + +H KGC+CK SGC K YCECFQ + C++NCRC+ C N
Sbjct: 514 --AQMSQNDADK-----QHRKGCNCKNSGCQKNYCECFQFGLECNKNCRCIGCSN 561
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
D + +K CNC+NS C K YCECF G+ C+ C C+ C N
Sbjct: 520 DADKQHRKGCNCKNSGCQKNYCECFQFGLECNKNCRCIGCSN 561
>gi|339238897|ref|XP_003381003.1| conserved hypothetical protein [Trichinella spiralis]
gi|316976020|gb|EFV59373.1| conserved hypothetical protein [Trichinella spiralis]
Length = 492
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 63/113 (55%), Gaps = 12/113 (10%)
Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
A + C+C NC NN+ +E+ R A+ + LERNP AF+PKI R D
Sbjct: 275 APVLKSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIG-------KGRADTERL- 326
Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
HNKGC+CKKS CLK YCEC++A + C+ C+C+ C+N E R H
Sbjct: 327 ----HNKGCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRNTEADRNHRNRGH 375
Score = 45.1 bits (105), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
K CNC+ S CLK YCEC+ A + C C CV C N
Sbjct: 329 KGCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRN 363
>gi|348678286|gb|EGZ18103.1| hypothetical protein PHYSODRAFT_504652 [Phytophthora sojae]
Length = 713
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 75/142 (52%), Gaps = 14/142 (9%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C+ CLK+YC CF++ +C GC C +C N +H+ R EA+ L +P AF
Sbjct: 536 CGCKTG-CLKMYCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAF--SF 592
Query: 215 ASSPHGAQDAREDAREAQLA-AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
AS P A Q + A +GC CKKS CLKKYCECFQ I C+ +CRC+DC N
Sbjct: 593 ASLPQSASTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCMDCSN 652
Query: 274 FEGS---------EERRALFHG 286
S ++R A HG
Sbjct: 653 HSESTAAHQHPHLQKRSATAHG 674
>gi|119626323|gb|EAX05918.1| hypothetical protein DKFZp686L1814, isoform CRA_b [Homo sapiens]
Length = 223
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 60/100 (60%), Gaps = 12/100 (12%)
Query: 187 NNVEHEVA-RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
NV + V Q A L+RNP AF+PKI G D R H+KGC+CK+
Sbjct: 29 GNVGYAVLPAQYVTQACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKR 77
Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 78 SGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 117
Score = 46.2 bits (108), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 29/49 (59%), Gaps = 3/49 (6%)
Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
GK+G ++ K CNC+ S CLK YCEC+ A I C C C+ C N E
Sbjct: 60 GKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEE 108
>gi|226502887|ref|NP_001148761.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|195621962|gb|ACG32811.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|413949786|gb|AFW82435.1| CXC domain containing TSO1-like protein 1 [Zea mays]
Length = 356
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 72/147 (48%), Gaps = 12/147 (8%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K C C+ SRCLKLYC CFA G YC D C C C N Q L R
Sbjct: 62 KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQ-KRMSM 120
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI P + EDA + + +GC+CKKS CLKKYC+C+Q CS CRC DC+
Sbjct: 121 KINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 180
Query: 273 NFEG-------SEERRALFHG---DHN 289
N G + +R L+ G DH+
Sbjct: 181 NPFGKNDGIIADDSKRYLYTGADLDHS 207
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 9/58 (15%)
Query: 138 RPRANTD-------GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
RP AN + TP + + CNC+ S CLK YC+C+ G C C C +C N
Sbjct: 125 RPEANAEPMEDAHHSSSSTPPR-RGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 181
>gi|242091037|ref|XP_002441351.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
gi|241946636|gb|EES19781.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
Length = 359
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 69/141 (48%), Gaps = 9/141 (6%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K C C+ SRCLKLYC CF+ G YC D C C C N Q L R
Sbjct: 67 KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQ-KRMSM 125
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI P + EDA + + +GC+CKKS CLKKYC+C+Q CS CRC DC+
Sbjct: 126 KINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 185
Query: 273 NFEGSEE-------RRALFHG 286
N G E +R L+ G
Sbjct: 186 NPFGKNEGIIADDSKRYLYTG 206
>gi|301097298|ref|XP_002897744.1| exosome complex exonuclease RRP41-like protein [Phytophthora
infestans T30-4]
gi|262106765|gb|EEY64817.1| exosome complex exonuclease RRP41-like protein [Phytophthora
infestans T30-4]
Length = 777
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 68/123 (55%), Gaps = 11/123 (8%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C+ CLK+YC CF++ +C GC C +C N + R EA+ + L +P AF
Sbjct: 540 CGCKTG-CLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAF--SF 596
Query: 215 ASSPHGAQDAREDAREAQLAAKHN----KGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
AS P QD L K + +GC CKKS CLKKYCECFQ I C+ +CRC+D
Sbjct: 597 ASLP---QDTNTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCMD 653
Query: 271 CKN 273
C N
Sbjct: 654 CSN 656
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQ 196
+ C C+ S+CLK YCECF GI C C C++C N+ E A Q
Sbjct: 622 RGCRCKKSKCLKKYCECFQNGIACTSHCRCMDCSNHSESTAAHQ 665
Score = 45.4 bits (106), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C CK +GCLK YC CF + C C C DCKN S+ R
Sbjct: 540 CGCK-TGCLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTER 579
>gi|115464805|ref|NP_001056002.1| Os05g0509400 [Oryza sativa Japonica Group]
gi|48475201|gb|AAT44270.1| unknown protein [Oryza sativa Japonica Group]
gi|113579553|dbj|BAF17916.1| Os05g0509400 [Oryza sativa Japonica Group]
gi|215766036|dbj|BAG98264.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218197077|gb|EEC79504.1| hypothetical protein OsI_20571 [Oryza sativa Indica Group]
Length = 374
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/163 (38%), Positives = 78/163 (47%), Gaps = 23/163 (14%)
Query: 149 TPKKQK-----------QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
TPK+QK C C+ SRCLKLYC CFA G YC + C C C N + Q
Sbjct: 70 TPKRQKVEESADGNGCKHCACKKSRCLKLYCPCFAGGGYCSEKCGCQPCFNKALYAETVQ 129
Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
L R KI +A EDA + + +GC+CKKS CLKKYC+C+
Sbjct: 130 TTRKVLLSRQKR-MSLKINRRSEANAEAVEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 188
Query: 257 QANILCSENCRCLDCKNFEG-------SEERRALFHG---DHN 289
Q CS CRC DC+N G E +R L+ G DH+
Sbjct: 189 QDGTGCSLFCRCEDCRNPFGKNEGIMAEESKRFLYTGADLDHS 231
>gi|224035483|gb|ACN36817.1| unknown [Zea mays]
Length = 356
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/147 (39%), Positives = 72/147 (48%), Gaps = 12/147 (8%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K C C+ SRCLKLYC CFA G YC D C C C + Q L R
Sbjct: 62 KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFSKEAFSETVQTTRKVLLSRQ-KRMSM 120
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI P + EDA + + +GC+CKKS CLKKYC+C+Q CS CRC DC+
Sbjct: 121 KINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 180
Query: 273 NFEG-------SEERRALFHG---DHN 289
N G + +R L+ G DH+
Sbjct: 181 NPFGKNDGIIADDSKRYLYTGADLDHS 207
Score = 39.3 bits (90), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 9/58 (15%)
Query: 138 RPRANTD-------GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
RP AN + TP + + CNC+ S CLK YC+C+ G C C C +C N
Sbjct: 125 RPEANAEPMEDAHHSSSSTPPR-RGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 181
>gi|226507366|ref|NP_001150621.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
gi|195640640|gb|ACG39788.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
gi|219884557|gb|ACL52653.1| unknown [Zea mays]
gi|414868874|tpg|DAA47431.1| TPA: Tesmin/TSO1-like CXC domain containing protein [Zea mays]
Length = 394
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 74/141 (52%), Gaps = 16/141 (11%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
S R R R DG DG K C+C+ SRCLKLYC CFA+G++C + C C CHN + +
Sbjct: 72 SDRKRRRKAEDG-DGC----KTCSCKKSRCLKLYCVCFASGLHCTESCGCEPCHNKPQLQ 126
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
+ A L K + QD E + + K C CKKSGCLKKY
Sbjct: 127 GTLRMATVLPL---------KPVQTFEAGQDIVEQVIRSPMDLIRRK-CTCKKSGCLKKY 176
Query: 253 CECFQANILCSENCRCLDCKN 273
C+C+Q CS NC+C DC+N
Sbjct: 177 CDCYQGGAGCSINCKCDDCRN 197
>gi|226500562|ref|NP_001150787.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|194699308|gb|ACF83738.1| unknown [Zea mays]
gi|195641826|gb|ACG40381.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|224028923|gb|ACN33537.1| unknown [Zea mays]
gi|413945974|gb|AFW78623.1| CXC domain containing TSO1-like protein 1 [Zea mays]
Length = 359
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 68/141 (48%), Gaps = 9/141 (6%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K C C+ SRCLKLYC CF+ G YC D C C C N L R
Sbjct: 67 KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNKEAFAETVHTTRKVLLSRQ-KRMSM 125
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KI P + EDA + + +GC+CKKS CLKKYC+C+Q CS CRC DC+
Sbjct: 126 KINRRPEANTEPMEDAHHSSSSTPPKRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 185
Query: 273 NFEGSEE-------RRALFHG 286
N G E +R L+ G
Sbjct: 186 NPFGKNEGIMADDSKRYLYTG 206
Score = 42.0 bits (97), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 138 RPRANTD-------GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
RP ANT+ TP K + CNC+ S CLK YC+C+ G C C C +C N
Sbjct: 130 RPEANTEPMEDAHHSSSSTPPK-RGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 186
>gi|145513118|ref|XP_001442470.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409823|emb|CAK75073.1| unnamed protein product [Paramecium tetraurelia]
Length = 176
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 89/169 (52%), Gaps = 22/169 (13%)
Query: 89 QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDG 148
++ RSP ++ + R ++ +N V K PL K S + R T K
Sbjct: 22 RSQFRSPPKSPHRKLERATKLKFINFV---TLKKPL------KYTSDQITNRKRTSSKRR 72
Query: 149 TPKKQKQ-----CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGAT 202
P+ +++ CNC+ S+CLKLYC+CF AG+ C CNC +CHNN EH R+ +
Sbjct: 73 KPQTEQKPKIIVCNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVIKQI 132
Query: 203 LERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKK 251
+ERNP+AFRPK+ S + ED E +H KGC+CKKS KK
Sbjct: 133 MERNPSAFRPKVESKSNS-----ED--EQDHKPRHFKGCNCKKSNWSKK 174
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 27/43 (62%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
C+CKKS CLK YC+CF A + C ++C C C N + + R +
Sbjct: 85 CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREI 127
>gi|390470886|ref|XP_003734371.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Callithrix jacchus]
Length = 466
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 78/121 (64%), Gaps = 11/121 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L +C+CFA+G + + CNC NC NN+ HE+ R +A+ A L R+P AF+P+I G
Sbjct: 305 LAGHCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQPQIGKGQLG--- 361
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
++ +H+KGC+C++ GCL+ YCEC +A I+CS C+C+ CKN+E S ER+ L
Sbjct: 362 --------EVRPRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGCKNYEESPERKTL 413
Query: 284 F 284
Sbjct: 414 M 414
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
P+ K CNCR CL+ YCEC A I C C C+ C N E
Sbjct: 365 PRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGCKNYEE 406
>gi|449274469|gb|EMC83611.1| Tesmin, partial [Columba livia]
Length = 289
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 86/156 (55%), Gaps = 18/156 (11%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CFA G +C CNC NC NN HE R A+ A LERNP AF PKI G
Sbjct: 151 YCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPKIGKRKLG------ 204
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS-EERRALFH 285
++ HNKGC+CK+SGCL+ YCECF+A I+CS C+C CKN+E S +E + L
Sbjct: 205 -----EIKHHHNKGCNCKRSGCLENYCECFEAKIVCSSICKCTGCKNYEESPDENKQL-- 257
Query: 286 GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 321
N I +M N V +S + L S K +
Sbjct: 258 ---NVINYMD-IGNKEGNSPVLTSAFNMLPKSRKDR 289
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 22/38 (57%), Gaps = 1/38 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
K CNC+ S CL+ YCECF A I C C C C N E
Sbjct: 212 KGCNCKRSGCLENYCECFEAKIVCSSICKCTGCKNYEE 249
>gi|222632736|gb|EEE64868.1| hypothetical protein OsJ_19725 [Oryza sativa Japonica Group]
Length = 223
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 73/137 (53%), Gaps = 15/137 (10%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
+C+C S C +LYC CF + +C D CNC C+N +E A +E +NPNAF P+
Sbjct: 9 KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68
Query: 214 I--------ASSPHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQAN 259
I A+ P + A D + A KH KGC C+KS C K YCECF+ +
Sbjct: 69 IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128
Query: 260 ILCSENCRCLDCKNFEG 276
+ C+ C+CL+C N G
Sbjct: 129 VGCTAKCKCLECSNSFG 145
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
C C +S C + YC CF++ CS+NC C C N + E+
Sbjct: 10 CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48
>gi|384246119|gb|EIE19610.1| hypothetical protein COCSUDRAFT_9740, partial [Coccomyxa
subellipsoidea C-169]
Length = 90
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 63/95 (66%), Positives = 67/95 (70%), Gaps = 9/95 (9%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS----SPHGAQ 222
YCECFA+G YCDGCNCVNC NN E E RQ AV A LERNPNAFRPKI G
Sbjct: 1 YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCTTGTH 60
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
A + A AA+HNKGC+CKKSGCLKKYCECFQ
Sbjct: 61 VAIQGA-----AARHNKGCNCKKSGCLKKYCECFQ 90
>gi|403347003|gb|EJY72913.1| Transcription factor, putative [Oxytricha trifallax]
Length = 286
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/138 (37%), Positives = 74/138 (53%), Gaps = 12/138 (8%)
Query: 157 NCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 216
NC N+ C+K YC+CF YC CN C NN E+E RQ+A+ L +NP AF K +
Sbjct: 144 NCSNTYCIKRYCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKT 203
Query: 217 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC--LDCKNF 274
+ + +++D+ C+C KSGC KYC C + ILCS+NC+C CKN
Sbjct: 204 RKNFIKKSKKDSNLIHT-------CNCSKSGCNNKYCVCLKEGILCSKNCKCNHRKCKNK 256
Query: 275 EGS---EERRALFHGDHN 289
+G ++R + D N
Sbjct: 257 QGDRNQKQRSVIISNDEN 274
>gi|340507878|gb|EGR33739.1| tesmin tso1-like cxc domain protein [Ichthyophthirius multifiliis]
Length = 142
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 72/128 (56%), Gaps = 18/128 (14%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNC----HNNVEHEVARQEAVGATLERNPNAF 210
C C+ SRCL+LYCECF GI+C+ C C NC +N + E A+QEA RNP+AF
Sbjct: 15 CKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQIESAKQEAKM----RNPDAF 70
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
K + H KGC+C K+ C KKYCECF A I C+ENC+C +
Sbjct: 71 SQKFLVVKQNQYEG---------IVSHKKGCNCTKTQCTKKYCECFNAGIKCTENCKCEN 121
Query: 271 CKNFEGSE 278
C+N++ +
Sbjct: 122 CENYKDEQ 129
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 35/70 (50%), Gaps = 9/70 (12%)
Query: 127 LPPGKQESPRSRPRANT--------DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC- 177
+ KQE+ P A + + +G +K CNC ++C K YCECF AGI C
Sbjct: 55 IESAKQEAKMRNPDAFSQKFLVVKQNQYEGIVSHKKGCNCTKTQCTKKYCECFNAGIKCT 114
Query: 178 DGCNCVNCHN 187
+ C C NC N
Sbjct: 115 ENCKCENCEN 124
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 28/40 (70%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 280
C CKKS CL+ YCECF I C+++C C +C N E ++++
Sbjct: 15 CKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQ 54
>gi|48475095|gb|AAT44164.1| hypothetical protein, contains Tesmin/TSO1-like CXC domain, PF03638
[Oryza sativa Japonica Group]
Length = 194
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 73/137 (53%), Gaps = 15/137 (10%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
+C+C S C +LYC CF + +C D CNC C+N +E A +E +NPNAF P+
Sbjct: 9 KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68
Query: 214 I--------ASSPHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQAN 259
I A+ P + A D + A KH KGC C+KS C K YCECF+ +
Sbjct: 69 IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128
Query: 260 ILCSENCRCLDCKNFEG 276
+ C+ C+CL+C N G
Sbjct: 129 VGCTAKCKCLECSNSFG 145
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
C C +S C + YC CF++ CS+NC C C N + E+
Sbjct: 10 CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48
>gi|118356010|ref|XP_001011264.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89293031|gb|EAR91019.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 617
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 46/126 (36%), Positives = 66/126 (52%), Gaps = 14/126 (11%)
Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
YCECF G YC D C C C NN ++ R++A+ RN +AF K + +
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQITSNST---- 412
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
H KGCHCK++ CLKKYCECF A + C+ NC+C +C+N + + +
Sbjct: 413 ---------VTHKKGCHCKRTHCLKKYCECFNAGLKCTNNCKCEECRNTQDGLKTMGISG 463
Query: 286 GDHNGI 291
G +G+
Sbjct: 464 GGTSGL 469
Score = 44.7 bits (104), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
+ T +K C+C+ + CLK YCECF AG+ C + C C C N
Sbjct: 410 NSTVTHKKGCHCKRTHCLKKYCECFNAGLKCTNNCKCEECRN 451
>gi|219112185|ref|XP_002177844.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410729|gb|EEC50658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 455
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 51/126 (40%), Positives = 69/126 (54%), Gaps = 22/126 (17%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG--CNCVNCHNNVEHEVARQEAVGATLERNP 207
P Q CNC SRCLKLYC CF AG C C C+ C N+VE R++A+ TL++ P
Sbjct: 332 PAGQVACNCIRSRCLKLYCTCFQAGKMCQPGICTCIACANSVEDHPERKQAIKHTLQKRP 391
Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
+AF+ K P G GC CK + C++KYCECF+ + C++ C
Sbjct: 392 DAFQTK--DRPVGL------------------GCACKNNKCIRKYCECFRNGLSCADKCC 431
Query: 268 CLDCKN 273
CL+C+N
Sbjct: 432 CLNCEN 437
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/133 (38%), Positives = 66/133 (49%), Gaps = 19/133 (14%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD--GCNCVNCHNNVEH---EVARQEAVGATLERNPNAF 210
C+C S+C+ LYC+CF AG C+ CNC C N + + AR A+ A L RNP AF
Sbjct: 260 CSCPKSKCVALYCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAF 319
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCL 269
G A + Q+A C+C +S CLK YC CFQA +C C C+
Sbjct: 320 ------VTAGVASAVQKLPAGQVA------CNCIRSRCLKLYCTCFQAGKMCQPGICTCI 367
Query: 270 DCKN-FEGSEERR 281
C N E ER+
Sbjct: 368 ACANSVEDHPERK 380
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 240 GCHCKKSGCLKKYCECFQANILCSE-NCRCLDCKN 273
GC C KS C+ YC+CF+A C+ NC C CKN
Sbjct: 259 GCSCPKSKCVALYCDCFKAGRRCNPINCNCTACKN 293
>gi|224007227|ref|XP_002292573.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971435|gb|EED89769.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 886
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 52/127 (40%), Positives = 65/127 (51%), Gaps = 17/127 (13%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVA---RQEAVGATLERN 206
KK C C ++CL LYC+CF I+C + C C++C N VE+ A R AV LER
Sbjct: 6 KKAPSCKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERR 65
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
P AF RE A+ K + GC C K+ CLKKYC CF A + C C
Sbjct: 66 PEAF-------------GRESGAFAERRNKKDGGCTCAKNSCLKKYCVCFGAKMQCRARC 112
Query: 267 RCLDCKN 273
C +CKN
Sbjct: 113 NCSNCKN 119
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 46/126 (36%), Positives = 63/126 (50%), Gaps = 11/126 (8%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNA 209
KK C C + CLK YC CF A + C CNC NC N + ++ V T+E
Sbjct: 82 KKDGGCTCAKNSCLKKYCVCFGAKMQCRARCNCSNCKNPF-GTIRIEDGVTCTIEGGKVN 140
Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
A P R+ AK N C C ++ CLK YC+CF+ +++C++NC CL
Sbjct: 141 ALKGAAFVP---------IRKLANLAKGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCL 191
Query: 270 DCKNFE 275
+CKN E
Sbjct: 192 ECKNSE 197
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 24/129 (18%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEV---ARQEAVGATLERN 206
K +C+C + CLKLYC+CF + C D C+C+ C N+ E+ R + LE+
Sbjct: 158 KGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCLECKNSEEYSGPMGERTLVIKDILEKR 217
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-EN 265
P+ F+ K A G GC C+K+ C YC+C + C
Sbjct: 218 PHIFQ-KTAKRKAGG------------------GCSCQKNKCRAGYCDCKGSGGKCDPAV 258
Query: 266 CRCLDCKNF 274
C C++C N
Sbjct: 259 CTCINCDNM 267
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 38/77 (49%), Gaps = 15/77 (19%)
Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--------------FEGSEE 279
AAK C C K+ CL YC+CF NI C E+C C+DCKN EG E
Sbjct: 4 AAKKAPSCKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLE 63
Query: 280 RRALFHGDHNGIAFMQR 296
RR G +G AF +R
Sbjct: 64 RRPEAFGRESG-AFAER 79
>gi|428163094|gb|EKX32185.1| hypothetical protein GUITHDRAFT_148793 [Guillardia theta CCMP2712]
Length = 496
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 63/113 (55%), Gaps = 11/113 (9%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
++ YC CF A + C+GC+CV+C+N+ +HE R A+ +P AF ++
Sbjct: 297 VRQYCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVR-------- 348
Query: 224 AREDAREAQLAAK---HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
A DA + +K H +GC CK S CLKKYCECF+ + CS C C DC N
Sbjct: 349 AEGDAASVSVESKPKQHVRGCKCKNSKCLKKYCECFEFGVSCSSKCDCKDCMN 401
>gi|242086178|ref|XP_002443514.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
gi|241944207|gb|EES17352.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
Length = 409
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 73/141 (51%), Gaps = 16/141 (11%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
S R R R DG DG K C+C+ SRC+KLYC CFA+G C + C C C N +
Sbjct: 87 SDRKRRRKAEDG-DGC----KTCSCKKSRCMKLYCVCFASGSRCTESCGCEPCENKQPLQ 141
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
VA + A+ L K + QD E + + K C CKKSGCLKKY
Sbjct: 142 VAPRTALILPL---------KPVQTSEAGQDIVEQVIRSPMDLIRRK-CTCKKSGCLKKY 191
Query: 253 CECFQANILCSENCRCLDCKN 273
C+C+Q CS NC+C DC+N
Sbjct: 192 CDCYQGGAGCSINCKCDDCRN 212
>gi|145496358|ref|XP_001434170.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401293|emb|CAK66773.1| unnamed protein product [Paramecium tetraurelia]
Length = 387
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 74/132 (56%), Gaps = 15/132 (11%)
Query: 155 QCNCRNSRCLKLYCECFAA-GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
QCNC+ S+CLKLYCECFA G+ CNC C N +++ R +A+ L RN +AF
Sbjct: 194 QCNCKKSKCLKLYCECFANNGVCSQSCNCQECKNRIDNPQERSKAIEEALLRNSDAFAQC 253
Query: 214 IAS--SPHGAQDA----------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
+ +P Q +E +++ +KGC+CKKSGC KKYCEC+ N+
Sbjct: 254 FTTKGAPQFVQQGIFNFIQDKPIKEPSKDNSSVV--HKGCNCKKSGCKKKYCECYSQNLK 311
Query: 262 CSENCRCLDCKN 273
C++ C+C C N
Sbjct: 312 CNDLCKCEHCLN 323
Score = 41.2 bits (95), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG 200
KD + K CNC+ S C K YCEC++ + C D C C +C N + ++ + G
Sbjct: 281 KDNSSVVHKGCNCKKSGCKKKYCECYSQNLKCNDLCKCEHCLNKTDAQIQASQEQG 336
>gi|428176775|gb|EKX45658.1| hypothetical protein GUITHDRAFT_152657 [Guillardia theta CCMP2712]
Length = 434
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 131/307 (42%), Gaps = 48/307 (15%)
Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGA 201
TD +K+K C C ++CLK YC CF + C + C CV+CHN+ +HE AR AV
Sbjct: 4 TDNAQKKSEKEKGCRCGRTKCLKQYCACFRNDVRCTNDCVCVDCHNDGKHEQARMMAVRL 63
Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
+P AF+ + ++ K +GC C++S C KKYCECF A +
Sbjct: 64 VRLNDPMAFKGT------SLELENQEVHTPNGTLKTVRGCRCRRSKCQKKYCECFGAGLK 117
Query: 262 CSENCRCLDCKNFEGSEERRALF--HGDHNGIAFMQRA--ANAAIIGAVGSSGY-----G 312
CS NC C C N G++ + + G + +A M A A ++ G + +
Sbjct: 118 CSTNCVCEGCLN--GNDPSKPFYPSGGSASSVARMAPAIPAGFSVSNMAGKTHFDRTSKA 175
Query: 313 TLMTSNKRKSEELLFG---------VAAKDQSV----------IRNPQSQQGN------- 346
+L TS + +L F A K QS+ +RNP S +
Sbjct: 176 SLDTSRASRPPQLSFSRTSAPSAEQKAEKSQSLASRRPPISVEVRNPTSMRVACAVTEIP 235
Query: 347 -NVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT 405
+ S P +Q V + + S+L +P AD++ +E SL +SS
Sbjct: 236 LHDSRSRAPVAQEVKLEDKVEVNKENKPLSSLDNPAADLVL---ERESSSLNYYMSSTPQ 292
Query: 406 KTLTSIY 412
LT +
Sbjct: 293 LNLTPTW 299
>gi|218197350|gb|EEC79777.1| hypothetical protein OsI_21187 [Oryza sativa Indica Group]
Length = 214
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 72/137 (52%), Gaps = 15/137 (10%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
+C+C S C +LYC CF + +C D CNC C+N +E A +E +NPNAF P+
Sbjct: 9 KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68
Query: 214 I--------ASSPHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQAN 259
I A+ P + A D + A KH KGC C+KS C K YCECF+ +
Sbjct: 69 IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128
Query: 260 ILCSENCRCLDCKNFEG 276
+ C+ C+C +C N G
Sbjct: 129 VGCTAKCKCQECSNSFG 145
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
C C +S C + YC CF++ CS+NC C C N + E+
Sbjct: 10 CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48
>gi|326504624|dbj|BAK06603.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 405
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 77/147 (52%), Gaps = 20/147 (13%)
Query: 139 PRANTDGKDGTPKKQK--------QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
P A +G KK+K C+C+ SRCLKLYC CFA+G +C + C C C+N
Sbjct: 72 PPAEANGSSDRRKKRKAEDGEGCRTCSCKKSRCLKLYCVCFASGSHCSELCGCDPCYNKP 131
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
H V + G L+ + + Q++ E AR Q+ K C CKKSGCL
Sbjct: 132 IHGVQQNTPPGLPLQ---------VVRTAEAGQNSAEFAR-YQMDFFRRK-CTCKKSGCL 180
Query: 250 KKYCECFQANILCSENCRCLDCKNFEG 276
KKYC+C+Q CS NC+C DCKN G
Sbjct: 181 KKYCDCYQGGAGCSINCKCDDCKNPYG 207
>gi|149061864|gb|EDM12287.1| rCG47760, isoform CRA_b [Rattus norvegicus]
Length = 371
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 62/96 (64%), Gaps = 15/96 (15%)
Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+PK+ GA
Sbjct: 275 LAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 330
Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
R H+KGC+CK+SGCLK YCEC++A+
Sbjct: 331 LR-----------HSKGCNCKRSGCLKNYCECYEAS 355
>gi|145506479|ref|XP_001439200.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406384|emb|CAK71803.1| unnamed protein product [Paramecium tetraurelia]
Length = 175
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 64/121 (52%), Gaps = 25/121 (20%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K CNC+ S+CLK YC+CFA G C + CNCV C NN + R+EA R+P AF+
Sbjct: 76 KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTRDPRAFKQ 135
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KGC+CKKSGC KKYCECFQ N+ C+ CRC C
Sbjct: 136 AF------------------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCV 171
Query: 273 N 273
N
Sbjct: 172 N 172
>gi|145518329|ref|XP_001445042.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412475|emb|CAK77645.1| unnamed protein product [Paramecium tetraurelia]
Length = 190
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 64/121 (52%), Gaps = 25/121 (20%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K CNC+ S+CLK YC+CFA G C + CNCV C NN + R++A R+P AF+
Sbjct: 91 KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKDAKLQIQARDPGAFKQ 150
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KGC+CKKSGC KKYCECFQ N+ C+ CRC C
Sbjct: 151 AF------------------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCV 186
Query: 273 N 273
N
Sbjct: 187 N 187
>gi|170044124|ref|XP_001849708.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867389|gb|EDS30772.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 586
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/82 (50%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
G ++ R P T D KK + CNC S+CLKLYC+CFA G +C CNC +C+NN+
Sbjct: 506 GSEDETRKGPAVGTIFPDEAYKK-RPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNL 564
Query: 190 EHEVARQEAVGATLERNPNAFR 211
++E RQ+A+ ATLERNP+AF+
Sbjct: 565 DNEEERQKAIRATLERNPSAFK 586
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C+C KS CLK YC+CF C NC C DC N +EE R
Sbjct: 531 CNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEER 570
>gi|168049832|ref|XP_001777365.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671214|gb|EDQ57769.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 120
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/118 (45%), Positives = 67/118 (56%), Gaps = 8/118 (6%)
Query: 167 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA----SSPH-- 219
YCECFAAG+YC G C C +C N E+ RNP AF PKI SSP
Sbjct: 1 YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPG 60
Query: 220 -GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
+ ++ + +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC C N G
Sbjct: 61 VSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMNKYG 118
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 10/93 (10%)
Query: 105 RPQHVRMVNRVPHPVH-KLPL---PTLPPGKQESPRSRPRAN--TDGKDGTP---KKQKQ 155
+P+++ V + + PL P + G + SP +N D TP + ++
Sbjct: 23 KPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPGVSSNQKLDEVLDTPASARHKRG 82
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
CNC+ S CLK YCEC+ AG+ C +GC C C N
Sbjct: 83 CNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMN 115
>gi|149061865|gb|EDM12288.1| rCG47760, isoform CRA_c [Rattus norvegicus]
gi|149061866|gb|EDM12289.1| rCG47760, isoform CRA_c [Rattus norvegicus]
Length = 191
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 61/93 (65%), Gaps = 15/93 (16%)
Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+PK+ GA R
Sbjct: 98 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 152
Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
H+KGC+CK+SGCLK YCEC++A+
Sbjct: 153 ----------HSKGCNCKRSGCLKNYCECYEAS 175
>gi|145525306|ref|XP_001448475.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416019|emb|CAK81078.1| unnamed protein product [Paramecium tetraurelia]
Length = 188
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/121 (42%), Positives = 64/121 (52%), Gaps = 25/121 (20%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K CNC+ S+CLK YC+CFA G C + CNCV C NN + R+EA + R+P AF+
Sbjct: 89 KSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFKQ 148
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KGC+CKKSGC KKYCECF + + C+ CRC C
Sbjct: 149 SF------------------------KGCNCKKSGCQKKYCECFLSGLACTHLCRCDGCL 184
Query: 273 N 273
N
Sbjct: 185 N 185
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 31/45 (68%)
Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
+ K C+CKKS CLK+YC+CF +CSENC C+ C N + E+R
Sbjct: 87 NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQR 131
>gi|145536949|ref|XP_001454192.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421945|emb|CAK86795.1| unnamed protein product [Paramecium tetraurelia]
Length = 196
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/122 (40%), Positives = 65/122 (53%), Gaps = 25/122 (20%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
++ CNC+ S+CLK YC+CFA G C + CNCV C NN + R+EA + R+P AF+
Sbjct: 96 ERSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFK 155
Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
KGC+CKKSGC KKYCECF + + C+ CRC C
Sbjct: 156 QSF------------------------KGCNCKKSGCQKKYCECFLSGLACTHLCRCDGC 191
Query: 272 KN 273
N
Sbjct: 192 LN 193
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 9/99 (9%)
Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK---HNKGCH 242
N +E E +E V +++ F+ ++++ + + Q+ +K + + C+
Sbjct: 47 QNRIEME---EETVETSIQL---VFKEYLSTNQDYYFLLQISQKSLQIQSKLFINERSCN 100
Query: 243 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
CKKS CLK+YC+CF +CSENC C+ C N + E+R
Sbjct: 101 CKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQR 139
>gi|168057921|ref|XP_001780960.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667594|gb|EDQ54220.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 159
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 70/132 (53%), Gaps = 22/132 (16%)
Query: 167 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA- 224
YCECFA G+YC G C C +C N E+ RNP AF PKI GA+ +
Sbjct: 1 YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIV---QGAEPSP 57
Query: 225 -----------------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
+++A + +A+H +GC+CKKS CLKKYCEC+QA + CSE CR
Sbjct: 58 VPGVTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCR 117
Query: 268 CLDCKNFEGSEE 279
C C N G +E
Sbjct: 118 CEGCMNKYGKKE 129
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
+ + ++ CNC+ S CLK YCEC+ AG+ C +GC C C N
Sbjct: 84 SARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMN 123
>gi|195153923|ref|XP_002017873.1| GL17406 [Drosophila persimilis]
gi|194113669|gb|EDW35712.1| GL17406 [Drosophila persimilis]
Length = 554
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 58/94 (61%)
Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
+SP+ TD G ++K CNC S+CLKLYC+CFA G +C C C +C NN+++E
Sbjct: 456 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYE 515
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
V R+ A+ + L+RNP+AF K + A+D +
Sbjct: 516 VERERAIRSCLDRNPSAFNLKHGEDSNNAKDVKR 549
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 38/84 (45%), Gaps = 6/84 (7%)
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
+I SP A DA A K C+C KS CLK YC+CF C ++C C DC
Sbjct: 453 EILKSPQLASAPTTDALAGM--ASRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCF 509
Query: 273 N---FEGSEERRALFHGDHNGIAF 293
N +E ER D N AF
Sbjct: 510 NNLDYEVERERAIRSCLDRNPSAF 533
>gi|145484384|ref|XP_001428202.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395286|emb|CAK60804.1| unnamed protein product [Paramecium tetraurelia]
Length = 164
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 75/141 (53%), Gaps = 31/141 (21%)
Query: 136 RSRPRANTDGKDGTP--KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHE 192
R + R T+ P +KQ+QC C+ S+CLK+YC+C A G YC C+C NCHN+ +H
Sbjct: 20 RQQKRVITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHP 79
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
R + + +E+N +A QL C+C+KS C KKY
Sbjct: 80 EQRSKVLEQMVEKNSSA----------------------QLK------CNCRKSNCQKKY 111
Query: 253 CECFQANILCSENCRCLDCKN 273
CEC+ A + CSE C+C DCKN
Sbjct: 112 CECYNAGVKCSELCKCDDCKN 132
>gi|302824819|ref|XP_002994049.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
gi|300138103|gb|EFJ04883.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
Length = 144
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 65/118 (55%), Gaps = 8/118 (6%)
Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH------ 219
YCECFAAG+YC D C C +C N E RNP AF PKI +
Sbjct: 1 YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEG 60
Query: 220 -GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
+ + + + +A+H +GC+CKKS CLKKYCEC+QA + CS+ C+C CKN G
Sbjct: 61 VASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKNIYG 118
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
G+ + ++ CNC+ S CLK YCEC+ AG+ C DGC C C N
Sbjct: 75 GSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKN 115
>gi|145479163|ref|XP_001425604.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392675|emb|CAK58206.1| unnamed protein product [Paramecium tetraurelia]
Length = 164
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 77/145 (53%), Gaps = 31/145 (21%)
Query: 132 QESPRSRPRANTDGKDGTP--KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNN 188
Q R + R T+ P +KQ+QC C+ S+CLK+YC+C A G YC C+C NCHN+
Sbjct: 16 QTPTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHND 75
Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
+H R + + +E+N +P AQ+ C+C+KS C
Sbjct: 76 DKHPEQRNKVLEQMVEKN----------TP------------AQIK------CNCRKSNC 107
Query: 249 LKKYCECFQANILCSENCRCLDCKN 273
KKYCEC+ A + CSE C+C DCKN
Sbjct: 108 QKKYCECYNAGVKCSELCKCDDCKN 132
>gi|348532345|ref|XP_003453667.1| PREDICTED: hypothetical protein LOC100710529 [Oreochromis
niloticus]
Length = 310
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 52/77 (67%), Gaps = 11/77 (14%)
Query: 203 LERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILC 262
L NP+AFRPKIA GA + HNKGC+CK+SGCLK+YCEC++ANI+C
Sbjct: 172 LGHNPDAFRPKIAGVKSGA-----------VRGWHNKGCNCKRSGCLKRYCECYEANIMC 220
Query: 263 SENCRCLDCKNFEGSEE 279
+ +C+C+ C+N++ +
Sbjct: 221 TSSCKCVGCRNYDAGSQ 237
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
K CNC+ S CLK YCEC+ A I C C CV C N
Sbjct: 197 KGCNCKRSGCLKRYCECYEANIMCTSSCKCVGCRN 231
>gi|223974815|gb|ACN31595.1| unknown [Zea mays]
Length = 391
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 78/159 (49%), Gaps = 18/159 (11%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
S R R R DG DG K C+C+ SRCLKLYC C+A+G +C + C C CHN
Sbjct: 61 SDRKRRRKADDG-DGC----KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQ- 114
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
GA +P + Q+ E + + K C CKKSGCLKKY
Sbjct: 115 -------GAPWTAPVLPLKP--VQTLEAGQEIVEQLIRSPMDLIRRK-CTCKKSGCLKKY 164
Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI 291
C+C+Q CS NC+C DC+N G + + G +G+
Sbjct: 165 CDCYQGGAGCSINCKCDDCRNPFG-RKVGVILDGKSSGL 202
>gi|339238879|ref|XP_003380994.1| conserved hypothetical protein [Trichinella spiralis]
gi|316976043|gb|EFV59390.1| conserved hypothetical protein [Trichinella spiralis]
Length = 238
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 72/136 (52%), Gaps = 20/136 (14%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
++ CNC S+CLKLYC+CFA G +C C+C NC NN+ +E+ R A+ + LERNP AF+P
Sbjct: 55 RRPCNCTKSQCLKLYCDCFANGEFCSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQP 114
Query: 213 KIASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCL 269
KI + + +++ AK GC + +K A + C+ C+C+
Sbjct: 115 KIGN------------KTSEIMAKDELTLNGCT-TRDAIVKSQ----AARVSCTVRCKCV 157
Query: 270 DCKNFEGSEERRALFH 285
C+N E R H
Sbjct: 158 GCRNTEADRNHRNRGH 173
>gi|219362923|ref|NP_001137104.1| uncharacterized protein LOC100217281 [Zea mays]
gi|194698372|gb|ACF83270.1| unknown [Zea mays]
gi|194708474|gb|ACF88321.1| unknown [Zea mays]
gi|414877953|tpg|DAA55084.1| TPA: hypothetical protein ZEAMMB73_574510 [Zea mays]
Length = 394
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 81/162 (50%), Gaps = 24/162 (14%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
S R R R DG DG K C+C+ SRCLKLYC C+A+G +C + C C CHN
Sbjct: 63 SDRKRRRKADDG-DGC----KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQ- 116
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCL 249
GA P + P +A ++ E + + + + C CKKSGCL
Sbjct: 117 -------GAPWTA------PVLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCL 163
Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI 291
KKYC+C+Q CS NC+C DC+N G + + G +G+
Sbjct: 164 KKYCDCYQGGAGCSINCKCDDCRNPFG-RKVGVILDGKSSGL 204
>gi|303283039|ref|XP_003060811.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
gi|226458282|gb|EEH55580.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
Length = 625
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 56/108 (51%), Gaps = 14/108 (12%)
Query: 166 LYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
LYCECFAAG +C C+C NC N ++ Q RNP AF KI ++
Sbjct: 280 LYCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKIVAT-------- 331
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
D E H K KS CLKKYCECFQA +LC E C+C CKN
Sbjct: 332 -DGEEKHKKGCHCK-----KSACLKKYCECFQAGVLCQEYCKCEGCKN 373
>gi|145501814|ref|XP_001436887.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404032|emb|CAK69490.1| unnamed protein product [Paramecium tetraurelia]
Length = 159
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 62/121 (51%), Gaps = 25/121 (20%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K CNC+ S+CLK YC+CFA G C + CNCV C NN + R+ A + R+P AF+
Sbjct: 60 KSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKVAKVQIVNRDPGAFKQ 119
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
KGC+CKKSGC KKYCECF + C+ CRC C
Sbjct: 120 SF------------------------KGCNCKKSGCQKKYCECFLNGLTCTHLCRCDGCL 155
Query: 273 N 273
N
Sbjct: 156 N 156
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 32/47 (68%)
Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+ K C+CKKS CLK+YC+CF +CSENC C+ C N + E+R +
Sbjct: 58 NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKV 104
>gi|428185670|gb|EKX54522.1| hypothetical protein GUITHDRAFT_63344, partial [Guillardia theta
CCMP2712]
Length = 125
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 69/130 (53%), Gaps = 21/130 (16%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K++ C C ++CLK YC+CF I C C C +CHN+ +HE R EA+ NP+AF
Sbjct: 9 KERGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRMNNPSAF 68
Query: 211 R-------PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
+ + ++P G + K +GC CK+S C KKYCECF A I C+
Sbjct: 69 KGTALELEDQEVTTPKGGK-------------KTVRGCRCKRSKCQKKYCECFSAGIPCT 115
Query: 264 ENCRCLDCKN 273
NC C DC N
Sbjct: 116 SNCVCTDCAN 125
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 30/43 (69%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
+GC C ++ CLK+YC+CF+ +I C+ +C C DC N EE+R
Sbjct: 11 RGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKR 53
Score = 38.5 bits (88), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
G K + C C+ S+C K YCECF+AGI C C C +C N
Sbjct: 85 GGKKTVRGCRCKRSKCQKKYCECFSAGIPCTSNCVCTDCAN 125
>gi|413949787|gb|AFW82436.1| hypothetical protein ZEAMMB73_293494 [Zea mays]
Length = 341
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 53/105 (50%), Gaps = 2/105 (1%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K C C+ SRCLKLYC CFA G YC D C C C N Q L R
Sbjct: 62 KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQ-KRMSM 120
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
KI P + EDA + + +GC+CKKS CLKKYC+C+Q
Sbjct: 121 KINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQ 165
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 22/41 (53%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
K C CKKS CLK YC CF CS+ C C C N E E
Sbjct: 62 KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSE 102
>gi|308808189|ref|XP_003081405.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
gi|116059867|emb|CAL55574.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
Length = 466
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 50/91 (54%), Gaps = 14/91 (15%)
Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
C N VE+ ++ RNPNAF KI +D +DAR H KGCHCK
Sbjct: 171 CQNTVENVSVVEKTRSQIEARNPNAFLSKI-------EDDGDDAR-------HTKGCHCK 216
Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKNFE 275
KS CLKKYCECFQA + C E C+C C N E
Sbjct: 217 KSACLKKYCECFQAGVKCQEYCKCEGCMNKE 247
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 27/45 (60%), Gaps = 3/45 (6%)
Query: 144 DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
DG D + K C+C+ S CLK YCECF AG+ C + C C C N
Sbjct: 203 DGDDA--RHTKGCHCKKSACLKKYCECFQAGVKCQEYCKCEGCMN 245
>gi|224092440|ref|XP_002309611.1| predicted protein [Populus trichocarpa]
gi|222855587|gb|EEE93134.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 55/103 (53%), Gaps = 6/103 (5%)
Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----AAKHNKG 240
C N E+E E RNP AF PKI H + D + L + +H G
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQ--HVTEFQAIDVEDVDLFTPYSGRHKTG 501
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
C+CK+S C+KKYCEC+QAN+ CS CRC C+N G +E A+
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRNIHGRKEEYAM 544
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 1/33 (3%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
CNC+ S C+K YCEC+ A + C + C C C N
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRN 534
>gi|219363019|ref|NP_001136646.1| uncharacterized protein LOC100216775 [Zea mays]
gi|194696492|gb|ACF82330.1| unknown [Zea mays]
Length = 278
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 37/80 (46%), Positives = 50/80 (62%)
Query: 205 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
RNP AF PK+ + + R+ + + +A+H +GC+CKKS CLKKYCECFQ + CS
Sbjct: 20 RNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSI 79
Query: 265 NCRCLDCKNFEGSEERRALF 284
+CRC CKN G E A+
Sbjct: 80 SCRCEGCKNAFGKREGAAVL 99
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
+ ++ CNC+ S CLK YCECF G+ C C C C N
Sbjct: 51 RHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKN 88
>gi|50787704|emb|CAH04409.1| hypothetical protein [Euplotes vannus]
Length = 482
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/125 (41%), Positives = 65/125 (52%), Gaps = 7/125 (5%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
K C C+ S+C+KLYCECF + +C C C +C N E+E + L RNP AF
Sbjct: 357 KGCTCKRSKCIKLYCECFLSKGFCGPSCTCNDCENMEENEEEIVKIREKILSRNPKAFEK 416
Query: 213 KI----ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
K+ AS P G A + L +H KGC CK+S C YCEC Q C+E C C
Sbjct: 417 KLQIEEASQPLGMGPKAASAPSSNL--RHIKGCTCKRSKCSNNYCECHQHGAKCTELCSC 474
Query: 269 LDCKN 273
DC N
Sbjct: 475 TDCNN 479
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
+EA L+ KGC CK+S C+K YCECF + C +C C DC
Sbjct: 349 KEANLS--ECKGCTCKRSKCIKLYCECFLSKGFCGPSCTCNDC 389
>gi|431910169|gb|ELK13242.1| Tesmin [Pteropus alecto]
Length = 484
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 90/165 (54%), Gaps = 34/165 (20%)
Query: 99 MPQWQARPQHVRMVNRVPHPVHKLPLPTLP-PGK-----QESPRSRPRANTDGKDGTPKK 152
+PQ+Q + +++ + VP P+ L LP P K Q+SP ++ D G K
Sbjct: 241 VPQYQDQNNYLQ--SDVPKPMTTLVGRFLPVPAKLNLITQQSPAEVTESSMDSDVGVRK- 297
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
YC+CFA+G C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 298 --------------YCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQP 343
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
KI P DA+ +HN+GC+CK+SGCLK YCEC++
Sbjct: 344 KIG--PGSLGDAK---------PRHNRGCNCKRSGCLKNYCECYE 377
>gi|293336269|ref|NP_001170528.1| uncharacterized protein LOC100384542 [Zea mays]
gi|238005872|gb|ACR33971.1| unknown [Zea mays]
Length = 272
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 49/80 (61%)
Query: 205 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
RNP AF PK+ + D + + + +A+H +GC+CKKS CLKKYCECFQ + CS
Sbjct: 14 RNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSV 73
Query: 265 NCRCLDCKNFEGSEERRALF 284
+CRC CKN G E ++
Sbjct: 74 SCRCEGCKNAFGKREGASVL 93
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
+ ++ CNC+ S CLK YCECF G+ C C C C N
Sbjct: 45 RHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCEGCKN 82
>gi|427797635|gb|JAA64269.1| Putative dna-dependent, partial [Rhipicephalus pulchellus]
Length = 184
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 45/64 (70%), Gaps = 1/64 (1%)
Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFM 294
+H KGC+CK+SGCLK YCEC++A ILCS C+C+ CKNFE S ER+ L D +
Sbjct: 3 RHTKGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKNFEDSSERKTLMQLADAAEVRVQ 62
Query: 295 QRAA 298
Q+AA
Sbjct: 63 QQAA 66
Score = 45.4 bits (106), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
K CNC+ S CLK YCEC+ A I C C CV C N
Sbjct: 6 KGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKN 40
>gi|159475976|ref|XP_001696090.1| hypothetical protein CHLREDRAFT_104804 [Chlamydomonas reinhardtii]
gi|158275261|gb|EDP01039.1| predicted protein [Chlamydomonas reinhardtii]
Length = 84
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 53/95 (55%), Gaps = 12/95 (12%)
Query: 163 CLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
CLKLYC+CFAAG +C C+C +C N E+ Q+ R+P AF KI +P
Sbjct: 1 CLKLYCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMDAP---- 56
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
KH +GC+CK+S CLKKYCECFQ
Sbjct: 57 --------GGGGGKHKRGCNCKRSHCLKKYCECFQ 83
>gi|145523856|ref|XP_001447761.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415283|emb|CAK80364.1| unnamed protein product [Paramecium tetraurelia]
Length = 573
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/66 (53%), Positives = 48/66 (72%), Gaps = 8/66 (12%)
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
AF+PKI++S D++ D KHNKGC CKKSGC+KKYCECFQA + CS+ C+C
Sbjct: 433 AFQPKISTS-----DSKPDPLNF---GKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKC 484
Query: 269 LDCKNF 274
++C+N+
Sbjct: 485 IECRNY 490
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
K K C C+ S C+K YCECF A + C D C C+ C N
Sbjct: 452 KHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 489
>gi|299115549|emb|CBN75752.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 313
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 62/119 (52%), Gaps = 14/119 (11%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C+ RCLKLYC+CFA +C C C C N + V Q A A LERNP+AF
Sbjct: 201 CTCKKCRCLKLYCDCFAQSRFCGSACLCKQCCNRL-GAVVVQAARRAVLERNPDAF---- 255
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
H A + R + + GC C +S CLKKYC CFQ ++LC C+C C N
Sbjct: 256 ----HRAHSL--ETRPGSVTS--GAGCGCTRSHCLKKYCVCFQKSLLCGAGCKCEGCLN 306
>gi|413949791|gb|AFW82440.1| hypothetical protein ZEAMMB73_340549 [Zea mays]
Length = 298
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 61/132 (46%), Gaps = 9/132 (6%)
Query: 163 CLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 221
C+ YC CFA G YC D C C C N Q L R KI P
Sbjct: 13 CICSYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKR-MSMKINRRPEAN 71
Query: 222 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG----- 276
+ EDA + + +GC+CKKS CLKKYC+C+Q CS CRC DC+N G
Sbjct: 72 AEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNDGI 131
Query: 277 --SEERRALFHG 286
+ +R L+ G
Sbjct: 132 IADDSKRYLYTG 143
Score = 38.1 bits (87), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 40/96 (41%), Gaps = 16/96 (16%)
Query: 138 RPRANTD-------GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN-- 187
RP AN + TP + + CNC+ S CLK YC+C+ G C C C +C N
Sbjct: 67 RPEANAEPMEDAHHSSSSTPPR-RGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPF 125
Query: 188 -----NVEHEVARQEAVGATLERNPNAFRPKIASSP 218
+ + R GA L+ + + SP
Sbjct: 126 GKNDGIIADDSKRYLYTGADLDHSEGEHDFVVERSP 161
>gi|194703140|gb|ACF85654.1| unknown [Zea mays]
Length = 204
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 71/141 (50%), Gaps = 17/141 (12%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
S R R R DG DG K C+C+ SRCLKLYC C+A+G +C + C C CHN
Sbjct: 63 SDRKRRRKADDG-DGC----KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQ- 116
Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
GA +P + Q+ E + + K C CKKSGCLKKY
Sbjct: 117 -------GAPWTAPVLPLKP--VQTLEAGQEIVEQLIRSPMDLIRRK-CTCKKSGCLKKY 166
Query: 253 CECFQANILCSENCRCLDCKN 273
C+C+Q CS NC+C DC+N
Sbjct: 167 CDCYQGGAGCSINCKCDDCRN 187
>gi|384253407|gb|EIE26882.1| hypothetical protein COCSUDRAFT_9768, partial [Coccomyxa
subellipsoidea C-169]
Length = 81
Score = 77.8 bits (190), Expect = 1e-11, Method: Composition-based stats.
Identities = 40/95 (42%), Positives = 53/95 (55%), Gaps = 17/95 (17%)
Query: 163 CLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 221
CLKLYC+CFA G++C D C C +C N + A +A + ++P AF+PK+ + G
Sbjct: 1 CLKLYCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPKVLDASGG- 59
Query: 222 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
H KGC C+KS CLKKYCECF
Sbjct: 60 ---------------HVKGCACRKSRCLKKYCECF 79
Score = 38.9 bits (89), Expect = 5.3, Method: Composition-based stats.
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 248 CLKKYCECFQANILCSENCRCLDCKN 273
CLK YC+CF + C++ C C DC+N
Sbjct: 1 CLKLYCDCFATGLFCNDACMCKDCEN 26
>gi|145546570|ref|XP_001458968.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426790|emb|CAK91571.1| unnamed protein product [Paramecium tetraurelia]
Length = 272
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 59/123 (47%), Gaps = 16/123 (13%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC S CLK YC CF +G C D C C +C N E R EA+ ++
Sbjct: 151 CNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTEFAEERDEAINHVYKK--------- 201
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
+D + E L+ + + GC CK +GC KKYCECF+ +C E C C DC N
Sbjct: 202 -----CHRDKKVPVNEL-LSLQMSYGCKCKSTGCQKKYCECFKRGQICGEQCACEDCLNI 255
Query: 275 EGS 277
S
Sbjct: 256 PFS 258
>gi|145505956|ref|XP_001438944.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406117|emb|CAK71547.1| unnamed protein product [Paramecium tetraurelia]
Length = 272
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 62/129 (48%), Gaps = 16/129 (12%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC S CLK YC CF +G C D C C +C N + R EA+ ++
Sbjct: 151 CNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKK--------- 201
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
+D + E L+ + + GC CK +GC KKYCECF+ +C E C C C N
Sbjct: 202 -----CHRDKKVPVNEL-LSLQMSYGCKCKTTGCQKKYCECFKRGQICGEQCSCEGCLNI 255
Query: 275 EGSEERRAL 283
++ ++ L
Sbjct: 256 PSAQNQKEL 264
>gi|224014361|ref|XP_002296843.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
CCMP1335]
gi|220968480|gb|EED86827.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
CCMP1335]
Length = 141
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 64/118 (54%), Gaps = 12/118 (10%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C SRC+KLYC+CF G C+ CNC +C N E R+ + L+ NP AF
Sbjct: 35 CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAF---- 90
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
++D+ + + Q ++ N GC C S CLKKYC CF A+ C++ C C DC+
Sbjct: 91 ------SEDSDKFNTKRQRMSRGN-GCACPSSHCLKKYCSCFGADAGCTDKCSCNDCE 141
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 280
C C KS C+K YC+CF LC+ C C DCKN E EER
Sbjct: 35 CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREER 75
>gi|115489472|ref|NP_001067223.1| Os12g0605300 [Oryza sativa Japonica Group]
gi|77557030|gb|ABA99826.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649730|dbj|BAF30242.1| Os12g0605300 [Oryza sativa Japonica Group]
Length = 407
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 55/111 (49%), Gaps = 12/111 (10%)
Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
YC CFA+G +C + C C C+N H V R A K + QD+
Sbjct: 114 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTP----------ALPLKAVQTLETGQDSV 163
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
E + + K C CKKSGCLKKYC+C+Q CS NC+C DCKN G
Sbjct: 164 EQLIRSPMDFGRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 213
>gi|125537321|gb|EAY83809.1| hypothetical protein OsI_39026 [Oryza sativa Indica Group]
gi|125579998|gb|EAZ21144.1| hypothetical protein OsJ_36791 [Oryza sativa Japonica Group]
Length = 421
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 55/111 (49%), Gaps = 12/111 (10%)
Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
YC CFA+G +C + C C C+N H V R A K + QD+
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTP----------ALPLKAVQTLETGQDSV 177
Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
E + + K C CKKSGCLKKYC+C+Q CS NC+C DCKN G
Sbjct: 178 EQLIRSPMDFGRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 227
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 43/93 (46%), Gaps = 11/93 (11%)
Query: 106 PQHVRMVNRVPHPVHKLPLP---TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSR 162
P + + ++ VP LPL TL G Q+S R+ D +++C C+ S
Sbjct: 146 PCYNKSIHGVPRSTPALPLKAVQTLETG-QDSVEQLIRSPMDFG------RRKCTCKKSG 198
Query: 163 CLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVA 194
CLK YC+C+ G C C C +C N +V
Sbjct: 199 CLKKYCDCYQGGAGCSINCKCDDCKNPYGRKVG 231
>gi|449016427|dbj|BAM79829.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 932
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 15/117 (12%)
Query: 164 LKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
LK YC+CFAAG +C C C C N+ +++ + A + +R+ S+ A
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRS---------STSGEAL 869
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
+ A E + ++GC CK++GCLK+YCECFQA C+ C C C N G E
Sbjct: 870 NGEPSASE-----RSSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLNCRGLSE 921
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEER 280
LKKYC+CF A C C C C N + ++E+
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQ 850
>gi|432959452|ref|XP_004086297.1| PREDICTED: uncharacterized protein LOC101167937 [Oryzias latipes]
Length = 377
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 38/49 (77%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
+ +K C+C SRCLKLYC+CF++G+ C CNC+NCHNN E+E R EA+
Sbjct: 325 RPRKTCHCTKSRCLKLYCDCFSSGMMCSSCNCINCHNNAENESQRHEAI 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 30/43 (69%), Gaps = 1/43 (2%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
K CHC KS CLK YC+CF + ++CS +C C++C N +E +R
Sbjct: 328 KTCHCTKSRCLKLYCDCFSSGMMCS-SCNCINCHNNAENESQR 369
>gi|195058418|ref|XP_001995449.1| GH22623 [Drosophila grimshawi]
gi|193899655|gb|EDV98521.1| GH22623 [Drosophila grimshawi]
Length = 916
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 59/110 (53%), Gaps = 6/110 (5%)
Query: 90 AHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGT 149
+ +SP Q +P Q+ N++ P L L P +P A+ D G+
Sbjct: 697 SQTKSPQQKLPILQSSSTQD---NKLCVPAKPL-FDILKPAPMPTPSPAASASLDVLGGS 752
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
++K CNC S+CLKLYC+CFA G +C C C +C NN+++EV R+ A+
Sbjct: 753 --RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAI 800
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 8/76 (10%)
Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
+ +P +P A A D L K C+C KS CLK YC+CF C ++C
Sbjct: 728 DILKPAPMPTPSPAASASLDV----LGGSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCT 782
Query: 268 CLDCKN---FEGSEER 280
C DC N +E ER
Sbjct: 783 CKDCFNNLDYEVKRER 798
>gi|145522672|ref|XP_001447180.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414680|emb|CAK79783.1| unnamed protein product [Paramecium tetraurelia]
Length = 263
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 61/122 (50%), Gaps = 17/122 (13%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCV-NCHNNVEHEVARQEAVGATLERNPNA 209
+Q+ CNC+NS CLK YC CF +G C C C +C NN +H+ R A+ + +
Sbjct: 144 QQQPCNCKNSGCLKRYCRCFHSGRMCLKECQCSEDCQNNEQHQEQRNNAI---IHVDQKC 200
Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
+R R R+A GC C KS C K+YCEC+ N C++ C+C
Sbjct: 201 YR------------NRRMPRDALFKLDVIYGCSCTKSKCRKRYCECYLRNQKCTDKCKCF 248
Query: 270 DC 271
DC
Sbjct: 249 DC 250
>gi|125580000|gb|EAZ21146.1| hypothetical protein OsJ_36793 [Oryza sativa Japonica Group]
Length = 710
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 86/197 (43%), Gaps = 46/197 (23%)
Query: 92 MRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPK 151
+R+P A+P H+ + VP + + Q SP+ + R D DGT
Sbjct: 383 LRTPPCALPGIGL---HLNALATVPKDKMTAVVAYVGESSQGSPKKK-RHKFDNGDGT-- 436
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNP 207
K+C+C+ S+CLKLYCECF AG++C + C+C C N N+E ++ +E + + RNP
Sbjct: 437 SCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNP 493
Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
AF PK+ + G + + CS +CR
Sbjct: 494 LAFAPKVIRTEPGQE---------------------------------LAGGVGCSVSCR 520
Query: 268 CLDCKNFEGSEERRALF 284
C CKN G E AL
Sbjct: 521 CEGCKNAFGRREGVALL 537
>gi|145548172|ref|XP_001459767.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427593|emb|CAK92370.1| unnamed protein product [Paramecium tetraurelia]
Length = 303
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 57/119 (47%), Gaps = 18/119 (15%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC S CLKLYC+CF C + C C++C N H RQ A+ + K+
Sbjct: 183 CNCSKSNCLKLYCQCFHQNKQCTELCKCLDCKNCDYHTQVRQTALE----------KIKM 232
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
S + D + GC C+KS C K YCECF+ N C+ +CRC DC N
Sbjct: 233 KSQRQKHDNDLFDLSKVW-------GCKCQKSQCQKNYCECFRRNQKCNSSCRCKDCAN 284
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 44/77 (57%), Gaps = 6/77 (7%)
Query: 201 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG----CHCKKSGCLKKYCECF 256
+TLE+ R + ++ Q E +E+ L ++ N+ C+C KS CLK YC+CF
Sbjct: 141 STLEQRSRRIRKQNINA--NLQRNYEYNQESDLNSQSNEHTMNPCNCSKSNCLKLYCQCF 198
Query: 257 QANILCSENCRCLDCKN 273
N C+E C+CLDCKN
Sbjct: 199 HQNKQCTELCKCLDCKN 215
>gi|300176906|emb|CBK25475.2| unnamed protein product [Blastocystis hominis]
Length = 123
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 72/153 (47%), Gaps = 40/153 (26%)
Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
LP PTLPP P R C C + CLKLYCEC AG C D C
Sbjct: 3 LPEPTLPP-----PSIR-----------------CTCTKTHCLKLYCECLKAGRTCTDLC 40
Query: 181 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 240
CV+C N + R++A+ + NP AF S+ G +D+R
Sbjct: 41 ACVDCLNRAKFP-ERKKALKHIKKTNPKAFLR--VSNLKGHEDSR--------------C 83
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
C C+KS C KKYC C+Q+ C+ C+C+DCKN
Sbjct: 84 CACQKSLCNKKYCSCYQSGRYCTPKCKCVDCKN 116
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER-RALFH-GDHNGIAFMQ 295
C C K+ CLK YCEC +A C++ C C+DC N ER +AL H N AF++
Sbjct: 15 CTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKFPERKKALKHIKKTNPKAFLR 71
>gi|145484801|ref|XP_001428410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395495|emb|CAK61012.1| unnamed protein product [Paramecium tetraurelia]
Length = 240
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 83/202 (41%), Gaps = 46/202 (22%)
Query: 83 SSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRAN 142
SS P + H + S M Q +++ + +P + P P+ G P P
Sbjct: 22 SSSPDMDEH-KDSSLDMSQL------IKLQHTIPDNILISPEPSKKKGSILLPSDDP--- 71
Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGA 201
CNC S C K+YCEC A G C C C NCHN +++ V
Sbjct: 72 -------------CNCSRSGCRKMYCECLAKGRLCSSFCRCQNCHNKTSNKLVLN--VIE 116
Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
L++N R+ + + GC C+KS CLKKYCECF +
Sbjct: 117 ELDQNR--------------------NRKRFRSRRFKDGCTCQKSMCLKKYCECFNSGKS 156
Query: 262 CSENCRCLDCKNFEGSEERRAL 283
C C+C++C+N+ E + +
Sbjct: 157 CGSGCKCVNCENYVLDEVEKII 178
>gi|340507965|gb|EGR33791.1| hypothetical protein IMG5_037680 [Ichthyophthirius multifiliis]
Length = 96
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 41/81 (50%), Gaps = 9/81 (11%)
Query: 198 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
AV RNPNAF K D H KGC CKKS CLKKYCECF
Sbjct: 2 AVNDAKNRNPNAFNIKFQIIEEKQDDK---------IVIHKKGCTCKKSNCLKKYCECFN 52
Query: 258 ANILCSENCRCLDCKNFEGSE 278
A +LC++NC C CKN + E
Sbjct: 53 AGVLCNQNCVCDSCKNMQKEE 73
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 1/36 (2%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
+K C C+ S CLK YCECF AG+ C+ C C +C N
Sbjct: 33 KKGCTCKKSNCLKKYCECFNAGVLCNQNCVCDSCKN 68
>gi|145488037|ref|XP_001430023.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397118|emb|CAK62625.1| unnamed protein product [Paramecium tetraurelia]
Length = 255
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 54/119 (45%), Gaps = 18/119 (15%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C S CLKLYC CF C D C C C N EH R A+ K+
Sbjct: 136 CRCTKSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNAL------------EKV 183
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
H ++ + +++ GC C+KS C K YCECF N CS +CRC DC N
Sbjct: 184 KQKLHRQKNDDDLFDRSKIW-----GCKCQKSQCQKNYCECFVRNQKCSSSCRCKDCAN 237
>gi|145496915|ref|XP_001434447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401573|emb|CAK67050.1| unnamed protein product [Paramecium tetraurelia]
Length = 321
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 53/119 (44%), Gaps = 18/119 (15%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C S CLKLYC+CF C + C C C N +H RQ A+ + KI
Sbjct: 201 CKCTKSNCLKLYCQCFHQNRQCTELCKCFACKNCDDHLPVRQTALE----------KIKI 250
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
S D D GC C+KS C K YCEC+ N CS +CRC DC N
Sbjct: 251 KSQRQKHDDDLFDRSRVW-------GCKCQKSQCQKNYCECYIRNQKCSSSCRCKDCAN 302
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 42/86 (48%), Gaps = 2/86 (2%)
Query: 190 EHEVARQEAVGATLERNPNAFRP-KIASSPHGAQD-AREDAREAQLAAKHNKGCHCKKSG 247
+H+ + TLE+ R K+ S + QD +E +Q + C C KS
Sbjct: 148 DHQKKADDPCSHTLEQRSRRIRKNKLNPSFNTTQDPNQESDLHSQQQEPNTTPCKCTKSN 207
Query: 248 CLKKYCECFQANILCSENCRCLDCKN 273
CLK YC+CF N C+E C+C CKN
Sbjct: 208 CLKLYCQCFHQNRQCTELCKCFACKN 233
>gi|145477875|ref|XP_001424960.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392027|emb|CAK57562.1| unnamed protein product [Paramecium tetraurelia]
Length = 240
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 53/119 (44%), Gaps = 23/119 (19%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
CNC S C K+YCEC A G C C C NC N + ++ + E + N KI
Sbjct: 72 CNCSKSECRKMYCECLAKGRLCSSACRCENCQN----RTSNKKVLHVIKELDLNKNLKKI 127
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
S + GC CKKS CLKKYCECF + C C C +C+N
Sbjct: 128 KS------------------KRFKDGCTCKKSMCLKKYCECFHSGKSCGYGCNCENCQN 168
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 3/63 (4%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C+ S CLK YCECF +G C GCNC NC N + E Q+ + ++ + F K
Sbjct: 136 CTCKKSMCLKKYCECFHSGKSCGYGCNCENCQNQLLDEA--QKNICTPNQKKTSKFSSKF 193
Query: 215 ASS 217
S+
Sbjct: 194 ESA 196
Score = 41.6 bits (96), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%)
Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
R+ L ++ C+C KS C K YCEC LCS CRC +C+N
Sbjct: 60 RDDSLLLSSDEPCNCSKSECRKMYCECLAKGRLCSSACRCENCQN 104
>gi|145520829|ref|XP_001446270.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413747|emb|CAK78873.1| unnamed protein product [Paramecium tetraurelia]
Length = 330
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 58/127 (45%), Gaps = 30/127 (23%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPN 208
P ++ C C+ ++CLK YC+CF G +C + C C++C NN E E + + RN N
Sbjct: 210 PLSKQGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNN-EQNKELIEQTRSQINRNLN 268
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
C CKKS CLKKYCEC+ A C +C+C
Sbjct: 269 ----------------------------DEVKCFCKKSKCLKKYCECYNAGKKCGVDCKC 300
Query: 269 LDCKNFE 275
C N+E
Sbjct: 301 EGCMNYE 307
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 28/41 (68%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
+GC CKK+ CLK YC+CFQ C E C CLDC N E ++E
Sbjct: 214 QGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKE 254
>gi|224070845|ref|XP_002303261.1| predicted protein [Populus trichocarpa]
gi|222840693|gb|EEE78240.1| predicted protein [Populus trichocarpa]
Length = 163
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/62 (54%), Positives = 40/62 (64%), Gaps = 1/62 (1%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
++CNC+ SRCLKLYCECFAAG YC D C+C NC N E+ A RNP AF P
Sbjct: 81 RRCNCKRSRCLKLYCECFAAGNYCLDTCSCENCFNKSEYADAVLHMRQQIEARNPVAFAP 140
Query: 213 KI 214
K+
Sbjct: 141 KV 142
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
C+CK+S CLK YCECF A C + C C +C F SE A+ H
Sbjct: 83 CNCKRSRCLKLYCECFAAGNYCLDTCSCENC--FNKSEYADAVLH 125
>gi|312380815|gb|EFR26710.1| hypothetical protein AND_07034 [Anopheles darlingi]
Length = 524
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 40/59 (67%), Gaps = 3/59 (5%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K++ CNC S+CLKLYC+CFA G +C CNC +CHN + + ARQ+A+ RN + F
Sbjct: 325 KKRPCNCTKSQCLKLYCDCFANGEFCYNCNCKDCHNTQKDDYARQKAIRC---RNTDQF 380
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C+C KS CLK YC+CF C NC C DC N + + R
Sbjct: 329 CNCTKSQCLKLYCDCFANGEFCY-NCNCKDCHNTQKDDYAR 368
>gi|145494055|ref|XP_001433022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400138|emb|CAK65625.1| unnamed protein product [Paramecium tetraurelia]
Length = 188
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 61/149 (40%), Gaps = 40/149 (26%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
+C C S+C K YCECFAAG C+ CNC C+N + + ++ G
Sbjct: 77 KCKCMKSKCQKSYCECFAAGKACNMDCNCQGCNNTICSQSIKKVQTG------------- 123
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
GC+CKK+GCLKKYCEC+ C+ C C+DC N
Sbjct: 124 --------------------------GCNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCN 157
Query: 274 FEGSEERRALFHGDHNGIAFMQRAANAAI 302
E SE + I Q N I
Sbjct: 158 QEDSESEKENEQLQQQSIDCGQTITNQII 186
>gi|145515421|ref|XP_001443610.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410999|emb|CAK76213.1| unnamed protein product [Paramecium tetraurelia]
Length = 275
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 66/150 (44%), Gaps = 16/150 (10%)
Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN 184
T+ K P+++ + T + + CNC S CLK YC CF +G C D C C +
Sbjct: 123 TIQQQKHSQPKNQIKIKTITNLYKASEIQPCNCSQSSCLKRYCTCFHSGRMCMDECQCKD 182
Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
C N R+ A+ ++ +D A E L+ + + GC CK
Sbjct: 183 CKNCDLFSEEREAAINYVFKK--------------CNRDKNVPANEL-LSLQISYGCKCK 227
Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKNF 274
+GC KKYCEC + C + C C DC N
Sbjct: 228 STGCQKKYCECLKRGQACGDLCSCEDCLNI 257
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 30/43 (69%), Gaps = 1/43 (2%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE-GSEERRA 282
C+C +S CLK+YC CF + +C + C+C DCKN + SEER A
Sbjct: 153 CNCSQSSCLKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREA 195
>gi|145521975|ref|XP_001446837.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414326|emb|CAK79440.1| unnamed protein product [Paramecium tetraurelia]
Length = 285
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 23/127 (18%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLE---RN 206
+Q+ C CRNS CLK YC CF +G C C CV C NN +H R +A+ E RN
Sbjct: 156 EQQPCFCRNSGCLKRYCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEKCHRN 215
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
N ++A K GC+CKK+ C YCECF C+ +C
Sbjct: 216 KNV------------------PKDALFKLKDCFGCNCKKTRCQTGYCECFLRKSKCTMDC 257
Query: 267 RCLDCKN 273
+C +C+N
Sbjct: 258 QCDNCEN 264
>gi|145476147|ref|XP_001424096.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391159|emb|CAK56698.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 63/133 (47%), Gaps = 20/133 (15%)
Query: 146 KDGTPK----KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG 200
K+ +PK K + C+C + CLK+YC CF G C + C C +C N E+ R EAV
Sbjct: 68 KEESPKLESNKWQPCSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQRHEAVE 127
Query: 201 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANI 260
I H + +E K GC+CKK+ CLK+YCECF
Sbjct: 128 Y------------IKKKAHRNKKV---TQEKLFETKEVWGCNCKKTRCLKRYCECFIRQK 172
Query: 261 LCSENCRCLDCKN 273
C+ +C C C+N
Sbjct: 173 TCTVDCNCNYCEN 185
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 6/58 (10%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAA 298
C C K+ CLK YC CF C E C+C DCKN + +R H + ++++ A
Sbjct: 82 CSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQR------HEAVEYIKKKA 133
>gi|145534021|ref|XP_001452755.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420454|emb|CAK85358.1| unnamed protein product [Paramecium tetraurelia]
Length = 326
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 44/66 (66%), Gaps = 7/66 (10%)
Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
FRPK+ S + ED ++ + +H KGC+CKKS CLKKYCEC+Q + CSE C+C
Sbjct: 244 FRPKVESKSNS-----EDEQDHK--PRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCD 296
Query: 270 DCKNFE 275
+CKN E
Sbjct: 297 ECKNCE 302
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 135 PRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
P+ ++N+ D +D P+ K CNC+ S CLK YCEC+ G+ C + C C C N
Sbjct: 246 PKVESKSNSEDEQDHKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKN 300
>gi|145524257|ref|XP_001447956.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415489|emb|CAK80559.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 54/119 (45%), Gaps = 18/119 (15%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C S CLKLYC CF I C + C C +CHN ++ R +A+ + K+
Sbjct: 225 CKCSKSHCLKLYCACFHRNIECSELCQCHDCHNKSDYSQIRTQALE----------KVKV 274
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
D D GC C+KS C K YCECF N CS C+C +C+N
Sbjct: 275 KQQRRKHDDDLFDKDTVW-------GCQCRKSQCKKNYCECFIRNKKCSSLCKCNNCEN 326
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 31/57 (54%), Gaps = 3/57 (5%)
Query: 217 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+ H + +D E L +N C C KS CLK YC CF NI CSE C+C DC N
Sbjct: 204 TTHYREFFDKDLDEHNL---NNSPCKCSKSHCLKLYCACFHRNIECSELCQCHDCHN 257
>gi|219121911|ref|XP_002181300.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407286|gb|EEC47223.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 381
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 50/105 (47%), Gaps = 24/105 (22%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYCDG--CNCVNCHNNVEHEVA---RQEAVGATL 203
PK K C C+N+ CLKLYC CF G +CD C C++C+N E RQEAV L
Sbjct: 272 VPKTVKPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNLREFNETGGKRQEAVSEIL 331
Query: 204 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
R R DA E++ K +GC CKK+ C
Sbjct: 332 LR-------------------RIDAFESRPKKKTGEGCACKKNRC 357
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNF 274
K K C CK + CLK YC CFQ C + C+C+DC N
Sbjct: 274 KTVKPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNL 313
>gi|145491063|ref|XP_001431531.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398636|emb|CAK64133.1| unnamed protein product [Paramecium tetraurelia]
Length = 246
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 17/125 (13%)
Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVN-CHNNVEHEVARQEAVGATLERNPN 208
K Q+ CNC++S C+K YC CF +G C C C + C N + R EA+ N
Sbjct: 131 KCQQPCNCKSSGCIKRYCRCFHSGKTCLPECQCQDGCLNKEHNNCERSEAIKHV---NEK 187
Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
+R R+ +EA GC C KS C K+YCECF N C+E C+C
Sbjct: 188 CYR------------NRKIPKEALFKLDVIYGCSCTKSKCRKRYCECFLRNQNCTEKCKC 235
Query: 269 LDCKN 273
DC N
Sbjct: 236 FDCCN 240
>gi|403367645|gb|EJY83644.1| hypothetical protein OXYTRI_18624 [Oxytricha trifallax]
gi|403370830|gb|EJY85281.1| hypothetical protein OXYTRI_16860 [Oxytricha trifallax]
Length = 590
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 59/125 (47%), Gaps = 32/125 (25%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD--GCNCVNCHNNVEHEVARQEAVGATLERNPNA 209
K+ +C+C+ + C + YC+C G CD C+C +C N +E++ R E
Sbjct: 466 KKIKCSCQKNSCTQAYCDCLKNGQACDPSSCSCASCLNTIENQNLRLEI----------- 514
Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRC 268
++ ++ Q AK +GC CK S C K+YCECFQ C C+C
Sbjct: 515 ----------------QEKKQKQGQAK--EGCSCKNSQCQKRYCECFQNGRQCDPSKCKC 556
Query: 269 LDCKN 273
DCKN
Sbjct: 557 KDCKN 561
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 30/49 (61%), Gaps = 8/49 (16%)
Query: 151 KKQKQ------CNCRNSRCLKLYCECFAAGIYCD--GCNCVNCHNNVEH 191
KKQKQ C+C+NS+C K YCECF G CD C C +C NN+ H
Sbjct: 517 KKQKQGQAKEGCSCKNSQCQKRYCECFQNGRQCDPSKCKCKDCKNNLPH 565
>gi|325187539|emb|CCA22077.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 319
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 11/81 (13%)
Query: 195 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCE 254
R +A+ + LERNP AF+PK+ ++ + H +GC+C+KS CLK YCE
Sbjct: 20 RSQAIASILERNPLAFQPKVVANGSETNNLH-----------HLRGCNCRKSNCLKNYCE 68
Query: 255 CFQANILCSENCRCLDCKNFE 275
C QA + C+ C C C N E
Sbjct: 69 CHQARVSCTNRCACHKCCNTE 89
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAV 199
+ CNCR S CLK YCEC A + C + C C C N E A++ +
Sbjct: 53 RGCNCRKSNCLKNYCECHQARVSCTNRCACHKCCNTEEFHSAKKMLI 99
>gi|194696554|gb|ACF82361.1| unknown [Zea mays]
Length = 372
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 65/136 (47%), Gaps = 19/136 (13%)
Query: 160 NSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 218
NS + YC C+A+G +C + C C CHN GA P + P
Sbjct: 62 NSDRKRSYCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PVLPLKP 107
Query: 219 HGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
+A ++ E + + + + C CKKSGCLKKYC+C+Q CS NC+C DC+N
Sbjct: 108 VQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRNPF 167
Query: 276 GSEERRALFHGDHNGI 291
G + + G +G+
Sbjct: 168 G-RKVGVILDGKSSGL 182
>gi|145541620|ref|XP_001456498.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424310|emb|CAK89101.1| unnamed protein product [Paramecium tetraurelia]
Length = 262
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/175 (27%), Positives = 73/175 (41%), Gaps = 38/175 (21%)
Query: 103 QARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQ--CNCRN 160
Q + + +VN++ + P L + + +P+ D D K+K+ C+C
Sbjct: 77 QQDEEDISLVNQM-YESQVFSQPILDISQISQIQLQPKTKLDLTDQPKYKKKKVGCSCTK 135
Query: 161 SRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
+C YC C G C C CV+C NN + +++Q V
Sbjct: 136 RKCSSKYCFCAKVGQSCSNLCKCVDCSNN-KSFMSQQIQV-------------------- 174
Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
A+ KGC CKK+ CLK YC+CF + CS NC+C+ CKN
Sbjct: 175 -------------TNAQEQKGCKCKKNFCLKGYCDCFAKGMQCSSNCKCISCKNM 216
>gi|145512403|ref|XP_001442118.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409390|emb|CAK74721.1| unnamed protein product [Paramecium tetraurelia]
Length = 278
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 59/127 (46%), Gaps = 23/127 (18%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLE---RN 206
+Q+ C CRNS CLK YC CF +G C C CV C NN +H R A+ E RN
Sbjct: 149 EQQPCFCRNSGCLKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEKCHRN 208
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
N PK +A K GC CKKS C YCECF C+ +C
Sbjct: 209 KNI--PK----------------DALFKLKDCFGCSCKKSRCQTGYCECFLRKSKCTIDC 250
Query: 267 RCLDCKN 273
C +C+N
Sbjct: 251 ECQNCEN 257
>gi|77415370|gb|AAI06127.1| Lin54 protein [Mus musculus]
Length = 523
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/64 (57%), Positives = 48/64 (75%)
Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
+ T + +K CNC S CLKLYC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RN
Sbjct: 451 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 510
Query: 207 PNAF 210
P AF
Sbjct: 511 PEAF 514
>gi|298711558|emb|CBJ32620.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 78
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 30/58 (51%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 163 CLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
CLKLYC+CFA + C D C C NC+N+ EH V R+ AV L R+P+AF K + PH
Sbjct: 2 CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKFKTEPH 59
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 1/36 (2%)
Query: 248 CLKKYCECFQANILCSENCRCLDCKN-FEGSEERRA 282
CLK YC+CF ++CS+ CRC +C N E ERRA
Sbjct: 2 CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRA 37
>gi|145490209|ref|XP_001431105.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398208|emb|CAK63707.1| unnamed protein product [Paramecium tetraurelia]
Length = 223
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 16/119 (13%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C+C+ + CLK+YC CF G +C C C C N E ++ R +AV ++
Sbjct: 89 CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKK--------- 139
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
A ++ +E GC+C K+ C+KKYCECF C+ C C C N
Sbjct: 140 ------AHRNKKVPKEKIFETVEIWGCNCSKTRCVKKYCECFIRGKKCTVECNCDHCDN 192
Score = 41.2 bits (95), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 26/41 (63%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C C+K+ CLK YC CF C ++CRC +C+N E + +R
Sbjct: 89 CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKR 129
>gi|145517899|ref|XP_001444827.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412260|emb|CAK77430.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 16/119 (13%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C+C + CLK+YC CF G C + C C +C N H R EAV ++
Sbjct: 82 CSCTKTYCLKMYCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEYIKKK--------- 132
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
A ++ +E K GC+CKK+ CLK+YCEC+ C+ C C C+N
Sbjct: 133 ------AHRNKKVPQEKLFETKDIWGCNCKKTRCLKRYCECYIRQKTCTVECNCNHCEN 185
>gi|145506933|ref|XP_001439427.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406611|emb|CAK72030.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 16/119 (13%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C+C + CLK+YC CF G C + C C +C N + R +AV ++ A R K
Sbjct: 82 CSCTKTNCLKMYCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEYIKKK---AHRNKK 138
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
S +E K GC+CKK+ CLK+YCEC+ C+ C C C+N
Sbjct: 139 VS------------QETLFETKDIWGCNCKKTRCLKRYCECYIRQKACTVECNCTHCEN 185
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN----NVEHEVARQ 196
CNC+ +RCLK YCEC+ C CNC +C N ++ +E+ RQ
Sbjct: 153 CNCKKTRCLKRYCECYIRQKACTVECNCTHCENGKDEDLYNEIRRQ 198
>gi|145477681|ref|XP_001424863.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391930|emb|CAK57465.1| unnamed protein product [Paramecium tetraurelia]
Length = 259
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 70/149 (46%), Gaps = 19/149 (12%)
Query: 128 PPGKQESPRSRPRANTDGKDG--TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN 184
P Q + R +AN + D + K C+C + CL+LYC CF C C C
Sbjct: 113 PIVNQRTRRKASKANYNVDDSPTMDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKCSE 172
Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
C N+ +H + V +E+ + K + H D+ D R+ GC CK
Sbjct: 173 CFNDGQHS----DEVLKAIEQ----IKIKEQRASHHDLDSF-DTRQVW-------GCKCK 216
Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKN 273
K+ C+K YCECF + CS +C+C +C+N
Sbjct: 217 KTQCIKGYCECFIRHKKCSSHCQCSNCQN 245
>gi|291001253|ref|XP_002683193.1| predicted protein [Naegleria gruberi]
gi|284096822|gb|EFC50449.1| predicted protein [Naegleria gruberi]
Length = 459
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 33/46 (71%)
Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
++ + + KGC+CKK+GCLK+YCECFQ N C+ CRC CKN+
Sbjct: 186 QQPKFVDNNAKGCNCKKTGCLKRYCECFQNNKRCTIKCRCQGCKNY 231
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
K CNC+ + CLK YCECF C C C C N
Sbjct: 196 KGCNCKKTGCLKRYCECFQNNKRCTIKCRCQGCKN 230
>gi|145524820|ref|XP_001448232.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415776|emb|CAK80835.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 16/119 (13%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C+C + CLK+YC CF G C + C C +C N E+ R AV ++ A R K
Sbjct: 82 CSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEYIRKK---AHRNKK 138
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
+ +E K GC+CKK+ C K+YCECF C+ +C C C+N
Sbjct: 139 VT------------QEKLFETKDVWGCNCKKTRCQKRYCECFIRQKTCTVDCNCNHCEN 185
Score = 45.8 bits (107), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 7/77 (9%)
Query: 223 DAREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
D E+ E+Q L + C C K+ CLK YC CF C E C+C DC+N + +R
Sbjct: 63 DLDENKEESQILENTKWQPCSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQR 122
Query: 282 ALFHGDHNGIAFMQRAA 298
HN + ++++ A
Sbjct: 123 ------HNAVEYIRKKA 133
>gi|170073469|ref|XP_001870379.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870088|gb|EDS33471.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 103
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/71 (47%), Positives = 45/71 (63%), Gaps = 12/71 (16%)
Query: 199 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
+ A L ++P +PKI GA A EDA +H KGC+CK+SGCLK YCEC++A
Sbjct: 35 ITAGLGQSP--VQPKI-----GATSAAEDATR-----RHTKGCNCKRSGCLKNYCECYEA 82
Query: 259 NILCSENCRCL 269
I CS NC+C+
Sbjct: 83 KIACSGNCKCI 93
Score = 46.2 bits (108), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 30/53 (56%), Gaps = 2/53 (3%)
Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVN-CHNNVEH 191
A + +D T + K CNC+ S CLK YCEC+ A I C G C C+ CH E
Sbjct: 50 ATSAAEDATRRHTKGCNCKRSGCLKNYCECYEAKIACSGNCKCIGECHAEREE 102
>gi|145512279|ref|XP_001442056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409328|emb|CAK74659.1| unnamed protein product [Paramecium tetraurelia]
Length = 139
Score = 60.5 bits (145), Expect = 1e-06, Method: Composition-based stats.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 21/136 (15%)
Query: 139 PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQE 197
P+ G D K+ C C+ ++CLKLYC+CF G C C C C N++ + R +
Sbjct: 6 PKLQIQGNDSEETKEG-CQCQKTKCLKLYCQCFHDGKCCGQSCGCNGCKNSLNDYLERNK 64
Query: 198 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
A+G + + + ++ K L+ +GC CKKS C YCECF
Sbjct: 65 AIGKIIHKYKDDYKQK-------------------LSGDKGQGCCCKKSKCKLNYCECFI 105
Query: 258 ANILCSENCRCLDCKN 273
C E C C C+N
Sbjct: 106 KGRSCGEQCHCKKCEN 121
>gi|297722381|ref|NP_001173554.1| Os03g0638300 [Oryza sativa Japonica Group]
gi|255674730|dbj|BAH92282.1| Os03g0638300, partial [Oryza sativa Japonica Group]
Length = 113
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 5/78 (6%)
Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
R + D D K+C+C+ S+CLKLYCECFA+ +YC + C+C C N+ HE
Sbjct: 7 RQKCQNDNVDSC----KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVL 62
Query: 197 EAVGATLERNPNAFRPKI 214
RNP AF PK+
Sbjct: 63 STRNRIESRNPLAFAPKV 80
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 19 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 59
>gi|145528842|ref|XP_001450215.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417815|emb|CAK82818.1| unnamed protein product [Paramecium tetraurelia]
Length = 244
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/169 (29%), Positives = 69/169 (40%), Gaps = 26/169 (15%)
Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNP 207
+PK Q +C C S+C + YCECFA G C C C NC N + + +G +R
Sbjct: 90 SPKPQIKCQCSKSQCQQSYCECFARGKTCGKHCGCQNCQN-----MQMNKKLGKIQKRLV 144
Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
R K Q ++ KGC C KS C +C C Q C+ C+
Sbjct: 145 KQKRIK------------------QRTPRYLKGCTCGKSKCQNNFCSCHQDGKYCNSECK 186
Query: 268 CLDCKNFEGSEERR--ALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTL 314
C+DCKN + + H + N + IIG SS + L
Sbjct: 187 CVDCKNVYNFCLKIFPEIEHVESNEMESTNANVVLFIIGCENSSNFQNL 235
>gi|118376115|ref|XP_001021240.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89303007|gb|EAS00995.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 379
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 69/132 (52%), Gaps = 19/132 (14%)
Query: 155 QCNCRNSRCLKLYCECFAAGIYC--DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
+CNC+ ++C+K YC C+A +C C C +C N Q++ +T ++ +P
Sbjct: 246 ECNCQTTQCIKKYCSCYANNNFCVPSICKCQSCMNTF------QQSYNSTSQK-----KP 294
Query: 213 KIASSPHGAQDAR----EDAREAQLAAKH-NKGCHCKKSGCLKKYCECFQANILCSENCR 267
K+ SS ++ + ++ ++ C C+KS C K+YC+C+++ CS C+
Sbjct: 295 KLISSQKSLSQVILYQIQNINQLKIQKENATISCTCQKSFCKKQYCDCYKSGKKCSTLCK 354
Query: 268 CLDCKNFEGSEE 279
C +C N +G +E
Sbjct: 355 CQNCFN-QGQKE 365
>gi|195155474|ref|XP_002018629.1| GL25902 [Drosophila persimilis]
gi|194114782|gb|EDW36825.1| GL25902 [Drosophila persimilis]
Length = 296
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSEN 265
P+A ++ ++P R R + + KGC CK++ C+K YC+C+Q+ +C E
Sbjct: 5 PSALTTRVIAAPVKTAAGRRQPRTPKTPKESTIKGCACKRTSCIKNYCDCYQSMRICHEF 64
Query: 266 CRCLDCKN 273
C+C+DCKN
Sbjct: 65 CKCVDCKN 72
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 4/67 (5%)
Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
P K + R +PR K+ T K C C+ + C+K YC+C+ + C + C CV+C N
Sbjct: 16 PVKTAAGRRQPRTPKTPKEST---IKGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKN 72
Query: 188 NVEHEVA 194
+VE +
Sbjct: 73 SVERPLV 79
>gi|198476800|ref|XP_001357481.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
gi|198137857|gb|EAL34551.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
Length = 296
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSEN 265
P+A ++ ++P R R + + KGC CK++ C+K YC+C+Q+ +C E
Sbjct: 5 PSALTTRVIAAPVKTAAGRRQPRTPKTPKESTIKGCACKRTSCIKNYCDCYQSMRICHEF 64
Query: 266 CRCLDCKN 273
C+C+DCKN
Sbjct: 65 CKCVDCKN 72
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 4/67 (5%)
Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
P K + R +PR K+ T K C C+ + C+K YC+C+ + C + C CV+C N
Sbjct: 16 PVKTAAGRRQPRTPKTPKEST---IKGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKN 72
Query: 188 NVEHEVA 194
+VE +
Sbjct: 73 SVERPLV 79
>gi|145545532|ref|XP_001458450.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426270|emb|CAK91053.1| unnamed protein product [Paramecium tetraurelia]
Length = 220
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 58/134 (43%), Gaps = 22/134 (16%)
Query: 144 DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG-- 200
+G++ +Q C+C S CL++YC CF G C C C C N ++ + R++AV
Sbjct: 77 EGQEIQQVEQFSCSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYA 136
Query: 201 -ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
+RN KI + GC+C K+ C+K YCECF
Sbjct: 137 KKKAQRNKKVPEGKIFETTEIW------------------GCNCSKTRCVKNYCECFIRG 178
Query: 260 ILCSENCRCLDCKN 273
CS C C C N
Sbjct: 179 KKCSVECNCQHCDN 192
Score = 41.2 bits (95), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 5/55 (9%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN----NVEHEVARQEAVGATLER 205
CNC +RC+K YCECF G C CNC +C N ++ +E+ RQ + + +
Sbjct: 160 CNCSKTRCVKNYCECFIRGKKCSVECNCQHCDNGKDEDLINEIKRQNQIKKRIRK 214
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 26/41 (63%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C C KS CL+ YC CF +C ++C+C +C+N E + +R
Sbjct: 89 CSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKR 129
>gi|145531914|ref|XP_001451718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419384|emb|CAK84321.1| unnamed protein product [Paramecium tetraurelia]
Length = 242
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 57/129 (44%), Gaps = 26/129 (20%)
Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLE 204
KD P Q +C C S+C + YCECFA G C C C NC N + + LE
Sbjct: 89 KDLQP--QIKCQCNKSQCQQSYCECFARGKTCGKHCGCQNCQN---------KQMNKKLE 137
Query: 205 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
+ + R Q +++ KGC C KS C K+C C Q C+
Sbjct: 138 KIQKK--------------LIKQKRIKQRTSRYLKGCTCGKSKCQNKFCSCHQDGKYCNS 183
Query: 265 NCRCLDCKN 273
CRC+DCKN
Sbjct: 184 GCRCVDCKN 192
>gi|145531469|ref|XP_001451501.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419156|emb|CAK84104.1| unnamed protein product [Paramecium tetraurelia]
Length = 192
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 65/162 (40%), Gaps = 41/162 (25%)
Query: 116 PHPVHKLPLPTLPPGKQESPRSRP-RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAG 174
P P+ K P P + + P + N K +C C S+C K YCECFA G
Sbjct: 33 PMPILKFFDPDESPNILSAKKPFPIQKNIVKPKEIEKSIVKCKCLKSKCQKSYCECFAVG 92
Query: 175 IYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 233
C CNC+ C+N+ + ++ G
Sbjct: 93 RSCSIDCNCLGCNNSNCSQKIKKSQCG--------------------------------- 119
Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
GC+CKK+GC+KKYCEC+ C+ C C+DC N E
Sbjct: 120 ------GCNCKKTGCVKKYCECYLKKQRCTFLCNCIDCCNQE 155
>gi|224123378|ref|XP_002319064.1| predicted protein [Populus trichocarpa]
gi|222857440|gb|EEE94987.1| predicted protein [Populus trichocarpa]
Length = 213
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 48/114 (42%), Gaps = 33/114 (28%)
Query: 164 LKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
L YCECFAAG+YC + C+C++C NN HE E G RNP AF PK+ + A
Sbjct: 8 LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIRNLDSAS 67
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
+ F+ + CS CRC CKN G
Sbjct: 68 E--------------------------------FRGGVGCSSYCRCEGCKNTFG 89
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 32/72 (44%), Gaps = 10/72 (13%)
Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSE----ERRALFHGDHNGIAFMQRA-----AN 299
L YCECF A + C E C CLDC N G E E R N +AF + +
Sbjct: 8 LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIES-RNPLAFAPKVIRNLDSA 66
Query: 300 AAIIGAVGSSGY 311
+ G VG S Y
Sbjct: 67 SEFRGGVGCSSY 78
>gi|229594976|ref|XP_001032555.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|225566469|gb|EAR84892.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 622
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 3/83 (3%)
Query: 131 KQESPRSRPRANTDGKDGTPKKQKQ--CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
K++ P ++ ++ K+G + + C CR S CLKLYCECFA G C C C+ C N
Sbjct: 302 KKQQPNNKLQSIDSSKEGEILRVSKTPCKCRKSLCLKLYCECFARGEICGHACVCLECKN 361
Query: 188 NVEHEVARQEAVGATLERNPNAF 210
+ H R EA+ +NP+AF
Sbjct: 362 SKNHLELRNEAIKVIEAKNPSAF 384
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%)
Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 277
P+ + + ++E ++ C C+KS CLK YCECF +C C CL+CKN +
Sbjct: 306 PNNKLQSIDSSKEGEILRVSKTPCKCRKSLCLKLYCECFARGEICGHACVCLECKNSKNH 365
Query: 278 EERR 281
E R
Sbjct: 366 LELR 369
>gi|145530936|ref|XP_001451240.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418884|emb|CAK83843.1| unnamed protein product [Paramecium tetraurelia]
Length = 263
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 83/200 (41%), Gaps = 27/200 (13%)
Query: 80 LPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPG---KQESPR 136
L SSQ H+ + + P++Q+ + +R +R + + + P G K++ R
Sbjct: 71 LTTSSQHHL-----NEDEWKPEYQSSQKQLRFSDRSQNHIEEENKPKQARGIKDKEQRQR 125
Query: 137 SRPRANTDGKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEV 193
++ R + D K C+C ++CL+LYC CF C C C C N+ H+V
Sbjct: 126 TKRRVDYGVGDSFSNDSDLKPCHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFNDGNHKV 185
Query: 194 ARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYC 253
E V A + +R ++ S + D R+ KG YC
Sbjct: 186 ---EMVKALQKVKLKEYR--VSQSDLDSFDTRQVLGCKCKKTHCKKG-----------YC 229
Query: 254 ECFQANILCSENCRCLDCKN 273
ECF C+ C+C +C N
Sbjct: 230 ECFIRGRKCTSQCKCCECNN 249
>gi|145481073|ref|XP_001426559.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393634|emb|CAK59161.1| unnamed protein product [Paramecium tetraurelia]
Length = 254
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 50/117 (42%), Gaps = 35/117 (29%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNN-VEHEVARQEAVGATLERNPNAFRPK 213
C C+ ++C+K YCECF G C C C C N V+ + + V T+E
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCNKQVKQSSGKMQLVSPTIEF-------- 195
Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+GC C+K C KKYCEC + N C++ C+C+D
Sbjct: 196 -------------------------QGCKCQKQNCSKKYCECLKRNQRCTDLCKCID 227
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 20/33 (60%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
C CKK+ C+K YCECFQ CS C C C N
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCN 176
>gi|195434044|ref|XP_002065013.1| GK14905 [Drosophila willistoni]
gi|194161098|gb|EDW75999.1| GK14905 [Drosophila willistoni]
Length = 250
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 45/69 (65%), Gaps = 8/69 (11%)
Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
K A S G + AR A++ + KGC CK+S C+K YC+C+Q+ +CS++CRC+ C+
Sbjct: 3 KPAPSGTGKRSARGQAQQPTI-----KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCR 57
Query: 273 NFEGSEERR 281
N ++ER+
Sbjct: 58 N---TQERK 63
Score = 39.3 bits (90), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 15/40 (37%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHE 192
K C+C+ S+C+K YC+C+ + C C C+ C N E +
Sbjct: 24 KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCRNTQERK 63
>gi|145521839|ref|XP_001446769.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414258|emb|CAK79372.1| unnamed protein product [Paramecium tetraurelia]
Length = 151
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 59/133 (44%), Gaps = 20/133 (15%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C C+ ++CLKLYC+CF G C + C+C C N++ R +A+ + +
Sbjct: 27 CQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHK--------- 77
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
+D + + + GC CKKS C YCECF C + C C C+N
Sbjct: 78 ----------YKDESKLKFSGDKGSGCCCKKSKCQLNYCECFIKGRSCGDQCHCKKCQNG 127
Query: 275 EGSEERRALFHGD 287
+ ++ H D
Sbjct: 128 KKQNQKICANHED 140
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 48/113 (42%), Gaps = 12/113 (10%)
Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--FEGSEERRALFHGDHNGIAFMQRA 297
GC C+K+ CLK YC+CF C E C C CKN + E +A+ + + +
Sbjct: 26 GCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAI-------VKIIHKY 78
Query: 298 ANAAIIGAVGSSGYGTLMTSNK---RKSEELLFGVAAKDQSVIRNPQSQQGNN 347
+ + + G G G +K E + G + DQ + Q+ + N
Sbjct: 79 KDESKLKFSGDKGSGCCCKKSKCQLNYCECFIKGRSCGDQCHCKKCQNGKKQN 131
>gi|222628471|gb|EEE60603.1| hypothetical protein OsJ_14005 [Oryza sativa Japonica Group]
Length = 97
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 39/76 (51%), Gaps = 5/76 (6%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
+ + C C S C+KLYC CF+A YC C C NC N +E +E V +NP AF
Sbjct: 5 QHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAF 64
Query: 211 RPKIASSPHGAQDARE 226
PKI QDA E
Sbjct: 65 DPKIVR----VQDASE 76
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 30/47 (63%)
Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
++ ++ C C KS C+K YC CF A CS+NCRC +CKN E E+
Sbjct: 1 MSNDQHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYED 47
>gi|395834329|ref|XP_003790159.1| PREDICTED: protein lin-54 homolog [Otolemur garnettii]
Length = 677
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 234 AAKHNKGCHCKKSGCLKKY--CECFQANILCSENCRCLDCKNFEGSEERRALFH 285
A++ K C+C KS CLK + A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 518 ASRPRKPCNCTKSLCLKLWEVNNISMAKIMCSSICKCIGCKNFEESPERKTLMH 571
>gi|195473860|ref|XP_002089210.1| GE18995 [Drosophila yakuba]
gi|194175311|gb|EDW88922.1| GE18995 [Drosophila yakuba]
Length = 237
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 216 SSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
S+P R + R+A+ KGC CK+S C+K YC+C+Q+ +C+ CRC+DC+N
Sbjct: 2 STPKKKSVDRSEGRKAKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRNT 61
Query: 275 E 275
E
Sbjct: 62 E 62
Score = 37.7 bits (86), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 30/61 (49%), Gaps = 4/61 (6%)
Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
P K+ RS R K P K C C+ S+C+K YC+C+ + C C C++C N
Sbjct: 4 PKKKSVDRSEGR---KAKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRN 60
Query: 188 N 188
Sbjct: 61 T 61
>gi|194759119|ref|XP_001961797.1| GF14749 [Drosophila ananassae]
gi|190615494|gb|EDV31018.1| GF14749 [Drosophila ananassae]
Length = 324
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 35/45 (77%), Gaps = 3/45 (6%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
KGC CK+S C+K YC+C+Q+ ++CS+ CRC+ C+N +E R+A+
Sbjct: 103 KGCCCKRSQCIKNYCDCYQSMVICSKFCRCVGCRN---TEVRKAV 144
>gi|145529794|ref|XP_001450680.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418302|emb|CAK83283.1| unnamed protein product [Paramecium tetraurelia]
Length = 229
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 23/123 (18%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K +C C S CLK YC+CFA G C C C CHN ++ R+EA+ +N +
Sbjct: 54 KSLKCKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEEREEAINKLKMQNQSIE 113
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+ P G + ++ LK+YC+CFQ N CSE+C+C +
Sbjct: 114 KE----VPVGISCKCKKSKC------------------LKRYCDCFQNNQKCSESCQCNN 151
Query: 271 CKN 273
C N
Sbjct: 152 CSN 154
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
C C KS CLK+YC+CF +C+ C+C C N E E R
Sbjct: 58 CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEER 98
>gi|223993335|ref|XP_002286351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977666|gb|EED95992.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 957
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 40/81 (49%), Gaps = 14/81 (17%)
Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG--CNCVNCHNNVEHEVARQEAVG 200
T K K K C C+ ++CLKLYC CF+ I CD C+C C N + E+AR G
Sbjct: 305 TASKSNKKKTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERC-ANTQDELARDGDNG 363
Query: 201 -----------ATLERNPNAF 210
+ LERNP AF
Sbjct: 364 MAMSAVVTARKSILERNPRAF 384
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 32/49 (65%)
Query: 225 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
R++A ++ K + C CKKS CLKKYCECF ++ C+ C+C +C N
Sbjct: 643 RQEAINSKALTKQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGN 691
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 47/108 (43%), Gaps = 21/108 (19%)
Query: 99 MPQWQARPQHVRMVN--------------RVPHPVHKLPLPTL----PPGKQESPRSRPR 140
MP W+ R + R V R HP H+ P PP +Q R +
Sbjct: 586 MPSWEGRHYYPREVESRRSYDYYYDPYHPRSSHP-HEPPQHYHGYYPPPHRQYYDEPRRQ 644
Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
+ K T K+ + C C+ S+CLK YCECFA + C C C NC N
Sbjct: 645 EAINSKALT-KQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGN 691
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 32/59 (54%), Gaps = 6/59 (10%)
Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNFEGSEERRALFHGDHNGIAF 293
K +KGC CKK+ CLK YC CF +I C + C C C N + R GD NG+A
Sbjct: 313 KTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERCANTQDELARD----GD-NGMAM 366
>gi|145490347|ref|XP_001431174.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398277|emb|CAK63776.1| unnamed protein product [Paramecium tetraurelia]
Length = 280
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 73/159 (45%), Gaps = 20/159 (12%)
Query: 119 VHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQ---KQCNCRNSRCLKLYCECFAAGI 175
+ + + +P +Q R + D +P +Q K C+C + CL+LYC CF
Sbjct: 111 IRQFEIQEIPTIQQIQTRRKGNKANYNVDDSPVEQDEMKPCHCTKTHCLQLYCSCFHNRR 170
Query: 176 YC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 234
C C C +C+N+ +HE Q+AV + K + H D+
Sbjct: 171 RCMSECRCNDCYNDGQHEEEVQKAV--------EQIKIKEQRASHHDLDS--------FD 214
Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
K GC CKK+ C+K YCECF C+ +C+C +C+N
Sbjct: 215 TKQVWGCKCKKTKCVKGYCECFIRGKKCTSHCQCTECEN 253
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 22/41 (53%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
K CHC K+ CL+ YC CF C CRC DC N EE
Sbjct: 149 KPCHCTKTHCLQLYCSCFHNRRRCMSECRCNDCYNDGQHEE 189
>gi|195388490|ref|XP_002052913.1| GJ17819 [Drosophila virilis]
gi|194149370|gb|EDW65068.1| GJ17819 [Drosophila virilis]
Length = 225
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 33/45 (73%), Gaps = 3/45 (6%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
KGC CK+S C+K YC+C+Q+ +CS C+C+ C+N +EER A+
Sbjct: 25 KGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRN---TEERLAV 66
Score = 38.9 bits (89), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 37/77 (48%), Gaps = 7/77 (9%)
Query: 136 RSRPRANTDGKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHE 192
RS N + PK+Q K C C+ S+C+K YC+C+ + C C C+ C N E
Sbjct: 5 RSGNYFNKEQSKRQPKQQAPKGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRNTEERL 64
Query: 193 VA----RQEAVGATLER 205
++ A+ A ER
Sbjct: 65 AVATPPKKTALAAKRER 81
>gi|194856723|ref|XP_001968813.1| GG24300 [Drosophila erecta]
gi|190660680|gb|EDV57872.1| GG24300 [Drosophila erecta]
Length = 239
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 29/37 (78%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
KGC CK+S C+K YC+C+Q+ +C++ CRC+ C+N E
Sbjct: 26 KGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRNTE 62
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 33/67 (49%), Gaps = 5/67 (7%)
Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
P K+ R+ R GK P K C C+ S+C+K YC+C+ + C C C+ C N
Sbjct: 4 PKKKNVERTESR---RGKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRN 60
Query: 188 N-VEHEV 193
V H V
Sbjct: 61 TEVRHLV 67
>gi|147832526|emb|CAN77361.1| hypothetical protein VITISV_011036 [Vitis vinifera]
Length = 514
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 46/121 (38%), Gaps = 45/121 (37%)
Query: 164 LKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
L YC+C AAG+YC D C C NC N E+E Q R+P AF P+I
Sbjct: 244 LSSYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRI-------- 295
Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEERR 281
A I CS+ CRC DC+N F +RR
Sbjct: 296 -----------------------------------AGIGCSDGCRCEDCRNSFGIKADRR 320
Query: 282 A 282
A
Sbjct: 321 A 321
>gi|219114349|ref|XP_002176345.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217402591|gb|EEC42581.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 636
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%)
Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
GC C++S CLKKYCECFQ C NC+C +C+N
Sbjct: 245 GCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQNL 279
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
Q C CR S CLK YCECF C C C NC N
Sbjct: 243 QFGCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQN 278
>gi|145520831|ref|XP_001446271.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413748|emb|CAK78874.1| unnamed protein product [Paramecium tetraurelia]
Length = 229
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 23/123 (18%)
Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
K +C C S CLK YC+CFA G C C C CHN +++ R EA+ +N +
Sbjct: 54 KSLKCKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERGEAINKLKLQNQSIE 113
Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
+ P G + ++ LK+YC+CFQ N C E+C+C +
Sbjct: 114 KE----VPVGISCKCKKSKC------------------LKRYCDCFQNNQKCHESCQCSN 151
Query: 271 CKN 273
C N
Sbjct: 152 CSN 154
Score = 44.7 bits (104), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 25/42 (59%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
C C KS CLK+YC+CF +C+ C+C C N E +E R
Sbjct: 58 CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERG 99
>gi|397568266|gb|EJK46048.1| hypothetical protein THAOC_35308 [Thalassiosira oceanica]
Length = 754
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 24/37 (64%)
Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
H GC C+KS CLKKYCECF + C +C C +C N
Sbjct: 421 HRTGCKCRKSRCLKKYCECFSNGVKCGASCSCENCGN 457
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 238 NKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNFEG-SEERRALF 284
+KGC C K+ CLK YC CF N+LC+++ C C DCKN E S E A+
Sbjct: 231 DKGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDCKNCEAESTEDGAIL 279
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 37/69 (53%), Gaps = 5/69 (7%)
Query: 153 QKQCNCRNSRCLKLYCECFAAGIYC--DGC---NCVNCHNNVEHEVARQEAVGATLERNP 207
K C+C ++CLKLYC CFA + C D C +C NC + A +A + L +NP
Sbjct: 231 DKGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDCKNCEAESTEDGAILKARRSILRKNP 290
Query: 208 NAFRPKIAS 216
AF+ K S
Sbjct: 291 KAFKSKFQS 299
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/33 (60%), Positives = 23/33 (69%), Gaps = 1/33 (3%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
C CR SRCLK YCECF+ G+ C C+C NC N
Sbjct: 425 CKCRKSRCLKKYCECFSNGVKCGASCSCENCGN 457
>gi|195118094|ref|XP_002003575.1| GI21907 [Drosophila mojavensis]
gi|193914150|gb|EDW13017.1| GI21907 [Drosophila mojavensis]
Length = 234
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 33/45 (73%), Gaps = 3/45 (6%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
KGC CK+S C+K YC+C+Q+ +CS C+C+ C+N ++ER A+
Sbjct: 25 KGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRN---TQERLAV 66
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 38/79 (48%), Gaps = 14/79 (17%)
Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVE 190
+E P+ +P+ P+ K C C+ S+C+K YC+C+ + C C CV C N E
Sbjct: 12 KEHPKRQPK---------PQAPKGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRNTQE 62
Query: 191 HEVA----RQEAVGATLER 205
++ A+ A ER
Sbjct: 63 RLAVATPPKKTALAAKRER 81
>gi|195034702|ref|XP_001988959.1| GH11450 [Drosophila grimshawi]
gi|193904959|gb|EDW03826.1| GH11450 [Drosophila grimshawi]
Length = 225
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 34/45 (75%), Gaps = 3/45 (6%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
+GC CK+S C+K YC+C+Q+ +CS +C+C+ C+N +E+R A+
Sbjct: 22 RGCCCKRSQCIKNYCDCYQSMAVCSIHCKCIGCRN---TEQRMAV 63
>gi|195342745|ref|XP_002037959.1| GM18017 [Drosophila sechellia]
gi|194132809|gb|EDW54377.1| GM18017 [Drosophila sechellia]
Length = 240
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 27/35 (77%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
KGC CK+S C+K YC+C+Q+ +C++ CRC C+N
Sbjct: 23 KGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRN 57
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 5/79 (6%)
Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHE 192
SPR + GK K C+C+ S+C+K YC+C+ + C C C C N V E
Sbjct: 3 SPRKKTVQGRRGKGQGAGGVKGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRNTVVRE 62
Query: 193 VARQEAVGATLERNPNAFR 211
+ +V +N NA +
Sbjct: 63 LVDPNSVA----KNSNAVK 77
>gi|145484609|ref|XP_001428314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395399|emb|CAK60916.1| unnamed protein product [Paramecium tetraurelia]
Length = 264
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 56/138 (40%), Gaps = 17/138 (12%)
Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVAR 195
S+ N D + K C+C + CL+LYC CF C C C +C+N+ +H
Sbjct: 129 SKANYNVDESPTLDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKCCDCYNDGQHTDEV 188
Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
+AV + A + S D R+ K KG YCEC
Sbjct: 189 LKAVEQIKIKEQRASHHDLDSF-----DTRQVWGCKCKKTKCVKG-----------YCEC 232
Query: 256 FQANILCSENCRCLDCKN 273
F N CS +C C DC+N
Sbjct: 233 FIRNKKCSSHCHCSDCQN 250
>gi|24581949|ref|NP_608936.1| tombola [Drosophila melanogaster]
gi|7296988|gb|AAF52259.1| tombola [Drosophila melanogaster]
gi|294862322|gb|ADE06681.2| RT06796p1 [Drosophila melanogaster]
Length = 243
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 28/38 (73%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 278
C CK+S C+K YC+C+Q+ +C++ CRC+ C+N E E
Sbjct: 28 CCCKRSQCIKNYCDCYQSMAICTKFCRCVGCRNTEVRE 65
>gi|145508477|ref|XP_001440188.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407394|emb|CAK72791.1| unnamed protein product [Paramecium tetraurelia]
Length = 235
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 50/116 (43%), Gaps = 37/116 (31%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C CR S+C++ YC C A C C C NC N +PK
Sbjct: 138 CKCRRSKCIQNYCVCSANNQECKLNCECYNCSNK----------------------QPK- 174
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
S+P +QL++ GC+C+K C K+YCEC + NI C+ C C +
Sbjct: 175 -STP------------SQLSSIEFNGCNCRKQNCSKRYCECQKRNIKCTSKCNCCE 217
>gi|145545676|ref|XP_001458522.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426342|emb|CAK91125.1| unnamed protein product [Paramecium tetraurelia]
Length = 289
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 65/138 (47%), Gaps = 17/138 (12%)
Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVAR 195
++ N D + K C+C + CL+LYC CF C C C +C+N+ +HE
Sbjct: 139 NKANYNVDNSPVEEDEMKPCHCTKTHCLQLYCSCFHNRRQCMSECKCNDCYNDGQHE--- 195
Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
E V +E+ + K + H D+ K GC CKK+ C+K YCEC
Sbjct: 196 -EDVLKAVEQ----IKMKEQRASHHDLDS--------FDTKQVWGCKCKKTKCVKGYCEC 242
Query: 256 FQANILCSENCRCLDCKN 273
F C+ +C+C +C+N
Sbjct: 243 FIRGKKCTSHCQCTECEN 260
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
K CHC K+ CL+ YC CF C C+C DC N +G E L
Sbjct: 156 KPCHCTKTHCLQLYCSCFHNRRQCMSECKCNDCYN-DGQHEEDVL 199
>gi|145483717|ref|XP_001427881.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394964|emb|CAK60483.1| unnamed protein product [Paramecium tetraurelia]
Length = 219
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 50/122 (40%), Gaps = 38/122 (31%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C CR S+C++ YC C A C C C NC N ++P + PK+
Sbjct: 122 CKCRKSKCIQNYCVCSANNQECKSNCECYNCSN-----------------KSPKSAPPKM 164
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL-DCKN 273
+S GC+C+K C K+YCEC + C+ C C DC+N
Sbjct: 165 SSIEFN-------------------GCNCRKQNCSKRYCECQKRMTRCTSRCNCCEDCEN 205
Query: 274 FE 275
E
Sbjct: 206 QE 207
>gi|145520699|ref|XP_001446205.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413682|emb|CAK78808.1| unnamed protein product [Paramecium tetraurelia]
Length = 226
Score = 46.2 bits (108), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 47/120 (39%), Gaps = 38/120 (31%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C CR S+C++ YC C A C C C NC N +PK
Sbjct: 129 CKCRKSKCIQNYCVCSANNQECSFKCECYNCSNK----------------------QPKS 166
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD-CKN 273
A S + GC+C+K C K YCEC + I C+ CRC D CKN
Sbjct: 167 AISKQSGSEYL--------------GCNCRKQDCSKGYCECQKRKIKCTSKCRCCDECKN 212
>gi|428180254|gb|EKX49122.1| hypothetical protein GUITHDRAFT_151625 [Guillardia theta CCMP2712]
Length = 289
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 23/33 (69%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
C CK+S C+KKYCECFQ + C+ C C DC N
Sbjct: 158 CKCKRSRCMKKYCECFQNSSACTSRCTCNDCGN 190
>gi|145492971|ref|XP_001432482.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399594|emb|CAK65085.1| unnamed protein product [Paramecium tetraurelia]
Length = 253
Score = 44.7 bits (104), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 69/144 (47%), Gaps = 22/144 (15%)
Query: 136 RSRPRANTDGK--DGTPKKQ---KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
++R +AN D +P ++ K C+C + CL+LYC CF C + C C +C N+
Sbjct: 101 QTRRKANKTNYNVDDSPTEEDDFKACHCSKTHCLQLYCSCFHNRRPCTNECKCNDCFNDG 160
Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
+HE E + A + K + H D+ K GC CKK+ C+
Sbjct: 161 KHE---DEVIKAV-----EQIKLKEQRASHHDLDS--------FDTKQVWGCKCKKTKCV 204
Query: 250 KKYCECFQANILCSENCRCLDCKN 273
K YCECF C+ +C+C +C+N
Sbjct: 205 KGYCECFIRGKKCTSHCQCTECEN 228
>gi|218194439|gb|EEC76866.1| hypothetical protein OsI_15056 [Oryza sativa Indica Group]
Length = 177
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 30/61 (49%), Gaps = 5/61 (8%)
Query: 167 YCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
YC CF+A YC C C NC N +E +E V +NP AF PKI QDA
Sbjct: 100 YCLCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAFDPKIVR----VQDAS 155
Query: 226 E 226
E
Sbjct: 156 E 156
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 241 CHCKKS--GCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
CH KK G + YC CF A CS+NCRC +CKN E E+
Sbjct: 87 CHFKKRARGKQESYCLCFSAYGYCSQNCRCTNCKNREYYED 127
>gi|145495603|ref|XP_001433794.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400914|emb|CAK66397.1| unnamed protein product [Paramecium tetraurelia]
Length = 253
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 57/121 (47%), Gaps = 17/121 (14%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C+C + CL+LYC CF C + C C +C+N+ +HE +AV + K
Sbjct: 149 CHCAKTHCLQLYCSCFHNRRPCTNECKCNDCYNDGKHEDEVLKAV--------EQIKLKE 200
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
+ H D+ K GC CKK+ C+K YCECF C+ +C+C + K
Sbjct: 201 QRASHHDLDS--------FDTKQVWGCKCKKTKCVKGYCECFIRGKKCTSHCQCTELKTI 252
Query: 275 E 275
E
Sbjct: 253 E 253
>gi|16764321|ref|NP_459936.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
Typhimurium str. LT2]
gi|374981628|ref|ZP_09722952.1| Cell division protein FtsK [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|34395700|sp|Q8ZQD5.1|FTSK_SALTY RecName: Full=DNA translocase FtsK
gi|16419472|gb|AAL19895.1| cell division protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. LT2]
gi|321223284|gb|EFX48353.1| Cell division protein FtsK [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
Length = 1351
Score = 43.9 bits (102), Expect = 0.17, Method: Composition-based stats.
Identities = 30/110 (27%), Positives = 44/110 (40%), Gaps = 11/110 (10%)
Query: 44 PQSHSP-LQPQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQA--------HMRS 94
PQ+ P + P P QP Q A P ++Q Q +P +S P A +
Sbjct: 364 PQTGEPVIAPAPEGYQPHPQ--YAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLA 421
Query: 95 PSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTD 144
P + PQWQ P H P P+ P PP ++ + P +T+
Sbjct: 422 PQETQPQWQPEPTHQPTPVYQPEPIAAEPSHMPPPVIEQPVATEPEPDTE 471
>gi|378983520|ref|YP_005246675.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
Typhimurium str. T000240]
gi|312911948|dbj|BAJ35922.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
Typhimurium str. T000240]
Length = 1370
Score = 43.9 bits (102), Expect = 0.17, Method: Composition-based stats.
Identities = 30/110 (27%), Positives = 44/110 (40%), Gaps = 11/110 (10%)
Query: 44 PQSHSP-LQPQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQA--------HMRS 94
PQ+ P + P P QP Q A P ++Q Q +P +S P A +
Sbjct: 383 PQTGEPVIAPAPEGYQPHPQ--YAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLA 440
Query: 95 PSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTD 144
P + PQWQ P H P P+ P PP ++ + P +T+
Sbjct: 441 PQETQPQWQPEPTHQPTPVYQPEPIAAEPSHMPPPVIEQPVATEPEPDTE 490
>gi|145529912|ref|XP_001450739.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418361|emb|CAK83342.1| unnamed protein product [Paramecium tetraurelia]
Length = 226
Score = 42.0 bits (97), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 45/120 (37%), Gaps = 38/120 (31%)
Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
C CR S+C+K YC C A C C C NC N +PK
Sbjct: 129 CRCRKSKCIKNYCVCSANNQECTFKCECYNCSNK----------------------QPKS 166
Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD-CKN 273
A S + GC+C++ C K YCEC + C+ C C D CKN
Sbjct: 167 AVSKQSGSEYL--------------GCNCRRQDCSKGYCECQKRKTKCTARCNCCDECKN 212
>gi|145483579|ref|XP_001427812.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394895|emb|CAK60414.1| unnamed protein product [Paramecium tetraurelia]
Length = 261
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
C C KS CLK+YC+CF +C+ C+C C N
Sbjct: 90 CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHN 122
Score = 41.2 bits (95), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
C C S CLK YC+CFA G C + C C CHN
Sbjct: 90 CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHN 122
>gi|123410061|ref|XP_001303597.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121884989|gb|EAX90667.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 162
Score = 41.6 bits (96), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 29/62 (46%), Gaps = 2/62 (3%)
Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIY-CDGCNCVNCHNNVEHEVARQEAVGATLERNPN 208
PK CNC + CL C CF Y CDGC C C N E+E R + L NP
Sbjct: 6 PKCASCCNCEGN-CLSTTCPCFLHSKYCCDGCKCQKCRNKKEYEQERVASFEQHLLENPL 64
Query: 209 AF 210
AF
Sbjct: 65 AF 66
>gi|76155324|gb|AAX26592.2| SJCHGC03965 protein [Schistosoma japonicum]
Length = 176
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/22 (72%), Positives = 17/22 (77%)
Query: 153 QKQCNCRNSRCLKLYCECFAAG 174
+K CNC S CLKLYCECFA G
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQG 165
>gi|345309828|ref|XP_001507154.2| PREDICTED: protein lin-54 homolog [Ornithorhynchus anatinus]
Length = 182
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 20/28 (71%)
Query: 258 ANILCSENCRCLDCKNFEGSEERRALFH 285
A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 49 AKIMCSSICKCIGCKNFEESPERKTLMH 76
>gi|16759831|ref|NP_455448.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
Typhi str. CT18]
gi|29142396|ref|NP_805738.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
Typhi str. Ty2]
gi|34395698|sp|Q8Z814.1|FTSK_SALTI RecName: Full=DNA translocase FtsK
gi|25512677|pir||AF0611 cell division protein FtsK [imported] - Salmonella enterica subsp.
enterica serovar Typhi (strain CT18)
gi|16502124|emb|CAD05360.1| cell division protein FtsK [Salmonella enterica subsp. enterica
serovar Typhi]
gi|29138026|gb|AAO69587.1| cell division protein FtsK [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
Length = 1343
Score = 40.8 bits (94), Expect = 1.2, Method: Composition-based stats.
Identities = 34/114 (29%), Positives = 52/114 (45%), Gaps = 20/114 (17%)
Query: 44 PQSHSP-LQPQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQA--------HMRS 94
PQ+ P + P P QP Q A P ++Q Q +P +S P A +
Sbjct: 364 PQTGEPVIAPAPEGYQPHPQ--YAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLA 421
Query: 95 PSQAMPQWQA-------RPQHVRM-VNRVPHPVHKLPLPTLP-PGKQESPRSRP 139
P + PQWQA +P+ + + +P PV + P+ T P PG +E+ +RP
Sbjct: 422 PQETQPQWQAPDAEQHWQPEPIAAEPSHMPPPVIEQPVATEPEPGIEETRPARP 475
>gi|378960139|ref|YP_005217625.1| DNA translocase ftsK [Salmonella enterica subsp. enterica serovar
Typhi str. P-stx-12]
gi|374354011|gb|AEZ45772.1| DNA translocase ftsK [Salmonella enterica subsp. enterica serovar
Typhi str. P-stx-12]
Length = 1317
Score = 40.8 bits (94), Expect = 1.2, Method: Composition-based stats.
Identities = 34/114 (29%), Positives = 52/114 (45%), Gaps = 20/114 (17%)
Query: 44 PQSHSP-LQPQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQA--------HMRS 94
PQ+ P + P P QP Q A P ++Q Q +P +S P A +
Sbjct: 364 PQTGEPVIAPAPEGYQPHPQ--YAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLA 421
Query: 95 PSQAMPQWQA-------RPQHVRM-VNRVPHPVHKLPLPTLP-PGKQESPRSRP 139
P + PQWQA +P+ + + +P PV + P+ T P PG +E+ +RP
Sbjct: 422 PQETQPQWQAPDAEQHWQPEPIAAEPSHMPPPVIEQPVATEPEPGIEETRPARP 475
>gi|115615245|ref|XP_001199031.1| PREDICTED: uncharacterized protein LOC763146 [Strongylocentrotus
purpuratus]
Length = 1617
Score = 39.7 bits (91), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 6/39 (15%)
Query: 241 CHCKKSGCL------KKYCECFQANILCSENCRCLDCKN 273
C CKK+ C K C C Q NI C++ CRC++CKN
Sbjct: 1560 CSCKKTSCEEPKAAGKGRCSCRQHNITCTDLCRCINCKN 1598
>gi|384246535|gb|EIE20025.1| hypothetical protein COCSUDRAFT_58250 [Coccomyxa subellipsoidea
C-169]
Length = 556
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 8/70 (11%)
Query: 204 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
+RNP AF+ KI +G + + ++ H +GCHCKKS C KKYCECFQA + C
Sbjct: 162 QRNPQAFQEKIEKQANG--------QGSGVSGLHKRGCHCKKSHCKKKYCECFQAGVNCG 213
Query: 264 ENCRCLDCKN 273
+C+C +C N
Sbjct: 214 VHCKCDECHN 223
>gi|285026357|dbj|BAI67897.1| liver stage-specific antigen-1 [Plasmodium vivax]
Length = 2149
Score = 39.7 bits (91), Expect = 3.1, Method: Composition-based stats.
Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 12/130 (9%)
Query: 34 EQSVKSQ-SSQPQSHSPLQPQPP-------KSQPQVQSKVASPSQSQPLVRLQLLPPSSQ 85
+Q +K Q QP+ H + QP + QP +Q ++ S +Q +P L P +
Sbjct: 698 KQQMKPQLRPQPRPHPQQELQPGAQPEEQLEEQPDLQLELQSEAQPKPQQPQMRLKPQLR 757
Query: 86 PHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDG 145
VQ +A PQ + PQ PHP K P P + P Q P + + N
Sbjct: 758 SKVQMRPHPQMKAHPQIKPNPQMKPNPQMKPHPQMK-PNPQMKPNPQMKPNPQMKPNPQM 816
Query: 146 KDGTPKKQKQ 155
K P+ Q Q
Sbjct: 817 K---PQPQLQ 823
>gi|390344245|ref|XP_003726079.1| PREDICTED: uncharacterized protein LOC764337 [Strongylocentrotus
purpuratus]
Length = 1742
Score = 39.3 bits (90), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 6/39 (15%)
Query: 241 CHCKKSGCL------KKYCECFQANILCSENCRCLDCKN 273
C CKK+ C K C C Q NI C++ CRC++CKN
Sbjct: 1685 CSCKKTSCEEPKAAGKGRCSCRQHNITCTDLCRCINCKN 1723
>gi|195147164|ref|XP_002014550.1| GL18894 [Drosophila persimilis]
gi|194106503|gb|EDW28546.1| GL18894 [Drosophila persimilis]
Length = 1100
Score = 38.9 bits (89), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 9/57 (15%)
Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHN 187
K++ P+ R T G PK CRN+RC C+ +G C C+CV CHN
Sbjct: 817 KEKPPKPICRCGTSGVAVNPK----TTCRNTRC-----PCYKSGNTCTNCHCVGCHN 864
>gi|325191197|emb|CCA25983.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 511
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 39/88 (44%), Gaps = 8/88 (9%)
Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSG-CLKKYCECFQANILCSENCRCLDCKN-FE 275
P G+ D REDA E NK C+ SG C C C + N +C+ CRC C+N F
Sbjct: 299 PLGSSDRREDALEK------NKVVSCRCSGGCRNGRCACVRENGMCTPACRCTSCQNPFS 352
Query: 276 GSEERRALFHGDHNGIAFMQRAANAAII 303
+ A N FMQ A ++
Sbjct: 353 IVKAAGANMDILKNDSCFMQNACKGRVM 380
>gi|198473707|ref|XP_001356412.2| GA16882 [Drosophila pseudoobscura pseudoobscura]
gi|198138072|gb|EAL33476.2| GA16882 [Drosophila pseudoobscura pseudoobscura]
Length = 1102
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 9/57 (15%)
Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHN 187
K++ P+ R T G PK CRN+RC C+ +G C C+CV CHN
Sbjct: 815 KEKPPKPICRCGTSGVAVNPK----TTCRNTRC-----PCYKSGNTCTNCHCVGCHN 862
>gi|345481297|ref|XP_003424335.1| PREDICTED: hypothetical protein LOC100679675 [Nasonia vitripennis]
Length = 539
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 25/48 (52%), Gaps = 3/48 (6%)
Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD--CKNFEGSEE 279
K+NKGC CK C K C C + I+C CRC D C N E +E
Sbjct: 103 GTKNNKGCSCK-GNCSNKICGCVKKGIMCQVFCRCNDAQCMNQEPEKE 149
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.128 0.388
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,412,691,960
Number of Sequences: 23463169
Number of extensions: 267316611
Number of successful extensions: 1327024
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 779
Number of HSP's successfully gapped in prelim test: 5435
Number of HSP's that attempted gapping in prelim test: 1261156
Number of HSP's gapped (non-prelim): 45855
length of query: 425
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 280
effective length of database: 8,957,035,862
effective search space: 2507970041360
effective search space used: 2507970041360
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)