BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008269
(571 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255551004|ref|XP_002516550.1| transcription factor, putative [Ricinus communis]
gi|223544370|gb|EEF45891.1| transcription factor, putative [Ricinus communis]
Length = 629
Score = 538 bits (1387), Expect = e-150, Method: Compositional matrix adjust.
Identities = 313/594 (52%), Positives = 381/594 (64%), Gaps = 74/594 (12%)
Query: 1 MRSPSQAMPQ-WQARPQHVRM---VNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKD 56
+ +P + PQ QARPQHV + + R+PHPV KL L TL KQ+SP SRPR N + KD
Sbjct: 79 LETPPKQQPQATQARPQHVMVPPAMRRIPHPVQKLSLHTLQLIKQQSPSSRPRNNIEAKD 138
Query: 57 --GTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
GTPKK K CNC+NSRCLKL YCECFAAG++
Sbjct: 139 NSGTPKKAKHCNCKNSRCLKL-----------------------------YCECFAAGVH 169
Query: 115 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
C GCNC NCHNNVE+E +RQEAVGA LERNP AF+PKIASSPHG++DA+EDA E QL K
Sbjct: 170 CTGCNCTNCHNNVENEASRQEAVGAVLERNPEAFKPKIASSPHGSRDAKEDAMEVQLVGK 229
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 234
HNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH +HNGI +MQ+
Sbjct: 230 HNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNHNGIVYMQQ 289
Query: 235 AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCS 294
AANAAI GA+GSSGYGT +TSNKRKSEE LFG +DQ R ++QQ N+ +N A S
Sbjct: 290 AANAAICGAIGSSGYGTPLTSNKRKSEE-LFGSTNRDQLGRRVAKTQQENHQRNPAASAS 348
Query: 295 Q-SVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAM 353
SVPV NA+ LGS+K +SPLA ILQPQD+K++CSLLV++S EA K K+ M
Sbjct: 349 PLSVPVPH--NATALGSSKFTYKSPLAGILQPQDVKKMCSLLVVLSQEARKAYAGKMD-M 405
Query: 354 QTEREGSR---SSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDML 410
Q + +R SS +S I+E ED G+ VH +P+D +GN + D +S SG DGG +
Sbjct: 406 QPGGDDNRKLESSSSSSIREREDGLMGNNVHKTVPNDHVNGNTAEKD-SSDSGKDGGGLE 464
Query: 411 NERPLSPGTRALMCDEEDSAFMQAGPQTGLA--SKSTVQK-SNAYASTRIYAEQERLILT 467
N RPLSP LMC E++ F + TG+A S++ +QK S+A + YAEQE+LILT
Sbjct: 465 NGRPLSPEID-LMCHEQEMVFREGESATGMAGLSQNEIQKSSSAQECSEAYAEQEKLILT 523
Query: 468 RFRDFLNRLITCGSIK---------------ETMCSPLVKSETRSQQKAVEDGTSRSGTE 512
FRDFLNRLITCGSIK ET CSPL +S+T S Q+ + ++G E
Sbjct: 524 GFRDFLNRLITCGSIKGKLLWSETFLCSNFHETTCSPLARSKTGSHQEPADGEMIKAGIE 583
Query: 513 VRGYMEPCGYGSAKSAVSVTLEPSCTTTSVSSRQNNDLPMRVGLSNGQRVVDSE 566
G KS V T S N DLP++ L G V+ E
Sbjct: 584 TLNDKNAYASGIVKSPVLAT-----------SASNVDLPIKALLPIGNGEVNPE 626
>gi|255546437|ref|XP_002514278.1| transcription factor, putative [Ricinus communis]
gi|223546734|gb|EEF48232.1| transcription factor, putative [Ricinus communis]
Length = 601
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 282/557 (50%), Positives = 348/557 (62%), Gaps = 41/557 (7%)
Query: 6 QAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPG------KQESPRSRPRANTDGKDGTP 59
Q P A PQ + + P V P LPP K ESP+S+PR N + KDGTP
Sbjct: 58 QQPPATVALPQEPVVAPQQPQAVAIPEAPLLPPQPPVRAIKPESPKSKPRPNAELKDGTP 117
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
K+QKQCNC++SRCLKL YCECFA+G YCDGCN
Sbjct: 118 KRQKQCNCKHSRCLKL-----------------------------YCECFASGTYCDGCN 148
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
CVNC+NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+ E + KHNKGC
Sbjct: 149 CVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREENGEGLILGKHNKGC 208
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANA 238
HCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANA
Sbjct: 209 HCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHTNNMAYIQQAANA 268
Query: 239 AIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSV- 297
AI GA+GSSGY + S KRK +ELLFG KD S R QQ +++ S S S
Sbjct: 269 AITGAIGSSGYVSPPISKKRKGQELLFGPTTKDPSFHRLGHFQQAVHIRPSTSSSSLSSN 328
Query: 298 PVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTER 357
P+++ +++ LG +K RS LADI+QPQD+KELCS+LV++S EA KTL + A +
Sbjct: 329 PIARAGSSATLGPSKFTYRSLLADIIQPQDLKELCSVLVVLSGEAAKTLAGQRNATENWV 388
Query: 358 EG-SRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLS 416
E + S+AS QE S+KG N I D S NQ D G S SDG D+ RP+S
Sbjct: 389 EDQTEPSLASSPQERLQSQKGAGAGNSIAHDCSSANQADKTGHGSSSSDGVDVPKGRPMS 448
Query: 417 PGTRALMCDEEDSAFMQAGPQTGLASK---STVQKSNAYASTRIYAEQERLILTRFRDFL 473
PGT ALMCDEED+ FM A GL + +T Q T IY EQER++LT+FRD L
Sbjct: 449 PGTLALMCDEEDTMFMTAASPNGLTGRGCSTTSQFPCGQGMTEIYTEQERIVLTKFRDCL 508
Query: 474 NRLITCGSIKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTL 533
NRLIT G IKET CS ++E + + V +GT+ + EV P G + + T
Sbjct: 509 NRLITFGEIKETKCSSFARTELGNPKDPVSNGTTNARAEVGNQQVPYSNGVGRHVIPPTS 568
Query: 534 EPSCTTTSVSSRQNNDL 550
+ S S + +NDL
Sbjct: 569 KTSQMVGSTVATSSNDL 585
>gi|225445198|ref|XP_002280793.1| PREDICTED: uncharacterized protein LOC100249023 [Vitis vinifera]
Length = 592
Score = 503 bits (1295), Expect = e-139, Method: Compositional matrix adjust.
Identities = 280/537 (52%), Positives = 347/537 (64%), Gaps = 42/537 (7%)
Query: 26 HPVHKLPLP--TLPPG----KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
HPV +P+P T P K ESPRSRPR N D KDGTPKKQKQCNC++SRCLKL
Sbjct: 65 HPV-TMPVPPQTTHPSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKL---- 119
Query: 80 IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
YCECFA+GIYCDGCNCVNCHNNVE+E AR+EAV
Sbjct: 120 -------------------------YCECFASGIYCDGCNCVNCHNNVENEAARREAVEV 154
Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
TLERNPNAFRPKIASSPHGA+D+RE++ EA + KHNKGCHCKKSGCLKKYCECFQANIL
Sbjct: 155 TLERNPNAFRPKIASSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECFQANIL 214
Query: 200 CSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKR 258
CSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANAAI GA+GSSG+G+ S KR
Sbjct: 215 CSENCKCMDCKNFEGSEERQALFHGDHANSMAYIQQAANAAITGAIGSSGFGSPPVSKKR 274
Query: 259 KSEELLFGVAAKDQSVIRNPQSQQGNNVQ-NSAVPCSQSVPVSQTANASVLGSTKSALRS 317
K +EL +G +KD S+ R Q QQ + + + S PVS+ N++ G++K RS
Sbjct: 275 KGQELFYGPTSKDPSLHRLAQFQQASLTKASVTTSSLSSTPVSRVPNSAASGTSKFTYRS 334
Query: 318 PLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQT-EREGSRSSVASPIQEGEDSRK 376
LADI+QPQD+KELCS+LV+VS EA +T K ++ E +S AS Q+ S+K
Sbjct: 335 LLADIIQPQDLKELCSVLVVVSEEAARTFADKRNLVEKPAEEQVETSRASSTQDRLQSQK 394
Query: 377 GHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGP 436
++ + DD S NQ D G S SDGGD+ RP+SPGT ALMCDE+D+ FM A
Sbjct: 395 ESDIEKAVADDCSSRNQADKLGPDDSSSDGGDVPKGRPMSPGTLALMCDEQDTMFMAAAS 454
Query: 437 QTGLASKS---TVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKS 493
L S + Q T YAEQER+ILT+FRD LNRLIT G IKET CS L ++
Sbjct: 455 PNVLMGHSCNTSSQLPYGQGITEAYAEQERIILTKFRDCLNRLITFGEIKETKCSSLART 514
Query: 494 ETRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTSVSSRQNNDL 550
E+ Q + +G + + T+ P G +K + S T T+ ++ NN L
Sbjct: 515 ESGGQNEPFSNGPANARTDTGNQQVPVSNGVSKPPFPPATKMSQTVTAAAASPNNTL 571
>gi|297738803|emb|CBI28048.3| unnamed protein product [Vitis vinifera]
Length = 524
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 274/523 (52%), Positives = 341/523 (65%), Gaps = 35/523 (6%)
Query: 34 PTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAH 93
P++ K ESPRSRPR N D KDGTPKKQKQCNC++SRCLKL
Sbjct: 10 PSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKL------------------ 51
Query: 94 LNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
YCECFA+GIYCDGCNCVNCHNNVE+E AR+EAV TLERNPNAFRPKIA
Sbjct: 52 -----------YCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIA 100
Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
SSPHGA+D+RE++ EA + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFE
Sbjct: 101 SSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFE 160
Query: 214 GSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQ 272
GSEER+ALFHGDH N +A++Q+AANAAI GA+GSSG+G+ S KRK +EL +G +KD
Sbjct: 161 GSEERQALFHGDHANSMAYIQQAANAAITGAIGSSGFGSPPVSKKRKGQELFYGPTSKDP 220
Query: 273 SVIRNPQSQQGNNVQ-NSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKEL 331
S+ R Q QQ + + + S PVS+ N++ G++K RS LADI+QPQD+KEL
Sbjct: 221 SLHRLAQFQQASLTKASVTTSSLSSTPVSRVPNSAASGTSKFTYRSLLADIIQPQDLKEL 280
Query: 332 CSLLVMVSSEATKTLTKKVGAMQT-EREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQS 390
CS+LV+VS EA +T K ++ E +S AS Q+ S+K ++ + DD S
Sbjct: 281 CSVLVVVSEEAARTFADKRNLVEKPAEEQVETSRASSTQDRLQSQKESDIEKAVADDCSS 340
Query: 391 GNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS---TVQ 447
NQ D G S SDGGD+ RP+SPGT ALMCDE+D+ FM A L S + Q
Sbjct: 341 RNQADKLGPDDSSSDGGDVPKGRPMSPGTLALMCDEQDTMFMAAASPNVLMGHSCNTSSQ 400
Query: 448 KSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTS 507
T YAEQER+ILT+FRD LNRLIT G IKET CS L ++E+ Q + +G +
Sbjct: 401 LPYGQGITEAYAEQERIILTKFRDCLNRLITFGEIKETKCSSLARTESGGQNEPFSNGPA 460
Query: 508 RSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTSVSSRQNNDL 550
+ T+ P G +K + S T T+ ++ NN L
Sbjct: 461 NARTDTGNQQVPVSNGVSKPPFPPATKMSQTVTAAAASPNNTL 503
>gi|296081871|emb|CBI20876.3| unnamed protein product [Vitis vinifera]
Length = 676
Score = 500 bits (1287), Expect = e-139, Method: Compositional matrix adjust.
Identities = 284/527 (53%), Positives = 354/527 (67%), Gaps = 46/527 (8%)
Query: 18 VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDF 77
V +V R+ +P PTL KQESPRS PR+N++ KDGTPKK+K CNC+NSRCLKL
Sbjct: 151 VPVVPRITNPSR----PTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCKNSRCLKL-- 204
Query: 78 VEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 137
YCECFA+G+YC+ C+C NC NNV +E AR+EAV
Sbjct: 205 ---------------------------YCECFASGVYCNNCHCSNCQNNVVNEAARKEAV 237
Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
G+TL+RNPNAFRPKIA+SPHG+ D E+A + L KHNKGCHCKKSGCLKKYCECFQAN
Sbjct: 238 GSTLDRNPNAFRPKIANSPHGSHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQAN 297
Query: 198 ILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNK 257
ILCSENC+C+DCKNFEGSEERRALFH + N +A++Q+AANAAI GA+GSSGYGT + S K
Sbjct: 298 ILCSENCKCMDCKNFEGSEERRALFHDNGNYMAYIQQAANAAISGAIGSSGYGTAVASKK 357
Query: 258 RKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRS 317
RK +++ F A +QS RN Q QQ N+++ SAV S S+ V+ T NA+VLGS+K RS
Sbjct: 358 RK-QQVSFCTTANNQSNHRNTQFQQDNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRS 416
Query: 318 PLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKG 377
PLADILQ QD+K+LCSLLV+VS EA KTLT + ER+ +S+ S QE E +
Sbjct: 417 PLADILQLQDVKDLCSLLVVVSQEAGKTLT------EAERDHIETSIISSTQEREHLQTD 470
Query: 378 HEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQ 437
+V +PD S NQ V+ D D+ N R +SPGT ALMCDE+DS FM G
Sbjct: 471 -DVQKPLPDACLSENQ--VERVQTDELDQADVQNGRLMSPGTLALMCDEQDSIFMVRGSP 527
Query: 438 TGLASKS---TVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSE 494
G+ S TV+ S+ A T +Y EQERL+LT+F DFLNRLITCGSIKETMCSPL KS
Sbjct: 528 DGVVGSSQIPTVKSSHGQACTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSH 587
Query: 495 TRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTS 541
T +QQ+ +ED ++ GT + EP G K + + T + S T +
Sbjct: 588 TGNQQEPLEDDATKVGTGKGNHKEPNGNSIVKPSFAATTQMSQTIAA 634
>gi|356522716|ref|XP_003529992.1| PREDICTED: uncharacterized protein LOC100818116 [Glycine max]
Length = 559
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 266/496 (53%), Positives = 325/496 (65%), Gaps = 41/496 (8%)
Query: 27 PVHKLPLPTLPP----GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWI 82
PV LPLP P GK ESP+SR R N + KD TPKKQKQCNC++S+CLKL
Sbjct: 52 PVAVLPLPLQAPPARVGKPESPKSRSRPNFEMKDATPKKQKQCNCKHSKCLKL------- 104
Query: 83 NIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLE 142
YCECFA+GIYCDGCNCVNC NNVE+E AR+EAV ATLE
Sbjct: 105 ----------------------YCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLE 142
Query: 143 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 202
RNPNAFRPKIASSPHG +D+REDA E + KHNKGCHCKKSGCLKKYCECFQANILCSE
Sbjct: 143 RNPNAFRPKIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCSE 202
Query: 203 NCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEE 262
NC+C+DCKNFEGSEER+ALFHGD N +A++Q+AANAAI GA+GSSGY + S KRK +E
Sbjct: 203 NCKCMDCKNFEGSEERQALFHGDQNNMAYIQQAANAAITGAIGSSGYSSPPVSKKRKGQE 262
Query: 263 LLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQT-ANASVLGSTKSALRSPLAD 321
L F KD S+ + QQ N+V+ A P S PVS + LG +K RS LAD
Sbjct: 263 LFFWPTTKDPSISK--LGQQVNHVRGPA-PSSSLSPVSGARVGTATLGPSKFMYRSLLAD 319
Query: 322 ILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEV 380
I+QPQ +KELCS+LV+VS +A KTLT + M+ + E + +S+AS QE S+K +
Sbjct: 320 IIQPQHLKELCSVLVLVSGQAAKTLTDQKILMEKQAEDQTETSLASSSQEQLPSQKEGHI 379
Query: 381 HNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFM-QAGPQTG 439
+ DD S NQ D S SDG D+ RP+SPGT ALMCDE+D+ FM A P
Sbjct: 380 EKTVADDCSSANQTDKISPDNSSSDGADVPKGRPMSPGTLALMCDEQDTMFMTTASPIAP 439
Query: 440 LASKSTVQKSNAYAS--TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRS 497
+A Y T +Y EQER++LT+FRDFLNR+IT G I ET CS L ++E S
Sbjct: 440 MAHACNTSSQFPYGQGMTEVYPEQERIVLTKFRDFLNRVITMGEINETKCSSLTRNELES 499
Query: 498 QQKAVEDGTSRSGTEV 513
Q+ + +G + +E+
Sbjct: 500 QKDPIINGIGNASSEI 515
>gi|356529628|ref|XP_003533391.1| PREDICTED: uncharacterized protein LOC100777698 [Glycine max]
Length = 559
Score = 496 bits (1277), Expect = e-137, Method: Compositional matrix adjust.
Identities = 273/530 (51%), Positives = 337/530 (63%), Gaps = 42/530 (7%)
Query: 11 WQARPQHVRMVNRVPHPVHKLPLPTLPP----GKQESPRSRPRANTDGKDGTPKKQKQCN 66
+ P+H ++ PV LPL P GK ESP+SR R N + KD TPKKQKQCN
Sbjct: 34 FTGAPEHPQLSQPPQLPVAVLPLQPQAPHARVGKPESPKSRSRPNFEIKDATPKKQKQCN 93
Query: 67 CRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNN 126
C++S+CLKL YCECFA+GIYCDGCNCVNC NN
Sbjct: 94 CKHSKCLKL-----------------------------YCECFASGIYCDGCNCVNCFNN 124
Query: 127 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 186
VE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+A E + KHNKGCHCKKSGC
Sbjct: 125 VENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREEAGEVLILGKHNKGCHCKKSGC 184
Query: 187 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD-HNGIAFMQRAANAAIIGAVG 245
LKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD +N +A++Q+AANAAI GA+G
Sbjct: 185 LKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNNMAYIQQAANAAITGAIG 244
Query: 246 SSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQT-AN 304
SSGY + S KRK +EL F KD S+ + QQ N+V+ A P S PVS
Sbjct: 245 SSGYSSPPVSKKRKGQELFFWPTTKDPSISK--LGQQVNHVRGPA-PSSSLSPVSGARVG 301
Query: 305 ASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREG-SRSS 363
+ LG +K RS LADI+QPQ +KELCS+LV+VS +A KTLT + M+ + E + +S
Sbjct: 302 TATLGPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQKILMEKQAEDQTETS 361
Query: 364 VASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALM 423
+AS QE S+K V + DD S NQ D S SDG D+ RP+SPGT ALM
Sbjct: 362 LASSSQEQLPSQKEGRVEKTVADDCSSANQTDKISPDNSSSDGADVPKGRPMSPGTLALM 421
Query: 424 CDEEDSAFMQAG-PQTGLASKSTVQKSNAYAS--TRIYAEQERLILTRFRDFLNRLITCG 480
CDE+D+ FM A P +A Y T +YAEQER++LT+FRDFLNR+IT G
Sbjct: 422 CDEQDTMFMTAASPIAPMAHACNTSSQFPYGQEMTEVYAEQERIVLTKFRDFLNRVITMG 481
Query: 481 SIKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVS 530
I ET CS L ++E SQ+ + +G + +E+ G K AVS
Sbjct: 482 EINETKCSSLARNELESQKDPIINGIGNTSSEIVLQHGTTSNGVTKPAVS 531
>gi|356511363|ref|XP_003524396.1| PREDICTED: uncharacterized protein LOC100786353 [Glycine max]
Length = 561
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 272/513 (53%), Positives = 328/513 (63%), Gaps = 40/513 (7%)
Query: 25 PHPVHKLPLP---TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIW 81
P P LPLP ++ GK ESP+SRPR + K+ TPKKQKQCNC++S+CLKL
Sbjct: 52 PQPPVILPLPPSASVRVGKPESPKSRPRPGFEIKEATPKKQKQCNCKHSKCLKL------ 105
Query: 82 INIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATL 141
YCECFA+GIYCDGCNCVNC+NNVE+E AR+EAV ATL
Sbjct: 106 -----------------------YCECFASGIYCDGCNCVNCYNNVENEAARREAVEATL 142
Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
ERNPNAFRPKIASSPHG +D+REDA E + KHNKGCHCKKSGCLKKYCECFQANILCS
Sbjct: 143 ERNPNAFRPKIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCS 202
Query: 202 ENCRCLDCKNFEGSEERRALFHGD-HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKS 260
ENC+C+DCKNFEGSEER+ALFHGD +N + ++Q+AANAAI GA+GSSGY + S KRK
Sbjct: 203 ENCKCMDCKNFEGSEERQALFHGDQNNNMVYIQQAANAAITGAIGSSGYSSPPISKKRKG 262
Query: 261 EELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLA 320
+ELLFG KD SV R Q QQ NNV+ A S S + LG +K RS LA
Sbjct: 263 QELLFGPTVKDPSVGR--QGQQANNVRVPAPSSSLSPIPGARVGPATLGPSKLMYRSLLA 320
Query: 321 DILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHE 379
DI+QPQ +KELCS+LV+VS +A K T M E + +S AS QE S+K
Sbjct: 321 DIIQPQHLKELCSVLVLVSGQAAKMFTDHKNFMDKHAEDQTETSRASSTQEQLPSQKEAN 380
Query: 380 VHNGIPDDR-QSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQT 438
V + DD S NQ D S SDG D+ RP+SPGT ALMCDE+D+ FM A
Sbjct: 381 VEKAMADDDCSSANQTDNISPDNSCSDGADVPKGRPMSPGTLALMCDEQDTMFMTAASTV 440
Query: 439 G---LASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
G A +++Q T +YAEQER++LT+FRDFLNR+IT G I ET CS L +SE
Sbjct: 441 GSRAHACNTSLQPPYGQVMTEVYAEQERIVLTKFRDFLNRVITMGEINETKCSSLARSEL 500
Query: 496 RSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSA 528
SQ+ + T + TE + G AK+A
Sbjct: 501 ESQKDPNINCTGNASTETQHQQGATNNGVAKTA 533
>gi|225429991|ref|XP_002281512.1| PREDICTED: uncharacterized protein LOC100243881 [Vitis vinifera]
Length = 579
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 280/527 (53%), Positives = 347/527 (65%), Gaps = 62/527 (11%)
Query: 18 VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDF 77
V +V R+ +P PTL KQESPRS PR+N++ KDGTPKK+K CNC+NSRCLKL
Sbjct: 70 VPVVPRITNPSR----PTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCKNSRCLKL-- 123
Query: 78 VEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 137
YCECFA+G+YC+ C+C NC NNV +E AR+EAV
Sbjct: 124 ---------------------------YCECFASGVYCNNCHCSNCQNNVVNEAARKEAV 156
Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
G+TL+RNPNAFRPKIA+SPHG+ D E+A + L KHNKGCHCKKSGCLKKYCECFQAN
Sbjct: 157 GSTLDRNPNAFRPKIANSPHGSHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQAN 216
Query: 198 ILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNK 257
ILCSENC+C+DCKNFEGSEERRALFH + N +A++Q+AANAAI GA+GSSGYGT + S K
Sbjct: 217 ILCSENCKCMDCKNFEGSEERRALFHDNGNYMAYIQQAANAAISGAIGSSGYGTAVASKK 276
Query: 258 RKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRS 317
RK +++ F A +QS RN Q QQ N+++ SAV S S+ V+ T NA+VLGS+K RS
Sbjct: 277 RK-QQVSFCTTANNQSNHRNTQFQQDNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRS 335
Query: 318 PLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKG 377
PLADILQ QD+K+LCSLLV+VS EA KTL TERE ++
Sbjct: 336 PLADILQLQDVKDLCSLLVVVSQEAGKTL--------TEREHLQTD-------------- 373
Query: 378 HEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQ 437
+V +PD S NQ V+ D D+ N R +SPGT ALMCDE+DS FM G
Sbjct: 374 -DVQKPLPDACLSENQ--VERVQTDELDQADVQNGRLMSPGTLALMCDEQDSIFMVRGSP 430
Query: 438 TGLASKS---TVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSE 494
G+ S TV+ S+ A T +Y EQERL+LT+F DFLNRLITCGSIKETMCSPL KS
Sbjct: 431 DGVVGSSQIPTVKSSHGQACTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSH 490
Query: 495 TRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTS 541
T +QQ+ +ED ++ GT + EP G K + + T + S T +
Sbjct: 491 TGNQQEPLEDDATKVGTGKGNHKEPNGNSIVKPSFAATTQMSQTIAA 537
>gi|224141773|ref|XP_002324239.1| predicted protein [Populus trichocarpa]
gi|222865673|gb|EEF02804.1| predicted protein [Populus trichocarpa]
Length = 682
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 272/521 (52%), Positives = 339/521 (65%), Gaps = 63/521 (12%)
Query: 11 WQARP--QH--VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCN 66
+Q RP QH V + R+PHPVHK+ +P GKQ+SP SR AN DGTPKK KQCN
Sbjct: 60 YQERPVSQHEAVAVAQRIPHPVHKIS--AMPRGKQDSPGSR-LAN----DGTPKKPKQCN 112
Query: 67 CRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNN 126
C+NS CLKL YCECFAAGI+C+GCNC+NC NN
Sbjct: 113 CKNSMCLKL-----------------------------YCECFAAGIHCNGCNCLNCRNN 143
Query: 127 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 186
VE+E R+EAV TL+RNPNAFRPKIASSPHG++D EDA+E Q+ KHNKGC+CK+SGC
Sbjct: 144 VENEKERKEAVETTLQRNPNAFRPKIASSPHGSRDTTEDAQEVQMLGKHNKGCNCKRSGC 203
Query: 187 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGS 246
LKKYCECFQ NI CSENC+CL+CKNFEGS+ERRALFHG + A+ Q+A NAAI GA+GS
Sbjct: 204 LKKYCECFQGNIFCSENCKCLECKNFEGSDERRALFHGSPHATAYAQQATNAAISGAIGS 263
Query: 247 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQ-SVPVSQTANA 305
SGYGT + S KRKSEE LFG+A KDQ + QGN ++NSA S SVPVS+TAN
Sbjct: 264 SGYGTTLASKKRKSEE-LFGIATKDQPAHSTAKFSQGNTLRNSAAFSSPLSVPVSRTANG 322
Query: 306 SVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAM--QTEREGSRSS 363
+VLGSTK RSPLA +L+PQ +KE+C+LLV +S +AT+ L KVG M Q+ERE +
Sbjct: 323 AVLGSTKPTYRSPLAGVLKPQFVKEICTLLVTISGKATEALAGKVGKMDTQSERENIETY 382
Query: 364 VASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALM 423
ASP QE E S KG +PDD +GN+ + DG+ D GD+ N RP SP LM
Sbjct: 383 SASPTQEKESSLKGWNAIEHMPDDCLNGNKAEKDGS----IDSGDVENARPSSPDID-LM 437
Query: 424 CDEEDSAFMQAGPQTGLAS--KSTVQK-SNAYASTRIYAEQERLILTRFRDFLNRLITCG 480
C EE+ FM+ G G+A +S QK + Y ++++AEQE++ILT F D LN + T G
Sbjct: 438 CHEEEMMFMEMGSPIGVARLCQSKTQKPDDGYECSQLHAEQEKIILTSFLDILNTVSTYG 497
Query: 481 SIKETMC------SPLVKSETRSQQKAVEDGTSRSGTEVRG 515
SIK +C P V+ ++ +E R+G RG
Sbjct: 498 SIK--VCQSGNQKEPAVRDTVKT---GIETVNHRNGYHNRG 533
>gi|224119800|ref|XP_002331164.1| predicted protein [Populus trichocarpa]
gi|222873247|gb|EEF10378.1| predicted protein [Populus trichocarpa]
Length = 519
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 250/451 (55%), Positives = 303/451 (67%), Gaps = 39/451 (8%)
Query: 40 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
K ESP+SRPR N + KDGTPKKQKQCNC++SRCLKL
Sbjct: 90 KPESPKSRPRLNAELKDGTPKKQKQCNCKHSRCLKL------------------------ 125
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YCDGCNCVNC+NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG
Sbjct: 126 -----YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGT 180
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D+RE+ E + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+
Sbjct: 181 RDSREETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQ 240
Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
ALFHGDH N +A++Q+AANAAI GA+GSSGY +L S KRK +EL FG KD S R
Sbjct: 241 ALFHGDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPSFHRLG 300
Query: 279 QSQQGNNVQNSAVPCS--QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLV 336
QQ ++++ A P S S PV++ N + LGS+K RS LADI+Q QD+KELCS+LV
Sbjct: 301 HFQQASHIR-PAAPSSPLPSNPVARAGNTTTLGSSKITYRSLLADIIQSQDLKELCSVLV 359
Query: 337 MVSSEATKTLTKKVGAMQTEREGSRSS-VASPIQEGEDSRKGHEVHNGIPDDRQSGNQMD 395
++S EA KTL+ + +M+ E R + +AS QE S+K + + +D S N
Sbjct: 360 VLSGEAAKTLSDQRNSMKKRVEDQRDTFLASSTQERLQSQKEIDADKIVSNDCSSANHE- 418
Query: 396 VDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASK---STVQKSNAY 452
G S DG DM RP+SPGT ALMCDE+D+ FM A GL S+ Q
Sbjct: 419 -GGLDDSSLDGVDMPKGRPMSPGTMALMCDEQDTIFMAAASPNGLMGHGCDSSSQLPCGQ 477
Query: 453 ASTRIYAEQERLILTRFRDFLNRLITCGSIK 483
T +AEQER++L +FRD LNRLIT G IK
Sbjct: 478 GMTEAHAEQERIVLMKFRDCLNRLITFGEIK 508
>gi|224143568|ref|XP_002325000.1| predicted protein [Populus trichocarpa]
gi|222866434|gb|EEF03565.1| predicted protein [Populus trichocarpa]
Length = 545
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 248/468 (52%), Positives = 301/468 (64%), Gaps = 39/468 (8%)
Query: 33 LPTLPPG--KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSP 90
L LPP K ESP+S P NT+ KDGTPKKQ+QCNC++SRCLKL
Sbjct: 102 LRLLPPMFRKPESPKSIPIPNTELKDGTPKKQRQCNCKHSRCLKL--------------- 146
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
YCECFA+G YCDGCNCVNC+NNVE+E AR+EAV ATLERNPNAFRP
Sbjct: 147 --------------YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 192
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KIASSPHG +D+RE+ + + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCK
Sbjct: 193 KIASSPHGTRDSREETGDGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 252
Query: 211 NFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAA 269
NFEGSEER+ALFHGDH N +A++Q+AANAAI GA+GSSGY + S KRK +EL FG
Sbjct: 253 NFEGSEERQALFHGDHGNNMAYIQQAANAAITGAIGSSGYASPPVSRKRKGQELFFGHTV 312
Query: 270 KDQSVIRNPQSQQGNNVQN---SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQ 326
KDQS R QQ N + S P+++ NA LG +K RS LADI+QPQ
Sbjct: 313 KDQSFDRLGHFQQVNGSHTRPPAPSSSLPSNPIARAGNAITLGPSKITYRSLLADIIQPQ 372
Query: 327 DMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSS-VASPIQEGEDSRKGHEVHNGIP 385
D+KELCS+LV++S EA KT + + +M+ E R + +AS QE S K + +
Sbjct: 373 DLKELCSVLVVLSGEAAKTFSDQRNSMEKRVEDQRETLLASSTQERLQSHKESDADKIVA 432
Query: 386 DDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLAS--- 442
+D S N D G S SDG DM RP+SPGT LMCDE+D+ M A +GL
Sbjct: 433 NDCSSANHADKVGPDDSSSDGADMPKGRPMSPGTLELMCDEQDTMLMAAASPSGLMGHGC 492
Query: 443 KSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPL 490
++ Q +AEQER++LT+FRD LNRLIT G IK P
Sbjct: 493 NTSSQLPCGQGMAEAHAEQERIVLTKFRDCLNRLITFGEIKGKFLFPF 540
>gi|449449629|ref|XP_004142567.1| PREDICTED: protein tesmin/TSO1-like CXC 5-like [Cucumis sativus]
Length = 602
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 257/537 (47%), Positives = 329/537 (61%), Gaps = 57/537 (10%)
Query: 30 KLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQC---NCRNSRCLKLDFVEIWINIYS 86
K PLP P K +SP+SR R+N + KD TPKKQKQC NC++SRCLKL
Sbjct: 94 KAPLP--PARKPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKL----------- 140
Query: 87 NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
YCECFA+G+YCDGCNC NCHNNVEHE +R+EAV TLERNPN
Sbjct: 141 ------------------YCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPN 182
Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
AFRPKIA+SPHG +++R++ E + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C
Sbjct: 183 AFRPKIANSPHGTRESRDEIGELVMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKC 242
Query: 207 LDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
+DCKNFEGSEER+ALFHGDH N IA++Q+AANAAI GA+GSSGY L S KRK EL F
Sbjct: 243 MDCKNFEGSEERQALFHGDHANNIAYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGF 302
Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
G KD + PQ Q NN+ +S + S P + + G +K + RS LAD++QP
Sbjct: 303 GPVGKDSPLNSIPQLHQANNIMSSGMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQP 362
Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
D+KELCS+LV+++SE K +Q E+ + P Q S G V +
Sbjct: 363 NDLKELCSVLVVLTSEVAK--------IQAEQRSDEKQINDPQQTSRASSSGDGVQHQKA 414
Query: 386 DDRQ------SGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTG 439
+++ S NQ D S S+ DM RP+SPGT ALMCDE+D+ FM AG G
Sbjct: 415 EEKAADGECGSSNQSDRSVHDSSNSNSSDMTRARPMSPGTLALMCDEQDTMFMGAGLADG 474
Query: 440 LASKSTVQKSNA--YASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRS 497
A+ S+ + + +Y EQER++LT+FRD LN+LIT G IKET + +SE +
Sbjct: 475 SAAHDCNTSSHVPDRSLSEVYIEQERIVLTKFRDCLNKLITLGEIKETKLTSRSESEVGN 534
Query: 498 QQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTSVSSRQNNDLPMRV 554
Q ++ + TS SG + R A SA + P+CT + NNDLP+++
Sbjct: 535 QNQS-NNFTSNSGCQQRSISNGVVKNVALSAPRIP--PACT---AAHHPNNDLPLKI 585
>gi|357500623|ref|XP_003620600.1| Lin-54-like protein [Medicago truncatula]
gi|357500779|ref|XP_003620678.1| Lin-54-like protein [Medicago truncatula]
gi|355495615|gb|AES76818.1| Lin-54-like protein [Medicago truncatula]
gi|355495693|gb|AES76896.1| Lin-54-like protein [Medicago truncatula]
Length = 629
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 270/562 (48%), Positives = 342/562 (60%), Gaps = 70/562 (12%)
Query: 6 QAMPQWQARPQHVRMVNRVPHPVHKLPLP-----------TLPPGKQESPRSRPRANTDG 54
Q+ PQ Q Q MV + P V LP+P ++ GKQESP+ R R N +
Sbjct: 61 QSKPQQQVVTQGSVMVQK-PVGVGGLPMPVPAQVQTLQHSSVRVGKQESPKPRSRPNFEV 119
Query: 55 KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
K+GTPKKQ+QCNC++S+CLKL YCECFA+GIY
Sbjct: 120 KEGTPKKQRQCNCKHSKCLKL-----------------------------YCECFASGIY 150
Query: 115 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR---------ED 165
CDGCNCVNC NNV++E AR+EAV ATLERNPNAFRPKIASSP GA+D+R E+
Sbjct: 151 CDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSPQGARDSRVSRSSILWQEE 210
Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
A E + KH+KGCHCKKSGCLKKYCECFQAN+LCSENCRC+DCKNFEGSEER+ALF GD
Sbjct: 211 AGEGLILIKHHKGCHCKKSGCLKKYCECFQANVLCSENCRCMDCKNFEGSEERQALFRGD 270
Query: 226 HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQ------ 279
N ++Q+AANAAI GA+GS G+ + S KRK +EL AKD S+ + Q
Sbjct: 271 QNNNVYLQQAANAAITGAIGSYGFSSPPASKKRKGQELFLWPTAKDPSISKPGQQVKFAY 330
Query: 280 --SQQG---NNVQNSAVPCSQSVPVSQTANASVLGSTKSAL--RSPLADILQPQDMKELC 332
S+ G N V+ A S S S LG + S L RS L+D++QP +KELC
Sbjct: 331 CTSRTGTGVNLVKGPAPSSSASPVSSARGTNPTLGQSPSKLKYRSLLSDVVQPHHLKELC 390
Query: 333 SLLVMVSSEATKTLT--KKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQS 390
S+LV+VS +A KTL KK +TE + + +S+AS QE S+K +V + DD S
Sbjct: 391 SVLVLVSGQAAKTLADQKKTVEKRTEDQ-TETSLASSTQEQLLSQKEVDVEKAMDDDCSS 449
Query: 391 GNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGP---QTGLASKSTVQ 447
NQ D S SDG D + +RP+SPGT ALMCDE+DS FM A QT A ++ Q
Sbjct: 450 ANQTDKISPGNSCSDGAD-VPKRPMSPGTLALMCDEQDSMFMTAASPIGQTTHACNTSSQ 508
Query: 448 KSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTS 507
+ T +YAEQER++LT+FRDFLNR+IT G I ET CS L +SE +++ + + T
Sbjct: 509 FPDGQGVTEVYAEQERIVLTQFRDFLNRVITMGEINETKCSSLARSELENKKDLINNETG 568
Query: 508 RSGTEVRGYMEPCGYGSAKSAV 529
+ TE E G AK+A+
Sbjct: 569 NASTETVHQQEATSNGDAKAAI 590
>gi|356523755|ref|XP_003530500.1| PREDICTED: uncharacterized protein LOC100777327 [Glycine max]
Length = 551
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 259/467 (55%), Positives = 307/467 (65%), Gaps = 37/467 (7%)
Query: 39 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
GK ESP+SRPR + K+ TPKKQKQCNC++S+CLKL
Sbjct: 71 GKPESPKSRPRPGFEIKEATPKKQKQCNCKHSKCLKL----------------------- 107
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFA+GIYCDGCNCVNC NNVE+E AR+EAV ATLERNPNAFRPKIASSPH
Sbjct: 108 ------YCECFASGIYCDGCNCVNCCNNVENEAARREAVEATLERNPNAFRPKIASSPHR 161
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+D REDA E + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER
Sbjct: 162 TRDIREDAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEER 221
Query: 219 RALFHGD-HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
+ALFHGD +N + ++Q+AANAAI GA+GSSGY + S KRK +ELLFG KD SV R
Sbjct: 222 QALFHGDQNNNLVYIQQAANAAITGAIGSSGYSSPPISKKRKGQELLFGPTVKDPSVGR- 280
Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 337
Q QQ NNV+ A S S + LG +K RS LADI+QPQ +KELCS+LV+
Sbjct: 281 -QGQQANNVRVPAPSSSLSPIPGARVGPATLGPSKLMYRSLLADIIQPQHLKELCSVLVL 339
Query: 338 VSSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEVHNGIPDDR-QSGNQMD 395
VS +A KT T + M E + +S AS QE S+K V + DD S NQ D
Sbjct: 340 VSGQAAKTFTDQKNLMDKHAEDQTETSRASSTQEQLPSQKEANVDKAMADDDCSSANQTD 399
Query: 396 VDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGL---ASKSTVQKSNAY 452
S SDG D RP+SPGT ALMCDE+D+ FM A G A +++Q
Sbjct: 400 NISLDNSCSDGADAPKGRPMSPGTLALMCDEQDTMFMTAASPVGSRAHACNTSLQPPYGQ 459
Query: 453 ASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQ 499
T +YAEQER++LT+FRDFLNR+IT G I ET CS L +SE SQ+
Sbjct: 460 VMTEVYAEQERIVLTKFRDFLNRVITMGEINETKCSSLARSELESQK 506
>gi|449519509|ref|XP_004166777.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 5-like
[Cucumis sativus]
Length = 556
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 237/468 (50%), Positives = 296/468 (63%), Gaps = 51/468 (10%)
Query: 30 KLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQC---NCRNSRCLKLDFVEIWINIYS 86
K PLP P K +SP+SR R+N + KD TPKKQKQC NC++SRCLKL
Sbjct: 94 KAPLP--PARKPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKL----------- 140
Query: 87 NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
YCECFA+G+YCDGCNC NCHNNVEHE +R+EAV TLERNPN
Sbjct: 141 ------------------YCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPN 182
Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
AFRPK A+SPHG +++R++ E + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C
Sbjct: 183 AFRPKXANSPHGTRESRDEIGELVMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKC 242
Query: 207 LDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
+DCKNFEGSEER+ALFHGDH N IA++Q+AANAAI GA+GSSGY L S KRK EL F
Sbjct: 243 MDCKNFEGSEERQALFHGDHANNIAYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGF 302
Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
G KD + PQ Q NN+ +S + S P + + G +K + RS LAD++QP
Sbjct: 303 GPVGKDSPLNSIPQLHQANNIMSSGMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQP 362
Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
D+KELCS+LV+++SE K +Q E+ + P Q S G V +
Sbjct: 363 NDLKELCSVLVVLTSEVAK--------IQAEQRSDEKQINDPQQTSRASSSGDGVQHQKA 414
Query: 386 DDRQ------SGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTG 439
+++ S NQ D S S+ DM RP+SPGT ALMCDE+D+ FM AG G
Sbjct: 415 EEKAADGECGSSNQSDRSVHDSSNSNSSDMTRARPMSPGTLALMCDEQDTMFMGAGLADG 474
Query: 440 LASKSTVQKSNA--YASTRIYAEQERLILTRFRDFLNRLITCGSIKET 485
A+ S+ + + +Y EQER++LT+FRD LN+LIT G IKET
Sbjct: 475 SAAHDCNTSSHVPDRSLSEVYIEQERIVLTKFRDCLNKLITLGEIKET 522
>gi|356510007|ref|XP_003523732.1| PREDICTED: uncharacterized protein LOC100811459 [Glycine max]
Length = 515
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 241/480 (50%), Positives = 303/480 (63%), Gaps = 51/480 (10%)
Query: 11 WQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNS 70
+ RP+ V V R+PHPV K P+ L K ESPRS+PR N + KD TPKKQKQCNC+NS
Sbjct: 79 YSPRPKLVSPVRRIPHPVVKFPVKVLQVVKPESPRSQPRCNVELKDTTPKKQKQCNCKNS 138
Query: 71 RCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHE 130
RCLKL YCECFAAGIYCDGCNCVNCHNNV++E
Sbjct: 139 RCLKL-----------------------------YCECFAAGIYCDGCNCVNCHNNVDNE 169
Query: 131 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 190
ARQEAVG TLERNPNAFRPKIASSP +D++E E ++ KHNKGCHCKKSGCLKKY
Sbjct: 170 AARQEAVGITLERNPNAFRPKIASSPQEQRDSKE--CEIKVIGKHNKGCHCKKSGCLKKY 227
Query: 191 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYG 250
CECFQANILCSENC+C+DCKNFEGS+ER A+FH D+N + M++AANA I GAVGSSGYG
Sbjct: 228 CECFQANILCSENCKCMDCKNFEGSDERIAIFHKDYN-LVHMKQAANATISGAVGSSGYG 286
Query: 251 TLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGS 310
T +T KRK++E+ G +A DQ+V Q Q + S+ +S +N + GS
Sbjct: 287 THITPKKRKNQEMFPGKSAMDQTVNMTAQYLQEIDPMASSPSSLSDSFISDPSNTRISGS 346
Query: 311 TKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAM--QTEREGSRSSVASPI 368
++S RS LAD+L+P ++K LCSL V++S A KT + G + QT+ +SVAS
Sbjct: 347 SRSTYRSVLADVLKPNNVKNLCSLFVVLSRVAAKTNAEMRGKVDQQTKTGNFEASVASSA 406
Query: 369 QEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEED 428
+ +++R +VH DD + ++ + + RPLSP T ALMCDE+D
Sbjct: 407 KSLQETR---DVHQPACDDHVNKDEAEAVDVAHYN---------RPLSPETLALMCDEQD 454
Query: 429 SAFMQAGPQTGLASKST----VQKS-NAYASTRIYAEQERLILTRFRDFLNRLITCGSIK 483
G+A ST +QKS N+ T +Y EQERLILT+F D L L+T GSIK
Sbjct: 455 DMLFGNCSADGVACNSTFQNMIQKSFNSDGCTDVYREQERLILTKFLDILRGLVTHGSIK 514
>gi|15235524|ref|NP_194629.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75213581|sp|Q9SZD1.1|TCX5_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 5; Short=AtTCX5
gi|4972046|emb|CAB43914.1| putative transcription factor [Arabidopsis thaliana]
gi|7269798|emb|CAB79658.1| putative transcription factor [Arabidopsis thaliana]
gi|26452336|dbj|BAC43254.1| putative transcription factor [Arabidopsis thaliana]
gi|29028974|gb|AAO64866.1| At4g29000 [Arabidopsis thaliana]
gi|332660171|gb|AEE85571.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 603
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 241/485 (49%), Positives = 300/485 (61%), Gaps = 51/485 (10%)
Query: 42 ESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
ESP++R PR N +G+DGTP+K+KQCNC++SRCLKL
Sbjct: 110 ESPKARGPRPNVEGRDGTPQKKKQCNCKHSRCLKL------------------------- 144
Query: 101 NLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
YCECFA+G YCDGCNCVNC NNV++E AR+EAV ATLERNP AFRPKIASSPHG +
Sbjct: 145 ----YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHGGR 200
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
D RED E L KHNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSEER+A
Sbjct: 201 DKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEERQA 260
Query: 221 LFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN-P 278
LFHG+H N +A++Q+AANAAI GAVGSSG+ +RK +E+LF A KD S + + P
Sbjct: 261 LFHGEHSNHMAYLQQAANAAITGAVGSSGFAPSPAPKRRKGQEILFNQAIKDSSRLSHFP 320
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q G + PVS+ + +K RS LADI+QP D++ LCS+LV V
Sbjct: 321 QVNNGRT--GGPTSGTSPSPVSRAGGNASSVPSKFVYRSLLADIIQPHDVRALCSVLVTV 378
Query: 339 SSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVD 397
+ EA KT T K ++ + + +S+AS Q D +G+ + NQ D
Sbjct: 379 AGEAAKTSTDKRNEIENRVDDQTETSLASSAQ---DQPQGNNNAADVEMVATDHNQADKS 435
Query: 398 GTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRI 457
G S SDG D PLSP T ALMCDE+D+ FM A P + + N+ + I
Sbjct: 436 GPEESNSDGVDASKVTPLSPATLALMCDEQDTIFMVAAPSPNGSVDPNGCRPNSQGQSEI 495
Query: 458 YAEQERLILTRFRDFLNRLITCGSIKETMCSPL------------VKSETRSQQKA-VED 504
YAEQERL+LT+FRD LNRLI+ IKE+ C L VK+E QQ+ + +
Sbjct: 496 YAEQERLVLTKFRDCLNRLISYAEIKESKCLSLARMHIQPPAIATVKTENGIQQQVPIVN 555
Query: 505 GTSRS 509
G SR+
Sbjct: 556 GASRT 560
>gi|297803150|ref|XP_002869459.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
lyrata]
gi|297315295|gb|EFH45718.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
lyrata]
Length = 601
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 246/490 (50%), Positives = 303/490 (61%), Gaps = 51/490 (10%)
Query: 40 KQESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
K ESP++R PR +G+DGTP+K+KQCNC++SRCLKL
Sbjct: 104 KPESPKARGPRPIVEGRDGTPQKKKQCNCKHSRCLKL----------------------- 140
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFA+G YCDGCNCVNC NNV++E AR+EAV ATLERNP AFRPKIASSPHG
Sbjct: 141 ------YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHG 194
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+D RED E L KHNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSEER
Sbjct: 195 VRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEER 254
Query: 219 RALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVI-R 276
+ALFHG+H N +A++Q+AANAAI GAVGSSG+ +RK +E+LF A KD S + +
Sbjct: 255 QALFHGEHANHMAYLQQAANAAITGAVGSSGFAPSPAPKRRKGQEILFNQATKDSSRLGQ 314
Query: 277 NPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLV 336
PQ G S S NAS S K RS LADI+QP D++ LCS+LV
Sbjct: 315 FPQVNSG-RASGPTSGSSPSPVSRAGGNASSAPS-KFVYRSLLADIIQPHDVRALCSVLV 372
Query: 337 MVSSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMD 395
V+ EA KT T K ++ E + +S+AS Q D +G + GNQ D
Sbjct: 373 AVAGEAAKTSTDKRNEIENRVEDQTETSLASSAQ---DQPQGDNDAADMEMVATDGNQAD 429
Query: 396 VDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYAST 455
G S SDG D PLSP T ALMCDE+D+ FM A P A +++N+ +
Sbjct: 430 KSGAEESNSDGADASKGNPLSPATLALMCDEQDTIFMVAAPSPNGAVDPGGRRTNSQGQS 489
Query: 456 RIYAEQERLILTRFRDFLNRLITCGSIKETMCSPL------------VKSETRSQQKA-V 502
IYAEQERL+LT+FRD L+RLI+ IKE+ C L VK+E QQ+ +
Sbjct: 490 EIYAEQERLVLTKFRDCLSRLISYAEIKESKCLSLARMHIQPSATATVKTENGVQQQVPI 549
Query: 503 EDGTSRSGTE 512
+G SR+ ++
Sbjct: 550 VNGASRTNSQ 559
>gi|356518324|ref|XP_003527829.1| PREDICTED: uncharacterized protein LOC100812944 [Glycine max]
Length = 656
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 238/491 (48%), Positives = 302/491 (61%), Gaps = 53/491 (10%)
Query: 31 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSP 90
L + K +SPRS+P+ N + KD TPKKQKQCNC+NSRCLKL
Sbjct: 170 LNFSAIKIVKPKSPRSQPQCNVELKDTTPKKQKQCNCKNSRCLKL--------------- 214
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
YCECFAAGIYCDGCNCVNCHNNV++E ARQEAVG TLERNPNAFRP
Sbjct: 215 --------------YCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLERNPNAFRP 260
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KIASSP +D++E E Q+ KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCK
Sbjct: 261 KIASSPLERRDSKE--CEIQVIGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 318
Query: 211 NFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAK 270
NFEGS+ERRA+FH D+N + +++AANA I GSSGYGT +T KRK E+ G +A
Sbjct: 319 NFEGSDERRAIFHKDYN-LVHIKQAANATISRVAGSSGYGTHLTPKKRKIHEMFPGKSAM 377
Query: 271 DQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKE 330
DQ+ Q QQ + S+ VS +N + S++S RS LAD QPQ++K
Sbjct: 378 DQTDNITAQYQQEIDPMASSPSFLSDSFVSDPSNTRISASSRSTYRSVLADFFQPQNVKN 437
Query: 331 LCSLLVMVSSEATKTLTKKVGAM--QTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDR 388
LCSL V++S A KT + +G + QT+ +SVAS + +++R +VH + DD
Sbjct: 438 LCSLFVVLSQVAAKTNAEMIGKVDQQTKTGNFEASVASSSKSLQETR---DVHQLVCDDH 494
Query: 389 QSGNQMDVDGTSGSGSDGGDMLN-ERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQ 447
+ ++ +D D+ N RPLSP T ALMCDE+D G+A ST+Q
Sbjct: 495 VNKDE----------ADAVDIANYNRPLSPETLALMCDEQDDMIFGNSSANGVACNSTLQ 544
Query: 448 K-----SNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAV 502
SN+ T + EQERLILT+F D L L+T GSIKETMCS K RS+++
Sbjct: 545 NMIQKSSNSDGCTDVCREQERLILTKFLDVLRGLVTHGSIKETMCSSSTKKGERSKKEPA 604
Query: 503 EDGTSRSGTEV 513
++ + T V
Sbjct: 605 DNANIGAETNV 615
>gi|15225302|ref|NP_179601.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75266022|sp|Q9SL70.1|TCX6_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 6; Short=AtTCX6
gi|4580462|gb|AAD24386.1| hypothetical protein [Arabidopsis thaliana]
gi|225898124|dbj|BAH30394.1| hypothetical protein [Arabidopsis thaliana]
gi|330251873|gb|AEC06967.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 571
Score = 394 bits (1011), Expect = e-106, Method: Compositional matrix adjust.
Identities = 217/450 (48%), Positives = 282/450 (62%), Gaps = 66/450 (14%)
Query: 40 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
+ ESP S PR + +DGTP+K+KQCNC++SRCLKL
Sbjct: 96 RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 131
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG
Sbjct: 132 -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 186
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D RE+ + + A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R+
Sbjct: 187 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQ 246
Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
+LFHG+H + +A++Q ANAAI GA+GSSG+ + +RK +E+ F KD S R
Sbjct: 247 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 305
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q+ G + S S+ + LG +K +S LA+I++P D+K LCS+LV V
Sbjct: 306 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 357
Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
+ EA KTLT+K A Q E +SVAS +Q+ +GH N+ + G
Sbjct: 358 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 396
Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
S +DG R LSP T ALMCDE D+ M A + + T Q N ++Y
Sbjct: 397 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 450
Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCS 488
AEQE+++LT+FRD LNR+I+CG +KE+ CS
Sbjct: 451 AEQEKVVLTKFRDCLNRIISCGEVKESNCS 480
>gi|26984067|gb|AAN85197.1| hypothetical protein [Arabidopsis thaliana]
Length = 536
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 224/485 (46%), Positives = 295/485 (60%), Gaps = 77/485 (15%)
Query: 40 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
+ ESP S PR + +DGTP+K+KQCNC++SRCLKL
Sbjct: 61 RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 96
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG
Sbjct: 97 -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 151
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D RE+ + + A+HNKGCHCKKSGCLKKYCECFQANILCSENC CLDCKNFEGSE R+
Sbjct: 152 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQ 211
Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
+LFHG+H + +A++Q ANAAI GA+GSSG+ + +RK +E+ F KD S R
Sbjct: 212 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 270
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q+ G + S S+ + LG +K +S LA+I++P D+K LCS+LV V
Sbjct: 271 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 322
Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
+ EA KTLT+K A Q E +SVAS +Q+ +GH N+ + G
Sbjct: 323 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 361
Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
S +DG R LSP T ALMCDE D+ M A + + T Q N ++Y
Sbjct: 362 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 415
Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCS--------PL---VKSETRSQQKAVEDGTS 507
AEQE+++LT+FRD LNR+I+CG +KE+ CS P+ V+ + QQ V +G S
Sbjct: 416 AEQEKVVLTKFRDCLNRIISCGEVKESNCSMSRMDLDTPVQTTVRIDPVVQQAPVANGVS 475
Query: 508 RSGTE 512
++ +
Sbjct: 476 QTAKQ 480
>gi|297836236|ref|XP_002886000.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297331840|gb|EFH62259.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 568
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 223/474 (47%), Positives = 291/474 (61%), Gaps = 74/474 (15%)
Query: 53 DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAG 112
+ +DGTP+K+KQCNC++SRCLKL YCECFA+G
Sbjct: 107 EARDGTPQKKKQCNCKHSRCLKL-----------------------------YCECFASG 137
Query: 113 IYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 172
YCDGCNCVNCHNNVE+E R+EA+G+TLERNPNAFRPKIA+SPHG +D RE+ E L
Sbjct: 138 SYCDGCNCVNCHNNVENEPERREAIGSTLERNPNAFRPKIAASPHGGRDNREEVGEVVLL 197
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAF 231
+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R++LFHG+H + +A+
Sbjct: 198 GRHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQSLFHGEHSHNVAY 257
Query: 232 MQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV 291
+Q ANAAI GA+GSSG+ + +RK +E+ F KD S R Q+ G
Sbjct: 258 LQH-ANAAITGAIGSSGFASAPAPKRRKGQEIFFNQGPKDSSTHRLGQANSGRTT----- 311
Query: 292 PCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVG 351
S NAS L +K RS LADI++P+D+K LCS+LV V+ EA KTLT++
Sbjct: 312 --SPKTGSHAGGNAS-LRPSKVVYRSLLADIIKPKDVKALCSVLVSVAGEAAKTLTEERL 368
Query: 352 AMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDR-QSGNQMDVDGTSGSGSDGGDML 410
A Q E +SVAS +Q+ +GH + G ++ SGNQ + G GS
Sbjct: 369 ADQKE-----TSVASSVQD-----QGHSNNKGTEGEKAASGNQAEKSGPDGSKG------ 412
Query: 411 NERPLSPGTRALMCDEEDSAFM-QAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRF 469
R LSP T ALMCDE D+ M A P + S + ++YAEQE+++LT+F
Sbjct: 413 --RSLSPETLALMCDERDTMLMVAASPNCSVEPPSQLPN----GQDQVYAEQEKVVLTKF 466
Query: 470 RDFLNRLITCGSIKETMC--------SPL---VKSETRSQQKAVEDGTSRSGTE 512
RD LNR+I+CG +KE+ C +P+ VK + Q+ V +G S++ +
Sbjct: 467 RDCLNRIISCGEVKESNCLMSRMDLDTPVHTAVKIDPVVDQRPVANGVSQTAKQ 520
>gi|30680884|ref|NP_849995.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|330251872|gb|AEC06966.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 578
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 214/445 (48%), Positives = 278/445 (62%), Gaps = 66/445 (14%)
Query: 40 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
+ ESP S PR + +DGTP+K+KQCNC++SRCLKL
Sbjct: 96 RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 131
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG
Sbjct: 132 -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 186
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D RE+ + + A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R+
Sbjct: 187 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQ 246
Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
+LFHG+H + +A++Q ANAAI GA+GSSG+ + +RK +E+ F KD S R
Sbjct: 247 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 305
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q+ G + S S+ + LG +K +S LA+I++P D+K LCS+LV V
Sbjct: 306 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 357
Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
+ EA KTLT+K A Q E +SVAS +Q+ +GH N+ + G
Sbjct: 358 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 396
Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
S +DG R LSP T ALMCDE D+ M A + + T Q N ++Y
Sbjct: 397 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 450
Query: 459 AEQERLILTRFRDFLNRLITCGSIK 483
AEQE+++LT+FRD LNR+I+CG +K
Sbjct: 451 AEQEKVVLTKFRDCLNRIISCGEVK 475
>gi|26984071|gb|AAN85199.1| hypothetical protein [Arabidopsis thaliana]
Length = 532
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 224/485 (46%), Positives = 294/485 (60%), Gaps = 81/485 (16%)
Query: 40 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
+ ESP S PR + +DGTP+K+KQCNC++SRCLKL
Sbjct: 61 RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 96
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG
Sbjct: 97 -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 151
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D RE+ + + A+HNKGCHCKKSGCLKKYCECFQANILCSENC CLDCKNFEGSE R+
Sbjct: 152 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQ 211
Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
+LFHG+H + +A++Q ANAAI GA+GSSG+ + +RK +E+ F KD S R
Sbjct: 212 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 270
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q+ G + S S+ + LG +KS LA+I++P D+K LCS+LV V
Sbjct: 271 QANNGRT--------TSSQTGSRAGGNASLGPSKSL----LANIIKPMDVKALCSVLVAV 318
Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
+ EA KTLT+K A Q E +SVAS +Q+ +GH N+ + G
Sbjct: 319 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 357
Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
S +DG R LSP T ALMCDE D+ M A + + T Q N ++Y
Sbjct: 358 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 411
Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCS--------PL---VKSETRSQQKAVEDGTS 507
AEQE+++LT+FRD LNR+I+CG +KE+ CS P+ V+ + QQ V +G S
Sbjct: 412 AEQEKVVLTKFRDCLNRIISCGEVKESNCSMSRMDLDTPVQTTVRIDPVVQQAPVANGVS 471
Query: 508 RSGTE 512
++ +
Sbjct: 472 QTAKQ 476
>gi|26984069|gb|AAN85198.1| hypothetical protein [Arabidopsis thaliana]
Length = 543
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 214/445 (48%), Positives = 277/445 (62%), Gaps = 66/445 (14%)
Query: 40 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
+ ESP S PR + +DGTP+K+KQCNC++SRCLKL
Sbjct: 61 RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 96
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG
Sbjct: 97 -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 151
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D RE+ + + A+HNKGCHCKKSGCLKKYCECFQANILCSENC CLDCKNFEGSE R+
Sbjct: 152 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQ 211
Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
+LFHG+H + +A++Q ANAAI GA+GSSG+ + +RK +E+ F KD S R
Sbjct: 212 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 270
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q+ G + S S+ + LG +K +S LA+I++P D+K LCS+LV V
Sbjct: 271 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 322
Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
+ EA KTLT+K A Q E +SVAS +Q+ +GH N+ + G
Sbjct: 323 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 361
Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
S +DG R LSP T ALMCDE D+ M A + + T Q N ++Y
Sbjct: 362 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 415
Query: 459 AEQERLILTRFRDFLNRLITCGSIK 483
AEQE+++LT+FRD LNR+I+CG +K
Sbjct: 416 AEQEKVVLTKFRDCLNRIISCGEVK 440
>gi|147817113|emb|CAN75367.1| hypothetical protein VITISV_008552 [Vitis vinifera]
Length = 628
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 209/378 (55%), Positives = 259/378 (68%), Gaps = 12/378 (3%)
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
E+A + L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH
Sbjct: 215 EEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFH 274
Query: 224 GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQG 283
+ N +A++Q+AANAAI GA+GSSGYGT + S KRK +E+ F A +QS RN Q QQ
Sbjct: 275 DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRKQQEVSFCTTANNQSNHRNTQFQQD 334
Query: 284 NNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT 343
N+++ SAV S S+ V+ T NA+VLGS+K RSPLADILQPQD+K+LCSLLV+VS EA
Sbjct: 335 NHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQPQDVKDLCSLLVVVSQEAG 394
Query: 344 KTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSG 403
KTLT + ER+ +S+ S E E + +V +PD S NQ V+
Sbjct: 395 KTLT------EAERDHIETSIISSTHEREHLQTD-DVQKPLPDACLSENQ--VERVQTDE 445
Query: 404 SDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS---TVQKSNAYASTRIYAE 460
D D+ N R +SPGT ALMCDE+DS FM G G+ S TV+ S+ A T +Y E
Sbjct: 446 LDQADVQNGRLMSPGTLALMCDEQDSIFMARGSPDGVVGSSQIPTVKSSHGQACTEVYVE 505
Query: 461 QERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYMEPC 520
QERL+LT+F DFLNRLITCGSIKETMCSPL KSET +QQ+ +ED ++ T + EP
Sbjct: 506 QERLVLTKFWDFLNRLITCGSIKETMCSPLAKSETGNQQEPLEDDATKVRTGKGNHKEPN 565
Query: 521 GYGSAKSAVSVTLEPSCT 538
G K + + T + S T
Sbjct: 566 GNSIVKPSFAATTQMSQT 583
>gi|218189097|gb|EEC71524.1| hypothetical protein OsI_03825 [Oryza sativa Indica Group]
Length = 576
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 215/475 (45%), Positives = 282/475 (59%), Gaps = 60/475 (12%)
Query: 40 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
K ESP+ PRA +GKDGTP K+K CNC++S+CLKL
Sbjct: 89 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKL----------------------- 124
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFA+G+YCDGCNC NC NNV++E AR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 125 ------YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHA 178
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
++ E A + L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER
Sbjct: 179 NRNNMEAAGDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEER 238
Query: 219 RALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
+ALF GDH MQ+AANAA+ GA+G++ + + TS KRK + +AK+ R
Sbjct: 239 KALFQGDHKNSIHMQQAANAAVNGAIGAAAFSSPSTSRKRKHIDPSLDHSAKENGAHRTN 298
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q Q + +P +P++Q + LG K R LADI+QP D+KELC LLV+V
Sbjct: 299 HVPQV-FYQKNGMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVV 357
Query: 339 SSEATKT----------LTKKVGAMQTEREGS-RSSVASPIQEGEDSRKGHEVHNGIPDD 387
S A K + +K M +REG S+AS + E++ K DD
Sbjct: 358 SGNAAKAYAGKKSQEDKVVEKEDEMGGKREGEIGGSLASTNHDREENNKDPNNQKASADD 417
Query: 388 RQSG----NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASK 443
+G + ++ + +D D + RP+SPGT ALMCDE+D+ FM S+
Sbjct: 418 PSTGGTRAGKASLEESKHDCND--DQKSNRPMSPGTLALMCDEQDTMFM--------TSQ 467
Query: 444 STVQKSNAYAS---TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
+ V + A + + +YAEQER +LT FRD L++L+T G +KE S +KSE
Sbjct: 468 NVVPQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 522
>gi|326532502|dbj|BAK05180.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 615
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 220/475 (46%), Positives = 283/475 (59%), Gaps = 48/475 (10%)
Query: 27 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
PV + P+ P K ESP+ R R DGKDGTP K+K CNCR+S+CLKL
Sbjct: 128 PVQRPPVAV--PLKPESPKPRGRL-YDGKDGTPTKKKCCNCRHSKCLKL----------- 173
Query: 87 NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
YCECFA+GIYCDGCNC NC NNVE+EVAR+EA+ ATLERNP+
Sbjct: 174 ------------------YCECFASGIYCDGCNCTNCFNNVENEVARREAIEATLERNPD 215
Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
AFRPKI SSPH ++ E A + L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C
Sbjct: 216 AFRPKIGSSPHTNRNNMEVAGDLPLIGKHNKGCHCKKSGCLKKYCECFQANILCSENCKC 275
Query: 207 LDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFG 266
DCKNF+GSE+R+ L DH + MQ+AANAA+ GA+G++ + + TS KRK +
Sbjct: 276 RDCKNFDGSEDRKHL-SLDHKNVVHMQQAANAAVNGAIGATAFSSPSTSKKRKHIDSSLD 334
Query: 267 VAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQ 326
++ K+ +RN Q Q Q +A+ S+PV Q+ + ++G K R LADI+QP
Sbjct: 335 LSTKEHLAMRNGQLSQ-VIYQKNAMASDGSLPVGQSVHPHMMGPFKVTYRPLLADIIQPG 393
Query: 327 DMKELCSLLVMVSSEATKTLTKKVGAMQTER-----EGSRSSVASPIQEGEDSRKGHEVH 381
D+KELC LLV VS +A K T + +Q E+ +G+ S+AS + E +
Sbjct: 394 DVKELCKLLVTVSGQAAKAYTGR--KIQEEKVAEKEDGTGGSLASTNHDREGKTQDQNHR 451
Query: 382 NGIPDDRQS-GNQMDVDGTSGSGSD-GGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTG 439
DDR S G M S D D + RP+SPGT ALMCDE+D+ F A Q
Sbjct: 452 KTSIDDRSSVGTDMGKASLDDSRPDCNDDQKSSRPMSPGTLALMCDEQDAMF--ATSQNA 509
Query: 440 LASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSE 494
+A + N + +YAEQER +LT FRD L +L+ G +KE S ++SE
Sbjct: 510 VAPPAVAADQN---RSELYAEQERCVLTEFRDCLRKLVQYGRMKEEKYSMAIRSE 561
>gi|242058765|ref|XP_002458528.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
gi|241930503|gb|EES03648.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
Length = 550
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 233/535 (43%), Positives = 295/535 (55%), Gaps = 108/535 (20%)
Query: 14 RPQHVRMVNRVPHPVHKLPL----PTL--------------PPGKQESPRSRPRANTDGK 55
RP V V VP P H +P+ P L P K ESP+ R R +GK
Sbjct: 48 RPHQV--VGHVPLPRHAVPMAVPVPQLRPMPPQPVQRPPVAVPLKPESPKPRARL-YEGK 104
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
D TP K+K CNCRNSRCLKL YCECFA+G +C
Sbjct: 105 DSTPTKKKCCNCRNSRCLKL-----------------------------YCECFASGTHC 135
Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
DGCNC NC NN E+EVAR+EA+ ATLERNP+AFRPKI SSPH ++ E + + L KH
Sbjct: 136 DGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPHTNRN-NEVSSDLPLVGKH 194
Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRA 235
NKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERR LF GDH MQ+A
Sbjct: 195 NKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINMQQA 254
Query: 236 ANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQ 295
NAA+ GA+G++G+ + TS KRK + + K+ RN Q + Q +AVP
Sbjct: 255 TNAAVNGAIGATGFPSPSTSRKRKHIDPSLDNSNKEHVAHRNYHISQ-SVYQKNAVP-DG 312
Query: 296 SVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT------KTLTKK 349
+P+SQ+A+ LG K R LADI+Q +D+KELC LLV+VS EA KT+ ++
Sbjct: 313 PIPISQSAHPPNLGPFKVTYRPLLADIVQEEDIKELCKLLVVVSGEAAKTYAGRKTMEER 372
Query: 350 VGAMQTEREGSRSSVASPIQEGEDSRKG--------HEVHNGIPDDRQSGNQMDVDGTSG 401
VG + ER G + ED R G E +N PD + S +D S
Sbjct: 373 VGEKEDERGGQK----------EDDRAGSLALTNHDREGNNQDPDQKAS-----IDDHSS 417
Query: 402 SGSDGG-------------DMLNERPLSPGTRALMCDEE-------DSAFMQAGPQTGLA 441
G+ G D + RP+SPGT AL+CDEE D+ F + Q +
Sbjct: 418 RGTHTGKAILEESRPNCADDQKSNRPMSPGTLALLCDEEVKMCDEQDTVFTTS--QNAVP 475
Query: 442 SKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPL-VKSET 495
++ N A +YAEQER++LT FRD L + + G KE S + ++SET
Sbjct: 476 QQTITVNQNQSA---LYAEQERVVLTEFRDCLRKFVERGRNKERRYSSMAIESET 527
>gi|115440083|ref|NP_001044321.1| Os01g0761100 [Oryza sativa Japonica Group]
gi|57900356|dbj|BAD87346.1| tesmin-like [Oryza sativa Japonica Group]
gi|113533852|dbj|BAF06235.1| Os01g0761100 [Oryza sativa Japonica Group]
gi|215768567|dbj|BAH00796.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 619
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 213/474 (44%), Positives = 279/474 (58%), Gaps = 63/474 (13%)
Query: 40 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
K ESP+ PRA +GKDGTP K+K CNC++S+CLKL
Sbjct: 137 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKL----------------------- 172
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFA+G+YCDGCNC NC NNV++E AR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 173 ------YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHA 226
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
++ E A + L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER
Sbjct: 227 NRNNMEAAGDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEER 286
Query: 219 RALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
+ALF GDH MQ+AANAA+ GA+G++ + + S KRK + +AK+ R
Sbjct: 287 KALFQGDHKNSIHMQQAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHRTN 346
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q N +P +P++Q + LG K R LADI+QP D+KELC LLV+V
Sbjct: 347 HVPQKN-----GMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVV 401
Query: 339 SSEATKT---------LTKKVGAMQTEREGS-RSSVASPIQEGEDSRKGHEVHNGIPDDR 388
S A K + +K M +REG S+AS + E++ DD
Sbjct: 402 SGNAAKAYAGKKSQDKVVEKEDEMGGKREGEIGGSLASTNHDREENNNDPNNQKASADDP 461
Query: 389 QSG----NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS 444
+G + ++ + +D D + RP+SPGT ALMCDE+D+ FM S++
Sbjct: 462 STGGTRAGKASLEESKHDCND--DQKSNRPMSPGTLALMCDEQDTMFM--------TSQN 511
Query: 445 TVQKSNAYAS---TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
V + A + + +YAEQER +LT FRD L++L+T G +KE S +KSE
Sbjct: 512 VVPQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 565
>gi|222619282|gb|EEE55414.1| hypothetical protein OsJ_03534 [Oryza sativa Japonica Group]
Length = 529
Score = 361 bits (926), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 213/474 (44%), Positives = 280/474 (59%), Gaps = 59/474 (12%)
Query: 40 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
K ESP+ PRA +GKDGTP K+K CNC++S+CLKL
Sbjct: 43 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKL----------------------- 78
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFA+G+YCDGCNC NC NNV++E AR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 79 ------YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHA 132
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
++ E A + L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER
Sbjct: 133 NRNNMEAAGDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEER 192
Query: 219 RALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
+ALF GDH MQ+AANAA+ GA+G++ + + S KRK + +AK+ R
Sbjct: 193 KALFQGDHKNSIHMQQAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHRTN 252
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q Q + +P +P++Q + LG K R LADI+QP D+KELC LLV+V
Sbjct: 253 HVPQV-FYQKNGMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVV 311
Query: 339 SSEATKT---------LTKKVGAMQTEREGS-RSSVASPIQEGEDSRKGHEVHNGIPDDR 388
S A K + +K M +REG S+AS + E++ DD
Sbjct: 312 SGNAAKAYAGKKSQDKVVEKEDEMGGKREGEIGGSLASTNHDREENNNDPNNQKASADDP 371
Query: 389 QSG----NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS 444
+G + ++ + +D D + RP+SPGT ALMCDE+D+ FM S++
Sbjct: 372 STGGTRAGKASLEESKHDCND--DQKSNRPMSPGTLALMCDEQDTMFM--------TSQN 421
Query: 445 TVQKSNAYAS---TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
V + A + + +YAEQER +LT FRD L++L+T G +KE S +KSE
Sbjct: 422 VVPQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 475
>gi|358344677|ref|XP_003636414.1| Lin-54-like protein [Medicago truncatula]
gi|355502349|gb|AES83552.1| Lin-54-like protein [Medicago truncatula]
Length = 646
Score = 350 bits (897), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 232/576 (40%), Positives = 307/576 (53%), Gaps = 117/576 (20%)
Query: 7 AMPQWQA-RPQHVRMVNRVPHPVHKLPLPTLPP-GKQESPRSRPRANTDGKDGTPKKQKQ 64
MP +Q+ RPQ V V+R+P+ KLP+ + SPRS+ + KD T KKQK+
Sbjct: 90 VMPHFQSPRPQLVSPVHRLPYSTPKLPVKRFQALNESPSPRSQSQNKAGLKDNTLKKQKR 149
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
CNC+NS+CLKL YCEC+AAGIYCDGCNC NCH
Sbjct: 150 CNCKNSKCLKL-----------------------------YCECYAAGIYCDGCNCQNCH 180
Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSP---------------------------- 156
NN+ +E AR+EA+G TLE+NPNAFRPKIASSP
Sbjct: 181 NNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSIQLPHVFIVGVLHQSFE 240
Query: 157 ---------HGA--------------QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 193
+GA + +E+ E QL +HNKGCHCKK GCLKKYCEC
Sbjct: 241 LKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNKGCHCKK-GCLKKYCEC 299
Query: 194 FQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAV--GSSGYGT 251
F AN+LCSENC+C+DCKNFEGS+ R + + + +++A NAAI GAV G S GT
Sbjct: 300 FHANVLCSENCKCIDCKNFEGSDVWRIVLQ-EECSLVQIRQATNAAINGAVGFGPSISGT 358
Query: 252 LMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGST 311
+T KRK +E G + DQ V Q Q+ + S+ + VS TA + +
Sbjct: 359 HITPKKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLSASFVSDTAYKRI---S 415
Query: 312 KSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVG--AMQTEREGSRSSVASPIQ 369
+ RS LAD+LQ Q++K LCSLLV++S EA KT + G A + + E +S+AS Q
Sbjct: 416 RPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKTNAEMRGKAARKIKTEKYEASIASSSQ 475
Query: 370 EGEDSR---KGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDE 426
+DSR + E H ++ + +D+D ++ RPLSP T LMCDE
Sbjct: 476 LLQDSRNVVRASENH----ANKDVADAVDID------------IHNRPLSPETLKLMCDE 519
Query: 427 EDSAFMQAGPQTGL----ASKSTVQK-SNAYASTRIYAEQERLILTRFRDFLNRLITCGS 481
D F G G+ A ++ +QK SN+ T +YAEQERLILT+FRD L LI GS
Sbjct: 520 LDEMFFGNGSANGVAIDNAYQNMIQKSSNSDGYTAVYAEQERLILTKFRDVLGELIILGS 579
Query: 482 IKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYM 517
IKETM S VK + ++ +G SG E +G +
Sbjct: 580 IKETMHSSSVKKDVSIEKTPKNNGD--SGAETKGIL 613
>gi|358344762|ref|XP_003636456.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
gi|355502391|gb|AES83594.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
Length = 649
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 231/576 (40%), Positives = 306/576 (53%), Gaps = 114/576 (19%)
Query: 7 AMPQWQA-RPQHVRMVNRVPHPVHKLPLPTLPP-GKQESPRSRPRANTDGKDGTPKKQKQ 64
MP +Q+ RPQ V V+R+P+ KLP+ + SPRS+ + KD T KKQK+
Sbjct: 90 VMPHFQSPRPQLVSPVHRLPYSTPKLPVKRFQALNESPSPRSQSQNKAGLKDNTLKKQKR 149
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
CNC+NS K DF +YCEC+AAGIYCDGCNC NCH
Sbjct: 150 CNCKNS---KSDFAT-----------------------ARYCECYAAGIYCDGCNCQNCH 183
Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSP---------------------------- 156
NN+ +E AR+EA+G TLE+NPNAFRPKIASSP
Sbjct: 184 NNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSIQLPHVFIVGVLHQSFE 243
Query: 157 ---------HGA--------------QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 193
+GA + +E+ E QL +HNKGCHCKK GCLKKYCEC
Sbjct: 244 LKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNKGCHCKK-GCLKKYCEC 302
Query: 194 FQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAV--GSSGYGT 251
F AN+LCSENC+C+DCKNFEGS+ R + + + +++A NAAI GAV G S GT
Sbjct: 303 FHANVLCSENCKCIDCKNFEGSDVWRIVLQ-EECSLVQIRQATNAAINGAVGFGPSISGT 361
Query: 252 LMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGST 311
+T KRK +E G + DQ V Q Q+ + S+ + VS TA + +
Sbjct: 362 HITPKKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLSASFVSDTAYKRI---S 418
Query: 312 KSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVG--AMQTEREGSRSSVASPIQ 369
+ RS LAD+LQ Q++K LCSLLV++S EA KT + G A + + E +S+AS Q
Sbjct: 419 RPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKTNAEMRGKAARKIKTEKYEASIASSSQ 478
Query: 370 EGEDSR---KGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDE 426
+DSR + E H ++ + +D+D ++ RPLSP T LMCDE
Sbjct: 479 LLQDSRNVVRASENH----ANKDVADAVDID------------IHNRPLSPETLKLMCDE 522
Query: 427 EDSAFMQAGPQTGL----ASKSTVQK-SNAYASTRIYAEQERLILTRFRDFLNRLITCGS 481
D F G G+ A ++ +QK SN+ T +YAEQERLILT+FRD L LI GS
Sbjct: 523 LDEMFFGNGSANGVAIDNAYQNMIQKSSNSDGYTAVYAEQERLILTKFRDVLGELIILGS 582
Query: 482 IKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYM 517
IKETM S VK + ++ +G SG E +G +
Sbjct: 583 IKETMHSSSVKKDVSIEKTPKNNGD--SGAETKGIL 616
>gi|293335083|ref|NP_001169566.1| uncharacterized protein LOC100383445 [Zea mays]
gi|224030123|gb|ACN34137.1| unknown [Zea mays]
gi|323433699|gb|ADX60083.1| CPP transcription factor [Zea mays]
Length = 607
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 214/491 (43%), Positives = 269/491 (54%), Gaps = 89/491 (18%)
Query: 38 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQ 97
P K ESP+ R R +GKD TP K+K NCRNS+CLKL
Sbjct: 126 PLKPESPKLRARL-YEGKDSTPTKKKCYNCRNSKCLKL---------------------- 162
Query: 98 FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
YCECFA+G YCDGCNC NC NN ++EVAR+EAV A L+RNP+AFRPKI SSPH
Sbjct: 163 -------YCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGSSPH 215
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
++ E + L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEE
Sbjct: 216 MHRN-NEVPSDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEE 274
Query: 218 RRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
RR LF GDH MQ+ NAA+ GA+G++G+ + TS KRK + K+ +N
Sbjct: 275 RRNLFQGDHKNAINMQQVTNAAVNGAIGATGFPSPSTSRKRKHNDPSLDHFNKEHIAHKN 334
Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 337
Q N AVP S+P+SQ+A+ LG + LAD +Q +D+KELC LLV+
Sbjct: 335 YHIPQKN-----AVP-DGSIPISQSAHPPTLGPF--TYKPLLADTVQAEDIKELCKLLVV 386
Query: 338 VSSEATKTLT------KKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSG 391
VS EA KT ++VG + E G + ED R G DR+
Sbjct: 387 VSGEAAKTYAGQKTKEERVGEKEDEIGGQK----------EDDRVGSLALTN--HDREGN 434
Query: 392 NQ-----MDVDGTSGSGSDGGDMLNE-------------RPLSPGTRALMCDEE------ 427
NQ +D S G+ G + E RP+SPGT ALMCDEE
Sbjct: 435 NQDQDQKAPIDDHSSRGTHTGKAIIEESRPNCVDDHKTHRPMSPGTLALMCDEEAFEKMC 494
Query: 428 ---DSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKE 484
D+ F + Q ++ + N A +YAEQE ++LT FRD L + + G +KE
Sbjct: 495 DEQDTMFTTS--QNAAPQQTVIVNHNQSA---LYAEQENVVLTEFRDCLRKFVRYGRMKE 549
Query: 485 TMCSPLVKSET 495
S +KSET
Sbjct: 550 ERYSMAIKSET 560
>gi|79521003|ref|NP_197951.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|395455100|sp|F4JY84.1|TCX7_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 7; Short=AtTCX7
gi|332006105|gb|AED93488.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 459
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 184/437 (42%), Positives = 245/437 (56%), Gaps = 68/437 (15%)
Query: 41 QESPRSRPRANTDGKDGTP-KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
Q+ SR + K+ TP K+QK CNC+NS+CLKL
Sbjct: 72 QDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKL------------------------ 107
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YC+GCNCVNCHN +E+E +RQ A+ LERNP+AF+PKIA SPHG
Sbjct: 108 -----YCECFASGSYCNGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGM 162
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D +E+ ++ L KH+KGCHC+KSGCLKKYCEC+QANILCSENCRC DCKNFEGSEER+
Sbjct: 163 KDLQENVQQVLLIGKHSKGCHCRKSGCLKKYCECYQANILCSENCRCQDCKNFEGSEERK 222
Query: 220 ALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQ 279
AL HG ++Q+ NAA+ A+ S Y S KRKS+++ V + Q
Sbjct: 223 ALLHGSQVSDTYIQQMTNAAVNRAIDMSAYLYPPESRKRKSKDISDSVVSS-----YGVQ 277
Query: 280 SQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVS 339
Q+ N+V+ + S+P N +V GST SA RS ++ QP ++ELCSLLV S
Sbjct: 278 YQRANHVRRNGENSLFSLP----NNKAVSGSTTSAYRSSWSNTFQPHHVRELCSLLVSNS 333
Query: 340 SEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGT 399
+ L+ K R + P R +E+ N PD N+MD
Sbjct: 334 VDVANKLSDK----------GRKNDKGPSSLDGAQRVANEI-NDSPDCVLDANRMD---- 378
Query: 400 SGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYA 459
E+P+SP TRALMCDEE + + +T K++ +K + S+ IY
Sbjct: 379 ------------EKPISPATRALMCDEEHE--IGSEKETSARVKTSQEKEDTDTSSGIYL 424
Query: 460 EQERLILTRFRDFLNRL 476
EQER IL+ FRD+L +L
Sbjct: 425 EQERQILSIFRDYLIQL 441
>gi|413952361|gb|AFW85010.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
Length = 412
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 163/308 (52%), Positives = 200/308 (64%), Gaps = 37/308 (12%)
Query: 38 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQ 97
P K +SP+ R +GKD TP K+K CNCRNSRCLKL
Sbjct: 131 PLKPDSPKPWARL-YEGKDSTPTKKKCCNCRNSRCLKL---------------------- 167
Query: 98 FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
YCECFA+G +CDGCNC NC NN E+EVAR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 168 -------YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPH 220
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
++ E + + L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEE
Sbjct: 221 TNRN-NEVSSDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEE 279
Query: 218 RRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
RR LF GDH +Q+A NAA+ GA+G++G+ + TS KRK + + K+ + RN
Sbjct: 280 RRNLFQGDHKNAINIQQATNAAVNGAIGTTGFPSPSTSRKRKHIDPSLDHSNKEHAAHRN 339
Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 337
Q N V + S+P+SQ+A+ LG K R LADI+Q +D+KELC LLV+
Sbjct: 340 CHIPQKNAVPDG------SIPISQSAHPPTLGPFKVTYRPLLADIVQAEDVKELCKLLVV 393
Query: 338 VSSEATKT 345
VS EA KT
Sbjct: 394 VSGEAAKT 401
>gi|168001515|ref|XP_001753460.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695339|gb|EDQ81683.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 405
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 205/447 (45%), Positives = 257/447 (57%), Gaps = 71/447 (15%)
Query: 40 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
K ESPR R R+ D KDGTPKK KQCNC+NSRCLKL
Sbjct: 4 KHESPRQRSRS-FDAKDGTPKKCKQCNCKNSRCLKL------------------------ 38
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YC+GCNCVNC NNVE+E+ RQEAV ATLERNPNAFRPKI SSP G
Sbjct: 39 -----YCECFASGTYCEGCNCVNCCNNVENEIVRQEAVEATLERNPNAFRPKIFSSP-GI 92
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D+ ED E LA KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNF+GSEERR
Sbjct: 93 RDSGEDVGEHPLAGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCVDCKNFDGSEERR 152
Query: 220 ALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTS---NKRKSEELLF-GVAAKDQSV 274
ALF DHN + FMQ + ++ S+ L S KR++ EL+F G K+Q
Sbjct: 153 ALFQADHNMSVNFMQPLGGSGVL----SNSPAYLSPSPLLKKRRTHELVFSGPGLKEQLT 208
Query: 275 IRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSL 334
+ + +S + S P SQ + +V K +S LA ++Q + + ELC L
Sbjct: 209 SGGGTAPSLSLALSSTI---ASTPSSQGVSPAV-PPAKPMHKSLLAGVVQIEAIHELCKL 264
Query: 335 LVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQM 394
LV+VSSE K + +G + ++ S S++ + Q + R V PDD
Sbjct: 265 LVVVSSETQKEI---LGMLLSDVANS-STLRAFSQSFQCDRA---VTTTGPDDE------ 311
Query: 395 DVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQ-AGPQTGLASKSTVQKSNAYA 453
D D +RP+SPGT ALMCDE++ F P GLAS+ + S +
Sbjct: 312 DADKP------------QRPMSPGTLALMCDEKEPLFTAPPSPIGGLASEE-LSTSGSPQ 358
Query: 454 STRIYAEQERLILTRFRDFLNRLITCG 480
++AEQER IL FRD L R+ + G
Sbjct: 359 IALLHAEQERAILLEFRDCLRRITSVG 385
>gi|224059104|ref|XP_002299717.1| predicted protein [Populus trichocarpa]
gi|222846975|gb|EEE84522.1| predicted protein [Populus trichocarpa]
Length = 497
Score = 275 bits (704), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 181/459 (39%), Positives = 246/459 (53%), Gaps = 46/459 (10%)
Query: 28 VHKLPLPTLPPGKQESPR-SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
+H LP L K ESPR S P+A T +GTPK +QC+C+ S+CLKL
Sbjct: 57 LHVLPRSPLSFKKSESPRISLPQARTKIINGTPKSHRQCHCKQSKCLKL----------- 105
Query: 87 NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
YCECFA+G YCD C+C NCHNNVE+E R+EA LERNPN
Sbjct: 106 ------------------YCECFASGSYCDECSCANCHNNVENEDVRREAAECILERNPN 147
Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
AF+PKI SP D + A++ + AKH KGCHCK++GCLKKYCECFQANILCSENC+C
Sbjct: 148 AFKPKITGSPCTPPDDGDAAKDVLVMAKHIKGCHCKRTGCLKKYCECFQANILCSENCKC 207
Query: 207 LDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
+ CKN EG E + +H A++Q+ + V SS + S KR E LF
Sbjct: 208 VSCKNLEGRELGMVVTSENHCKTKAYIQQENANSSSAVVSSSHIFS-QESRKRTFRE-LF 265
Query: 266 GVAAKDQSVIRNPQSQQGNNV-QNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQ 324
+D + + N V + ++P ++ V Q A+++ LGS+K RS LAD +
Sbjct: 266 DSNMRDTEIHEFTKHHAVNPVGVHVSIPTLRADCVCQIASSAALGSSKLTYRSLLADAIH 325
Query: 325 PQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGI 384
P D +ELCSLLV+V SEA K L + Q +E +A ++ +D +K +VH +
Sbjct: 326 PLDTRELCSLLVVV-SEAAKFLADEHCKAQI-KEVKEYQMACDLE--KDCKKEPDVHQVV 381
Query: 385 PDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS 444
P ++ G++ D+ S S DM + P S LMCD++D F + S S
Sbjct: 382 PHNQFPGSRTDITVMDNSRSCDIDMQDGGPFSSEEDYLMCDKKDKLFTR--------SAS 433
Query: 445 TVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIK 483
Q N + E ERL+LT RD L +LI G++K
Sbjct: 434 LDQNLNHGCNENFTKEHERLVLTCLRDCLGKLIAFGTMK 472
>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 912
Score = 274 bits (701), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 197/477 (41%), Positives = 253/477 (53%), Gaps = 96/477 (20%)
Query: 37 PPGKQESPR---SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAH 93
P K SPR SR R+ DGKDGTPKK KQCNC+NSRCLKL
Sbjct: 440 PSIKHSSPRRPVSRSRSALDGKDGTPKKCKQCNCKNSRCLKL------------------ 481
Query: 94 LNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
YCECFA+G YCDGCNCVNC NN EHEV RQEAV ATLERNPNAFRPKIA
Sbjct: 482 -----------YCECFASGTYCDGCNCVNCCNNSEHEVVRQEAVEATLERNPNAFRPKIA 530
Query: 154 SSPHGAQDAREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
+SP D +ED + AAKHNKGCHCKKSGCLKKYCEC+QANILCSENC+C+DC+NF
Sbjct: 531 NSPRILHDGKEDVVDRNGSAAKHNKGCHCKKSGCLKKYCECYQANILCSENCKCVDCRNF 590
Query: 213 EGSEERRALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKD 271
S+ERRALF GD G++F+QR + A SG+ + + S KRK++++ +
Sbjct: 591 ITSDERRALFPGDPALGLSFVQRPSGCA-------SGHMSPL-SRKRKTQDVEY------ 636
Query: 272 QSVIRNPQSQQGN--NVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMK 329
++ Q+ + N V+N A + + +A K RS LAD+L ++
Sbjct: 637 --IMPGLQTNKSNLVKVENHAATTPKMSIYTASAEHIPEPMVKVVNRSLLADVLHTDVIQ 694
Query: 330 ELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDR- 388
+LC LLV+VS++ ++ G + + +S P ED G E DR
Sbjct: 695 DLCKLLVIVSADVQHEVS---GIPAVDNGTNNASTVDPTNT-EDCPSGKEPSAPAAPDRL 750
Query: 389 ---QSGNQM------------DVDGTSGSGSDGGDMLN----------ERPLSPGTRALM 423
+G M V + S L+ ERP+SPGT ALM
Sbjct: 751 QAESTGTTMAEYVRNDPTSTPPVAAVRTAPSPSNQALDVVAKREETRKERPMSPGTLALM 810
Query: 424 CDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCG 480
CDE+D T S A+ +++ QER+IL+ FRD L R+I+ G
Sbjct: 811 CDEKDPLL-------------TAPSSPAH-ENQLHVVQERVILSEFRDCLRRIISVG 853
>gi|357520979|ref|XP_003630778.1| Lin-54-like protein, partial [Medicago truncatula]
gi|355524800|gb|AET05254.1| Lin-54-like protein, partial [Medicago truncatula]
Length = 220
Score = 274 bits (701), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 135/210 (64%), Positives = 152/210 (72%), Gaps = 35/210 (16%)
Query: 24 VPHPVHKLPLPTLPP------GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDF 77
P P H P PTLPP GK ESP+S+ R N + KD TPKKQKQCNC++SRCLKL
Sbjct: 38 APLPEHPQPSPTLPPLPATKLGKPESPKSKSRPNFETKDATPKKQKQCNCKHSRCLKL-- 95
Query: 78 VEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 137
YCECFA+GIYCDGCNCVNC NNV++E AR+EAV
Sbjct: 96 ---------------------------YCECFASGIYCDGCNCVNCFNNVDNEAARREAV 128
Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
ATLERNPNAFRPKIASSPHG +D +E+ E ++ KHNKGCHCKKSGCLKKYCECFQAN
Sbjct: 129 EATLERNPNAFRPKIASSPHGTRDNKEETGEVKVLVKHNKGCHCKKSGCLKKYCECFQAN 188
Query: 198 ILCSENCRCLDCKNFEGSEERRALFHGDHN 227
ILCSENC+C+DCKNFEGSEER+ALFHGD N
Sbjct: 189 ILCSENCKCMDCKNFEGSEERQALFHGDQN 218
>gi|50725423|dbj|BAD32896.1| tesmin-like [Oryza sativa Japonica Group]
gi|50725495|dbj|BAD32965.1| tesmin-like [Oryza sativa Japonica Group]
Length = 473
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 178/478 (37%), Positives = 247/478 (51%), Gaps = 60/478 (12%)
Query: 23 RVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWI 82
++P P P+ +PP +S P + K++K CNC+NS+CLKL I
Sbjct: 51 QMPTP-RPWPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKLACSSIL- 99
Query: 83 NIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLE 142
L T D L YCECFAAG+YCDGC+C C N V +E ARQEA+ +T +
Sbjct: 100 ----------DLFTGQIDQLYGYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQ 149
Query: 143 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 202
RNP AF+PKI + + ++DA KHNKGCHCKKSGCLKKYCECFQANILCS+
Sbjct: 150 RNPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSK 209
Query: 203 NCRCLDCKNFEGSEERRALFHGDHNGIA-FMQRAANAAIIGAVGSSGYGTLMTSNKRKSE 261
NC+C DCKNFEGSEE R + GD++ +Q AAN A+ GA+GSSGY KR E
Sbjct: 210 NCKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPE 269
Query: 262 ELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLAD 321
+ L+ + S ++ Q Q+ N+V +S + S + + ++ + RS LA+
Sbjct: 270 DHLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALAN 325
Query: 322 ILQPQDMKELCSLLVMV---SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGH 378
+ P D L LV+V +++A T + G M+ ERE
Sbjct: 326 TISPTDATGLAKHLVIVCRKAADAFLTTAENKGEMEVERE-------------------- 365
Query: 379 EVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQT 438
I D MD G + + RP SPGT+ALMCDE+DS F G
Sbjct: 366 -----IHTDSDGATNMDQQNGGDFGPCCNSLEDSRPASPGTQALMCDEQDSTF---GTDY 417
Query: 439 GLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETR 496
++ + + ++ + A+QE+ +LT FRD+L +IT G I T + + S R
Sbjct: 418 RISFPVALHDQD---TSELNAQQEKAVLTGFRDYLRTVITRGKINGTEFAKIFLSLKR 472
>gi|357139249|ref|XP_003571196.1| PREDICTED: uncharacterized protein LOC100845699 [Brachypodium
distachyon]
Length = 487
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 186/490 (37%), Positives = 248/490 (50%), Gaps = 74/490 (15%)
Query: 24 VPHPVHKLPLPTLPPGKQES----PRSRPRANT------DGKDGTPKKQKQCNCRNSRCL 73
VP + P + P +Q+ PR P A T + K+ TPKK+K CNC+NSRCL
Sbjct: 19 VPAAAGGMKAPVVAPQQQKPAAPVPRPWPVAFTPMKPAAEVKNVTPKKRKHCNCKNSRCL 78
Query: 74 KLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVAR 133
KL YCECFAA + CDGCNC NC NN E+E R
Sbjct: 79 KL-----------------------------YCECFAARVSCDGCNCSNCGNNSENEKIR 109
Query: 134 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 193
+ A+ ATL RNP AF+PKI + P ++++ L KHNKGCHCKKSGCLKKYCEC
Sbjct: 110 KGAIEATLLRNPLAFQPKIENGPSTVNVRKDNSGAVPLRPKHNKGCHCKKSGCLKKYCEC 169
Query: 194 FQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI-AFMQRAANAAIIGAVGSSGYGTL 252
FQANILCS+NCRC+DCKNFEGSEERRAL GD+ +Q+AAN A+ GA+GSSGY
Sbjct: 170 FQANILCSKNCRCMDCKNFEGSEERRALVQGDNTSDRNNIQQAANTALNGAIGSSGYKCS 229
Query: 253 MTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTK 312
K E+ L + Q QQ N S + S + S S+K
Sbjct: 230 PARRKISQEDSLSEM-----------QFQQVNLSDVSQLTPSCTGFGGHNGCYSQSNSSK 278
Query: 313 SALRSPLADILQPQDMKELCSLLVMV---SSEATKTLTKKVGAMQTEREGSRSSVASPIQ 369
RSPLA+ + D+ +L +V+ ++EA T+ G V ++
Sbjct: 279 MIYRSPLANTIHLSDVNDLVKHVVIACRKAAEAFPTIADNKG------------VEVEVE 326
Query: 370 EGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDS 429
+ + G + D + + MD+ + + D RP SPGT+ALMCDE+D+
Sbjct: 327 KEFHANYGLNNNRSKVQDLKEASPMDIHSKACTDQHNVD---PRPASPGTQALMCDEQDT 383
Query: 430 AFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSP 489
+F ++ A+ S + + I A QE L+LT FRD+L LIT G I E S
Sbjct: 384 SFGNDIYRSSFAATSCDKN-----ISEINALQENLVLTGFRDYLRVLITRGKINEAKSSS 438
Query: 490 LVKSETRSQQ 499
E +Q+
Sbjct: 439 EAAMELNAQR 448
>gi|297605751|ref|NP_001057547.2| Os06g0332700 [Oryza sativa Japonica Group]
gi|255677011|dbj|BAF19461.2| Os06g0332700 [Oryza sativa Japonica Group]
Length = 448
Score = 265 bits (678), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 171/465 (36%), Positives = 236/465 (50%), Gaps = 80/465 (17%)
Query: 24 VPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWIN 83
P P P+ +PP +S P + K++K CNC+NS+CLKL
Sbjct: 3 TPRP---WPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKL-------- 42
Query: 84 IYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 143
YCECFAAG+YCDGC+C C N V +E ARQEA+ +T +R
Sbjct: 43 ---------------------YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQR 81
Query: 144 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 203
NP AF+PKI + + ++DA KHNKGCHCKKSGCLKKYCECFQANILCS+N
Sbjct: 82 NPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKN 141
Query: 204 CRCLDCKNFEGSEERRALFHGDHNGIA-FMQRAANAAIIGAVGSSGYGTLMTSNKRKSEE 262
C+C DCKNFEGSEE R + GD++ +Q AAN A+ GA+GSSGY KR E+
Sbjct: 142 CKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPED 201
Query: 263 LLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADI 322
L+ + S ++ Q Q+ N+V +S + S + + ++ + RS LA+
Sbjct: 202 HLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALANT 257
Query: 323 LQPQDMKELCSLLVMV---SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHE 379
+ P D L LV+V +++A T + G M+ ERE
Sbjct: 258 ISPTDATGLAKHLVIVCRKAADAFLTTAENKGEMEVERE--------------------- 296
Query: 380 VHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTG 439
I D MD G + + RP SPGT+ALMCDE+DS F G
Sbjct: 297 ----IHTDSDGATNMDQQNGGDFGPCCNSLEDSRPASPGTQALMCDEQDSTF---GTDYR 349
Query: 440 LASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKE 484
++ + + ++ + A+QE+ +LT FRD+L +IT G I E
Sbjct: 350 ISFPVALHDQD---TSELNAQQEKAVLTGFRDYLRTVITRGKINE 391
>gi|302801582|ref|XP_002982547.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
gi|300149646|gb|EFJ16300.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
Length = 460
Score = 261 bits (667), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 190/490 (38%), Positives = 251/490 (51%), Gaps = 120/490 (24%)
Query: 5 SQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQ 64
SQA+P Q P R PV +L ++ P RP + K+ KK KQ
Sbjct: 51 SQALPSSQVPPAQPGEPGR--KPVRQLNFSSILP--------RPPV-YELKEVAAKKCKQ 99
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
CNC+N+RCLKL YCECFA+G YCDGCNC+NC
Sbjct: 100 CNCKNTRCLKL-----------------------------YCECFASGTYCDGCNCLNCC 130
Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
NNVE+E RQEAV TLERNPNAFRPKIA++ H +D+ +D R KHNKGC+CKKS
Sbjct: 131 NNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSADDMR----LGKHNKGCNCKKS 186
Query: 185 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD------HNGIAFMQRAANA 238
GCLKKYCECFQANILCS+NC+C+DCKN++ S+ERRAL+ D H G +A
Sbjct: 187 GCLKKYCECFQANILCSDNCKCVDCKNYDSSDERRALYQPDYSQSTYHGGPPPYTTSATG 246
Query: 239 AIIGAVGSSGYGTLMTSNKRKSEELLF-GVAAK-DQSVIRNPQSQQGNNVQNSAVPCSQS 296
++ V SS M + KR++ +L+F G + K Q + R Q Q + S +P + S
Sbjct: 247 SL-ACVSSSP----MPAKKRRTSDLVFVGESLKVQQPIYRTLQGIQDAKLSGSMLPPNSS 301
Query: 297 VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTE 356
+ S K+ S LA ++QP+ + +LC+LLVM S+EA K +
Sbjct: 302 L------------SAKTIQTSLLAGVVQPETVSDLCNLLVMASAEAAKAM---------- 339
Query: 357 REGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLS 416
I G S++ E NG +GSG D+ +RP+S
Sbjct: 340 -----------IASGPVSKRPAE--NG--------------QEAGSGQGDEDIREKRPMS 372
Query: 417 PGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRL 476
PGT ALMCDE+D F A ++ + AEQERLIL FRD L ++
Sbjct: 373 PGTLALMCDEQDPLFTPA--------------THGDYDKELAAEQERLILEEFRDCLRKI 418
Query: 477 ITCGSIKETM 486
IT G + T+
Sbjct: 419 ITVGRKRGTL 428
>gi|302798629|ref|XP_002981074.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
gi|300151128|gb|EFJ17775.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
Length = 461
Score = 261 bits (666), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 190/490 (38%), Positives = 251/490 (51%), Gaps = 120/490 (24%)
Query: 5 SQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQ 64
SQA+P Q P R PV +L ++ P RP + K+ KK KQ
Sbjct: 51 SQALPSSQVPPAQPGEPGR--KPVRQLNFSSILP--------RPPV-YELKEVAAKKCKQ 99
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
CNC+N+RCLKL YCECFA+G YCDGCNC+NC
Sbjct: 100 CNCKNTRCLKL-----------------------------YCECFASGTYCDGCNCLNCC 130
Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
NNVE+E RQEAV TLERNPNAFRPKIA++ H +D+ +D R KHNKGC+CKKS
Sbjct: 131 NNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSADDMR----LGKHNKGCNCKKS 186
Query: 185 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD------HNGIAFMQRAANA 238
GCLKKYCECFQANILCS+NC+C+DCKN++ S+ERRAL+ D H G +A
Sbjct: 187 GCLKKYCECFQANILCSDNCKCVDCKNYDSSDERRALYQPDYSQSTYHGGPPPYTTSATG 246
Query: 239 AIIGAVGSSGYGTLMTSNKRKSEELLF-GVAAK-DQSVIRNPQSQQGNNVQNSAVPCSQS 296
++ V SS M + KR++ +L+F G + K Q + R Q Q + S +P + S
Sbjct: 247 SL-ACVSSSP----MPAKKRRTSDLVFVGESLKVQQPIYRTLQGIQDAKLSGSMLPPNSS 301
Query: 297 VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTE 356
+ S K+ S LA ++QP+ + +LC+LLVM S+EA K +
Sbjct: 302 L------------SAKTIQTSLLAGVVQPETVSDLCNLLVMASAEAAKAM---------- 339
Query: 357 REGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLS 416
I G S++ E NG +GSG D+ +RP+S
Sbjct: 340 -----------IASGPVSKRPAE--NG--------------QEAGSGQGDEDIREKRPMS 372
Query: 417 PGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRL 476
PGT ALMCDE+D F A ++ + AEQERLIL FRD L ++
Sbjct: 373 PGTLALMCDEQDPLFTPA--------------THGDYDKELAAEQERLILEEFRDCLRKI 418
Query: 477 ITCGSIKETM 486
IT G + T+
Sbjct: 419 ITVGRKRGTL 428
>gi|414880393|tpg|DAA57524.1| TPA: hypothetical protein ZEAMMB73_933477 [Zea mays]
Length = 356
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 137/263 (52%), Positives = 165/263 (62%), Gaps = 35/263 (13%)
Query: 38 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQ 97
P K ESP+ R R +GKD TP K+K CNCRNS+CLKL
Sbjct: 126 PLKPESPKLRARL-YEGKDSTPTKKKCCNCRNSKCLKL---------------------- 162
Query: 98 FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
YCECFA+G YCDGCNC NC NN ++EVAR+EAV A L+RNP+AFRPKI SSPH
Sbjct: 163 -------YCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGSSPH 215
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
++ E + L KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEE
Sbjct: 216 MHRN-NEVPSDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEE 274
Query: 218 RRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
RR LF GDH MQ+ NAA+ GA+G++G+ + TS KRK + K+ +N
Sbjct: 275 RRNLFQGDHKNAINMQQVTNAAVNGAIGATGFPSPSTSRKRKHNDPSLDHFNKEHIAHKN 334
Query: 278 PQSQQGNNVQNSAVPCSQSVPVS 300
Q + + VPC + VS
Sbjct: 335 YHIPQ----EQAEVPCYITRSVS 353
>gi|222635505|gb|EEE65637.1| hypothetical protein OsJ_21206 [Oryza sativa Japonica Group]
Length = 437
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 156/397 (39%), Positives = 215/397 (54%), Gaps = 39/397 (9%)
Query: 104 KYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
KYCECFAAG+YCDGC+C C N V +E ARQEA+ +T +RNP AF+PKI + + +
Sbjct: 75 KYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRK 134
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+DA KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R +
Sbjct: 135 DDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQ 194
Query: 224 GDHNGI-AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQ 282
GD++ +Q AAN A+ GA+GSSGY KR E+ L+ + S ++ Q Q+
Sbjct: 195 GDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQE 253
Query: 283 GNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV---S 339
N+V +S + S + + ++ + RS LA+ + P D L LV+V +
Sbjct: 254 ANHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCRKA 310
Query: 340 SEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGT 399
++A T + G M+ ERE I D MD
Sbjct: 311 ADAFLTTAENKGEMEVERE-------------------------IHTDSDGATNMDQQNG 345
Query: 400 SGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYA 459
G + + RP SPGT+ALMCDE+DS F G ++ + + ++ + A
Sbjct: 346 GDFGPCCNSLEDSRPASPGTQALMCDEQDSTF---GTDYRISFPVALHDQD---TSELNA 399
Query: 460 EQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETR 496
+QE+ +LT FRD+L +IT G I T + + S R
Sbjct: 400 QQEKAVLTGFRDYLRTVITRGKINGTEFAKIFLSLKR 436
>gi|242095900|ref|XP_002438440.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
gi|241916663|gb|EER89807.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
Length = 509
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 157/386 (40%), Positives = 219/386 (56%), Gaps = 23/386 (5%)
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
L YCECFAAG YCDGCNC C N VE+E RQ+A+ T RNPNAF+PKI + P
Sbjct: 81 LKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQPKIENGPIPPS- 139
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
R+DA KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C+DCKN+EGSEE R+
Sbjct: 140 VRKDAGALPSLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDCKNYEGSEELRST 199
Query: 222 FHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS 280
GD++ +Q+AAN A+ GA+GSSGY KR +E+ + + S+++ Q
Sbjct: 200 IQGDNSCDRNNLQQAANVALNGAIGSSGYRFSPVRRKRPAEDPHYQRLNVEGSMMQT-QF 258
Query: 281 QQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSS 340
Q+ NNV S + S + + G +K RSPLA+ + D+ +L + LV+V
Sbjct: 259 QEANNVDVSQITSSTG---RDGSTGNFQGRSKLVYRSPLANTIHVTDVNDLANHLVIVCR 315
Query: 341 EATKTLTKKV---GAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVD 397
+A + T V M+ +RE ++ E +D +K G + +Q VD
Sbjct: 316 KAAERFTTIVDNKAEMEIDREICTNTYQK-FDENKDVQKDALSQLG---KVTNIDQQIVD 371
Query: 398 GTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKS-NAYASTR 456
+ SD + + RP SPGT+AL+CDE+D F G A +S + + + +
Sbjct: 372 DSILYCSDTQE--DARPASPGTQALLCDEQDLTF-------GTAHRSPIPIALHDQDISE 422
Query: 457 IYAEQERLILTRFRDFLNRLITCGSI 482
++A QE +L FR++L +I G I
Sbjct: 423 LHAAQENAVLKEFRNYLRLIIARGKI 448
>gi|259489914|ref|NP_001159253.1| uncharacterized protein LOC100304343 [Zea mays]
gi|223943031|gb|ACN25599.1| unknown [Zea mays]
gi|413936375|gb|AFW70926.1| hypothetical protein ZEAMMB73_955245 [Zea mays]
Length = 530
Score = 248 bits (632), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 179/521 (34%), Positives = 255/521 (48%), Gaps = 66/521 (12%)
Query: 5 SQAMPQWQARPQHVRMVNRVPHPV--HKLPLPTLPPGKQESPRSRP------RANTDGKD 56
+Q PQ + P + + + P + P P P PR P +++T+ K
Sbjct: 6 NQQHPQQLSSPPVLAAASEIKAPAVPSQQPKPAAP--VMSVPRQWPIAFNPLKSSTEEKS 63
Query: 57 GTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD 116
TPKK+K CNCRNS+CLK+ YCECF YCD
Sbjct: 64 VTPKKKKHCNCRNSKCLKM-----------------------------YCECFQELQYCD 94
Query: 117 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 176
GCNC NC N V +E AR EA+ A +RNP+AF+PKI + + ++++ L AKH+
Sbjct: 95 GCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHH 154
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRA 235
KGCHCKKSGCLKKYCEC+QAN+ CS+NCRC+ CKN EG+++ DH + +++A
Sbjct: 155 KGCHCKKSGCLKKYCECYQANVFCSKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQA 214
Query: 236 ANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV-PCS 294
AN A G+VGSSGY + + ++++S E V + + Q QQGN+ S + PCS
Sbjct: 215 ANIAFNGSVGSSGY--ICSLSRKRSHEDTQCVRINSEGSMPETQYQQGNHADVSMLPPCS 272
Query: 295 QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQ 354
AN RSPLA+ + ++ +L LVMV A T+ +
Sbjct: 273 TGFDGHNAANTKPYNPN---YRSPLANTIHLSEVNDLVKQLVMVCRMAAATIADDETDLI 329
Query: 355 TEREGSRSSVASPIQEGEDSRKGH-EVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNER 413
E +GS+S+ + G ++ E + + +Q ++D T S D R
Sbjct: 330 AEEKGSQSNNG--LSNGNCKQQDFKEASSMVILSEGCSDQPNIDETGSHRSKTLDF--SR 385
Query: 414 PLSPGTRALMCDEEDSAFMQ--AGPQTG--LASKSTVQKSNAYAS-----------TRIY 458
P SP T+ALMCDE+D+ F A P T + + N Y S + I
Sbjct: 386 PASPTTQALMCDEQDATFGNEPASPTTQALMCEEQNTTFGNDYRSSFPLVSCDQDISEIN 445
Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQ 499
A QE L+LT R++L +IT G I E S E +QQ
Sbjct: 446 AAQENLVLTGLREYLRVIITRGKINEHKSSLEAAMELDAQQ 486
>gi|413936376|gb|AFW70927.1| hypothetical protein ZEAMMB73_955245, partial [Zea mays]
Length = 485
Score = 244 bits (624), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 174/504 (34%), Positives = 250/504 (49%), Gaps = 66/504 (13%)
Query: 5 SQAMPQWQARPQHVRMVNRVPHPV--HKLPLPTLPPGKQESPRSRP------RANTDGKD 56
+Q PQ + P + + + P + P P P PR P +++T+ K
Sbjct: 6 NQQHPQQLSSPPVLAAASEIKAPAVPSQQPKPAAP--VMSVPRQWPIAFNPLKSSTEEKS 63
Query: 57 GTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD 116
TPKK+K CNCRNS+CLK+ YCECF YCD
Sbjct: 64 VTPKKKKHCNCRNSKCLKM-----------------------------YCECFQELQYCD 94
Query: 117 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 176
GCNC NC N V +E AR EA+ A +RNP+AF+PKI + + ++++ L AKH+
Sbjct: 95 GCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHH 154
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRA 235
KGCHCKKSGCLKKYCEC+QAN+ CS+NCRC+ CKN EG+++ DH + +++A
Sbjct: 155 KGCHCKKSGCLKKYCECYQANVFCSKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQA 214
Query: 236 ANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV-PCS 294
AN A G+VGSSGY + + ++++S E V + + Q QQGN+ S + PCS
Sbjct: 215 ANIAFNGSVGSSGY--ICSLSRKRSHEDTQCVRINSEGSMPETQYQQGNHADVSMLPPCS 272
Query: 295 QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQ 354
AN RSPLA+ + ++ +L LVMV A T+ +
Sbjct: 273 TGFDGHNAANTKPYNPN---YRSPLANTIHLSEVNDLVKQLVMVCRMAAATIADDETDLI 329
Query: 355 TEREGSRSSVASPIQEGEDSRKGH-EVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNER 413
E +GS+S+ + + G ++ E + + +Q ++D T S D R
Sbjct: 330 AEEKGSQSN--NGLSNGNCKQQDFKEASSMVILSEGCSDQPNIDETGSHRSKTLDF--SR 385
Query: 414 PLSPGTRALMCDEEDSAFMQ--AGP--QTGLASKSTVQKSNAYAS-----------TRIY 458
P SP T+ALMCDE+D+ F A P Q + + N Y S + I
Sbjct: 386 PASPTTQALMCDEQDATFGNEPASPTTQALMCEEQNTTFGNDYRSSFPLVSCDQDISEIN 445
Query: 459 AEQERLILTRFRDFLNRLITCGSI 482
A QE L+LT R++L +IT G I
Sbjct: 446 AAQENLVLTGLREYLRVIITRGKI 469
>gi|293336528|ref|NP_001167886.1| uncharacterized protein LOC100381595 [Zea mays]
gi|223944643|gb|ACN26405.1| unknown [Zea mays]
gi|413944443|gb|AFW77092.1| hypothetical protein ZEAMMB73_278537 [Zea mays]
Length = 499
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 154/389 (39%), Positives = 215/389 (55%), Gaps = 28/389 (7%)
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
L YCECFAAG YCDGCNC C N VE+E RQEA+ T RNPNAF+PKI + P +
Sbjct: 83 LKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIPS-S 141
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
+DA KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C+DCKN+EGSEE R+
Sbjct: 142 VWKDAGALPSHPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDCKNYEGSEELRST 201
Query: 222 FHGDH----NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
GD+ N I Q+AAN A+ GA+GSSGY KR E+ + S+++
Sbjct: 202 TQGDNSCDRNNI---QQAANVALNGAIGSSGYRFSPVRRKRPPEDPHCQTLNVEGSMMQT 258
Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 337
Q ++ NNV S + S + + G++K RSPLA+ + D+ +L + LV+
Sbjct: 259 -QFRETNNVDVSQMTSSTG---RDGSTVNFQGTSKLVYRSPLANTIHITDVNDLANHLVI 314
Query: 338 VSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMD-- 395
+A + T V + E E + + Q+ ++++KG V +D
Sbjct: 315 ACRKAAERFTTIVDN-KVEMEIDKEICTNTDQKFDENKKG--VQKAALSQLGKVTDIDQQ 371
Query: 396 -VDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKS-NAYA 453
VD + SD D + RP SPGT+AL+CDE+D F G +S++ + +
Sbjct: 372 IVDDSRLHCSDTQD--DARPASPGTQALLCDEQDLTF-------GTVHRSSIPVAMHDQV 422
Query: 454 STRIYAEQERLILTRFRDFLNRLITCGSI 482
+ ++A QE +L R+ L +I G +
Sbjct: 423 ISELHAAQESAVLKELRNNLRLIIARGKL 451
>gi|357141227|ref|XP_003572141.1| PREDICTED: uncharacterized protein LOC100829506 [Brachypodium
distachyon]
Length = 540
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 160/458 (34%), Positives = 224/458 (48%), Gaps = 87/458 (18%)
Query: 51 NTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFA 110
+++ +GTP++ K C C+NS+CLKL YC+CFA
Sbjct: 138 SSESINGTPRR-KNCKCKNSKCLKL-----------------------------YCDCFA 167
Query: 111 AGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP-HGAQDAREDARE 168
+G YC D CNC NC N+V HE ARQ+A+ A +ERNP AF PKI + P H AQ+ A E
Sbjct: 168 SGRYCNDDCNCKNCCNDVSHETARQDAINAVMERNPVAFMPKIGNIPRHAAQNREYRAAE 227
Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNG 228
KH KGC CK+S CLKKYCECFQ+N+LCSENC+C DCKN+E SE+ + +
Sbjct: 228 GPRVGKHMKGCQCKRSECLKKYCECFQSNVLCSENCKCTDCKNYESSEDMKEMRRMTQQH 287
Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 288
++ N A+ G +G S ++ K L +DQ + N SQ
Sbjct: 288 GVYVHHVQNLALKGMIGPSA---VLPRAAEKFSGLSVASLGRDQPINNNDSSQ------- 337
Query: 289 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTK 348
P SVP T ++ LG R+ LADI+Q +D+ LC +LV+VS
Sbjct: 338 VLSPLLTSVPTEDTESSVRLGRHGVTYRTLLADIIQIEDVNVLCKVLVLVS--------- 388
Query: 349 KVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPD---DRQS------------GNQ 393
R+ + + + S +E + +K + IP DR++ GN
Sbjct: 389 --------RQAAGAFLDSGFKENTNIKKLDRAESHIPSTNHDRKAVQKQHDEQLCSLGNS 440
Query: 394 MDV----DGTSGSG-SDGGDML--NERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTV 446
+ +G +G SD D N R +SPGT+ALMC+E+D F+ G +T
Sbjct: 441 LIAVPLSEGRAGMPRSDPSDTWKDNRRSVSPGTQALMCNEQD-MFVHTSSVAGAILSTT- 498
Query: 447 QKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKE 484
K N + IY EQE+ +LT DFL L G ++E
Sbjct: 499 -KENL---SEIYIEQEKRVLTNLLDFLRELAKSGRLQE 532
>gi|47848051|dbj|BAD21836.1| putative tesmin/TSO1-like CXC domain-containing protein [Oryza
sativa Japonica Group]
Length = 508
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 157/438 (35%), Positives = 222/438 (50%), Gaps = 65/438 (14%)
Query: 55 KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
K+ +PKK+K CNC+NS YCECFAA +Y
Sbjct: 115 KNASPKKRKHCNCKNS----------------------------------YCECFAARVY 140
Query: 115 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
CDGC+C C N +E+E R+EA+ L RNP AF+PKI +SP+ ++++ K
Sbjct: 141 CDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTVTVRKDNSEAIPPIPK 200
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH----GDHNGIA 230
HNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL D N I
Sbjct: 201 HNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQMKNASDRNHI- 259
Query: 231 FMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSA 290
Q AAN A+ GA GS GY K+ E L + ++ Q Q ++ S
Sbjct: 260 --QEAANFALNGATGSLGYKNSPVRRKKYQENSL------GEQILSEAQFQPTDHADVSQ 311
Query: 291 VPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKV 350
+ S + A S++ RSPLA+ + ++ +L ++ +A + K
Sbjct: 312 LASSCTGFGGDIAINYQSKSSEMIYRSPLANTIPLIEVNDLVKNALVACRKAPEAFLTKA 371
Query: 351 G---AMQTEREGSRSSVASPIQEGEDSRKG---HEVHNGIPDDRQSGNQMDVDGTSGSGS 404
G MQ E++ + + + E + K ++ N DRQ +++GT +
Sbjct: 372 GNKVEMQAEKQYQTNDEINNDKTKEQNLKEASLKDIQNKACIDRQ-----NINGTEPHLA 426
Query: 405 DGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERL 464
+ + RP SPGT+ALMCDE D+ F ++ S Q + + A+QER+
Sbjct: 427 NS--FKDSRPASPGTQALMCDEMDTTFGNEDYRSPFVVPSRDQD-----ISELNADQERI 479
Query: 465 ILTRFRDFLNRLITCGSI 482
+LT RD+L LIT G+I
Sbjct: 480 VLTGLRDYLRVLITRGNI 497
>gi|242064794|ref|XP_002453686.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
gi|241933517|gb|EES06662.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
Length = 497
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 162/487 (33%), Positives = 241/487 (49%), Gaps = 90/487 (18%)
Query: 34 PTLPPGKQESPRSRP-RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
P +P +Q P + +T+ K TPKK+K CNCRNS+CLK+
Sbjct: 38 PVMPVPRQWPMAFNPLKPSTEVKSVTPKKKKHCNCRNSKCLKM----------------- 80
Query: 93 HLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 152
YCECF YCDGCNC NC N V +E AR EA+ A +RNP+AF+PKI
Sbjct: 81 ------------YCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKI 128
Query: 153 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
+ P+ ++++ + AKH+KGCHCKKSGCLKKYCEC+QAN+ CS+NCRC+ CKN
Sbjct: 129 ENGPNTLNVRKDNSGAVPVVAKHHKGCHCKKSGCLKKYCECYQANVFCSKNCRCMHCKNS 188
Query: 213 EGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKD 271
EG+E+ DH + + +Q+AAN A G VGSSGY + + ++++S E GV K
Sbjct: 189 EGNEDTETSSQRDHASDRSHIQQAANVAFNGTVGSSGY--ICSLSRKRSHEDALGVRIKS 246
Query: 272 QSVIRNPQSQ--QGNNVQNSAV-PCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDM 328
+ + Q+Q QGN+ S + PCS AN+ S RSPLA+ + +++
Sbjct: 247 EGSMPETQTQYQQGNHADVSPLAPCSTGFDRHNAANSK---SNNPIYRSPLANTIHLREV 303
Query: 329 KELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDR 388
+L LVMV A + ++ G S + +++ + +D ++ + D
Sbjct: 304 NDLVKHLVMVCRMAEAKIAEEKGFQ------SNNGLSNGNCKQQDFKETSSI-----DIL 352
Query: 389 QSG--NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQ--AGPQTG----- 439
G +Q +++ S+ ++ RP SP T++LMCDE+++ F A P T
Sbjct: 353 SKGCSHQPNINEMGFHWSETSNV--SRPASPTTQSLMCDEQNTTFGNEPASPTTQALMCE 410
Query: 440 ------------------LASKSTVQKSNAYAS-----------TRIYAEQERLILTRFR 470
+ + N Y S + I A QE L+LT R
Sbjct: 411 EQNTTFGNEPASPTTQALMCEEQNTTFGNDYKSSFPLVSRGQDISEINAVQENLVLTGLR 470
Query: 471 DFLNRLI 477
+L +I
Sbjct: 471 QYLCTII 477
>gi|222640454|gb|EEE68586.1| hypothetical protein OsJ_27092 [Oryza sativa Japonica Group]
Length = 612
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 159/476 (33%), Positives = 231/476 (48%), Gaps = 63/476 (13%)
Query: 52 TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
++ KD T ++ C+C+ S+CLKL YCECF
Sbjct: 161 SETKDVTLTRKTNCSCKYSKCLKL-----------------------------YCECFEK 191
Query: 112 GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 171
G YC GCNC NC NNV HE RQ+A+ LERNP AF PK+ +S A + A E +
Sbjct: 192 GRYCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVLNS--TAHNCESKAAEGDI 249
Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHNG 228
KH KGC CK++ LKKYCECF+A++ CSENCRC CKNF+ +E+R + H H
Sbjct: 250 VGKHTKGCKCKRTEYLKKYCECFKASVFCSENCRCTGCKNFKSNEDRISQKNTVHAHH-- 307
Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL----FGVAAKDQSVIRNPQSQQGN 284
N A G VG S N + VI + Q +
Sbjct: 308 ------VQNPASSGMVGQSVIIFHAAENDSSLSLAASVSDHSINNNTSHVILLQHTFQIS 361
Query: 285 NVQNSAVPCSQSVPVSQTANASVLGST--KSALRSPLADILQPQDMKELCSLLVMVSSEA 342
Q A+ CS + V++ S++ + R LADI+Q +D+ ELC+LL++VS +A
Sbjct: 362 PFQ--ALSCSLTSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQA 419
Query: 343 TKTLTKKVGAMQTERE--GSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG-T 399
TL + T+R+ S S ++ + ++ S + +G
Sbjct: 420 GATLLESGMKENTKRKKLNRVDSCLSSTNHDREAIQKEPAKQMCSLEKSSNEVLIAEGRL 479
Query: 400 SGSGSDGGDML--NERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRI 457
+D D+ N+RPLSPGT+ALMCDE+D+ F + + A+ ST +++ I
Sbjct: 480 ELPRADPFDIQKGNKRPLSPGTQALMCDEQDAVFQDS--KAPDATPSTTEEN----LVDI 533
Query: 458 YAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTSRSGTEV 513
+ EQE+ +LT FRD+L +L TCG ++E S + S S+Q + G + S T V
Sbjct: 534 FKEQEKRVLTNFRDYLCKLATCGKLQELKLS--LASTNCSEQAFADHGNNSSITRV 587
>gi|218201063|gb|EEC83490.1| hypothetical protein OsI_29022 [Oryza sativa Indica Group]
Length = 586
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 155/470 (32%), Positives = 227/470 (48%), Gaps = 77/470 (16%)
Query: 52 TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
++ KD T ++ C+C+ S+CLKL YCECF
Sbjct: 161 SETKDVTLTRKTNCSCKYSKCLKL-----------------------------YCECFEK 191
Query: 112 GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 171
G YC GCNC NC NNV HE ARQ+A+ LERNP AF PK+++S A + A E +
Sbjct: 192 GRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVSNS--TAHNCESKAAEGDI 249
Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHNG 228
KH KGC CK++ CLKKYCECF+A++ CSENCRC CKN++ + +R + H H
Sbjct: 250 VGKHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAHH-- 307
Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 288
N A G VG S ++F A D S+ + +
Sbjct: 308 ------VQNPASSGMVGQSV--------------IIFHAAENDSSLSLAASALSCSL--- 344
Query: 289 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTK 348
+ SV + + R LADI+Q +D+ ELC+LL++VS +A TL +
Sbjct: 345 -----TSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQAGATLLE 399
Query: 349 KVGAMQTERE--GSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG-TSGSGSD 405
T+R+ S S ++ + ++ S + +G +D
Sbjct: 400 SGMKENTKRKKLNRVDSCLSSTNHDREAIQKEPAKQMCSLEKSSNEVLIAEGRLELPRAD 459
Query: 406 GGDML--NERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQER 463
D+ N+RPLSPGT+ALMCDE+D+ F + + A+ ST +++ I+ EQE+
Sbjct: 460 PFDIQKGNKRPLSPGTQALMCDEQDAVFQDS--KAPDATPSTTEEN----LVDIFKEQEK 513
Query: 464 LILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTSRSGTEV 513
+LT FRD+L +L TCG ++E S + S S+Q + G + S T V
Sbjct: 514 RVLTNFRDYLCKLATCGKLQELKLS--LASTNCSEQAFADHGNNSSITRV 561
>gi|307108583|gb|EFN56823.1| hypothetical protein CHLNCDRAFT_144374 [Chlorella variabilis]
Length = 702
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 95/169 (56%), Positives = 107/169 (63%), Gaps = 40/169 (23%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K CNC+NSRCLKL YCECFA+G YCDGCNC+
Sbjct: 153 KKHCNCKNSRCLKL-----------------------------YCECFASGRYCDGCNCL 183
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC+NN EHE RQ AV A LERNPNAFRPKIA GA + R A+HNKGC+C
Sbjct: 184 NCYNNREHEATRQSAVEAILERNPNAFRPKIAP---GADNRRAGG------ARHNKGCNC 234
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF--HGDHNG 228
KKSGCLKKYCECFQA I+CSE C+CLDCKN+EGSE R AL G+ G
Sbjct: 235 KKSGCLKKYCECFQAGIVCSEICKCLDCKNYEGSEARDALVSPQGERQG 283
>gi|224166209|ref|XP_002338900.1| predicted protein [Populus trichocarpa]
gi|222873866|gb|EEF10997.1| predicted protein [Populus trichocarpa]
Length = 154
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/152 (61%), Positives = 114/152 (75%), Gaps = 4/152 (2%)
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
E+ E + KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFH
Sbjct: 1 EETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFH 60
Query: 224 GDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQ 282
GDH N +A++Q+AANAAI GA+GSSGY +L S KRK +EL FG KD S R QQ
Sbjct: 61 GDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPSFHRLGHFQQ 120
Query: 283 GNNVQNSAVPCS--QSVPVSQTANASVLGSTK 312
++++ A P S S PV++ N + LGS+K
Sbjct: 121 ASHIR-PAAPSSPLPSNPVARAGNTTTLGSSK 151
>gi|218198100|gb|EEC80527.1| hypothetical protein OsI_22807 [Oryza sativa Indica Group]
Length = 426
Score = 181 bits (460), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 131/383 (34%), Positives = 186/383 (48%), Gaps = 44/383 (11%)
Query: 118 CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
CNC N + QEA+ + KI + + ++DA KHNK
Sbjct: 83 CNCKN-----SQCLKLQEAITVRSNGIQKPSKLKIENGSNALNLRKDDAGAPASLPKHNK 137
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIA-FMQRAA 236
GCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R + GD++ +Q AA
Sbjct: 138 GCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAA 197
Query: 237 NAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQS 296
N A+ GA+GSSGY KR E+ L+ + S ++ Q Q+ N+V +S + S
Sbjct: 198 NVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTG 256
Query: 297 VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV---SSEATKTLTKKVGAM 353
+ + ++ + RS LA+ + P D L LV+V +++A T + G M
Sbjct: 257 L---EGCYSNYQCRSNVVYRSALANTISPTDATGLAKHLVIVCRKAADAFLTTAENKGEM 313
Query: 354 QTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNER 413
+ ERE I D MD G + + R
Sbjct: 314 EVERE-------------------------IHTDSHGATNMDQQNGGDFGPCCNSLEDSR 348
Query: 414 PLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFL 473
P SPGT+ALMCDE+ S F G ++ + + ++ + A+QE+ +LT FRD+L
Sbjct: 349 PASPGTQALMCDEQGSTF---GTDYRISFPVALHDQD---TSELNAQQEKAVLTGFRDYL 402
Query: 474 NRLITCGSIKETMCSPLVKSETR 496
+IT G I T + + S R
Sbjct: 403 RTVITRGKINGTEFAKIFLSLKR 425
>gi|218190473|gb|EEC72900.1| hypothetical protein OsI_06724 [Oryza sativa Indica Group]
Length = 274
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/169 (55%), Positives = 113/169 (66%), Gaps = 9/169 (5%)
Query: 102 LVK--YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
LVK YCECFAA +YCDGC+C C N +E+E R+EA+ L RNP AF+PKI +SP+
Sbjct: 83 LVKSWYCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTV 142
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
++++ KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+
Sbjct: 143 TVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERK 202
Query: 220 ALFH----GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL 264
AL D N I Q AAN A+ GA GS GY + K+ E L
Sbjct: 203 ALVQMKNASDRNHI---QEAANFALSGATGSLGYKSSPVRRKKYQENSL 248
>gi|412988366|emb|CCO17702.1| unnamed protein product [Bathycoccus prasinos]
Length = 321
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 94/185 (50%), Positives = 111/185 (60%), Gaps = 44/185 (23%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNC 120
+K CNCRNSRCLKL YCECFA+G YC CNC
Sbjct: 20 RKTCNCRNSRCLKL-----------------------------YCECFASGQYCLQNCNC 50
Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
V C NNVE+ V RQEAV ATLERNPNAFRPK+ + +G + KHNKGCH
Sbjct: 51 VECKNNVENSVQRQEAVEATLERNPNAFRPKVLPTLNGGEGDE----------KHNKGCH 100
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAI 240
CKKSGCLKKYCECFQA+I CS+ CRCLDCKNFEGS+ +A+ FM ++A++
Sbjct: 101 CKKSGCLKKYCECFQASIFCSDMCRCLDCKNFEGSDAHKAIASQQE----FMPAGSDASL 156
Query: 241 IGAVG 245
+ A G
Sbjct: 157 LVAPG 161
>gi|303274438|ref|XP_003056539.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
gi|226462623|gb|EEH59915.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
Length = 255
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 84/159 (52%), Positives = 99/159 (62%), Gaps = 35/159 (22%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
K++K CNCRNSRCLKL YCECFA+G++CD CN
Sbjct: 4 KRKKYCNCRNSRCLKL-----------------------------YCECFASGLHCDSCN 34
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
CVNC NN+ R AV +TLERNPNAFRPKIA + RE A ++HN+GC
Sbjct: 35 CVNCSNNLASAAIRCTAVESTLERNPNAFRPKIAPGVGPLEKRREGA------SRHNRGC 88
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
HCKKS CLKKYCECFQANI CS+ CRC++CKNF GS +R
Sbjct: 89 HCKKSSCLKKYCECFQANIFCSDICRCMECKNFLGSSQR 127
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+ +K K C+C+ S CLK YCECF + + C ++C C++C N
Sbjct: 1 MPSKRKKYCNCRNSRCLKLYCECFASGLHC-DSCNCVNCSN 40
>gi|302780747|ref|XP_002972148.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
gi|302791403|ref|XP_002977468.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
gi|300154838|gb|EFJ21472.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
gi|300160447|gb|EFJ27065.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
Length = 116
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 78/116 (67%), Positives = 86/116 (74%), Gaps = 9/116 (7%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD--- 161
YCECFA+G+YCDGCNC+NC NN E+E RQEAV LERNPNAFRPKI S D
Sbjct: 1 YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60
Query: 162 -----AREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+E RE Q A+HNKGC CKKSGCLKKYCECFQANILCSENCRC+DC+N
Sbjct: 61 CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116
>gi|302851277|ref|XP_002957163.1| TSO1-like protein [Volvox carteri f. nagariensis]
gi|300257570|gb|EFJ41817.1| TSO1-like protein [Volvox carteri f. nagariensis]
Length = 1133
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 68/109 (62%), Positives = 78/109 (71%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
YCECFA+G YC+ CNCV C NN EHE RQ AV A LERNPNAFRPKI S+ A
Sbjct: 144 YCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNPNAFRPKIQSNEQAPAAAAA 203
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
A +H KGC+CKKS CLKKYCECFQA I CS+NC+C++CKN+E
Sbjct: 204 VVNNAAAPGRHLKGCNCKKSSCLKKYCECFQAGIYCSDNCKCVECKNYE 252
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 27/67 (40%), Gaps = 30/67 (44%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K CNC+ S CLK KYCECF AGIYC D C CV
Sbjct: 216 KGCNCKKSSCLK-----------------------------KYCECFQAGIYCSDNCKCV 246
Query: 122 NCHNNVE 128
C N E
Sbjct: 247 ECKNYEE 253
>gi|308814306|ref|XP_003084458.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
tauri]
gi|116056343|emb|CAL56726.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
tauri]
Length = 766
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 90/156 (57%), Gaps = 40/156 (25%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG--CNC 120
K CNC+NSRCLKL YCECFAAG C C C
Sbjct: 16 KGCNCKNSRCLKL-----------------------------YCECFAAGALCASARCRC 46
Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
NC N VEH R A+ L+RNPNAFRPKIA+ ++D +A +HN+GCH
Sbjct: 47 ANCMNTVEHAGERTAAIETVLDRNPNAFRPKIAAVASPSRD---------VALRHNRGCH 97
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
CK+SGCLKKYCECFQA I C+E CRC+ C+N+EGS+
Sbjct: 98 CKRSGCLKKYCECFQAAIYCAETCRCVSCENYEGSD 133
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 40/75 (53%), Gaps = 4/75 (5%)
Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILC-SENCRCLDCKN-FEGSEERRALFHG-- 224
AQ+ KGC+CK S CLK YCECF A LC S CRC +C N E + ER A
Sbjct: 8 AQIQRGDGKGCNCKNSRCLKLYCECFAAGALCASARCRCANCMNTVEHAGERTAAIETVL 67
Query: 225 DHNGIAFMQRAANAA 239
D N AF + A A
Sbjct: 68 DRNPNAFRPKIAAVA 82
>gi|91088685|ref|XP_974962.1| PREDICTED: similar to Lin54 protein [Tribolium castaneum]
Length = 695
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 78/192 (40%), Positives = 103/192 (53%), Gaps = 39/192 (20%)
Query: 42 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
ES P +N + P+K CNC S+CLKL
Sbjct: 464 ESITMTPESNLESNGIRPRKP--CNCTKSQCLKL-------------------------- 495
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YC+CFA G +C CNC+NC NN+E+E RQ A+ LERNPNAFRPKI + A D
Sbjct: 496 ---YCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLERNPNAFRPKIGKAKDVAGD 552
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
+ KH KGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E S E+++L
Sbjct: 553 SS--------IRKHTKGCNCKRSGCLKNYCECYEAKIACSNNCKCMGCRNIEDSMEKKSL 604
Query: 222 FHGDHNGIAFMQ 233
+G+ +Q
Sbjct: 605 RASIEDGLHNLQ 616
>gi|14587305|dbj|BAB61216.1| P0460E08.26 [Oryza sativa Japonica Group]
Length = 553
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 173/352 (49%), Gaps = 45/352 (12%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDC-KNFEGSEERRALFHG--DHNGIAFMQ 233
K C+CK S CLK YCECF + + C + C C +C N + RR + N AF
Sbjct: 160 KCCNCKHSKCLKLYCECFASGVYC-DGCNCSNCFNNVKNETARREAIDATLERNPDAFRP 218
Query: 234 R-------------AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS 280
+ AANAA+ GA+G++ + + S KRK + +AK+ R
Sbjct: 219 KIGSSPHANRNNMQAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHRTNHV 278
Query: 281 QQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSS 340
Q Q + +P +P++Q + LG K R LADI+QP D+KELC LLV+VS
Sbjct: 279 PQVF-YQKNGMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVVSG 337
Query: 341 EATKT---------LTKKVGAMQTEREGS-RSSVASPIQEGEDSRKGHEVHNGIPDDRQS 390
A K + +K M +REG S+AS + E++ DD +
Sbjct: 338 NAAKAYAGKKSQDKVVEKEDEMGGKREGEIGGSLASTNHDREENNNDPNNQKASADDPST 397
Query: 391 G----NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTV 446
G + ++ + +D D + RP+SPGT ALMCDE+D+ FM S++ V
Sbjct: 398 GGTRAGKASLEESKHDCND--DQKSNRPMSPGTLALMCDEQDTMFM--------TSQNVV 447
Query: 447 QKSNAYAS---TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
+ A + + +YAEQER +LT FRD L++L+T G +KE S +KSE
Sbjct: 448 PQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 499
Score = 125 bits (314), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 66/131 (50%), Positives = 82/131 (62%), Gaps = 31/131 (23%)
Query: 40 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
K ESP+ PRA +GKDGTP K+K CNC++S+CLKL
Sbjct: 137 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKL----------------------- 172
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFA+G+YCDGCNC NC NNV++E AR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 173 ------YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHA 226
Query: 159 AQDAREDAREA 169
++ + A A
Sbjct: 227 NRNNMQAANAA 237
>gi|270011672|gb|EFA08120.1| hypothetical protein TcasGA2_TC005724 [Tribolium castaneum]
Length = 752
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 78/192 (40%), Positives = 103/192 (53%), Gaps = 39/192 (20%)
Query: 42 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
ES P +N + P+K CNC S+CLKL
Sbjct: 521 ESITMTPESNLESNGIRPRKP--CNCTKSQCLKL-------------------------- 552
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YC+CFA G +C CNC+NC NN+E+E RQ A+ LERNPNAFRPKI + A D
Sbjct: 553 ---YCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLERNPNAFRPKIGKAKDVAGD 609
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
+ KH KGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E S E+++L
Sbjct: 610 SS--------IRKHTKGCNCKRSGCLKNYCECYEAKIACSNNCKCMGCRNIEDSMEKKSL 661
Query: 222 FHGDHNGIAFMQ 233
+G+ +Q
Sbjct: 662 RASIEDGLHNLQ 673
>gi|145356362|ref|XP_001422401.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582643|gb|ABP00718.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 151
Score = 148 bits (373), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 82/170 (48%), Positives = 99/170 (58%), Gaps = 31/170 (18%)
Query: 43 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
SPRS + +D + ++K+CNC+NS+CLKL
Sbjct: 12 SPRSIAARASTPRD-SASQRKRCNCKNSKCLKL--------------------------- 43
Query: 103 VKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS-SPHGAQD 161
YCECFA+G YCD CNC C NN EH RQ A+ TLERNPNAFRPKI + + +G D
Sbjct: 44 --YCECFASGKYCDACNCAGCKNNDEHAHERQSAIEQTLERNPNAFRPKIINQTTNGTLD 101
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
A A+HN+GCHCK+SGCLKKYCECFQA I C E CRC +CKN
Sbjct: 102 GAGAGDGATTEARHNRGCHCKRSGCLKKYCECFQAAIYCVERCRCAECKN 151
Score = 42.4 bits (98), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 44/90 (48%), Gaps = 4/90 (4%)
Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF-EGSEERRALFHG--DHNG 228
+A K C+CK S CLK YCECF + C + C C CKN E + ER++ + N
Sbjct: 26 SASQRKRCNCKNSKCLKLYCECFASGKYC-DACNCAGCKNNDEHAHERQSAIEQTLERNP 84
Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKR 258
AF + N G + +G G T+ R
Sbjct: 85 NAFRPKIINQTTNGTLDGAGAGDGATTEAR 114
>gi|428168961|gb|EKX37899.1| hypothetical protein GUITHDRAFT_44838, partial [Guillardia theta
CCMP2712]
Length = 138
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 80/161 (49%), Positives = 101/161 (62%), Gaps = 33/161 (20%)
Query: 55 KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
+ GTP K+K CNC+NS+CLKL YCECFA+G+Y
Sbjct: 7 RPGTPAKRKPCNCKNSKCLKL-----------------------------YCECFASGLY 37
Query: 115 CD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS-PHGAQDA-REDAREAQL 171
C+ CNC +CHNNV E RQ AV +TLERNP+AFRPKI + G D+ + AR
Sbjct: 38 CEVQCNCHSCHNNVRFEWDRQAAVVSTLERNPHAFRPKITQTGQQGIVDSPAKGARWLMF 97
Query: 172 A-AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
A+H KGCHCKKSGCLKKYCECFQA ++CS+ C+C++C+N
Sbjct: 98 VKARHMKGCHCKKSGCLKKYCECFQAAVMCSDTCKCIECRN 138
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 34/77 (44%), Gaps = 3/77 (3%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEGSEERRALFHGDHNGIAFMQ 233
K C+CK S CLK YCECF + + C C C C N FE + + + N AF
Sbjct: 15 KPCNCKNSKCLKLYCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLERNPHAFRP 74
Query: 234 RAANAAIIGAVGSSGYG 250
+ G V S G
Sbjct: 75 KITQTGQQGIVDSPAKG 91
>gi|328722237|ref|XP_001950656.2| PREDICTED: protein lin-54 homolog [Acyrthosiphon pisum]
Length = 750
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/205 (38%), Positives = 107/205 (52%), Gaps = 52/205 (25%)
Query: 46 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
++P++ +D K +K CNC S+CLKL Y
Sbjct: 525 TKPKSTSDVN----KTKKPCNCTRSQCLKL-----------------------------Y 551
Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
C+CFA G +C CNC +C+NN+EHE R +A+ + L+RNPNAFRPKI S
Sbjct: 552 CDCFANGEFCYQCNCNSCYNNMEHEEDRAQAIRSVLDRNPNAFRPKIKS----------- 600
Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
Q +H KGC+CK+SGCLK YCECF+A I CS+NC+C+ C+N E ++ F GD
Sbjct: 601 -FVGQAERQHTKGCNCKRSGCLKNYCECFEARIACSQNCKCIGCRNME--DDSHTSF-GD 656
Query: 226 HNGIA----FMQRAANAAIIGAVGS 246
H G R +A + + GS
Sbjct: 657 HPGSLSVDNLKSRTKSANFVTSFGS 681
>gi|157111432|ref|XP_001651563.1| hypothetical protein AaeL_AAEL005893 [Aedes aegypti]
gi|108878391|gb|EAT42616.1| AAEL005893-PA [Aedes aegypti]
Length = 757
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/175 (42%), Positives = 97/175 (55%), Gaps = 40/175 (22%)
Query: 39 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
G ++ R P T D KK + CNC S+CLKL
Sbjct: 467 GSEDETRKGPAVGTIFPDEAYKK-RPCNCTKSQCLKL----------------------- 502
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YC+CFA G +C CNC +C+NN+++E RQ+A+ ATLERNP+AF+PKI G
Sbjct: 503 ------YCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFKPKI-----G 551
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
A A EDA H KGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 552 AVSADEDALRL-----HTKGCNCKRSGCLKNYCECYEAKIACSANCKCIGCRNTE 601
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 37/77 (48%), Gaps = 10/77 (12%)
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC-KNFEGSEERRALFHG--DHNGI 229
A + C+C KS CLK YC+CF C NC C DC N + EER+ + N
Sbjct: 486 AYKKRPCNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEERQKAIRATLERNPS 544
Query: 230 AFMQRAANAAIIGAVGS 246
AF + IGAV +
Sbjct: 545 AFKPK------IGAVSA 555
>gi|427785019|gb|JAA57961.1| Putative dna-dependent [Rhipicephalus pulchellus]
Length = 916
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 83/187 (44%), Positives = 102/187 (54%), Gaps = 46/187 (24%)
Query: 56 DGTP-----KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFA 110
+GTP + +K CNC S+CLKL YC+CFA
Sbjct: 652 NGTPCEDNSRPRKPCNCTKSQCLKL-----------------------------YCDCFA 682
Query: 111 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
G +C CNC NC NN+EHE RQ+A+GA LERNPNAFRPKI G + R
Sbjct: 683 NGEFCHSCNCNNCFNNLEHEEERQKAIGACLERNPNAFRPKIGKGKEGDHERR------- 735
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGI 229
H KGC+CK+SGCLK YCEC++A ILCS C+C+ CKNFE S ER+ L D +
Sbjct: 736 ----HTKGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKNFEDSSERKTLMQLADAAEV 791
Query: 230 AFMQRAA 236
Q+AA
Sbjct: 792 RVQQQAA 798
>gi|403340562|gb|EJY69569.1| hypothetical protein OXYTRI_09693 [Oxytricha trifallax]
Length = 596
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 87/193 (45%), Positives = 100/193 (51%), Gaps = 57/193 (29%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K CNC+NS+CLKL YCECFA+G YC CNC
Sbjct: 33 KKNCNCKNSKCLKL-----------------------------YCECFASGEYCKNCNCN 63
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIAS----SPHGAQDA--------------- 162
C NN+E+E R+E + LERNPNAFRPKIAS SP Q
Sbjct: 64 GCCNNIENESIRKETIAIILERNPNAFRPKIASLPPQSPQITQKVVGVTGGPINNPSQIN 123
Query: 163 -------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
+ A KH KGC CKKSGCLKKYCECFQA I CS+NC+C DCKNF+GS
Sbjct: 124 IGLSTPIMTGVNDGVAAGKHAKGCACKKSGCLKKYCECFQAGIFCSDNCKCCDCKNFDGS 183
Query: 216 EERRALFHGDHNG 228
+RR L D NG
Sbjct: 184 LDRRILL--DSNG 194
>gi|79403647|ref|NP_188237.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75136698|sp|Q700D0.1|TCX8_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 8; Short=AtTCX8
gi|45935045|gb|AAS79557.1| tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|46367484|emb|CAG25868.1| hypothetical protein [Arabidopsis thaliana]
gi|332642257|gb|AEE75778.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 368
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 137/457 (29%), Positives = 190/457 (41%), Gaps = 134/457 (29%)
Query: 27 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
PV +LPL PP +E S + TD + T +K K C C+ S+CLKL
Sbjct: 34 PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKL----------- 78
Query: 87 NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
YC+CFA+G+ C C+CV+CHNN E AR+ A+ L RNPN
Sbjct: 79 ------------------YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPN 120
Query: 147 AFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
AF K S Q A D + L+ +GC CK++ CLKKYCECFQAN+LCS+NC+
Sbjct: 121 AFSEKALGSLTDNQCKAAPDTKPGLLS----RGCKCKRTRCLKKYCECFQANLLCSDNCK 176
Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
C++CKN SE + HN +R + + LF
Sbjct: 177 CINCKNV--SEAFQPPAFSAHNSPQVYRRRRDRELTEWNSCPAP--------------LF 220
Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
+ D S+ N S + CS +P + S++G T + L
Sbjct: 221 SIP--DNSI---------QNALGSPMSCSPKLPYRK--KRSLMGYTSTLL---------- 257
Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
D+ +LCSLLV S AT T ED + P
Sbjct: 258 PDLGDLCSLLVAASESATTT-------------------------AEDQNRIF----TKP 288
Query: 386 DDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCD-----EEDSAFMQAGPQTGL 440
DD+++ ++ +S S S + E S G + D EEDS
Sbjct: 289 DDKEA-----IELSSESESRN---VEEEIQSRGRLIELIDVQYNGEEDSQ---------C 331
Query: 441 ASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLI 477
+K++V + T IY EQER +L FRD L + I
Sbjct: 332 KTKTSVNE------TDIYMEQERAVLETFRDCLQKFI 362
>gi|323457208|gb|EGB13074.1| hypothetical protein AURANDRAFT_60657 [Aureococcus anophagefferens]
Length = 1772
Score = 140 bits (352), Expect = 3e-30, Method: Composition-based stats.
Identities = 72/175 (41%), Positives = 90/175 (51%), Gaps = 49/175 (28%)
Query: 52 TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
T K K+ K CNCRNSRCLKL YC+CFAA
Sbjct: 320 TAAKASATKRFKPCNCRNSRCLKL-----------------------------YCDCFAA 350
Query: 112 GIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
G +C+ C CV+CHN+ H R +A+ ATLE+NP AFR K+ +
Sbjct: 351 GRFCEESCKCVDCHNDQAHARDRDDAIKATLEKNPKAFRAKVDAE--------------- 395
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE---RRALF 222
A H GCHCKK+ CLKKYCECF+A I C C+C DC+N+ GS + RR L+
Sbjct: 396 -ATTHQNGCHCKKTKCLKKYCECFEAGITCGAKCKCADCENYPGSLQLAARRGLY 449
>gi|116831214|gb|ABK28561.1| unknown [Arabidopsis thaliana]
Length = 369
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 137/457 (29%), Positives = 190/457 (41%), Gaps = 134/457 (29%)
Query: 27 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
PV +LPL PP +E S + TD + T +K K C C+ S+CLKL
Sbjct: 34 PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKL----------- 78
Query: 87 NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
YC+CFA+G+ C C+CV+CHNN E AR+ A+ L RNPN
Sbjct: 79 ------------------YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPN 120
Query: 147 AFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
AF K S Q A D + L+ +GC CK++ CLKKYCECFQAN+LCS+NC+
Sbjct: 121 AFSEKALGSLTDNQCKAAPDTKPGLLS----RGCKCKRTRCLKKYCECFQANLLCSDNCK 176
Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
C++CKN SE + HN +R + + LF
Sbjct: 177 CINCKNV--SEAFQPPAFSAHNSPQVYRRRRDRELTEWNSCPAP--------------LF 220
Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
+ D S+ N S + CS +P + S++G T + L
Sbjct: 221 SIP--DNSI---------QNALGSPMSCSPKLPYRK--KRSLMGYTSTLL---------- 257
Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
D+ +LCSLLV S AT T ED + P
Sbjct: 258 PDLGDLCSLLVAASESATTT-------------------------AEDQNRIF----TKP 288
Query: 386 DDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCD-----EEDSAFMQAGPQTGL 440
DD+++ ++ +S S S + E S G + D EEDS
Sbjct: 289 DDKEA-----IELSSESESRN---VEEEIQSRGRLIELIDVQYNGEEDSQ---------C 331
Query: 441 ASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLI 477
+K++V + T IY EQER +L FRD L + I
Sbjct: 332 KTKTSVNE------TDIYMEQERAVLETFRDCLQKFI 362
>gi|297834488|ref|XP_002885126.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297330966|gb|EFH61385.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 421
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 134/469 (28%), Positives = 193/469 (41%), Gaps = 116/469 (24%)
Query: 33 LPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
+P PP +E ++ + +G +K+C C+ S+CLKL
Sbjct: 38 IPPFPPISREHSETKDTTDQEGITW----RKRCRCKQSKCLKL----------------- 76
Query: 93 HLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 152
YC+CFA+G+ C C+CV+CHNN ++ AR AV L RNPNAF K+
Sbjct: 77 ------------YCDCFASGVLCTDCDCVDCHNNSDNYDARDAAVVNVLGRNPNAFNEKL 124
Query: 153 ASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
SS + Q A D R L+ +GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN
Sbjct: 125 FSSINDKQCKAAPDTRPGLLS----RGCKCKRTKCLKKYCECFQANVLCSDNCKCINCKN 180
Query: 212 ---------FEGSEERRALFHGDHNGIAFMQRAANAAIIG----AVGSSGYGTLMTSNKR 258
F R L+ G N + + II +VG + + T N
Sbjct: 181 VSEAFQPSVFAWGLNSRKLYEGFDNPEK--KFVCDLGIISPSEDSVGFNSHNTAGCMNYA 238
Query: 259 KSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSA---- 314
G +A + + + Q NS S+P + NA LGS S+
Sbjct: 239 P------GFSAHNSPQVYRRRRHQELPEWNSCPAPLFSIPDNSIQNA--LGSPMSSSPKL 290
Query: 315 ----LRSPL--ADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPI 368
+ PL A + P D+ ++CSLL+ S AT + +R +
Sbjct: 291 PYRKKKPPLGYASTVVP-DLGDICSLLLAASETAT----------ANAEDQNRICIK--- 336
Query: 369 QEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEED 428
PDD+ +V S SG+ ++ + R L A EE
Sbjct: 337 ----------------PDDKVDNVYTEVLSESESGNVEEEIQSFRRLIELIDAQYNGEE- 379
Query: 429 SAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLI 477
S + + T IY EQER +L FRD L I
Sbjct: 380 --------------HSKCKTETSVHETDIYMEQERAVLETFRDCLQTFI 414
>gi|320163778|gb|EFW40677.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 491
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 92/155 (59%), Gaps = 42/155 (27%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCNC 120
+K CNC+NSRCLKL YCECFA+G YCD C C
Sbjct: 137 KKTCNCKNSRCLKL-----------------------------YCECFASGQYCDPSCKC 167
Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
V+C NN ++ ARQ+AV ATLERNP AFRPKI S+ A E+ A+H+KGCH
Sbjct: 168 VSCSNNESNQEARQQAVDATLERNPAAFRPKINSA----------ATESW--ARHSKGCH 215
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
C+KS C K+YCEC+QA I C+ +C+C+DCKN S
Sbjct: 216 CRKSNCQKRYCECYQAGIQCTAHCKCVDCKNHSHS 250
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 28/43 (65%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
K C+CK S CLK YCECF + C +C+C+ C N E ++E R
Sbjct: 138 KTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEAR 180
>gi|281363313|ref|NP_001163143.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
gi|272432466|gb|ACZ94416.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
Length = 952
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 99/194 (51%), Gaps = 54/194 (27%)
Query: 33 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
LPTLPPG + + ++P D G ++K CNC S+CLKL
Sbjct: 698 LPTLPPGVKANVPAKPLFEVLKPPATAAAAGAVDPLGGMTSRRKHCNCSKSQCLKL---- 753
Query: 80 IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
YC+CFA G +C C C +C NN+++EV R+ A+ +
Sbjct: 754 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 788
Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC++A I
Sbjct: 789 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 836
Query: 200 CSENCRCLDCKNFE 213
CS C+C+ C+N E
Sbjct: 837 CSSICKCVGCRNME 850
>gi|24653408|ref|NP_610879.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
gi|190359804|sp|A1Z9E2.1|LIN54_DROME RecName: Full=Protein lin-54 homolog; AltName: Full=Myb complex
protein of 120 kDa
gi|7303305|gb|AAF58365.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
gi|372466647|gb|AEX93136.1| FI18123p1 [Drosophila melanogaster]
Length = 950
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 99/194 (51%), Gaps = 54/194 (27%)
Query: 33 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
LPTLPPG + + ++P D G ++K CNC S+CLKL
Sbjct: 696 LPTLPPGVKANVPAKPLFEVLKPPATAAAAGAVDPLGGMTSRRKHCNCSKSQCLKL---- 751
Query: 80 IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
YC+CFA G +C C C +C NN+++EV R+ A+ +
Sbjct: 752 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 786
Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC++A I
Sbjct: 787 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 834
Query: 200 CSENCRCLDCKNFE 213
CS C+C+ C+N E
Sbjct: 835 CSSICKCVGCRNME 848
>gi|159884199|gb|ABX00778.1| RE32583p [Drosophila melanogaster]
Length = 950
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 99/194 (51%), Gaps = 54/194 (27%)
Query: 33 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
LPTLPPG + + ++P D G ++K CNC S+CLKL
Sbjct: 696 LPTLPPGVKANVPAKPLFEVLKPPATAAAAGAVDPLGGMTSRRKHCNCSRSQCLKL---- 751
Query: 80 IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
YC+CFA G +C C C +C NN+++EV R+ A+ +
Sbjct: 752 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 786
Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC++A I
Sbjct: 787 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 834
Query: 200 CSENCRCLDCKNFE 213
CS C+C+ C+N E
Sbjct: 835 CSSICKCVGCRNME 848
>gi|391347269|ref|XP_003747887.1| PREDICTED: uncharacterized protein LOC100905647 [Metaseiulus
occidentalis]
Length = 609
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/221 (37%), Positives = 106/221 (47%), Gaps = 53/221 (23%)
Query: 6 QAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP-RANTDGKD---GTPKK 61
QA PQ Q PQ++ + P H + + S P +A+ D D
Sbjct: 341 QASPQSQ-HPQYLVVKQTTPQGQHSFAV--------QPQNSEPAQADYDAYDKQLSHTSS 391
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K CNC S+CLKL YC+CFA G +C CNCV
Sbjct: 392 KKPCNCTKSQCLKL-----------------------------YCDCFANGEFCQNCNCV 422
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C+NN+EHE R AV LERNPNAF PKI G ++ R H KGC+C
Sbjct: 423 QCYNNLEHEEERSLAVKLCLERNPNAFHPKIGKYKPGDKERR-----------HTKGCNC 471
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
K+SGCLK YCEC++A ILCS CRC+ C N E + + +L
Sbjct: 472 KRSGCLKNYCECYEARILCSGVCRCVGCHNIEENMDAESLL 512
>gi|195334093|ref|XP_002033719.1| GM21473 [Drosophila sechellia]
gi|194125689|gb|EDW47732.1| GM21473 [Drosophila sechellia]
Length = 949
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 99/194 (51%), Gaps = 54/194 (27%)
Query: 33 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
LPTLPPG + + ++P D G ++K CNC S+CLKL
Sbjct: 695 LPTLPPGAKANVPAKPLFEVLKPAATPAAAGAVDQLGGMTSRRKHCNCSKSQCLKL---- 750
Query: 80 IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
YC+CFA G +C C C +C NN+++EV R+ A+ +
Sbjct: 751 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 785
Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC++A I
Sbjct: 786 CLDRNPSAFKPKITAPNSG--DMRL----------HNKGCNCKRSGCLKNYCECYEAKIP 833
Query: 200 CSENCRCLDCKNFE 213
C+ C+C+ C+N E
Sbjct: 834 CTSICKCVGCRNME 847
>gi|449676261|ref|XP_002165629.2| PREDICTED: uncharacterized protein LOC100197918, partial [Hydra
magnipapillata]
Length = 530
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/200 (39%), Positives = 104/200 (52%), Gaps = 42/200 (21%)
Query: 42 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
+S +S +AN + +K CNC S+CLKL
Sbjct: 298 DSNKSASKANVGPITESKILKKPCNCTKSQCLKL-------------------------- 331
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YCECFA G +C+ CNC C NN+ HEV R +A+ + L+RNP AF PKI G
Sbjct: 332 ---YCECFANGEFCNNCNCRICFNNIAHEVERSKAIKSCLDRNPYAFHPKI-----GKGK 383
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
+ D +H KGC+C++SGCLK YCEC++A ILCS C+C CKNFE S +R+ L
Sbjct: 384 IKGDTER-----RHTKGCNCRRSGCLKNYCECYEAKILCSTLCKCSGCKNFEESADRKTL 438
Query: 222 FH-GDHNGIAFMQRAANAAI 240
D + +Q+ NAAI
Sbjct: 439 MQLADAAEVRVLQQ--NAAI 456
>gi|194883323|ref|XP_001975752.1| GG20386 [Drosophila erecta]
gi|190658939|gb|EDV56152.1| GG20386 [Drosophila erecta]
Length = 944
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/210 (36%), Positives = 107/210 (50%), Gaps = 45/210 (21%)
Query: 6 QAMPQWQARPQHVRMVNRVP-HPVHKLPLPTLPPGK-QESPRSRPRANTDGKDGTPKKQK 63
Q +P Q+ P V ++P P P+PT P + P + A D G ++K
Sbjct: 676 QQLPTEQSAPTKVE--PKLPTQPPGFNPVPTKPLFDILKQPATGAAATVDALGGMTSRRK 733
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNC 123
CNC S+CLKL YC+CFA G +C C C +C
Sbjct: 734 HCNCSKSQCLKL-----------------------------YCDCFANGEFCQDCTCKDC 764
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
NN+++EV R+ A+ + L+RNP+AF+PKI + G D R HNKGC+CK+
Sbjct: 765 FNNMDYEVERERAIRSCLDRNPSAFQPKITAPNSG--DMRL----------HNKGCNCKR 812
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFE 213
SGCLK YCEC++A I C+ C+C+ C+N E
Sbjct: 813 SGCLKNYCECYEAKIPCTSICKCVGCRNME 842
>gi|308492241|ref|XP_003108311.1| CRE-LIN-54 protein [Caenorhabditis remanei]
gi|308249159|gb|EFO93111.1| CRE-LIN-54 protein [Caenorhabditis remanei]
Length = 433
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 136/302 (45%), Gaps = 51/302 (16%)
Query: 49 RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
R T K P ++K CNC S+CLKL YC+C
Sbjct: 159 RLKTKKKVYAPGQRKPCNCTKSQCLKL-----------------------------YCDC 189
Query: 109 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 168
FA G +C CNC +CHNN++++ R +A+ +LERNPNAF+PKI + G D
Sbjct: 190 FANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGTAD------- 242
Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNG 228
+ H KGCHCKKSGCLK YCEC++A + C++ C+C C+N E R G
Sbjct: 243 --IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKCCQNTEAYRMTRFKNSGSLPS 300
Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 288
+ NA+ A SG G++MT A D + PQ +
Sbjct: 301 TNALMSLTNASST-ATPDSGPGSVMTDE-----------AGDDYDDMMLPQKPKAEMDPR 348
Query: 289 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTK 348
+ V + A ++ + AL + ++ +Q +D K + ++ +V E + L +
Sbjct: 349 RFPWYYMTDEVVEAATMCMVAQAEEALNASFSEKMQTEDDK-IINMEKLVLREFGRCLEQ 407
Query: 349 KV 350
+
Sbjct: 408 MI 409
>gi|341881213|gb|EGT37148.1| CBN-LIN-54 protein [Caenorhabditis brenneri]
Length = 407
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/207 (37%), Positives = 104/207 (50%), Gaps = 40/207 (19%)
Query: 49 RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
R T K P ++K CNC S+CLKL YC+C
Sbjct: 134 RMKTKKKVYAPGQRKPCNCTKSQCLKL-----------------------------YCDC 164
Query: 109 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 168
FA G +C CNC +CHNN++++ R +A+ +LERNPNAF+PKI + G D
Sbjct: 165 FANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIA-RGTAD------- 216
Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG-DHN 227
+ H KGCHCKKSGCLK YCEC++A + C++ C+C C+N E R G
Sbjct: 217 --IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRFKNSGVSVP 274
Query: 228 GIAFMQRAANAAIIGAVGSSGYGTLMT 254
G + NAA A SG G++MT
Sbjct: 275 GTNALISLTNAASSSATPDSGPGSVMT 301
>gi|268552505|ref|XP_002634235.1| C. briggsae CBR-LIN-54 protein [Caenorhabditis briggsae]
Length = 437
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 77/207 (37%), Positives = 103/207 (49%), Gaps = 39/207 (18%)
Query: 49 RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
R T K P ++K CNC S+CLKL YC+C
Sbjct: 161 RMKTKKKVYAPGQRKPCNCTKSQCLKL-----------------------------YCDC 191
Query: 109 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 168
FA G +C CNC +CHNN++++ R +A+ +LERNPNAF+PKI + G D
Sbjct: 192 FANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGTAD------- 244
Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHN- 227
+ H KGCHCKKSGCLK YCEC++A + C++ C+C C+N E R G
Sbjct: 245 --IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRFKNSGGPGA 302
Query: 228 GIAFMQRAANAAIIGAVGSSGYGTLMT 254
G + NA A SG G++MT
Sbjct: 303 GANALMSLTNANSSTATPDSGPGSVMT 329
>gi|198433102|ref|XP_002124866.1| PREDICTED: similar to lin-54 homolog [Ciona intestinalis]
Length = 605
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 83/154 (53%), Gaps = 42/154 (27%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
K +K CNC S CLKL YCECFA G +CD CN
Sbjct: 375 KIRKPCNCTKSMCLKL-----------------------------YCECFANGHFCDSCN 405
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C+NCHNN+E + R +A+ + LERNP AFRPKI +DA H KGC
Sbjct: 406 CINCHNNLEFDTDRSKAIKSCLERNPMAFRPKIGR----GRDANRT---------HQKGC 452
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+CK+SGCLK YCEC++A I C+ C+C+ CKN E
Sbjct: 453 NCKRSGCLKNYCECYEARIPCTSKCKCIGCKNLE 486
>gi|9294463|dbj|BAB02682.1| unnamed protein product [Arabidopsis thaliana]
Length = 356
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 106/320 (33%), Positives = 144/320 (45%), Gaps = 77/320 (24%)
Query: 27 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
PV +LPL PP +E S + TD + T +K K C C+ S+CLKL
Sbjct: 34 PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKL----------- 78
Query: 87 NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
YC+CFA+G+ C C+CV+CHNN E AR+ A+ L RNPN
Sbjct: 79 ------------------YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPN 120
Query: 147 AFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
AF K S Q A D + L+ +GC CK++ CLKKYCECFQAN+LCS+NC+
Sbjct: 121 AFSEKALGSLTDNQCKAAPDTKPGLLS----RGCKCKRTRCLKKYCECFQANLLCSDNCK 176
Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
C++CKN SE + HN +R + + LF
Sbjct: 177 CINCKNV--SEAFQPPAFSAHNSPQVYRRRRDRELTEWNSCPAP--------------LF 220
Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
+ D S+ N S + CS +P + S++G T + L
Sbjct: 221 SIP--DNSI---------QNALGSPMSCSPKLPYRK--KRSLMGYTSTLL---------- 257
Query: 326 QDMKELCSLLVMVSSEATKT 345
D+ +LCSLLV S AT T
Sbjct: 258 PDLGDLCSLLVAASESATTT 277
>gi|195583082|ref|XP_002081353.1| GD10970 [Drosophila simulans]
gi|194193362|gb|EDX06938.1| GD10970 [Drosophila simulans]
Length = 950
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 98/194 (50%), Gaps = 54/194 (27%)
Query: 33 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
LPTLPPG + ++P D G ++K CNC S+CLKL
Sbjct: 696 LPTLPPGVKADVPAKPLFEVLKPAATPAAAGAVDPLGGMTSRRKHCNCSKSQCLKL---- 751
Query: 80 IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
YC+CFA G +C C C +C NN+++EV R+ A+ +
Sbjct: 752 -------------------------YCDCFANGEFCQDCTCKDCCNNLDYEVERERAIRS 786
Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC++A I
Sbjct: 787 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 834
Query: 200 CSENCRCLDCKNFE 213
C+ C+C+ C+N E
Sbjct: 835 CTSICKCVGCRNME 848
>gi|198458221|ref|XP_001360959.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
gi|198136263|gb|EAL25535.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
Length = 978
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 92/172 (53%), Gaps = 41/172 (23%)
Query: 42 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
+SP+ TD G ++K CNC S+CLKL
Sbjct: 745 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKL-------------------------- 778
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YC+CFA G +C C C +C NN+++EV R+ A+ + L+RNP+AF+PKI + G D
Sbjct: 779 ---YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--D 833
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
R HNKGC+CK+SGCLK YCEC++A I CS C+C+ C+N E
Sbjct: 834 MR----------LHNKGCNCKRSGCLKNYCECYEAKIPCSSICKCVGCRNME 875
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 38/84 (45%), Gaps = 6/84 (7%)
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
+I SP A DA A K C+C KS CLK YC+CF C ++C C DC
Sbjct: 742 EILKSPQLASAPTTDALAGM--ASRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCF 798
Query: 211 N---FEGSEERRALFHGDHNGIAF 231
N +E ER D N AF
Sbjct: 799 NNLDYEVERERAIRSCLDRNPSAF 822
>gi|146182846|ref|XP_001025406.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|146143702|gb|EAS05161.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 546
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 66/149 (44%), Positives = 86/149 (57%), Gaps = 33/149 (22%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K+CNC+ S+C+KL YCEC+AAG+YC+GC+CV+
Sbjct: 69 KKCNCKQSQCIKL-----------------------------YCECYAAGVYCNGCHCVS 99
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C NN + E R +A+ TL +NP AF PKI+ A + KHNKGC CK
Sbjct: 100 CKNNDQSEGWRNQAIQNTLVKNPTAFLPKISQKQETASKVVDPLN----LGKHNKGCACK 155
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKN 211
+SGC+KKYCECFQA + CSENC+C DC+N
Sbjct: 156 RSGCMKKYCECFQAGVPCSENCKCTDCQN 184
Score = 47.0 bits (110), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 29/43 (67%), Gaps = 1/43 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
K C+CK+S C+K YCEC+ A + C+ C C+ CKN + SE R
Sbjct: 69 KKCNCKQSQCIKLYCECYAAGVYCN-GCHCVSCKNNDQSEGWR 110
>gi|17541142|ref|NP_502544.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
gi|75019534|sp|Q95QD7.1|LIN54_CAEEL RecName: Full=Protein lin-54; AltName: Full=Abnormal cell lineage
protein 54
gi|6425180|emb|CAB05229.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
Length = 435
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 107/213 (50%), Gaps = 46/213 (21%)
Query: 46 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
S R T K P ++K CNC S+CLKL Y
Sbjct: 158 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKL-----------------------------Y 188
Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
C+CFA G +C CNC +CHNN+E++ R +A+ +LERNPNAF+PKI + G D
Sbjct: 189 CDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITD---- 244
Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG- 224
+ H KGCHCKKSGCLK YCEC++A + C++ C+C C+N E R G
Sbjct: 245 -----IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRYKNSGG 299
Query: 225 ---DHNGIAFMQRAANAAIIGAVGSSGYGTLMT 254
+ N + + A++ A SG G+++T
Sbjct: 300 AVSNTNALMSLTNASST----ATPDSGPGSVVT 328
>gi|17541144|ref|NP_502545.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
gi|3924783|emb|CAB05228.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
Length = 429
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 107/213 (50%), Gaps = 46/213 (21%)
Query: 46 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
S R T K P ++K CNC S+CLKL Y
Sbjct: 152 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKL-----------------------------Y 182
Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
C+CFA G +C CNC +CHNN+E++ R +A+ +LERNPNAF+PKI + G D
Sbjct: 183 CDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITD---- 238
Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG- 224
+ H KGCHCKKSGCLK YCEC++A + C++ C+C C+N E R G
Sbjct: 239 -----IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRYKNSGG 293
Query: 225 ---DHNGIAFMQRAANAAIIGAVGSSGYGTLMT 254
+ N + + A++ A SG G+++T
Sbjct: 294 AVSNTNALMSLTNASST----ATPDSGPGSVVT 322
>gi|158289963|ref|XP_559078.3| AGAP010380-PA [Anopheles gambiae str. PEST]
gi|157018411|gb|EAL41036.3| AGAP010380-PA [Anopheles gambiae str. PEST]
Length = 725
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 71/185 (38%), Positives = 96/185 (51%), Gaps = 44/185 (23%)
Query: 42 ESPRSRPRANTDGKDGTPK---KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
++ + PRA T G P K++ CNC S+CLKL
Sbjct: 432 DTSKDSPRALTTTVYGFPDEAYKKRPCNCTKSQCLKL----------------------- 468
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YC+CFA G YC CNC +C N +H+ RQ+A+ +TLERNPNAF+PKI G
Sbjct: 469 ------YCDCFANGEYCYNCNCKDCFNTFDHDNERQKAIRSTLERNPNAFKPKI-----G 517
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+ + +D H KGC+CK+SGCLK YCEC++ I CS NC+C+ +F +
Sbjct: 518 SIGSTDDGTRL-----HTKGCNCKRSGCLKNYCECYEGKIACSSNCKCVGTPSFHLLQSH 572
Query: 219 RALFH 223
LFH
Sbjct: 573 --LFH 575
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 42/82 (51%), Gaps = 10/82 (12%)
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEERRALFHGDHNGIAF 231
A + C+C KS CLK YC+CF C NC C DC N F+ ER+ +
Sbjct: 452 AYKKRPCNCTKSQCLKLYCDCFANGEYC-YNCNCKDCFNTFDHDNERQKAIR------ST 504
Query: 232 MQRAANA--AIIGAVGSSGYGT 251
++R NA IG++GS+ GT
Sbjct: 505 LERNPNAFKPKIGSIGSTDDGT 526
>gi|195124896|ref|XP_002006919.1| GI21332 [Drosophila mojavensis]
gi|193911987|gb|EDW10854.1| GI21332 [Drosophila mojavensis]
Length = 879
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 86/153 (56%), Gaps = 41/153 (26%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
++K CNC S+CLKL YC+CFA G +C C C
Sbjct: 663 RRKHCNCSKSQCLKL-----------------------------YCDCFANGEFCQDCTC 693
Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
+C NN+++EV R+EA+ + LERNP+AF+PKI + G D R HNKGC+
Sbjct: 694 KDCFNNLDYEVKREEAIRSCLERNPSAFKPKITAPNSG--DIRL----------HNKGCN 741
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
CK+SGCLK YCEC++A I CS C+C+ C+N E
Sbjct: 742 CKRSGCLKNYCECYEAKIPCSTMCKCVGCRNME 774
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
K C+C KS CLK YC+CF C ++C C DC N E +R
Sbjct: 665 KHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCFNNLDYEVKR 706
>gi|348671359|gb|EGZ11180.1| hypothetical protein PHYSODRAFT_520155 [Phytophthora sojae]
Length = 999
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 99/215 (46%), Gaps = 73/215 (33%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
K+ CNC+ S+CLKL YCECFA+G YCD CN
Sbjct: 563 KKAPCNCKKSKCLKL-----------------------------YCECFASGGYCDESCN 593
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH-----------GAQDAREDARE 168
C++C N E RQ+A+ A LE+NPNAF+PKI ++P GA +
Sbjct: 594 CLDCSNTPATEEVRQQAIAARLEKNPNAFKPKIGATPGMTPGSARRLSVGASGGSHASPG 653
Query: 169 AQLAAK---------------------HNKGCHCKKSGCLKKYCECFQANILCSENCRCL 207
L+ H GCHCKKS C KKYCECFQA + C ENCRC+
Sbjct: 654 TFLSPSGRLNLQQQHQLLSAGLLSTKMHKHGCHCKKSACQKKYCECFQAGVPCGENCRCI 713
Query: 208 DCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIG 242
DCKN +A NG+A A +A++G
Sbjct: 714 DCKN-------QAPCVAHANGVA----TAGSAVVG 737
>gi|390332563|ref|XP_792482.3| PREDICTED: protein lin-54 homolog [Strongylocentrotus purpuratus]
Length = 705
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/171 (45%), Positives = 96/171 (56%), Gaps = 42/171 (24%)
Query: 53 DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAG 112
DG P+K CNC S+CLKL YC+CFA G
Sbjct: 475 DGSGSRPRKP--CNCTKSQCLKL-----------------------------YCDCFANG 503
Query: 113 IYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 172
+C CNC NC NN++HE R +AV A L+RNP+AF PKI HG+Q R
Sbjct: 504 EFCRNCNCNNCLNNLDHEDERTKAVKACLDRNPHAFHPKIGKG-HGSQTNR--------- 553
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+HNKGC+CK+SGCLK YCEC++A ILCS C+C+ CKNFE S ER+ L H
Sbjct: 554 -RHNKGCNCKRSGCLKNYCECYEAKILCSNFCKCVGCKNFEESPERKTLMH 603
>gi|359476108|ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera]
Length = 1001
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 100/199 (50%), Gaps = 39/199 (19%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SP+ R + + DG K+CNC+ S+CLKL
Sbjct: 547 QPSPKKRRKKASSTNDG--DGCKRCNCKKSKCLKL------------------------- 579
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YC+CFAAGIYC +GC CV C N E++ E RNP AF PKI +G+
Sbjct: 580 ----YCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGS 635
Query: 160 Q-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
++ ED R +A+H +GC+CKKS CLKKYCEC+QAN+ CS CRC CKN G +E
Sbjct: 636 PINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGCKNVYGRKEE 695
Query: 219 RALFHGDHNGIAFMQRAAN 237
F M R AN
Sbjct: 696 YGAFK------EMMSRRAN 708
>gi|428168842|gb|EKX37782.1| hypothetical protein GUITHDRAFT_52378, partial [Guillardia theta
CCMP2712]
Length = 118
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 77/153 (50%), Positives = 92/153 (60%), Gaps = 38/153 (24%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
K+KQCNC+NS+CLKL YCECFA+G YC+ CN
Sbjct: 2 KRKQCNCKNSKCLKL-----------------------------YCECFASGSYCNLSCN 32
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS-PHGAQDAREDAREAQLAAKHNKG 178
C C NN +E R+ A+ ATLERNP AF+PKIASS P G +D+ R H KG
Sbjct: 33 CQLCQNNENYEKQRKSAIDATLERNPLAFQPKIASSAPPGIEDSPVQGR-------HTKG 85
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
CHCKKSGCLKKYCECFQA +LCS C+C +C+N
Sbjct: 86 CHCKKSGCLKKYCECFQAGVLCSAQCKCHECRN 118
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 40/69 (57%), Gaps = 3/69 (4%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE-ERRALFHG--DHNGIAFMQ 233
K C+CK S CLK YCECF + C+ +C C C+N E E +R++ + N +AF
Sbjct: 4 KQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAFQP 63
Query: 234 RAANAAIIG 242
+ A++A G
Sbjct: 64 KIASSAPPG 72
>gi|325183843|emb|CCA18301.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183968|emb|CCA18426.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 983
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/181 (39%), Positives = 89/181 (49%), Gaps = 53/181 (29%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
K+ CNC+ S+CLKL YCECFA G YCD CN
Sbjct: 583 KKAPCNCKKSKCLKL-----------------------------YCECFANGGYCDENCN 613
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK----- 174
C++C N +E RQ AV + LE+NPNAF+PKI S A + +A + +
Sbjct: 614 CLDCSNTLERIDERQAAVASRLEKNPNAFKPKILPSQSSAVKSLYEALSSTKSTSSGVFP 673
Query: 175 ------------------HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
H GCHCKKS C KKYCECFQA +LC +NCRC+DC+N E +
Sbjct: 674 MRRSTADGHRTPTGKKRMHKDGCHCKKSACQKKYCECFQAGVLCGDNCRCIDCRNVEKKK 733
Query: 217 E 217
E
Sbjct: 734 E 734
>gi|301094740|ref|XP_002896474.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109449|gb|EEY67501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 972
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/182 (39%), Positives = 89/182 (48%), Gaps = 60/182 (32%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
K+ CNC+ S+CLKL YCECFA+G YCD CN
Sbjct: 560 KKAPCNCKKSKCLKL-----------------------------YCECFASGGYCDESCN 590
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH-------GAQDAREDAR----- 167
C++C N E RQ+A+ + LE+NPNAF+PKI ++ GA+ D R
Sbjct: 591 CLDCANTTAAEDVRQQAIASRLEKNPNAFKPKIGATSTMVTVTSGGARRLSADRRASTGT 650
Query: 168 ----------------EAQLAAK--HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
A +A H GCHCKKS C KKYCECFQA + C ENCRC+DC
Sbjct: 651 FLSPPGQLSLQQRQLLSAGMATTKMHKHGCHCKKSACQKKYCECFQAGVSCGENCRCIDC 710
Query: 210 KN 211
KN
Sbjct: 711 KN 712
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 30/49 (61%)
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+ A C+CKKS CLK YCECF + C E+C CLDC N +E+ R
Sbjct: 556 IPAPKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVR 604
>gi|194376810|dbj|BAG57551.1| unnamed protein product [Homo sapiens]
Length = 709
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|195400539|ref|XP_002058874.1| GJ19678 [Drosophila virilis]
gi|194156225|gb|EDW71409.1| GJ19678 [Drosophila virilis]
Length = 980
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 85/153 (55%), Gaps = 41/153 (26%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
++K CNC S+CLKL YC+CFA G +C C C
Sbjct: 765 RRKHCNCSKSQCLKL-----------------------------YCDCFANGEFCQDCTC 795
Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
+C NN+++EV R+ A+ + LERNP+AF+PKI + G D R HNKGC+
Sbjct: 796 KDCFNNLDYEVKRERAIRSCLERNPSAFKPKITAPNSG--DMRL----------HNKGCN 843
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
CK+SGCLK YCEC++A I CS C+C+ C+N E
Sbjct: 844 CKRSGCLKNYCECYEAKIPCSAMCKCVGCRNME 876
Score = 45.4 bits (106), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
L K C+C KS CLK YC+CF C ++C C DC N E +R
Sbjct: 761 LGGSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCFNNLDYEVKR 808
>gi|195484960|ref|XP_002090893.1| GE12548 [Drosophila yakuba]
gi|194176994|gb|EDW90605.1| GE12548 [Drosophila yakuba]
Length = 950
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 76/220 (34%), Positives = 108/220 (49%), Gaps = 54/220 (24%)
Query: 6 QAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANT------------D 53
QA + + R Q + P V LPTLP G + + ++P + D
Sbjct: 671 QAQAKQRIRQQQLPTEQNAPTKVEP-KLPTLPTGLKANAPAKPLFDILKPPATGAAAAVD 729
Query: 54 GKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGI 113
G ++K CNC S+CLKL YC+CFA G
Sbjct: 730 ALGGMTSRRKHCNCSKSQCLKL-----------------------------YCDCFANGE 760
Query: 114 YCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 173
+C C C +C NN+++EV R+ A+ + L+RNP+AF+PKI + G D R
Sbjct: 761 FCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--DMRL--------- 809
Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
HNKGC+CK+SGCLK YCEC++A I C+ C+C+ C+N E
Sbjct: 810 -HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 848
>gi|165972329|ref|NP_766302.2| protein lin-54 homolog [Mus musculus]
gi|169234729|ref|NP_001108482.1| protein lin-54 homolog [Mus musculus]
gi|190359887|sp|Q571G4.2|LIN54_MOUSE RecName: Full=Protein lin-54 homolog
gi|74141051|dbj|BAE22096.1| unnamed protein product [Mus musculus]
gi|148688344|gb|EDL20291.1| expressed sequence AI461788, isoform CRA_a [Mus musculus]
Length = 749
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+ T + +K CNC S CLKL YC+CFA G +C
Sbjct: 516 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 546
Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H
Sbjct: 547 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 595
Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+
Sbjct: 596 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 655
Query: 235 AA 236
A
Sbjct: 656 TA 657
>gi|301089591|ref|XP_002895080.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102431|gb|EEY60483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 972
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 72/182 (39%), Positives = 89/182 (48%), Gaps = 60/182 (32%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
K+ CNC+ S+CLKL YCECFA+G YCD CN
Sbjct: 560 KKAPCNCKKSKCLKL-----------------------------YCECFASGGYCDESCN 590
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH-------GAQDAREDAR----- 167
C++C N E RQ+A+ + LE+NPNAF+PKI ++ GA+ D R
Sbjct: 591 CLDCTNTTAAEDVRQQAIASRLEKNPNAFKPKIGATSTMVTVTSGGARRLSADRRASTGT 650
Query: 168 ----------------EAQLA--AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
A +A H GCHCKKS C KKYCECFQA + C ENCRC+DC
Sbjct: 651 FLSPPGQLSLQQRQLLSAGMANTKMHKHGCHCKKSACQKKYCECFQAGVPCGENCRCIDC 710
Query: 210 KN 211
KN
Sbjct: 711 KN 712
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 30/49 (61%)
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+ A C+CKKS CLK YCECF + C E+C CLDC N +E+ R
Sbjct: 556 IPAPKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVR 604
>gi|326680100|ref|XP_002666845.2| PREDICTED: protein lin-54 homolog [Danio rerio]
Length = 460
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 96/194 (49%), Gaps = 40/194 (20%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
K ++ CNC S+CLKL YCECFA G +C C
Sbjct: 262 KSKRPCNCTKSQCLKL-----------------------------YCECFANGEFCSSCK 292
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC NN EH R +AV A L+RNP AFRPKI S ++ + H KGC
Sbjct: 293 CTNCFNNTEHVFERSQAVKACLDRNPGAFRPKIGSR-----------KQGNVKGCHTKGC 341
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAA 239
+CK+SGCLK YCEC++A I+C+ C+C+ C+N++ + + D + I F R +
Sbjct: 342 NCKRSGCLKNYCECYEAKIMCTSTCKCVGCRNYDEGCDMKKSSEQDRHDIYFPTRCPVSV 401
Query: 240 IIGAVGSSGYGTLM 253
I V + G L+
Sbjct: 402 ITRDVVEATCGCLL 415
>gi|60359862|dbj|BAD90150.1| mKIAA2037 protein [Mus musculus]
Length = 793
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+ T + +K CNC S CLKL YC+CFA G +C
Sbjct: 560 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 590
Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H
Sbjct: 591 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 639
Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+
Sbjct: 640 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 699
Query: 235 AA 236
A
Sbjct: 700 TA 701
>gi|34365308|emb|CAE45981.1| hypothetical protein [Homo sapiens]
Length = 749
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|190689471|gb|ACE86510.1| lin-54 homolog (C. elegans) protein [synthetic construct]
Length = 749
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|344284817|ref|XP_003414161.1| PREDICTED: protein lin-54 homolog isoform 1 [Loxodonta africana]
Length = 749
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|169234719|ref|NP_919258.2| protein lin-54 homolog isoform a [Homo sapiens]
gi|114593921|ref|XP_517185.2| PREDICTED: protein lin-54 homolog isoform 3 [Pan troglodytes]
gi|397524648|ref|XP_003832301.1| PREDICTED: protein lin-54 homolog isoform 1 [Pan paniscus]
gi|313104222|sp|Q6MZP7.3|LIN54_HUMAN RecName: Full=Protein lin-54 homolog; AltName: Full=CXC
domain-containing protein 1
gi|119626322|gb|EAX05917.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
gi|119626324|gb|EAX05919.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
gi|355687283|gb|EHH25867.1| CXC domain-containing protein 1 [Macaca mulatta]
gi|355762609|gb|EHH62025.1| CXC domain-containing protein 1 [Macaca fascicularis]
gi|383415851|gb|AFH31139.1| protein lin-54 homolog isoform a [Macaca mulatta]
gi|410222154|gb|JAA08296.1| lin-54 homolog [Pan troglodytes]
gi|410249630|gb|JAA12782.1| lin-54 homolog [Pan troglodytes]
gi|410296420|gb|JAA26810.1| lin-54 homolog [Pan troglodytes]
gi|410329877|gb|JAA33885.1| lin-54 homolog [Pan troglodytes]
Length = 749
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|402869363|ref|XP_003898732.1| PREDICTED: protein lin-54 homolog isoform 2 [Papio anubis]
Length = 749
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|426344811|ref|XP_004038946.1| PREDICTED: protein lin-54 homolog isoform 1 [Gorilla gorilla
gorilla]
Length = 749
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|324510685|gb|ADY44467.1| Protein lin-54 [Ascaris suum]
Length = 448
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 87/163 (53%), Gaps = 40/163 (24%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K CNC S CLKL YC+CFA G +C CNC
Sbjct: 221 KKPCNCTKSMCLKL-----------------------------YCDCFANGEFCKDCNCK 251
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+CHNN+++E R A+ A+LERNPNAF+PKI + G D+ H KGCHC
Sbjct: 252 DCHNNLDNEAERSRAIKASLERNPNAFKPKIGVASRGRVDSER---------LHQKGCHC 302
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG 224
KKS CLK YCEC++A + C+E C+C C+N E +R + F G
Sbjct: 303 KKSNCLKNYCECYEAKVPCTERCKCSSCRNTES--DRASKFRG 343
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 46/92 (50%), Gaps = 11/92 (11%)
Query: 137 VGATLERNPNAFRPKIAS-------SPH-GAQDAREDAREAQLAAKHNKGCHCKKSGCLK 188
V AT R P AFR A+ +P A+ + R++ + K K C+C KS CLK
Sbjct: 176 VQATEPRVPIAFRTAEAAYQQNRLLTPFLAARMVHQKKRQSGIGIK--KPCNCTKSMCLK 233
Query: 189 KYCECFQANILCSENCRCLDCKNFEGSEERRA 220
YC+CF C ++C C DC N +E R+
Sbjct: 234 LYCDCFANGEFC-KDCNCKDCHNNLDNEAERS 264
>gi|291401518|ref|XP_002717112.1| PREDICTED: lin-54 homolog [Oryctolagus cuniculus]
Length = 749
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|149701464|ref|XP_001494323.1| PREDICTED: protein lin-54 homolog isoform 2 [Equus caballus]
Length = 749
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|410903926|ref|XP_003965444.1| PREDICTED: protein lin-54 homolog [Takifugu rubripes]
Length = 807
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 77/190 (40%), Positives = 100/190 (52%), Gaps = 41/190 (21%)
Query: 48 PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCE 107
P T + P+ +K CNC S+CLKL YC+
Sbjct: 566 PLNGTSTTETAPRPRKPCNCTKSQCLKL-----------------------------YCD 596
Query: 108 CFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR 167
CFA G +C+ CNC NC NN+EHE R +A+ L+RNP AF+PKI G D R
Sbjct: 597 CFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRR---- 652
Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDH 226
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D
Sbjct: 653 -------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADA 705
Query: 227 NGIAFMQRAA 236
+ Q+ A
Sbjct: 706 AEVRVQQQTA 715
>gi|332233371|ref|XP_003265875.1| PREDICTED: protein lin-54 homolog isoform 1 [Nomascus leucogenys]
Length = 749
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|426231958|ref|XP_004010003.1| PREDICTED: protein lin-54 homolog isoform 1 [Ovis aries]
Length = 748
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 519 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 549
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 550 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 598
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 599 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 656
>gi|329664578|ref|NP_001192410.1| protein lin-54 homolog [Bos taurus]
gi|296486396|tpg|DAA28509.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
gi|440898648|gb|ELR50096.1| Protein lin-54-like protein [Bos grunniens mutus]
Length = 748
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 519 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 549
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 550 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 598
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 599 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 656
>gi|452819860|gb|EME26911.1| transcription factor [Galdieria sulphuraria]
Length = 284
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 61/117 (52%), Positives = 83/117 (70%), Gaps = 11/117 (9%)
Query: 102 LVKYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YC+CF +G+YC C CV C+NN +H+V R+ A+ + LE+NP+AF+PK+A+ +
Sbjct: 23 LKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQPKVAT----FR 78
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
D+ L KHNKGC+C+K+ CLKKYCECFQ+ +LCSE C+C CKNFEGS E
Sbjct: 79 DS------GGLIVKHNKGCNCRKTACLKKYCECFQSGVLCSELCKCSGCKNFEGSLE 129
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
KGC C KS CLK YC+CF + + C +C C+ C N E + R
Sbjct: 13 KGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVER 55
>gi|355699397|gb|AES01114.1| lin-54-like protein [Mustela putorius furo]
Length = 748
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|403263377|ref|XP_003924012.1| PREDICTED: protein lin-54 homolog isoform 1 [Saimiri boliviensis
boliviensis]
Length = 749
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|354499573|ref|XP_003511883.1| PREDICTED: protein lin-54 homolog [Cricetulus griseus]
gi|344242817|gb|EGV98920.1| Protein lin-54-like [Cricetulus griseus]
Length = 749
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|195431948|ref|XP_002063989.1| GK15963 [Drosophila willistoni]
gi|194160074|gb|EDW74975.1| GK15963 [Drosophila willistoni]
Length = 587
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 78/227 (34%), Positives = 111/227 (48%), Gaps = 63/227 (27%)
Query: 5 SQAMPQWQARPQHVRMVNR------------------VPHPVHKLPLPTLPPGKQESPRS 46
+QA + +AR QH++ + + VP P +K L P S S
Sbjct: 294 AQAQIRQRARQQHLQQLQKAKPQLKGSQLNQSANVKPVPAPANKPNFEILKP--PSSAAS 351
Query: 47 RPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYC 106
P D + ++K CNC S+CLKL YC
Sbjct: 352 TPAV--DAQLNLASRRKHCNCSKSQCLKL-----------------------------YC 380
Query: 107 ECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 166
+CFA G +C C C +C NN+++EV R++A+ + L+RNP+AF+PKI + G D R
Sbjct: 381 DCFANGEFCKDCTCKDCFNNLDNEVEREKAIRSCLDRNPSAFKPKITAPASG--DMRL-- 436
Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
HNKGC+CK+SGCLK YCEC++A I CS C+C+ C+N E
Sbjct: 437 --------HNKGCNCKRSGCLKNYCECYEAKIPCSSMCKCVGCRNME 475
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 36/74 (48%), Gaps = 5/74 (6%)
Query: 162 AREDAREAQL-AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC-KNFEGSEERR 219
A A +AQL A K C+C KS CLK YC+CF C ++C C DC N + ER
Sbjct: 350 ASTPAVDAQLNLASRRKHCNCSKSQCLKLYCDCFANGEFC-KDCTCKDCFNNLDNEVERE 408
Query: 220 ALFHG--DHNGIAF 231
D N AF
Sbjct: 409 KAIRSCLDRNPSAF 422
>gi|296196136|ref|XP_002745695.1| PREDICTED: protein lin-54 homolog isoform 1 [Callithrix jacchus]
Length = 749
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|118404776|ref|NP_001072590.1| protein lin-54 homolog [Xenopus (Silurana) tropicalis]
gi|123905701|sp|Q0IHV2.1|LIN54_XENTR RecName: Full=Protein lin-54 homolog
gi|114107615|gb|AAI22958.1| lin-54 homolog [Xenopus (Silurana) tropicalis]
Length = 741
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 76/182 (41%), Positives = 101/182 (55%), Gaps = 43/182 (23%)
Query: 45 RSRPRANTDG---KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
+++PR +G + + +K CNC S CLKL
Sbjct: 494 QNQPRGPLNGIISSESASRPRKPCNCTKSLCLKL-------------------------- 527
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D
Sbjct: 528 ---YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESD 584
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L
Sbjct: 585 RR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTL 633
Query: 222 FH 223
H
Sbjct: 634 MH 635
>gi|165972337|ref|NP_001094034.1| protein lin-54 homolog [Rattus norvegicus]
gi|190359881|sp|Q641Z1.2|LIN54_RAT RecName: Full=Protein lin-54 homolog
gi|149046795|gb|EDL99569.1| similar to hypothetical protein, isoform CRA_b [Rattus norvegicus]
Length = 749
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|301753331|ref|XP_002912513.1| PREDICTED: protein lin-54 homolog [Ailuropoda melanoleuca]
gi|281346804|gb|EFB22388.1| hypothetical protein PANDA_000261 [Ailuropoda melanoleuca]
Length = 749
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|348567366|ref|XP_003469470.1| PREDICTED: protein lin-54 homolog [Cavia porcellus]
Length = 749
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|359323632|ref|XP_003640147.1| PREDICTED: protein lin-54 homolog isoform 2 [Canis lupus
familiaris]
Length = 749
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|410957317|ref|XP_003985276.1| PREDICTED: protein lin-54 homolog isoform 1 [Felis catus]
Length = 749
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|190690833|gb|ACE87191.1| lin-54 homolog (C. elegans) protein [synthetic construct]
Length = 749
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 97/178 (54%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQRAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|297292592|ref|XP_001083154.2| PREDICTED: protein lin-54 homolog, partial [Macaca mulatta]
Length = 813
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 584 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 614
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 615 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 663
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 664 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 721
>gi|431916156|gb|ELK16408.1| Protein lin-54 like protein [Pteropus alecto]
Length = 827
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 598 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 628
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 629 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 677
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 678 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 735
>gi|194757683|ref|XP_001961092.1| GF13699 [Drosophila ananassae]
gi|190622390|gb|EDV37914.1| GF13699 [Drosophila ananassae]
Length = 921
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 85/153 (55%), Gaps = 41/153 (26%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
++K CNC S+CLKL YC+CFA G +C C C
Sbjct: 706 RRKHCNCSKSQCLKL-----------------------------YCDCFANGEFCQNCTC 736
Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
+C NN+++EV R+ A+ + L+RNP+AF+PKI + G D R HNKGC+
Sbjct: 737 KDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--DMRL----------HNKGCN 784
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
CK+SGCLK YCEC++A I C+ C+C+ C+N E
Sbjct: 785 CKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 817
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 32/64 (50%), Gaps = 4/64 (6%)
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEGSEERRALFHGDHN 227
L A K C+C KS CLK YC+CF C +NC C DC N +E ER D N
Sbjct: 702 LGASRRKHCNCSKSQCLKLYCDCFANGEFC-QNCTCKDCFNNLDYEVERERAIRSCLDRN 760
Query: 228 GIAF 231
AF
Sbjct: 761 PSAF 764
>gi|410038272|ref|XP_003950370.1| PREDICTED: protein lin-54 homolog [Pan troglodytes]
gi|426344815|ref|XP_004038948.1| PREDICTED: protein lin-54 homolog isoform 3 [Gorilla gorilla
gorilla]
gi|80475938|gb|AAI09279.1| LIN54 protein [Homo sapiens]
Length = 660
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|402869365|ref|XP_003898733.1| PREDICTED: protein lin-54 homolog isoform 3 [Papio anubis]
Length = 660
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|426231960|ref|XP_004010004.1| PREDICTED: protein lin-54 homolog isoform 2 [Ovis aries]
Length = 659
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 430 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 460
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 461 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 509
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 510 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 567
>gi|296082007|emb|CBI21012.3| unnamed protein product [Vitis vinifera]
Length = 1094
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/177 (41%), Positives = 91/177 (51%), Gaps = 37/177 (20%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YC+CFAAGIYC +GC CV
Sbjct: 574 KRCNCKKSKCLKL-----------------------------YCDCFAAGIYCAEGCACV 604
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCH 180
C N E++ E RNP AF PKI +G+ ++ ED R +A+H +GC+
Sbjct: 605 GCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGSPINSGEDGRSTPSSARHKRGCN 664
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 237
CKKS CLKKYCEC+QAN+ CS CRC CKN G +E F M R AN
Sbjct: 665 CKKSMCLKKYCECYQANVGCSAGCRCEGCKNVYGRKEEYGAFK------EMMSRRAN 715
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 47/106 (44%), Gaps = 21/106 (19%)
Query: 134 QEAVGATLERNPN----AFRPKIASSPHGAQDAREDAREAQLA---AKHN---------- 176
+ + G RN N A + S HG Q + +A L+ HN
Sbjct: 510 ENSAGTEDRRNENKASIAMSSATSQSSHGIQPTKPKKEKASLSFHVISHNFDFSASSTND 569
Query: 177 ----KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
K C+CKKS CLK YC+CF A I C+E C C+ C N ++R
Sbjct: 570 GDGCKRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDR 615
>gi|441625230|ref|XP_004089055.1| PREDICTED: protein lin-54 homolog [Nomascus leucogenys]
Length = 660
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|149701467|ref|XP_001494305.1| PREDICTED: protein lin-54 homolog isoform 1 [Equus caballus]
Length = 660
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|311262890|ref|XP_003129406.1| PREDICTED: protein lin-54 homolog [Sus scrofa]
Length = 763
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 534 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 564
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 565 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 613
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 614 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 671
>gi|403263381|ref|XP_003924014.1| PREDICTED: protein lin-54 homolog isoform 3 [Saimiri boliviensis
boliviensis]
Length = 660
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|296196138|ref|XP_002745696.1| PREDICTED: protein lin-54 homolog isoform 2 [Callithrix jacchus]
Length = 660
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|340376737|ref|XP_003386888.1| PREDICTED: protein lin-54 homolog [Amphimedon queenslandica]
Length = 408
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/191 (39%), Positives = 99/191 (51%), Gaps = 48/191 (25%)
Query: 35 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
T+PP + +S+P++ + P+K CNC+NS+CLKL
Sbjct: 155 TIPPPSHQ--QSKPKSGANLASNKPRKP--CNCKNSQCLKL------------------- 191
Query: 95 NTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
YC+CFA G +C D CNC NC N+ +HE R AV A LERNPNAF+PK+
Sbjct: 192 ----------YCDCFANGEFCRDSCNCQNCKNSFQHEGDRSRAVKACLERNPNAFKPKVG 241
Query: 154 SSPHG-AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
HG D R +H KGC+CKKS CLK YCEC++A I CS CRC+ C+N
Sbjct: 242 ---HGRVGDER----------RHIKGCNCKKSSCLKNYCECYEAKIPCSHLCRCVGCQNL 288
Query: 213 EGSEERRALFH 223
E + L
Sbjct: 289 ADRPEGKGLMQ 299
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 4/94 (4%)
Query: 151 KIASSPHGAQDAREDAREAQLAA-KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
K A+ P + + A LA+ K K C+CK S CLK YC+CF C ++C C +C
Sbjct: 152 KTATIPPPSHQQSKPKSGANLASNKPRKPCNCKNSQCLKLYCDCFANGEFCRDSCNCQNC 211
Query: 210 KN---FEGSEERRALFHGDHNGIAFMQRAANAAI 240
KN EG R + N AF + + +
Sbjct: 212 KNSFQHEGDRSRAVKACLERNPNAFKPKVGHGRV 245
>gi|345795675|ref|XP_003434061.1| PREDICTED: protein lin-54 homolog isoform 1 [Canis lupus
familiaris]
Length = 660
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|410957319|ref|XP_003985277.1| PREDICTED: protein lin-54 homolog isoform 2 [Felis catus]
Length = 660
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|31873949|emb|CAD97902.1| hypothetical protein [Homo sapiens]
Length = 621
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 392 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 422
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 423 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 471
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 472 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 529
>gi|380799221|gb|AFE71486.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
Length = 560
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 331 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 361
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 362 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 410
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 411 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 468
>gi|117306452|gb|AAI25581.1| Lin54 protein [Mus musculus]
Length = 515
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+ T + +K CNC S CLKL YC+CFA G +C
Sbjct: 282 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 312
Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H
Sbjct: 313 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 361
Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+
Sbjct: 362 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 421
Query: 235 AA 236
A
Sbjct: 422 TA 423
>gi|363733235|ref|XP_003641221.1| PREDICTED: protein lin-54 homolog isoform 1 [Gallus gallus]
Length = 804
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 75/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 575 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 605
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN++HE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 606 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 654
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 655 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 712
>gi|126330910|ref|XP_001376980.1| PREDICTED: protein lin-54 homolog [Monodelphis domestica]
Length = 756
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 97/178 (54%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 527 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 557
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 558 CNNCFNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 606
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 607 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 664
>gi|118090184|ref|XP_420554.2| PREDICTED: protein lin-54 homolog isoform 2 [Gallus gallus]
Length = 752
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 75/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 523 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 553
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN++HE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 554 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 602
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 603 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 660
>gi|326918692|ref|XP_003205622.1| PREDICTED: protein lin-54 homolog isoform 1 [Meleagris gallopavo]
Length = 756
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 75/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 527 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 557
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN++HE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 558 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 606
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 607 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 664
>gi|327272952|ref|XP_003221248.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Anolis
carolinensis]
Length = 749
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 75/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN++HE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CANCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|224049388|ref|XP_002189980.1| PREDICTED: protein lin-54 homolog [Taeniopygia guttata]
Length = 751
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 72/164 (43%), Positives = 93/164 (56%), Gaps = 40/164 (24%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 522 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 552
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN++HE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 553 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 601
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 602 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 645
>gi|449276576|gb|EMC85038.1| Protein lin-54 like protein, partial [Columba livia]
Length = 757
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 72/164 (43%), Positives = 93/164 (56%), Gaps = 40/164 (24%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 528 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 558
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN++HE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 559 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 607
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 608 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 651
>gi|432885357|ref|XP_004074681.1| PREDICTED: protein lin-54 homolog [Oryzias latipes]
Length = 821
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 75/182 (41%), Positives = 98/182 (53%), Gaps = 41/182 (22%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+ T + +K CNC S+CLKL YC+CFA G +C
Sbjct: 588 ETTSRPRKPCNCTKSQCLKL-----------------------------YCDCFANGEFC 618
Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
+ CNC NC NN+EHE R +A+ L+RNP AF+PKI G D R H
Sbjct: 619 NMCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRR-----------H 667
Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+
Sbjct: 668 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 727
Query: 235 AA 236
A
Sbjct: 728 TA 729
>gi|426231962|ref|XP_004010005.1| PREDICTED: protein lin-54 homolog isoform 3 [Ovis aries]
Length = 528
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|426344813|ref|XP_004038947.1| PREDICTED: protein lin-54 homolog isoform 2 [Gorilla gorilla
gorilla]
Length = 528
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|169234721|ref|NP_001108479.1| protein lin-54 homolog isoform b [Homo sapiens]
gi|169234723|ref|NP_001108480.1| protein lin-54 homolog isoform b [Homo sapiens]
gi|197098666|ref|NP_001125469.1| protein lin-54 homolog [Pongo abelii]
gi|332233373|ref|XP_003265876.1| PREDICTED: protein lin-54 homolog isoform 2 [Nomascus leucogenys]
gi|332233375|ref|XP_003265877.1| PREDICTED: protein lin-54 homolog isoform 3 [Nomascus leucogenys]
gi|332233377|ref|XP_003265878.1| PREDICTED: protein lin-54 homolog isoform 4 [Nomascus leucogenys]
gi|397524650|ref|XP_003832302.1| PREDICTED: protein lin-54 homolog isoform 2 [Pan paniscus]
gi|402869361|ref|XP_003898731.1| PREDICTED: protein lin-54 homolog isoform 1 [Papio anubis]
gi|75042058|sp|Q5RBN8.1|LIN54_PONAB RecName: Full=Protein lin-54 homolog
gi|34364698|emb|CAE45799.1| hypothetical protein [Homo sapiens]
gi|55728144|emb|CAH90822.1| hypothetical protein [Pongo abelii]
gi|158258975|dbj|BAF85458.1| unnamed protein product [Homo sapiens]
gi|190689473|gb|ACE86511.1| lin-54 homolog (C. elegans) protein [synthetic construct]
gi|190690835|gb|ACE87192.1| lin-54 homolog (C. elegans) protein [synthetic construct]
gi|410249632|gb|JAA12783.1| lin-54 homolog [Pan troglodytes]
gi|410329875|gb|JAA33884.1| lin-54 homolog [Pan troglodytes]
Length = 528
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|344284819|ref|XP_003414162.1| PREDICTED: protein lin-54 homolog isoform 2 [Loxodonta africana]
Length = 528
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|395850121|ref|XP_003797647.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Otolemur
garnettii]
Length = 544
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 64/161 (39%), Positives = 83/161 (51%), Gaps = 40/161 (24%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K C C S CLKLD C+CFA G + + CNC +
Sbjct: 345 KPCECVISLCLKLD-----------------------------CDCFANGEFYNNCNCTS 375
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
+N++EHE RQ+A+ A L+RNP AF+PK +E H+KGC+CK
Sbjct: 376 YYNSLEHENERQKAIKACLDRNPEAFKPKTG-----------KGKEEDSDXHHSKGCNCK 424
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
SGCLK YCEC++A I+CS C+C CKNFE ER+ L H
Sbjct: 425 XSGCLKNYCECYEAKIMCSSICKCKGCKNFEEXPERKTLMH 465
>gi|338723424|ref|XP_003364724.1| PREDICTED: protein lin-54 homolog [Equus caballus]
Length = 528
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|380799223|gb|AFE71487.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
Length = 521
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 292 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 322
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 323 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 371
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 372 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 429
>gi|403263379|ref|XP_003924013.1| PREDICTED: protein lin-54 homolog isoform 2 [Saimiri boliviensis
boliviensis]
Length = 528
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|348535421|ref|XP_003455199.1| PREDICTED: protein lin-54 homolog [Oreochromis niloticus]
Length = 775
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 76/190 (40%), Positives = 99/190 (52%), Gaps = 41/190 (21%)
Query: 48 PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCE 107
P + T + +K CNC S+CLKL YC+
Sbjct: 534 PLNGLSAAETTSRPRKPCNCTKSQCLKL-----------------------------YCD 564
Query: 108 CFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR 167
CFA G +C+ CNC NC NN+EHE R +A+ L+RNP AF+PKI G D R
Sbjct: 565 CFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRR---- 620
Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDH 226
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D
Sbjct: 621 -------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADA 673
Query: 227 NGIAFMQRAA 236
+ Q+ A
Sbjct: 674 AEVRVQQQTA 683
>gi|390460682|ref|XP_003732523.1| PREDICTED: protein lin-54 homolog [Callithrix jacchus]
Length = 528
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|417402343|gb|JAA48022.1| Putative protein lin-54 [Desmodus rotundus]
Length = 527
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 298 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 328
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 329 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 377
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 378 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 435
>gi|359323634|ref|XP_003640148.1| PREDICTED: protein lin-54 homolog isoform 3 [Canis lupus
familiaris]
gi|410957321|ref|XP_003985278.1| PREDICTED: protein lin-54 homolog isoform 3 [Felis catus]
Length = 528
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|66812974|ref|XP_640666.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
discoideum AX4]
gi|60468739|gb|EAL66741.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
discoideum AX4]
Length = 902
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 69/166 (41%), Positives = 87/166 (52%), Gaps = 40/166 (24%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K+C+C+NS+CLK+ YCECFAA + C+GC C
Sbjct: 360 KKCHCKNSKCLKM-----------------------------YCECFAAKVLCNGCLCFG 390
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C NN + + A TLERNP+AF PKI S + KH+KGCHC+
Sbjct: 391 CQNNEANIEKVERARFQTLERNPDAFNPKIKPS-------NSKINNQIIIDKHSKGCHCR 443
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNG 228
KS CLKKYCECFQA+I C+ENC+C DCKN + L H D+ G
Sbjct: 444 KSSCLKKYCECFQASIPCNENCKCYDCKN----QAHHHLHHPDNGG 485
>gi|291243053|ref|XP_002741422.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
[Saccoglossus kowalevskii]
Length = 946
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/182 (42%), Positives = 95/182 (52%), Gaps = 53/182 (29%)
Query: 42 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
E+P +RPR K CNC S+CLKL
Sbjct: 495 EAPGTRPR-------------KPCNCTKSQCLKL-------------------------- 515
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YC+CFA G +C CNC NC NN+EHE R +A+ + LERNP AF PKI G D
Sbjct: 516 ---YCDCFANGEFCSNCNCTNCFNNLEHESERAKAIKSCLERNPLAFHPKIGKGKEGQAD 572
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
R HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S +R+ L
Sbjct: 573 RR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPDRKTL 621
Query: 222 FH 223
H
Sbjct: 622 MH 623
>gi|326918694|ref|XP_003205623.1| PREDICTED: protein lin-54 homolog isoform 2 [Meleagris gallopavo]
Length = 667
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/164 (43%), Positives = 93/164 (56%), Gaps = 40/164 (24%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 438 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 468
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN++HE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 469 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 517
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 518 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMH 561
>gi|344295768|ref|XP_003419583.1| PREDICTED: tesmin [Loxodonta africana]
Length = 503
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 57/124 (45%), Positives = 82/124 (66%), Gaps = 11/124 (8%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CFA+G +C+ C+ NC NN+ HE+ R +A+ A L+RNP AF+PKI + G
Sbjct: 296 KLTLAGYCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGTGNLG 355
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN++ S ER
Sbjct: 356 -----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDESPER 404
Query: 219 RALF 222
+ L
Sbjct: 405 KTLM 408
>gi|37360936|dbj|BAC98377.1| KIAA2037 protein [Homo sapiens]
Length = 494
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 265 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 295
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 296 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 344
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 345 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 402
>gi|115497062|ref|NP_001070035.1| protein lin-54 homolog [Danio rerio]
gi|123905351|sp|Q08CM4.1|LIN54_DANRE RecName: Full=Protein lin-54 homolog
gi|115313049|gb|AAI24176.1| Zgc:152921 [Danio rerio]
Length = 771
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S+CLKL YC+CFA G +C+ CN
Sbjct: 543 RPRKPCNCTRSQCLKL-----------------------------YCDCFANGEFCNNCN 573
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
CVNC NN++HE R +A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 574 CVNCFNNLDHESERLKAIKACLDRNPVAFKPKIGKGKEGESDRR-----------HSKGC 622
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CKKSGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 623 NCKKSGCLKNYCECYEAKIMCSSICKCMGCKNFEESPERKTLMHLADAAEVRVQQQTA 680
>gi|449471001|ref|XP_004153179.1| PREDICTED: CRC domain-containing protein TSO1-like, partial
[Cucumis sativus]
Length = 581
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/161 (40%), Positives = 86/161 (53%), Gaps = 30/161 (18%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ SRCLKL YCECFAAG+YC + C+C
Sbjct: 287 KRCNCKKSRCLKL-----------------------------YCECFAAGVYCIEPCSCQ 317
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N HE E RNP AF PK+ + + R+D+ + +A+H +GC+C
Sbjct: 318 DCFNKPIHEAMVLETRRQIESRNPLAFAPKVIMNCDLISELRDDSNKTPASARHKRGCNC 377
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
KKSGCLKKYCEC+Q + CS NCRC CKN G ++ AL
Sbjct: 378 KKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALL 418
>gi|51980517|gb|AAH82043.1| Lin54 protein [Rattus norvegicus]
Length = 495
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/164 (44%), Positives = 93/164 (56%), Gaps = 40/164 (24%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 266 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 296
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 297 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 345
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 346 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 389
>gi|449455360|ref|XP_004145421.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
sativus]
gi|449494974|ref|XP_004159699.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
sativus]
Length = 602
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/161 (40%), Positives = 86/161 (53%), Gaps = 30/161 (18%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ SRCLKL YCECFAAG+YC + C+C
Sbjct: 308 KRCNCKKSRCLKL-----------------------------YCECFAAGVYCIEPCSCQ 338
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N HE E RNP AF PK+ + + R+D+ + +A+H +GC+C
Sbjct: 339 DCFNKPIHEAMVLETRRQIESRNPLAFAPKVIMNCDLISELRDDSNKTPASARHKRGCNC 398
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
KKSGCLKKYCEC+Q + CS NCRC CKN G ++ AL
Sbjct: 399 KKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALL 439
>gi|26325238|dbj|BAC26373.1| unnamed protein product [Mus musculus]
gi|148688345|gb|EDL20292.1| expressed sequence AI461788, isoform CRA_b [Mus musculus]
gi|219520492|gb|AAI45026.1| Lin54 protein [Mus musculus]
Length = 419
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+ T + +K CNC S CLKL YC+CFA G +C
Sbjct: 186 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 216
Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H
Sbjct: 217 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 265
Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+
Sbjct: 266 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 325
Query: 235 AA 236
A
Sbjct: 326 TA 327
>gi|297830990|ref|XP_002883377.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
lyrata]
gi|297329217|gb|EFH59636.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
lyrata]
Length = 684
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 89/164 (54%), Gaps = 38/164 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C+
Sbjct: 390 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 420
Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
+C N HE E V AT + RNP AF PK+ S +A +DA + +A+H +
Sbjct: 421 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRSADSIMEAGDDASKTPASARHKR 476
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
GC+CKKS CLKKYCEC+Q + CS NCRC CKN G +E L
Sbjct: 477 GCNCKKSNCLKKYCECYQGGVGCSVNCRCEGCKNVFGRKEGSLL 520
>gi|89111949|ref|NP_001034746.1| tesmin isoform a [Mus musculus]
gi|205371774|sp|Q9WTJ6.2|MTL5_MOUSE RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific; AltName: Full=Testis-specific
metallothionein-like protein
gi|15617422|dbj|BAB64935.1| tesmin [Mus musculus]
gi|19353778|gb|AAH24377.1| Metallothionein-like 5, testis-specific (tesmin) [Mus musculus]
gi|148700990|gb|EDL32937.1| mCG3887, isoform CRA_d [Mus musculus]
Length = 475
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G L K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
K+ GA R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 368
Query: 211 NFEGSEERRALFHGDH 226
N+E S ER+ L H
Sbjct: 369 NYEESPERKMLMSTPH 384
>gi|71051367|gb|AAH99076.1| Lin54 protein [Rattus norvegicus]
Length = 438
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 73/164 (44%), Positives = 93/164 (56%), Gaps = 40/164 (24%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 209 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 239
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 240 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 288
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 289 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 332
>gi|15617420|dbj|BAB64934.1| tesmin [Mus musculus]
Length = 475
Score = 125 bits (314), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G L K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
K+ GA R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 368
Query: 211 NFEGSEERRALFHGDH 226
N+E S ER+ L H
Sbjct: 369 NYEESPERKMLMSTPH 384
>gi|443711997|gb|ELU05498.1| hypothetical protein CAPTEDRAFT_229042 [Capitella teleta]
Length = 614
Score = 125 bits (314), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 73/169 (43%), Positives = 93/169 (55%), Gaps = 39/169 (23%)
Query: 55 KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
++ + +K CNC S+CLKL YC+CFA G +
Sbjct: 386 QNAAARPRKACNCTKSQCLKL-----------------------------YCDCFANGEF 416
Query: 115 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
C CNCVNC NN+EHE R ++ LERNP AF PKI S G D R +
Sbjct: 417 CSNCNCVNCFNNLEHEEERSRSIKTCLERNPQAFHPKIGKSKAGQGDIR----------R 466
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
HNKGC+CK+SGCLK YCEC++A ILC+ C+C+ CKNFE S +R+ L H
Sbjct: 467 HNKGCNCKRSGCLKNYCECYEAKILCTSLCKCVGCKNFEESPDRKTLMH 515
Score = 38.9 bits (89), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 19/30 (63%)
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECF 194
A E AA+ K C+C KS CLK YC+CF
Sbjct: 382 SAAEQNAAARPRKACNCTKSQCLKLYCDCF 411
>gi|219126634|ref|XP_002183557.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404794|gb|EEC44739.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1852
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 77/220 (35%), Positives = 105/220 (47%), Gaps = 57/220 (25%)
Query: 24 VPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWIN 83
P + K PL PGK+ N +G + +CNC+ S+CLKL
Sbjct: 1481 FPSSIVKTPLAV--PGKE---------NPNGITPMSTPRNRCNCKKSQCLKL-------- 1521
Query: 84 IYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 143
YCECF+A + C+GCNC +C N R +A+ L +
Sbjct: 1522 ---------------------YCECFSAEVMCEGCNCTDCRNLPAFAAVRAKAMTDCLNK 1560
Query: 144 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 203
N +AF+P+IA++P GA D + H+ GC CKKS CLKKYCECFQA +LC
Sbjct: 1561 NAHAFKPRIAAAP-GAADLQ----------GHSMGCKCKKSECLKKYCECFQAGVLCGLK 1609
Query: 204 CRCLDCKNFEGS----EERRALFHGDHNGIAFMQRAANAA 239
C+C CKN GS ++RR + D +G F R + A
Sbjct: 1610 CKCESCKNIAGSQKLIDKRRQM--KDTSGAEFAMRVSQEA 1647
>gi|298708780|emb|CBJ30741.1| tesmin-like, animal TCX proteins are associated with development of
both male and female reproductiv [Ectocarpus
siliculosus]
Length = 735
Score = 125 bits (313), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 78/155 (50%), Gaps = 43/155 (27%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
CNC+ S+CLKL YCECF YC+GCNCV C
Sbjct: 440 CNCKKSKCLKL-----------------------------YCECFQRQQYCNGCNCVECL 470
Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
N E RQ A+ T+ERNP AF K + R R HN GC+CKKS
Sbjct: 471 NTERTEDLRQLAIQGTIERNPQAFVSKF--------ERRAGKRS------HNAGCNCKKS 516
Query: 185 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
CLKKYCECFQA + C NC+C++CKN+EG+ R
Sbjct: 517 ACLKKYCECFQAGVACGTNCKCVNCKNYEGAAGGR 551
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 29/43 (67%), Gaps = 1/43 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
C+CKKS CLK YCECFQ C+ C C++C N E +E+ R L
Sbjct: 440 CNCKKSKCLKLYCECFQRQQYCN-GCNCVECLNTERTEDLRQL 481
>gi|149046796|gb|EDL99570.1| similar to hypothetical protein, isoform CRA_c [Rattus norvegicus]
Length = 419
Score = 125 bits (313), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 73/164 (44%), Positives = 93/164 (56%), Gaps = 40/164 (24%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 190 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 220
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 221 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 269
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 270 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 313
>gi|156544375|ref|XP_001607403.1| PREDICTED: protein lin-54 homolog isoform 1 [Nasonia vitripennis]
gi|345480219|ref|XP_003424108.1| PREDICTED: protein lin-54 homolog isoform 2 [Nasonia vitripennis]
Length = 818
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 91/168 (54%), Gaps = 41/168 (24%)
Query: 46 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
SR N+D P+K CNC S+CLKL Y
Sbjct: 583 SRSFNNSDANGIRPRKP--CNCTKSQCLKL-----------------------------Y 611
Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
C+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRPKI +D R
Sbjct: 612 CDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIR-- 669
Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+HNKGC+CK+SGCLK YCEC++A I CSENC+C+ C+N E
Sbjct: 670 --------RHNKGCNCKRSGCLKNYCECYEAKIPCSENCKCIGCRNVE 709
>gi|452822459|gb|EME29478.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
sulphuraria]
Length = 542
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 100/206 (48%), Gaps = 43/206 (20%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
K++K C C+NS CLKL YCECFAAG C C+
Sbjct: 228 KERKSCKCKNSMCLKL-----------------------------YCECFAAGQLCSNCS 258
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC N+ +HE EA A L RNP AF PK+ + P +ED LA KH KGC
Sbjct: 259 CQNCLNDEDHEKEVSEARNAILLRNPAAFEPKMTAVP------KEDG---ALAIKHQKGC 309
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE--GSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGC K YCECF A + CS C+C DC N E SE+ L H +GI R A
Sbjct: 310 NCKRSGCQKNYCECFHAGVFCSNICKCNDCGNREDDNSEKSTNLSHILSDSGIGI--RGA 367
Query: 237 NAAIIGAVGSSGYGTLMTSNKRKSEE 262
+ + G + M S +++ E
Sbjct: 368 LSPLRGETNGYQHLLFMESPRKRWRE 393
>gi|449015362|dbj|BAM78764.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 707
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 92/203 (45%), Gaps = 74/203 (36%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC+NS+CLKL YC+CFAAG YC+GC+
Sbjct: 99 RPRKPCNCKNSKCLKL-----------------------------YCDCFAAGTYCNGCH 129
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKI--------------------------- 152
C NC N ++ RQ A+ +LERNP+AFR K+
Sbjct: 130 CTNCLNLPAYDTLRQNAIRTSLERNPHAFRSKVVTLEYDTSSKDTASRQNCAATAAAAKS 189
Query: 153 -------ASSPHGAQDAREDAREAQLA-----------AKHNKGCHCKKSGCLKKYCECF 194
+S P Q E+A A+H KGC C +S CLKKYCECF
Sbjct: 190 ADVAPEDSSEPVQEQVPNEEASSTSTQPRTTGTTTQRHAQHIKGCSCLRSLCLKKYCECF 249
Query: 195 QANILCSENCRCLDCKNFEGSEE 217
Q + CS +CRC +C+NFEGS E
Sbjct: 250 QNGVYCSASCRCSNCRNFEGSAE 272
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
+A E + K C+CK S CLK YC+CF A C+ C C +C N
Sbjct: 90 NALEGGSVTRPRKPCNCKNSKCLKLYCDCFAAGTYCN-GCHCTNCLNL 136
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 24/44 (54%), Gaps = 4/44 (9%)
Query: 102 LVKYCECFAAGIYCDG-CNCVNCHN---NVEHEVARQEAVGATL 141
L KYCECF G+YC C C NC N + E AR+ V A L
Sbjct: 242 LKKYCECFQNGVYCSASCRCSNCRNFEGSAELSQARERLVLAEL 285
>gi|260803175|ref|XP_002596466.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
gi|229281723|gb|EEN52478.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
Length = 458
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/187 (41%), Positives = 100/187 (53%), Gaps = 44/187 (23%)
Query: 41 QESPRSRPRANTDGKDGT----PKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNT 96
Q + ++PR +G T + +K CNC S+CLKL
Sbjct: 210 QATTATQPRQQVNGTLNTEQVGTRPRKPCNCTKSQCLKL--------------------- 248
Query: 97 QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
YC+CFA G +C+ CNC NC NN+EHE R +++ A LERNP AF PKI
Sbjct: 249 --------YCDCFANGEFCNNCNCNNCFNNLEHEAERAKSIKACLERNPYAFHPKIGKGK 300
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
G D R HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S
Sbjct: 301 EGEGDRR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESP 349
Query: 217 ERRALFH 223
ER+ L H
Sbjct: 350 ERKTLMH 356
>gi|349501047|ref|NP_001012069.2| tesmin [Rattus norvegicus]
gi|205829210|sp|Q5XHX9.2|MTL5_RAT RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific
Length = 475
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 82/136 (60%), Gaps = 15/136 (11%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G L K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
K+ GA R H+KGC+CK+SGCLK YCEC++A I CS C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACK 368
Query: 211 NFEGSEERRALFHGDH 226
N+E S ER+ L H
Sbjct: 369 NYEESPERKMLMSTPH 384
>gi|53733621|gb|AAH83920.1| Metallothionein-like 5, testis-specific (tesmin) [Rattus
norvegicus]
Length = 467
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 82/136 (60%), Gaps = 15/136 (11%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G L K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 256 GPALQGPPKIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 311
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
K+ GA R H+KGC+CK+SGCLK YCEC++A I CS C+C+ CK
Sbjct: 312 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACK 360
Query: 211 NFEGSEERRALFHGDH 226
N+E S ER+ L H
Sbjct: 361 NYEESPERKMLMSTPH 376
>gi|55729585|emb|CAH91522.1| hypothetical protein [Pongo abelii]
Length = 528
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/178 (42%), Positives = 97/178 (54%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+C A G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCCANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|405973431|gb|EKC38148.1| lin-54-like protein [Crassostrea gigas]
Length = 764
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/144 (47%), Positives = 88/144 (61%), Gaps = 14/144 (9%)
Query: 94 LNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
L+ K N YC+CFA G +C CNC NC NN++HE R A+ + L+RNP AF PKI
Sbjct: 520 LSANLKQN--GYCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIG 577
Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
G +D R HNKGC+CK+SGCLK YCEC++A I+CS +C+C+ CKNFE
Sbjct: 578 KGRDGERDRR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSSCKCIGCKNFE 626
Query: 214 GSEERRALFH-GDHNGIAFMQRAA 236
S ER+ L H D + Q+ A
Sbjct: 627 ESPERKTLMHLADAAEVRVQQQTA 650
>gi|328865133|gb|EGG13519.1| tesmin/TSO1-like protein [Dictyostelium fasciculatum]
Length = 597
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 80/150 (53%), Gaps = 41/150 (27%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K C+C+NS+CLKL YCECFA + C+GC+C
Sbjct: 341 RKGCHCKNSKCLKL-----------------------------YCECFANKLLCNGCHCF 371
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
CHNN + Q + LERNP AF PK +E+ ++ KH KGCHC
Sbjct: 372 GCHNNDANIEVVQRSRSNILERNPEAFNPKFKQ--------KEETKQH----KHTKGCHC 419
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
+KS CLKKYCECFQA I C E+C+C+DC+N
Sbjct: 420 RKSECLKKYCECFQAGIPCGEHCKCIDCRN 449
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
KGCHCK S CLK YCECF +LC+ C C C N + + E
Sbjct: 342 KGCHCKNSKCLKLYCECFANKLLCN-GCHCFGCHNNDANIE 381
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 35/92 (38%), Gaps = 33/92 (35%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGC 118
K K C+CR S CLK KYCECF AGI C + C
Sbjct: 412 KHTKGCHCRKSECLK-----------------------------KYCECFQAGIPCGEHC 442
Query: 119 NCVNCHNNVEHEVARQEAVGATL---ERNPNA 147
C++C N E R + L ER P A
Sbjct: 443 KCIDCRNTGEDIKERSKPKEQPLKKEERKPTA 474
>gi|300121347|emb|CBK21727.2| unnamed protein product [Blastocystis hominis]
Length = 305
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 86/172 (50%), Gaps = 42/172 (24%)
Query: 50 ANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECF 109
+++DG P CNCR SRCLKL YCECF
Sbjct: 174 SSSDGSPHDPNHSHTCNCRKSRCLKL-----------------------------YCECF 204
Query: 110 AAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREA 169
A+ ++C+GCNCVNC NN EH R+E A ++R+P+ F P + +
Sbjct: 205 ASDVFCNGCNCVNCKNNPEHLKEREEVKQALIQRDPHVFEPHV-------------VWDT 251
Query: 170 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
+ +GC C+K+ C+KKYC+CFQ I C NCRC+DCKN E + RAL
Sbjct: 252 PATRTNLRGCRCQKTRCVKKYCDCFQNGICCGPNCRCVDCKNTEEAALNRAL 303
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 6/56 (10%)
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-----FEGSEERRALFHGD 225
H+ C+C+KS CLK YCECF +++ C+ C C++CKN E E ++AL D
Sbjct: 185 HSHTCNCRKSRCLKLYCECFASDVFCN-GCNCVNCKNNPEHLKEREEVKQALIQRD 239
>gi|115476122|ref|NP_001061657.1| Os08g0369600 [Oryza sativa Japonica Group]
gi|38637224|dbj|BAD03490.1| transcription factor-like [Oryza sativa Japonica Group]
gi|38637277|dbj|BAD03541.1| transcription factor-like [Oryza sativa Japonica Group]
gi|113623626|dbj|BAF23571.1| Os08g0369600 [Oryza sativa Japonica Group]
Length = 369
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/194 (40%), Positives = 107/194 (55%), Gaps = 39/194 (20%)
Query: 52 TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
++ KD T ++ C+C+ S+CLKL YCECF
Sbjct: 161 SETKDVTLTRKTNCSCKYSKCLKL-----------------------------YCECFEK 191
Query: 112 GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 171
G YC GCNC NC NNV HE ARQ+A+ LERNP AF PK+ +S A + A E +
Sbjct: 192 GRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVLNS--TAHNCESKAAEGDI 249
Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHNG 228
KH KGC CK++ CLKKYCECF+A++ CSENCRC CKN++ + +R + H H
Sbjct: 250 VGKHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAHH-- 307
Query: 229 IAFMQRAANAAIIG 242
+Q A++ ++G
Sbjct: 308 ---VQNPASSGMVG 318
>gi|168046141|ref|XP_001775533.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673088|gb|EDQ59616.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1223
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 81/156 (51%), Gaps = 30/156 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
K+CNC+ S+CLKL YCECFAA +C G C C
Sbjct: 737 KRCNCKKSKCLKL-----------------------------YCECFAAREFCVGSCACR 767
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC N E+E RNP AF PKI +P + ++A + +A+H +GC+C
Sbjct: 768 NCFNKPEYEATVLNTRQQIESRNPLAFAPKIVQAPESSPIPGDEALDTPASARHKRGCNC 827
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
KKS CLKKYCEC+QA + CSE CRC C+N G +E
Sbjct: 828 KKSLCLKKYCECYQAGVGCSEGCRCEGCRNMYGRKE 863
Score = 38.9 bits (89), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 29/117 (24%), Positives = 46/117 (39%), Gaps = 39/117 (33%)
Query: 38 PGKQESPRSRPRANTDGKD--GTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLN 95
P ++P S P + D + + ++ CNC+ S CLK
Sbjct: 796 PKIVQAPESSPIPGDEALDTPASARHKRGCNCKKSLCLK--------------------- 834
Query: 96 TQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNV-------EHEVARQEAVGATLERN 144
KYCEC+ AG+ C +GC C C N E E ++ V T++++
Sbjct: 835 --------KYCECYQAGVGCSEGCRCEGCRNMYGRKEGPEEGEAKEKDQVNLTVDKS 883
>gi|334186533|ref|NP_193213.5| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
gi|395455099|sp|F4JIF5.1|TCX2_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 2; Short=AtTCX2;
AltName: Full=Protein TSO1-like 2; Short=Protein SOL2
gi|332658096|gb|AEE83496.1| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
Length = 674
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/184 (38%), Positives = 93/184 (50%), Gaps = 35/184 (19%)
Query: 38 PGKQESPRSRP---RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
P QE S P R D +G + K+CNC+ S+CLKL
Sbjct: 349 PALQELNLSSPKKKRVKLDSGEG--ESCKRCNCKKSKCLKL------------------- 387
Query: 95 NTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
YCECFAAG+YC + C+C++C N HE RNP AF PK+
Sbjct: 388 ----------YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVI 437
Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+ Q+ +DA + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN
Sbjct: 438 RNSDSVQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAF 497
Query: 214 GSEE 217
G ++
Sbjct: 498 GRKD 501
>gi|148700988|gb|EDL32935.1| mCG3887, isoform CRA_b [Mus musculus]
Length = 360
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G L K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 149 GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 204
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
K+ GA R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 205 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 253
Query: 211 NFEGSEERRALFHGDH 226
N+E S ER+ L H
Sbjct: 254 NYEESPERKMLMSTPH 269
>gi|145502865|ref|XP_001437410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404560|emb|CAK70013.1| unnamed protein product [Paramecium tetraurelia]
Length = 220
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/151 (42%), Positives = 89/151 (58%), Gaps = 38/151 (25%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
K+CNC+NS+C+KL YCEC+ ++CD C+C
Sbjct: 16 KKCNCKNSKCIKL-----------------------------YCECYQNKVFCDNNCHCH 46
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC NN + RQ+A+ TLE+NP+AF+PKI +S D++ D KHNKGC C
Sbjct: 47 NCFNNSSNAKQRQKAMQYTLEKNPSAFQPKITTS-----DSKPDPLNF---GKHNKGCQC 98
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNF 212
KKSGC+KKYCECFQA + CS+ C+C++C+N+
Sbjct: 99 KKSGCMKKYCECFQAKVPCSDQCKCIECRNY 129
Score = 46.2 bits (108), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 55/132 (41%), Gaps = 15/132 (11%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAA 236
K C+CK S C+K YCEC+Q + C NC C +C N + ++R Q+A
Sbjct: 16 KKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQR-------------QKAM 62
Query: 237 NAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQS 296
+ S + +T++ K + L FG K ++ ++ + VPCS
Sbjct: 63 QYTL--EKNPSAFQPKITTSDSKPDPLNFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQ 120
Query: 297 VPVSQTANASVL 308
+ N L
Sbjct: 121 CKCIECRNYGFL 132
>gi|312078614|ref|XP_003141815.1| hypothetical protein LOAG_06231 [Loa loa]
Length = 495
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 102/217 (47%), Gaps = 61/217 (28%)
Query: 3 SPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQ 62
SP A QA Q NRV PV L +P+ RP + +
Sbjct: 228 SPPLAYRAAQASIQR----NRVFTPVLAAKL---------APKKRPLGS----------K 264
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K C+C S CLKL YC+CFA G +C+ C+C +
Sbjct: 265 KPCHCTRSMCLKL-----------------------------YCDCFANGEFCNDCDCKD 295
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C N +E+E+ R A+ +LERNPNAF+PKI + + R+ + H KGCHCK
Sbjct: 296 CKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHCK 346
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
KS CLK YCEC++A + C++ C+C+ C+N E R
Sbjct: 347 KSNCLKNYCECYEAKVPCTDRCKCVCCRNTESDRAAR 383
>gi|194389404|dbj|BAG61668.1| unnamed protein product [Homo sapiens]
Length = 348
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/164 (44%), Positives = 93/164 (56%), Gaps = 40/164 (24%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 119 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 149
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 150 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 198
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 199 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 242
>gi|321459097|gb|EFX70154.1| hypothetical protein DAPPUDRAFT_217492 [Daphnia pulex]
Length = 209
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 83/152 (54%), Gaps = 41/152 (26%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
K +K CNC S+CLKL YC+CFA G +C GCN
Sbjct: 20 KPRKPCNCTKSQCLKL-----------------------------YCDCFANGEFCHGCN 50
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
CV C NN+EHE R A+ + L+RNP+AF+PKI G E R HNKGC
Sbjct: 51 CVCCANNLEHEELRLRAIRSCLDRNPHAFKPKI-----GVGWGPEPRR-------HNKGC 98
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
HCK+SGCLK YCEC++A I CS C+C+ CKN
Sbjct: 99 HCKRSGCLKNYCECYEAKIACSAICKCIGCKN 130
>gi|170582258|ref|XP_001896048.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
gi|158596825|gb|EDP35103.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
Length = 469
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 102/217 (47%), Gaps = 61/217 (28%)
Query: 3 SPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQ 62
SP A QA Q NRV PV L +P+ RP + +
Sbjct: 202 SPPLAYRAAQASVQR----NRVFTPVLAAKL---------APKKRPLGS----------K 238
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K C+C S CLKL YC+CFA G +C+ C+C +
Sbjct: 239 KPCHCTRSMCLKL-----------------------------YCDCFANGEFCNDCDCKD 269
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C N +E+E+ R A+ +LERNPNAF+PKI + + R+ + H KGCHCK
Sbjct: 270 CKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHCK 320
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
KS CLK YCEC++A + C++ C+C+ C+N E R
Sbjct: 321 KSNCLKNYCECYEAKVPCTDRCKCICCRNTESDRAAR 357
>gi|393905053|gb|EFO22256.2| hypothetical protein LOAG_06231 [Loa loa]
Length = 452
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 107/210 (50%), Gaps = 48/210 (22%)
Query: 59 PKK-----QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGI 113
PKK +K C+C S CLKL YC+CFA G
Sbjct: 213 PKKRPLGSKKPCHCTRSMCLKL-----------------------------YCDCFANGE 243
Query: 114 YCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 173
+C+ C+C +C N +E+E+ R A+ +LERNPNAF+PKI + + R+ +
Sbjct: 244 FCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATN---------RQVEPER 294
Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQ 233
H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N +E RA + +
Sbjct: 295 LHQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCVCCRN---TESDRAARLANTKSATALT 351
Query: 234 RAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
+AA +G+ + + T + S++ +E+
Sbjct: 352 DIRSAATMGS--ADFHKTTLYSDEDSDDEV 379
Score = 47.0 bits (110), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 48/99 (48%), Gaps = 15/99 (15%)
Query: 136 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
A A+++RN F P +A A+ ++ L +K K CHC +S CLK YC+CF
Sbjct: 192 AAQASIQRN-RVFTPVLA--------AKLAPKKRPLGSK--KPCHCTRSMCLKLYCDCFA 240
Query: 196 ANILCSENCRCLDCKN---FEGSEERRALFHGDHNGIAF 231
C+ +C C DCKN +E R + N AF
Sbjct: 241 NGEFCN-DCDCKDCKNTIEYEIERTRAIRLSLERNPNAF 278
>gi|354496627|ref|XP_003510427.1| PREDICTED: tesmin-like [Cricetulus griseus]
Length = 380
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 57/128 (44%), Positives = 80/128 (62%), Gaps = 15/128 (11%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CFA+G +C C+C NN+ HE+ R +A+ A ++RNP AF+PK+ G
Sbjct: 177 KIALAGYCDCFASGDFCTSCSC----NNLRHELERFKAIKACIDRNPEAFQPKMGKGRLG 232
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER
Sbjct: 233 VSKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPER 281
Query: 219 RALFHGDH 226
+ L + H
Sbjct: 282 KMLMNTPH 289
>gi|297830988|ref|XP_002883376.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
lyrata]
gi|297329216|gb|EFH59635.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
lyrata]
Length = 623
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 104/208 (50%), Gaps = 38/208 (18%)
Query: 19 RMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFV 78
+++ VP + L + P K+ R ++ G+ G+ K+CNC+ S+CLKL
Sbjct: 302 KVIEVVPKSSGLVELTPISPKKK-----RRKSEQSGEGGS--SCKRCNCKKSKCLKL--- 351
Query: 79 EIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAV 137
YCECFAAG+YC + C+C++C N HE
Sbjct: 352 --------------------------YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATR 385
Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
RNP AF PK+ + + EDA + +A+H +GC+CKKS CLKKYCEC+Q
Sbjct: 386 KQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECYQGG 445
Query: 198 ILCSENCRCLDCKNFEGSEERRALFHGD 225
+ CS NCRC CKN G ++ +LF D
Sbjct: 446 VGCSINCRCEGCKNAFGRKD-GSLFEQD 472
>gi|242019702|ref|XP_002430298.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515413|gb|EEB17560.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 995
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 70/154 (45%), Positives = 90/154 (58%), Gaps = 40/154 (25%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S+CLKL YCECFA G +C CN
Sbjct: 779 RMKKPCNCTRSQCLKL-----------------------------YCECFANGEFCFQCN 809
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE ARQ+++ + LERNP AFRPKI + ED R +HNKGC
Sbjct: 810 CNNCYNNIEHEEARQKSIKSCLERNPCAFRPKIQ-----IGETEEDER------RHNKGC 858
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
HCK+SGCLK YCEC++A I CS++C+C+ C+N E
Sbjct: 859 HCKRSGCLKNYCECYEAKITCSKSCKCIGCRNTE 892
>gi|47228462|emb|CAG05282.1| unnamed protein product [Tetraodon nigroviridis]
Length = 798
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 64/130 (49%), Positives = 84/130 (64%), Gaps = 11/130 (8%)
Query: 94 LNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
+ +Q + L YC+CFA G +C+ CNC NC NN+EHE R +A+ L+RNP AF+PKI
Sbjct: 579 VQSQARMPLNGYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIG 638
Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
G D R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE
Sbjct: 639 KGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFE 687
Query: 214 GSEERRALFH 223
S ER+ L H
Sbjct: 688 ESPERKTLMH 697
>gi|4581559|gb|AAD24666.1|U67176_1 tesmin-1 [Mus musculus]
gi|4581561|gb|AAD24667.1|U77383_1 tesmin-2 [Mus musculus]
gi|21464732|gb|AAM54491.1|AF329359_1 tesmin [Mus musculus]
gi|21464734|gb|AAM54492.1|AF329360_1 tesmin [Mus musculus]
gi|148700989|gb|EDL32936.1| mCG3887, isoform CRA_c [Mus musculus]
gi|148700991|gb|EDL32938.1| mCG3887, isoform CRA_c [Mus musculus]
Length = 295
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G L K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 84 GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 139
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
K+ GA R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 140 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 188
Query: 211 NFEGSEERRALFHGDH 226
N+E S ER+ L H
Sbjct: 189 NYEESPERKMLMSTPH 204
>gi|6739821|gb|AAF27433.1|AF206324_1 putative DNA binding protein [Arabidopsis thaliana]
gi|7767425|gb|AAF69124.1|AF204059_1 CXC domain protein TSO1 [Arabidopsis thaliana]
Length = 695
Score = 122 bits (306), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 88/164 (53%), Gaps = 38/164 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C+
Sbjct: 401 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 431
Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
+C N HE E V AT + RNP AF PK+ + +A +DA + +A+H +
Sbjct: 432 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 487
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
GC+CKKS C+KKYCEC+Q + CS NCRC C N G ++ L
Sbjct: 488 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLL 531
>gi|30686784|ref|NP_566718.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75274232|sp|Q9LUI3.1|TSO1_ARATH RecName: Full=CRC domain-containing protein TSO1
gi|9279696|dbj|BAB01253.1| DNA binding protein-like [Arabidopsis thaliana]
gi|332643155|gb|AEE76676.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 695
Score = 122 bits (306), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 88/164 (53%), Gaps = 38/164 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C+
Sbjct: 401 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 431
Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
+C N HE E V AT + RNP AF PK+ + +A +DA + +A+H +
Sbjct: 432 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 487
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
GC+CKKS C+KKYCEC+Q + CS NCRC C N G ++ L
Sbjct: 488 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLL 531
>gi|414877954|tpg|DAA55085.1| TPA: hypothetical protein ZEAMMB73_985885 [Zea mays]
Length = 772
Score = 122 bits (305), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 69/183 (37%), Positives = 92/183 (50%), Gaps = 33/183 (18%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SP+ + R D DGT K+C+C+ S+CLKL
Sbjct: 443 QGSPKKK-RHKFDSGDGT--SCKRCSCKKSKCLKL------------------------- 474
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFAAG+YC + C+C+ C NN H RNP AF PK+ +
Sbjct: 475 ----YCECFAAGVYCSEPCSCIGCMNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSEPG 530
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+ R+ + + +A+H +GC+CKKS CLKKYCECFQ + CS +CRC CKN G E
Sbjct: 531 LELRDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGKREGA 590
Query: 220 ALF 222
A+
Sbjct: 591 AVL 593
>gi|402586637|gb|EJW80574.1| hypothetical protein WUBG_08516, partial [Wuchereria bancrofti]
Length = 391
Score = 122 bits (305), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 84/158 (53%), Gaps = 38/158 (24%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K C+C S CLKL YC+CFA G +C+ C+C
Sbjct: 234 KKPCHCTRSMCLKL-----------------------------YCDCFANGEFCNDCDCK 264
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N +E+E+ R A+ +LERNPNAF+PKI + + R+ + H KGCHC
Sbjct: 265 DCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHC 315
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
KKS CLK YCEC++A + C++ C+C+ C+N E R
Sbjct: 316 KKSNCLKNYCECYEAKVPCTDRCKCICCRNTESDRAAR 353
Score = 45.8 bits (107), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 34/68 (50%), Gaps = 8/68 (11%)
Query: 171 LAAKH----NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEGSEERRALFH 223
LAAK K CHC +S CLK YC+CF C+ +C C DCKN +E R
Sbjct: 225 LAAKRPLGSKKPCHCTRSMCLKLYCDCFANGEFCN-DCDCKDCKNTIEYEIERTRAIRLS 283
Query: 224 GDHNGIAF 231
+ N AF
Sbjct: 284 LERNPNAF 291
>gi|18403510|ref|NP_566717.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75153799|sp|Q8L548.1|TCX3_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 3; Short=AtTCX3;
AltName: Full=Protein TSO1-like 1; Short=Protein SOL1
gi|20466374|gb|AAM20504.1| unknown protein [Arabidopsis thaliana]
gi|22136318|gb|AAM91237.1| unknown protein [Arabidopsis thaliana]
gi|332643153|gb|AEE76674.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 609
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 85/164 (51%), Gaps = 31/164 (18%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG YC + C+C+
Sbjct: 328 KRCNCKKSKCLKL-----------------------------YCECFAAGFYCIEPCSCI 358
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC N H+ RNP AF PK+ + + EDA + +A+H +GC+C
Sbjct: 359 NCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNC 418
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
KKS CLKKYCEC+Q + CS NCRC CKN G ++ +LF D
Sbjct: 419 KKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 8/130 (6%)
Query: 89 SPGAHLNT---QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQ----EAVGATL 141
S G HLNT KD V F+ I + + + +H++ R+ + G +
Sbjct: 234 SIGLHLNTIAMSSKDKNVANEYSFSGNIKVGVQSSLTPVLHSQHDIVRENESGKDSGQII 293
Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
E P + + +P + R + ++ K C+CKKS CLK YCECF A C
Sbjct: 294 EVVPKSLA-SVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKKSKCLKLYCECFAAGFYCI 352
Query: 202 ENCRCLDCKN 211
E C C++C N
Sbjct: 353 EPCSCINCFN 362
>gi|7767427|gb|AAF69125.1|AF205142_1 CXC domain containing TSO1-like protein 1 [Arabidopsis thaliana]
Length = 609
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 85/164 (51%), Gaps = 31/164 (18%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG YC + C+C+
Sbjct: 328 KRCNCKKSKCLKL-----------------------------YCECFAAGFYCIEPCSCI 358
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC N H+ RNP AF PK+ + + EDA + +A+H +GC+C
Sbjct: 359 NCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNC 418
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
KKS CLKKYCEC+Q + CS NCRC CKN G ++ +LF D
Sbjct: 419 KKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 8/130 (6%)
Query: 89 SPGAHLNT---QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQ----EAVGATL 141
S G HLNT KD V F+ I + + + +H++ R+ + G +
Sbjct: 234 SIGLHLNTIAMSSKDKNVANEYSFSGNIKVGVQSSLTPVLHSQHDIVRENESGKDSGQII 293
Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
E P + + +P + R + ++ K C+CKKS CLK YCECF A C
Sbjct: 294 EVVPKSLA-SVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKKSKCLKLYCECFAAGFYCI 352
Query: 202 ENCRCLDCKN 211
E C C++C N
Sbjct: 353 EPCSCINCFN 362
>gi|413952362|gb|AFW85011.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
Length = 235
Score = 121 bits (304), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 64/120 (53%), Positives = 74/120 (61%), Gaps = 30/120 (25%)
Query: 38 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQ 97
P K +SP+ R +GKD TP K+K CNCRNSRCLKL
Sbjct: 131 PLKPDSPKPWARL-YEGKDSTPTKKKCCNCRNSRCLKL---------------------- 167
Query: 98 FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
YCECFA+G +CDGCNC NC NN E+EVAR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 168 -------YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPH 220
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
K C+C+ S CLK YCECF + C + C C +C N +E R
Sbjct: 155 KCCNCRNSRCLKLYCECFASGTHC-DGCNCTNCFNNPENEVAR 196
>gi|255542340|ref|XP_002512233.1| tso1, putative [Ricinus communis]
gi|223548194|gb|EEF49685.1| tso1, putative [Ricinus communis]
Length = 729
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/155 (40%), Positives = 81/155 (52%), Gaps = 30/155 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YC+CFAAG+YC + C+C
Sbjct: 450 KRCNCKRSKCLKL-----------------------------YCDCFAAGLYCIEPCSCQ 480
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N HE E RNP AF PK+ S D ++ + +A+H +GC+C
Sbjct: 481 DCFNKPAHEDTVLETRKQIESRNPLAFAPKVIRSTDYVSDFGDETNKTPASARHKRGCNC 540
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
KKS CLKKYCECFQ + CS NCRC CKN GS+
Sbjct: 541 KKSNCLKKYCECFQGGVGCSSNCRCEGCKNSFGSK 575
>gi|223995127|ref|XP_002287247.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976363|gb|EED94690.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1174
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 94/203 (46%), Gaps = 54/203 (26%)
Query: 54 GKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGI 113
GK+ + + + C C+NS+CLKL YC+CF+A
Sbjct: 852 GKENS-ENESTCKCKNSKCLKL-----------------------------YCDCFSAEK 881
Query: 114 YCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 173
YC GC C++C N E R +A+ T RNPNAF+ K+ +S
Sbjct: 882 YCVGCKCIDCQNTPTFESIRAKAIADTRARNPNAFKTKLNAS----------------VT 925
Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS----EERRALFHGDHNGI 229
+H GC CKKS CLKKYCECF ++C E+C+C +CKN+ GS E RR L D G
Sbjct: 926 EHATGCKCKKSECLKKYCECFANLVVCGESCKCSNCKNYVGSQALIERRRKL--KDAKGT 983
Query: 230 AFMQRAANAAIIGAVGSS--GYG 250
R G++ S YG
Sbjct: 984 ELAMRPVARGWKGSMSDSKVAYG 1006
>gi|328785527|ref|XP_392991.3| PREDICTED: protein lin-54 homolog isoform 1 [Apis mellifera]
Length = 804
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)
Query: 35 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
TL G +S R+ T + + +K CNC S+CLKL
Sbjct: 580 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 620
Query: 95 NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
YC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRPKI
Sbjct: 621 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 670
Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 671 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 719
>gi|395542051|ref|XP_003772948.1| PREDICTED: protein lin-54 homolog, partial [Sarcophilus harrisii]
Length = 360
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 97/178 (54%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 131 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 161
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 162 CNNCFNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 210
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 211 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 268
>gi|340722922|ref|XP_003399848.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus terrestris]
Length = 835
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)
Query: 35 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
TL G +S R+ T + + +K CNC S+CLKL
Sbjct: 588 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 628
Query: 95 NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
YC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRPKI
Sbjct: 629 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 678
Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 679 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 727
>gi|340722920|ref|XP_003399847.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus terrestris]
Length = 829
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)
Query: 35 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
TL G +S R+ T + + +K CNC S+CLKL
Sbjct: 582 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 622
Query: 95 NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
YC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRPKI
Sbjct: 623 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 672
Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 673 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 721
>gi|380022721|ref|XP_003695187.1| PREDICTED: protein lin-54 homolog [Apis florea]
Length = 827
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)
Query: 35 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
TL G +S R+ T + + +K CNC S+CLKL
Sbjct: 580 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 620
Query: 95 NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
YC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRPKI
Sbjct: 621 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 670
Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 671 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 719
>gi|350403514|ref|XP_003486824.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus impatiens]
Length = 829
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)
Query: 35 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
TL G +S R+ T + + +K CNC S+CLKL
Sbjct: 582 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 622
Query: 95 NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
YC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRPKI
Sbjct: 623 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 672
Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 673 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 721
>gi|350403512|ref|XP_003486823.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus impatiens]
Length = 835
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)
Query: 35 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
TL G +S R+ T + + +K CNC S+CLKL
Sbjct: 588 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 628
Query: 95 NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
YC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRPKI
Sbjct: 629 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 678
Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
D R +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 679 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 727
>gi|343959662|dbj|BAK63688.1| hypothetical protein [Pan troglodytes]
Length = 348
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/119 (53%), Positives = 82/119 (68%), Gaps = 11/119 (9%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
YC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R
Sbjct: 135 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR- 193
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 194 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 242
>gi|15028139|gb|AAK76693.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 526
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 87/160 (54%), Gaps = 38/160 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C+
Sbjct: 232 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 262
Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
+C N HE E V AT + RNP AF PK+ + +A +DA + +A+H +
Sbjct: 263 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 318
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
GC+CKKS C+KKYCEC+Q + CS NCRC C N G ++
Sbjct: 319 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKD 358
>gi|23297130|gb|AAN13100.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 526
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 87/160 (54%), Gaps = 38/160 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C+
Sbjct: 232 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 262
Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
+C N HE E V AT + RNP AF PK+ + +A +DA + +A+H +
Sbjct: 263 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 318
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
GC+CKKS C+KKYCEC+Q + CS NCRC C N G ++
Sbjct: 319 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKD 358
>gi|383858991|ref|XP_003704982.1| PREDICTED: protein lin-54 homolog isoform 2 [Megachile rotundata]
Length = 802
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 90/169 (53%), Gaps = 39/169 (23%)
Query: 45 RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
+S R+ T + + +K CNC S+CLKL
Sbjct: 589 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL----------------------------- 619
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
YC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRPKI D R
Sbjct: 620 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR- 678
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 679 ---------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 718
>gi|326500406|dbj|BAK06292.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 782
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/171 (37%), Positives = 86/171 (50%), Gaps = 30/171 (17%)
Query: 48 PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCE 107
P+ +G + K+C+C+ S+CLKL YCE
Sbjct: 458 PKKRRKSDNGDDESCKRCSCKKSKCLKL-----------------------------YCE 488
Query: 108 CFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 166
CFAAG+YC + C+C C N HE RNP AF PK+ Q+A+ED
Sbjct: 489 CFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKHIEFRNPLAFAPKVIRMSEAGQEAQEDP 548
Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
+ +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G+ +
Sbjct: 549 KNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSNNCRCETCKNTFGTRD 599
>gi|383858989|ref|XP_003704981.1| PREDICTED: protein lin-54 homolog isoform 1 [Megachile rotundata]
Length = 826
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 90/169 (53%), Gaps = 39/169 (23%)
Query: 45 RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
+S R+ T + + +K CNC S+CLKL
Sbjct: 589 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL----------------------------- 619
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
YC+CFA G +C CNC NC NN+ +E RQ A+ + LERNPNAFRPKI D R
Sbjct: 620 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR- 678
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 679 ---------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 718
>gi|414883728|tpg|DAA59742.1| TPA: hypothetical protein ZEAMMB73_893320 [Zea mays]
Length = 800
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 87/177 (49%), Gaps = 32/177 (18%)
Query: 39 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
G+ SP+ + R +G DG + K+C+C+ S+CLKL
Sbjct: 460 GQSVSPQKKRRKTDNGDDG--EACKRCSCKKSKCLKL----------------------- 494
Query: 99 KDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
YCECFAAG+YC + C+C C N HE RNP AF PK+
Sbjct: 495 ------YCECFAAGVYCSEPCSCQGCMNKPIHEETVLSTRKQIEFRNPLAFAPKVIRMSD 548
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ ED +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G
Sbjct: 549 AGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCKNAFG 605
>gi|145479557|ref|XP_001425801.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392873|emb|CAK58403.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/238 (33%), Positives = 112/238 (47%), Gaps = 53/238 (22%)
Query: 1 MRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLP-PGKQESPRSRPRANTDGKDGTP 59
RSP ++ + R ++ +N V TL P K S + R T K P
Sbjct: 135 FRSPPKSPHRKLERATKLKFINFV----------TLKKPLKFTSDQVNNRKRTSSKRRKP 184
Query: 60 KKQKQ-----CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
+ +++ CNC+ S+CLKL YC+CFAAG+
Sbjct: 185 QTEQKPKIIVCNCKKSKCLKL-----------------------------YCDCFAAGVP 215
Query: 115 C-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 173
C CNC +CHNN +H R+ + +ERNP AFRPK+ S + + + R
Sbjct: 216 CGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKVESRSNSEDEIDQKPR------ 269
Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAF 231
H KGC+CKKS CLKKYCEC+Q + CSE C+C +CKN E ++ + H +F
Sbjct: 270 -HFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKNCEMPVKKESRKRVKHTHSSF 326
>gi|307182142|gb|EFN69485.1| Protein lin-54-like protein [Camponotus floridanus]
Length = 822
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/151 (44%), Positives = 82/151 (54%), Gaps = 39/151 (25%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K CNC S+CLKL YC+CFA G +C CNC N
Sbjct: 603 KPCNCTKSQCLKL-----------------------------YCDCFANGEFCHMCNCNN 633
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK
Sbjct: 634 CSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCK 683
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 684 RSGCLKNYCECYEAKIPCSANCKCIGCRNIE 714
>gi|167536200|ref|XP_001749772.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771699|gb|EDQ85361.1| predicted protein [Monosiga brevicollis MX1]
Length = 429
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 79/150 (52%), Gaps = 39/150 (26%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
+ CNC+NS+CLKL YCECFA G YC CNCV
Sbjct: 216 RTCNCKNSKCLKL-----------------------------YCECFANGEYCGPNCNCV 246
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N E R++A+ TL +NP+AFRPK GA +R+ A + H KGC C
Sbjct: 247 SCGNREATETLRKQAIEVTLAKNPDAFRPKF-----GANKSRQSAAPS----AHVKGCRC 297
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
+S C+K+YCECFQA I C+ C+C DC N
Sbjct: 298 ARSKCIKRYCECFQAGIPCTSACQCRDCHN 327
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 26/47 (55%)
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
A + C+CK S CLK YCECF C NC C+ C N E +E R
Sbjct: 212 ASGGRTCNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLR 258
>gi|255548982|ref|XP_002515547.1| tso1, putative [Ricinus communis]
gi|223545491|gb|EEF46996.1| tso1, putative [Ricinus communis]
Length = 873
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/162 (40%), Positives = 83/162 (51%), Gaps = 31/162 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ ++CLKL YC+CFAAGIYC D C C
Sbjct: 453 KRCNCKKTKCLKL-----------------------------YCDCFAAGIYCADPCACQ 483
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA-AKHNKGCH 180
+C N E+E E RNP AF PKI +RED + + ++H +GC+
Sbjct: 484 DCFNRPEYEDTVLETRQQIESRNPLAFAPKIVQHAKEFAASREDRSSSMPSLSRHKRGCN 543
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
CKKS CLKKYCEC+QAN+ CS CRC CKN G +E +
Sbjct: 544 CKKSMCLKKYCECYQANVGCSSECRCEGCKNGYGRKEEYGII 585
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 30/51 (58%)
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
D E+ + + K C+CKK+ CLK YC+CF A I C++ C C DC N
Sbjct: 437 DNFEEKKSLSIDGDGCKRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFN 487
>gi|307206123|gb|EFN84203.1| Protein lin-54-like protein [Harpegnathos saltator]
Length = 825
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/151 (44%), Positives = 82/151 (54%), Gaps = 39/151 (25%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K CNC S+CLKL YC+CFA G +C CNC N
Sbjct: 606 KPCNCTKSQCLKL-----------------------------YCDCFANGEFCHMCNCNN 636
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C NN+ +E RQ A+ + LERNPNAFRPKI D R +HNKGC+CK
Sbjct: 637 CSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCK 686
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 687 RSGCLKNYCECYEAKIPCSANCKCIGCRNIE 717
>gi|145539662|ref|XP_001455521.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423329|emb|CAK88124.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 107/221 (48%), Gaps = 55/221 (24%)
Query: 1 MRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLP-PGKQESPRSRPRANTDGKDGTP 59
RSP ++ + R ++ +N V TL P K S + R T K P
Sbjct: 144 FRSPPKSPHRKLERATKLKFINFV----------TLKKPLKFTSDQVNNRKRTSSKRRKP 193
Query: 60 KKQKQ-----CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
+++ CNC+ S+CLKL YC+CFAAG+
Sbjct: 194 LTEQKPKIIVCNCKKSKCLKL-----------------------------YCDCFAAGVP 224
Query: 115 C-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR-EAQLA 172
C CNC +CHNN +H R+ + +ERNP AFRPK+ D+R ++ E
Sbjct: 225 CGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKV--------DSRSNSEDEIDHK 276
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+H KGC+CKKS CLKKYCEC+Q + CSE C+C DCKN E
Sbjct: 277 PRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDDCKNCE 317
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C+CKKS CLK YC+CF A + C ++C C C N + + R
Sbjct: 204 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKER 244
>gi|357111377|ref|XP_003557490.1| PREDICTED: uncharacterized protein LOC100836677 [Brachypodium
distachyon]
Length = 787
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 65/172 (37%), Positives = 87/172 (50%), Gaps = 33/172 (19%)
Query: 44 PRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLV 103
P+ + R + +G D + K+C+C+ S+CLKL
Sbjct: 459 PQKKRRKSDNGDD---ESCKRCSCKKSKCLKL---------------------------- 487
Query: 104 KYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
YCECFAAG+YC + C+C+ C N HE RNP AF PK+ Q+
Sbjct: 488 -YCECFAAGVYCSEPCSCLGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRLSDAGQET 546
Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ED +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G
Sbjct: 547 QEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCKNTFG 598
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 26/41 (63%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K C CKKS CLK YCECF A + CSE C CL C N EE
Sbjct: 475 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCLGCLNKPIHEE 515
>gi|414868877|tpg|DAA47434.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
gi|414868878|tpg|DAA47435.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
Length = 587
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 89/178 (50%), Gaps = 33/178 (18%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SP+ + R D DGT K+C+C+ S+CLKL
Sbjct: 441 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 472
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFAAG+YC + C+C+ C NN H RNP AF PK+ +
Sbjct: 473 ----YCECFAAGVYCSEPCSCIGCQNNQSHMETVLSTRQQIESRNPLAFAPKVIHTSEPG 528
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
D + + + +A+H +GC+CKKS CLKKYCECFQ + CS +CRC CKN G E
Sbjct: 529 MDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCEGCKNAFGKRE 586
>gi|242043084|ref|XP_002459413.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
gi|241922790|gb|EER95934.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
Length = 776
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 87/177 (49%), Gaps = 32/177 (18%)
Query: 39 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
G+ SP+ + R +G DG + K+C+C+ S+CLKL
Sbjct: 442 GQSVSPQKKRRKTDNGDDG--EACKRCSCKKSKCLKL----------------------- 476
Query: 99 KDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
YCECFAAG+YC + C+C C N HE RNP AF PK+
Sbjct: 477 ------YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSD 530
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ ED +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G
Sbjct: 531 AGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSTNCRCESCKNTFG 587
>gi|242086180|ref|XP_002443515.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
gi|241944208|gb|EES17353.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
Length = 767
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 68/183 (37%), Positives = 91/183 (49%), Gaps = 33/183 (18%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SP+ + R D DGT K+C+C+ S+CLKL
Sbjct: 438 QGSPKKK-RQKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 469
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFAAG+YC + C+C+ C NN H RNP AF PK+ +
Sbjct: 470 ----YCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSESG 525
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+ + + + +A+H +GC+CKKS CLKKYCECFQ + CS +CRC CKN G E
Sbjct: 526 LELGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGKREGA 585
Query: 220 ALF 222
A+
Sbjct: 586 AVL 588
>gi|156378675|ref|XP_001631267.1| predicted protein [Nematostella vectensis]
gi|156218304|gb|EDO39204.1| predicted protein [Nematostella vectensis]
Length = 226
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 75/162 (46%), Positives = 90/162 (55%), Gaps = 40/162 (24%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K CNC S+CLKL YC+CFA G +C CNCV
Sbjct: 4 RKPCNCTKSQCLKL-----------------------------YCDCFAQGEFCSNCNCV 34
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC NN+EHE R +A+ A LERNP+AF PKI G + R HNKGCHC
Sbjct: 35 NCSNNIEHERERSKAIKACLERNPHAFHPKIGKGKVGESERR-----------HNKGCHC 83
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
K+SGCLK YCEC++A ILC+ C+C CKNFE S ER+ L H
Sbjct: 84 KRSGCLKNYCECYEAKILCTSLCKCTGCKNFEESPERKTLMH 125
>gi|297737091|emb|CBI26292.3| unnamed protein product [Vitis vinifera]
Length = 717
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/183 (37%), Positives = 90/183 (49%), Gaps = 32/183 (17%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SP+ + R G G + K+CNC+ S+CLKL
Sbjct: 403 QSSPKKKRRRLEHG--GETEGCKRCNCKKSKCLKL------------------------- 435
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFAAG+YC + C+C C N HE RNP AF PK+ S
Sbjct: 436 ----YCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSL 491
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+ ++ + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G ++
Sbjct: 492 PEVGDEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGS 551
Query: 220 ALF 222
AL
Sbjct: 552 ALI 554
>gi|115470839|ref|NP_001059018.1| Os07g0176200 [Oryza sativa Japonica Group]
gi|34393829|dbj|BAC83433.1| putative CXC domain protein TSO1 [Oryza sativa Japonica Group]
gi|113610554|dbj|BAF20932.1| Os07g0176200 [Oryza sativa Japonica Group]
Length = 782
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 87/174 (50%), Gaps = 33/174 (18%)
Query: 43 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
SP+ + R + +G DG + K+C+C+ S+CLKL
Sbjct: 450 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKL--------------------------- 480
Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YCECFAAG+YC + C+C C N HE RNP AF PK+ QD
Sbjct: 481 --YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQD 538
Query: 162 AR-EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
ED +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G
Sbjct: 539 ITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTFG 592
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 25/41 (60%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K C CKKS CLK YCECF A + CSE C C C N EE
Sbjct: 468 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEE 508
>gi|322788388|gb|EFZ14059.1| hypothetical protein SINV_04654 [Solenopsis invicta]
Length = 834
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/150 (44%), Positives = 82/150 (54%), Gaps = 39/150 (26%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K CNC S+CLKL YC+CFA G +C CNC N
Sbjct: 615 KPCNCTKSQCLKL-----------------------------YCDCFANGEFCHMCNCNN 645
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C NN+ +E RQ A+ + LERNPNAFRPKI +D R +HNKGC+CK
Sbjct: 646 CSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIR----------RHNKGCNCK 695
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNF 212
+SGCLK YCEC++A I CS NC+C+ C+N
Sbjct: 696 RSGCLKNYCECYEAKIPCSGNCKCIGCRNI 725
>gi|332021648|gb|EGI62007.1| Protein lin-54-like protein [Acromyrmex echinatior]
Length = 859
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 84/153 (54%), Gaps = 39/153 (25%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S+CLKL YC+CFA G +C CN
Sbjct: 637 RPRKPCNCTKSQCLKL-----------------------------YCDCFANGEFCHMCN 667
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC NN+ +E RQ A+ + LERNPNAFRPKI +D R +HNKGC
Sbjct: 668 CNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIR----------RHNKGC 717
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
+CK+SGCLK YCEC++A I CS NC+C+ C+N
Sbjct: 718 NCKRSGCLKNYCECYEAKIPCSGNCKCIGCRNI 750
>gi|222636534|gb|EEE66666.1| hypothetical protein OsJ_23296 [Oryza sativa Japonica Group]
Length = 782
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 87/174 (50%), Gaps = 33/174 (18%)
Query: 43 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
SP+ + R + +G DG + K+C+C+ S+CLKL
Sbjct: 438 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKL--------------------------- 468
Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YCECFAAG+YC + C+C C N HE RNP AF PK+ QD
Sbjct: 469 --YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQD 526
Query: 162 AR-EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
ED +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G
Sbjct: 527 ITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTFG 580
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 25/41 (60%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K C CKKS CLK YCECF A + CSE C C C N EE
Sbjct: 456 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEE 496
>gi|218199174|gb|EEC81601.1| hypothetical protein OsI_25091 [Oryza sativa Indica Group]
Length = 794
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 87/174 (50%), Gaps = 33/174 (18%)
Query: 43 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
SP+ + R + +G DG + K+C+C+ S+CLKL
Sbjct: 450 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKL--------------------------- 480
Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YCECFAAG+YC + C+C C N HE RNP AF PK+ QD
Sbjct: 481 --YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQD 538
Query: 162 AR-EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
ED +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G
Sbjct: 539 ITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTFG 592
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 25/41 (60%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K C CKKS CLK YCECF A + CSE C C C N EE
Sbjct: 468 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEE 508
>gi|2244834|emb|CAB10256.1| hypothetical protein [Arabidopsis thaliana]
gi|7268183|emb|CAB78519.1| hypothetical protein [Arabidopsis thaliana]
Length = 658
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 71/114 (62%), Gaps = 1/114 (0%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCECFAAG+YC + C+C++C N HE RNP AF PK+ + Q+
Sbjct: 372 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETG 431
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
+DA + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G ++
Sbjct: 432 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD 485
Score = 39.7 bits (91), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 188 KKYCECFQANILCSENCRCLDCKN 211
K YCECF A + C E C C+DC N
Sbjct: 370 KSYCECFAAGVYCIEPCSCIDCFN 393
>gi|40241245|emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus]
gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like protein [Lotus japonicus]
Length = 897
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/157 (40%), Positives = 81/157 (51%), Gaps = 31/157 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YC+CFAAG++C D C+C
Sbjct: 509 KRCNCKKSKCLKL-----------------------------YCDCFAAGVFCLDPCSCQ 539
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR-EAQLAAKHNKGCH 180
+C N E+ E RNP AF PKI S A ED +A+H +GC+
Sbjct: 540 DCFNKPEYGEKVLETRQQIESRNPLAFAPKIVKSATNAPSNMEDVNLTTPSSARHTRGCN 599
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
CK+S CLKKYCEC+Q+N+ CS CRC CKN G +E
Sbjct: 600 CKRSMCLKKYCECYQSNVGCSSGCRCEGCKNVYGKKE 636
>gi|51949820|gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japonicus]
Length = 821
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/157 (40%), Positives = 81/157 (51%), Gaps = 31/157 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YC+CFAAG++C D C+C
Sbjct: 433 KRCNCKKSKCLKL-----------------------------YCDCFAAGVFCLDPCSCQ 463
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR-EAQLAAKHNKGCH 180
+C N E+ E RNP AF PKI S A ED +A+H +GC+
Sbjct: 464 DCFNKPEYGEKVLETRQQIESRNPLAFAPKIVKSATNAPSNMEDVNLTTPSSARHTRGCN 523
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
CK+S CLKKYCEC+Q+N+ CS CRC CKN G +E
Sbjct: 524 CKRSMCLKKYCECYQSNVGCSSGCRCEGCKNVYGKKE 560
>gi|359477548|ref|XP_002279154.2| PREDICTED: uncharacterized protein LOC100261336 [Vitis vinifera]
Length = 738
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 81/161 (50%), Gaps = 30/161 (18%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C
Sbjct: 444 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCVEPCSCQ 474
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C N HE RNP AF PK+ S + ++ + +A+H +GC+C
Sbjct: 475 ECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNC 534
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
KKS CLKKYCEC+Q + CS NCRC CKN G ++ AL
Sbjct: 535 KKSSCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSALI 575
>gi|357480917|ref|XP_003610744.1| Lin-54-like protein [Medicago truncatula]
gi|355512079|gb|AES93702.1| Lin-54-like protein [Medicago truncatula]
Length = 808
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 88/170 (51%), Gaps = 22/170 (12%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL +++ I + L F YCECFAAG+YC + C+C
Sbjct: 488 KRCNCKKSKCLKLIDLKMTILVL-------ELTILF---FFSYCECFAAGVYCIEPCSCQ 537
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C N HE + RNP AF PK+ S + D + +A+H +GC+C
Sbjct: 538 ECFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRSADSVPETGIDPNKTPASARHKRGCNC 597
Query: 182 KKSGCLKKYCECFQANIL-----------CSENCRCLDCKNFEGSEERRA 220
KKS CLKKYCEC+Q IL CS +CRC CKN G ++ A
Sbjct: 598 KKSNCLKKYCECYQVLILTISCLPQGGVGCSISCRCEGCKNAFGRKDGSA 647
>gi|147840870|emb|CAN68781.1| hypothetical protein VITISV_018991 [Vitis vinifera]
Length = 806
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 82/156 (52%), Gaps = 30/156 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C
Sbjct: 503 KRCNCKKSKCLKL-----------------------------YCECFAAGLYCVEPCSCQ 533
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N HE + RNP AF PK+ S + +++ + +A+H +GC+C
Sbjct: 534 DCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNC 593
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
KKS CLKKYCECFQ + CS +CRC CKN G ++
Sbjct: 594 KKSSCLKKYCECFQGGVGCSISCRCEGCKNTFGRKD 629
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 4/57 (7%)
Query: 157 HGAQDAREDAREAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
HG+ + R ++ AA++ K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 483 HGS--PKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537
>gi|356538172|ref|XP_003537578.1| PREDICTED: uncharacterized protein LOC100815907 [Glycine max]
Length = 774
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 91/181 (50%), Gaps = 32/181 (17%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SPR + R + G + K+CNC+ S+CLKL
Sbjct: 466 QNSPRMKRRKSE--TPGDTEGCKRCNCKKSKCLKL------------------------- 498
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFAAG+YC + C+C +C N HE + RNP AF PK+ +
Sbjct: 499 ----YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSV 554
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+ +D + +A+H +GC+CKKS CLKKYCEC+Q + CS +CRC CKN G ++
Sbjct: 555 PEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGS 614
Query: 220 A 220
A
Sbjct: 615 A 615
>gi|298204891|emb|CBI34198.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 82/156 (52%), Gaps = 30/156 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C
Sbjct: 503 KRCNCKKSKCLKL-----------------------------YCECFAAGLYCVEPCSCQ 533
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N HE + RNP AF PK+ S + +++ + +A+H +GC+C
Sbjct: 534 DCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNC 593
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
KKS CLKKYCECFQ + CS +CRC CKN G ++
Sbjct: 594 KKSSCLKKYCECFQGGVGCSISCRCEGCKNTFGRKD 629
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 4/57 (7%)
Query: 157 HGAQDAREDAREAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
HG+ + R ++ AA++ K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 483 HGS--PKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537
>gi|356497442|ref|XP_003517569.1| PREDICTED: uncharacterized protein LOC100804447 [Glycine max]
Length = 774
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 94/192 (48%), Gaps = 46/192 (23%)
Query: 46 SRPRANTDGKD---GTPKKQ-------------KQCNCRNSRCLKLDFVEIWINIYSNTS 89
S+P A G+D +PKK+ K+CNC+ S+CLKL
Sbjct: 453 SQPSAYMAGEDFNQNSPKKKRRKSETPGETEGCKRCNCKKSKCLKL-------------- 498
Query: 90 PGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
YCECFAAG+YC + C+C +C N HE + RNP AF
Sbjct: 499 ---------------YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAF 543
Query: 149 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
PK+ + + +D + +A+H +GC+CKKS CLKKYCEC+Q + CS +CRC
Sbjct: 544 APKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEG 603
Query: 209 CKNFEGSEERRA 220
CKN G ++ A
Sbjct: 604 CKNAFGRKDGSA 615
>gi|359487901|ref|XP_002272709.2| PREDICTED: uncharacterized protein LOC100265943 [Vitis vinifera]
Length = 777
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 82/156 (52%), Gaps = 30/156 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C
Sbjct: 474 KRCNCKKSKCLKL-----------------------------YCECFAAGLYCVEPCSCQ 504
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N HE + RNP AF PK+ S + +++ + +A+H +GC+C
Sbjct: 505 DCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNC 564
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
KKS CLKKYCECFQ + CS +CRC CKN G ++
Sbjct: 565 KKSSCLKKYCECFQGGVGCSISCRCEGCKNTFGRKD 600
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 4/57 (7%)
Query: 157 HGAQDAREDAREAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
HG+ + R ++ AA++ K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 454 HGS--PKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 508
>gi|297800740|ref|XP_002868254.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
lyrata]
gi|297314090|gb|EFH44513.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
lyrata]
Length = 651
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 52/111 (46%), Positives = 68/111 (61%), Gaps = 1/111 (0%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCECFAAG+YC + C+C++C N HE RNP AF PK+ + +
Sbjct: 365 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSESVLETG 424
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+DA + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G
Sbjct: 425 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFG 475
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 42/91 (46%), Gaps = 6/91 (6%)
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA-QDAREDAREAQLAAKHNKGCH 180
+C NV H+ + + L+ + + + + + + A +DA +D + N
Sbjct: 301 DCKTNVSHDYSCSGKIQVGLQSSISTMQDTLDQTENEAREDADQDVPVEPALQELNLSSP 360
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKN 211
KKS YCECF A + C E C C+DC N
Sbjct: 361 KKKS-----YCECFAAGVYCIEPCSCIDCFN 386
>gi|193237591|dbj|BAG50072.1| transcription factor CPP [Lotus japonicus]
Length = 764
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 94/196 (47%), Gaps = 43/196 (21%)
Query: 39 GKQESPRSRPRANTDGKDGTPKKQ-------------KQCNCRNSRCLKLDFVEIWINIY 85
G Q + S A+ D +PKK+ K+CNC+ S+CLKL
Sbjct: 439 GVQPAEDSSQAASEDFNQNSPKKKRRKVEPAGDTEGCKRCNCKKSKCLKL---------- 488
Query: 86 SNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERN 144
YCECFAAG+YC + C+C +C N HE + RN
Sbjct: 489 -------------------YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRN 529
Query: 145 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 204
P AF PK+ + + +D + +A+H +GC+CKKS CLKKYCEC+Q + CS +C
Sbjct: 530 PLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSISC 589
Query: 205 RCLDCKNFEGSEERRA 220
RC CKN G ++ A
Sbjct: 590 RCEGCKNAFGRKDGSA 605
>gi|255552089|ref|XP_002517089.1| tso1, putative [Ricinus communis]
gi|223543724|gb|EEF45252.1| tso1, putative [Ricinus communis]
Length = 727
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/159 (38%), Positives = 80/159 (50%), Gaps = 30/159 (18%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C
Sbjct: 435 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCQ 465
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C N HE RNP AF PK+ S + ++ + +A+H +GC+C
Sbjct: 466 ECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPVTEIGDEFSKTPASARHKRGCNC 525
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
KKS CLKKYCEC+Q + CS NCRC CKN G ++ A
Sbjct: 526 KKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSA 564
>gi|334332647|ref|XP_001371812.2| PREDICTED: tesmin [Monodelphis domestica]
Length = 728
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 86/132 (65%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G++L K L YC+CFA G +C+ CNC NC+NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 513 GSNLQGPAKITLAGYCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQP 572
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 573 KIGKGKLG-----------DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 621
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 622 NYEESPERKTLL 633
>gi|345309503|ref|XP_001516587.2| PREDICTED: tesmin-like [Ornithorhynchus anatinus]
Length = 243
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 87/132 (65%), Gaps = 11/132 (8%)
Query: 98 FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
+D +YC+CFA+G +C+ CNC NC+NN HE R +A+ A L RNP AF+PKI
Sbjct: 45 LEDGGFRYCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIGKGKL 104
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
G ++ +HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S +
Sbjct: 105 G-----------EVKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCKNYEESPQ 153
Query: 218 RRALFHGDHNGI 229
RR L++G H +
Sbjct: 154 RRILWNGPHPAV 165
>gi|118384765|ref|XP_001025522.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89307289|gb|EAS05277.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1084
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/147 (40%), Positives = 78/147 (53%), Gaps = 30/147 (20%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
CNC+ ++CLKL YCECFAA C+GC C C
Sbjct: 533 CNCKRTKCLKL-----------------------------YCECFAASRMCEGCTCQGCF 563
Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
N E E R+EA A LERN +AF PKI S Q E+ + + H+KGC+CKKS
Sbjct: 564 NKPEFEAMRKEARQAILERNNSAFDPKIDSQSKLKQLCNEEGVDNEQKV-HSKGCNCKKS 622
Query: 185 GCLKKYCECFQANILCSENCRCLDCKN 211
CLKKYCEC+Q + C++ C+C +C+N
Sbjct: 623 NCLKKYCECYQLGVKCTKLCKCDNCRN 649
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 34/58 (58%), Gaps = 8/58 (13%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEG--SEERRALFHGDHNGIAF 231
C+CK++ CLK YCECF A+ +C E C C C N FE E R+A+ + N AF
Sbjct: 533 CNCKRTKCLKLYCECFAASRMC-EGCTCQGCFNKPEFEAMRKEARQAIL--ERNNSAF 587
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 34/87 (39%), Gaps = 31/87 (35%)
Query: 51 NTDGKDGTPK-KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECF 109
N +G D K K CNC+ S CLK KYCEC+
Sbjct: 602 NEEGVDNEQKVHSKGCNCKKSNCLK-----------------------------KYCECY 632
Query: 110 AAGIYCDG-CNCVNCHNNVEHEVARQE 135
G+ C C C NC NN+ H + +E
Sbjct: 633 QLGVKCTKLCKCDNCRNNINHPHSHEE 659
>gi|358332933|dbj|GAA51516.1| protein lin-54 homolog [Clonorchis sinensis]
Length = 638
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 78/153 (50%), Gaps = 42/153 (27%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
++C+C S CLKL YCECFAAG++C C+CV
Sbjct: 163 RRCSCSRSFCLKL-----------------------------YCECFAAGVFCSDCSCVG 193
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C+N +HE RQ+A+ + R P+AF+ KIA S + A H +GC+CK
Sbjct: 194 CYNLAQHENYRQKAIMRIVNRKPDAFQSKIAHS-------------LENATVHARGCNCK 240
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
+SGCLK YCEC++A + C+ CRC C N +
Sbjct: 241 RSGCLKNYCECYEARVRCTSRCRCQYCYNMPAA 273
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C C +S CLK YCECF A + CS+ C C+ C N E R
Sbjct: 165 CSCSRSFCLKLYCECFAAGVFCSD-CSCVGCYNLAQHENYR 204
>gi|356575873|ref|XP_003556061.1| PREDICTED: uncharacterized protein LOC100811662 [Glycine max]
Length = 774
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 89/179 (49%), Gaps = 32/179 (17%)
Query: 40 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
+Q SP+ + R +G + K+CNC+ S+CLKL
Sbjct: 467 QQNSPKKKRRKLEPAGEG--ESCKRCNCKKSKCLKL------------------------ 500
Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFAAG+YC + C C +C N H + RNP AF PK+ S
Sbjct: 501 -----YCECFAAGVYCIEPCACRDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDS 555
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
+ +D + +A+H +GC+CKKS CLKKYCEC+Q + CS +CRC CKN G ++
Sbjct: 556 VSEIGDDLNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCEGCKNTYGRKD 614
>gi|395851609|ref|XP_003798345.1| PREDICTED: tesmin [Otolemur garnettii]
Length = 509
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 87/133 (65%), Gaps = 11/133 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 294 GSALPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQP 353
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 354 KIGKGKLG-----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 402
Query: 211 NFEGSEERRALFH 223
N+E S ER++L +
Sbjct: 403 NYEESPERKSLIN 415
>gi|224099825|ref|XP_002311635.1| predicted protein [Populus trichocarpa]
gi|222851455|gb|EEE89002.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 85/151 (56%), Gaps = 5/151 (3%)
Query: 76 DFVEIWINIYSNTSPGA--HLNTQFK--DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHE 130
D V ++ +S TSP H+ + + L YCECFAAG+YC + C C +C N HE
Sbjct: 437 DNVVPFVEDFSQTSPKKKRHVGRRSRGLQALQLYCECFAAGVYCIEPCACQDCFNKPIHE 496
Query: 131 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 190
RNP AF PK+ S A + +++ + +A+H +GC+CKKS CLKKY
Sbjct: 497 DTVLATRKQIESRNPLAFAPKVIRSSEPAPEIGDESSKTPASARHKRGCNCKKSSCLKKY 556
Query: 191 CECFQANILCSENCRCLDCKNFEGSEERRAL 221
CEC+Q + CS NCRC CKN G ++ AL
Sbjct: 557 CECYQGGVGCSLNCRCEGCKNAFGRKDGSAL 587
>gi|357436899|ref|XP_003588725.1| Lin-54-like protein [Medicago truncatula]
gi|355477773|gb|AES58976.1| Lin-54-like protein [Medicago truncatula]
Length = 921
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/178 (37%), Positives = 84/178 (47%), Gaps = 40/178 (22%)
Query: 45 RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
R + + +TD G K CNC+ S+CLKL
Sbjct: 506 RKKKKTSTDDASGP----KTCNCKKSKCLKL----------------------------- 532
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC+CF AGI+C DGC C C N VE + E RNP AF PKI P A
Sbjct: 533 YCDCFGAGIFCGDGCACEGCGNRVEFQDKVVETKQQIESRNPQAFAPKIV--PCAADVPP 590
Query: 164 EDAREAQL----AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
+ + + +A+H +GC+CK+S C KKYCECFQAN+ CS CRC C N G E
Sbjct: 591 NNMEDVNMTTPASARHKRGCNCKRSKCTKKYCECFQANVGCSTGCRCDGCMNAFGKRE 648
>gi|297833182|ref|XP_002884473.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
lyrata]
gi|297330313|gb|EFH60732.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
lyrata]
Length = 611
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 80/153 (52%), Gaps = 30/153 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+C C+ S+CLKL YCECFAAG+YC + C+C
Sbjct: 431 KRCKCKKSQCLKL-----------------------------YCECFAAGLYCVEPCSCQ 461
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC N HE ++ RNP AF PK+ S+ A D + + +A+H +GC+C
Sbjct: 462 NCFNKPIHEDLVMKSREVIEARNPLAFAPKVVSTSDTAIDLWVENSKTPASARHTRGCNC 521
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+KSGCLKKYCEC + CS +CRC+ CKN G
Sbjct: 522 RKSGCLKKYCECNLMGVRCSSSCRCIGCKNVFG 554
>gi|311247083|ref|XP_003122474.1| PREDICTED: tesmin [Sus scrofa]
Length = 511
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 86/133 (64%), Gaps = 11/133 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 296 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQP 355
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 356 KIGKGKLG-----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 404
Query: 211 NFEGSEERRALFH 223
N+E S ER+ L +
Sbjct: 405 NYEESPERKTLMN 417
>gi|449522630|ref|XP_004168329.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 3-like
[Cucumis sativus]
Length = 788
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 83/161 (51%), Gaps = 30/161 (18%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C
Sbjct: 493 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCQ 523
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N HE RNP AF PK+ + + +++ + +A+H +GC+C
Sbjct: 524 DCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNC 583
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
KKS CLKKYCEC+Q + CS +CRC CKN G ++ ++
Sbjct: 584 KKSSCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSSIL 624
>gi|426252144|ref|XP_004019777.1| PREDICTED: tesmin [Ovis aries]
Length = 512
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/124 (48%), Positives = 82/124 (66%), Gaps = 11/124 (8%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CFA+G +C+ CNC NC N++ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 305 KITLAGYCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG 364
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER
Sbjct: 365 -----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 413
Query: 219 RALF 222
+ L
Sbjct: 414 KTLM 417
>gi|357157337|ref|XP_003577764.1| PREDICTED: uncharacterized protein LOC100829642 [Brachypodium
distachyon]
Length = 771
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/186 (37%), Positives = 96/186 (51%), Gaps = 39/186 (20%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SP+ + R D DGT K+C+C+ S+CLKL
Sbjct: 446 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 477
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSP 156
YCECF AG++C + C+C C N N+E ++ +E + + RNP AF PK+ +
Sbjct: 478 ----YCECFHAGVFCSEPCSCQGCLNKPSNMETVLSTREQIES---RNPLAFAPKVIRTS 530
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
Q+ E + + +A+H +GC+CKKS CLKKYCECFQ + CS +CRC CKN G
Sbjct: 531 EPGQELGEQSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGRR 590
Query: 217 ERRALF 222
E L
Sbjct: 591 EGVVLL 596
>gi|449432678|ref|XP_004134126.1| PREDICTED: protein tesmin/TSO1-like CXC 3-like [Cucumis sativus]
Length = 788
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 83/161 (51%), Gaps = 30/161 (18%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C
Sbjct: 493 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCQ 523
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N HE RNP AF PK+ + + +++ + +A+H +GC+C
Sbjct: 524 DCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNC 583
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
KKS CLKKYCEC+Q + CS +CRC CKN G ++ ++
Sbjct: 584 KKSSCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSSIL 624
>gi|218187206|gb|EEC69633.1| hypothetical protein OsI_39028 [Oryza sativa Indica Group]
Length = 780
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/186 (37%), Positives = 99/186 (53%), Gaps = 40/186 (21%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SP+ + R D DGT K+C+C+ S+CLKL
Sbjct: 458 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 489
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSP 156
YCECF AG++C + C+C C N N+E ++ +E + + RNP AF PK+ +
Sbjct: 490 ----YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLAFAPKVIRTE 542
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
G Q+ +D+ + +++H +GC+CKKS CLKKYCEC+Q + CS +CRC CKN G
Sbjct: 543 PG-QELADDSNKTPASSRHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCEGCKNAFGRR 601
Query: 217 ERRALF 222
E AL
Sbjct: 602 EGVALL 607
>gi|395544681|ref|XP_003774236.1| PREDICTED: tesmin [Sarcophilus harrisii]
Length = 548
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 86/132 (65%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G++L K L YC+CFA G +C+ CNC NC+NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 333 GSNLQGPAKITLAGYCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQP 392
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 393 KIGKGKLG-----------DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 441
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 442 NYEESPERKTLL 453
>gi|410974670|ref|XP_003993766.1| PREDICTED: tesmin [Felis catus]
Length = 502
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/148 (43%), Positives = 89/148 (60%), Gaps = 15/148 (10%)
Query: 86 SNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNP 145
S G+ L K +L YC+CFA+G +C CNC NC NN+ HEV R +A+ A L+RNP
Sbjct: 283 STVPSGSTLPGPPKISLAGYCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNP 342
Query: 146 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
AF+PKI R + +HN+GC+C++SGCLK YCEC++A I+CS C+
Sbjct: 343 EAFQPKIGK-----------GRPGDVTPRHNRGCNCRRSGCLKNYCECYEAKIMCSSICK 391
Query: 206 CLDCKNFEGSEERRAL----FHGDHNGI 229
C+ CKN++ ER+ L +H D G
Sbjct: 392 CIGCKNYDEGPERKTLMSMPYHPDAGGF 419
>gi|351726712|ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max]
gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like protein [Glycine max]
Length = 896
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/162 (38%), Positives = 77/162 (47%), Gaps = 31/162 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YC+CFAAG YC D C C
Sbjct: 474 KRCNCKKSKCLKL-----------------------------YCDCFAAGTYCTDPCACQ 504
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR-EAQLAAKHNKGCH 180
C N E+ E RNP AF PKI +D +A+H +GC+
Sbjct: 505 GCLNRPEYVETVVETKQQIESRNPIAFAPKIVQPTTDISSHMDDENLTTPSSARHKRGCN 564
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
CK+S CLKKYCEC+QAN+ CS CRC CKN G +E F
Sbjct: 565 CKRSMCLKKYCECYQANVGCSSGCRCEGCKNVHGKKEDYVAF 606
>gi|255084730|ref|XP_002504796.1| tso1-like transcription factor [Micromonas sp. RCC299]
gi|226520065|gb|ACO66054.1| tso1-like transcription factor [Micromonas sp. RCC299]
Length = 620
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 63/109 (57%), Gaps = 13/109 (11%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
YCECFAAG++C C+C NC N ++ Q RNPNAF KI +S G
Sbjct: 278 YCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKIVASGDGG----- 332
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+H KGCHCKKS CLKKYCECFQA +LC + C+C CKN E
Sbjct: 333 --------GQHKKGCHCKKSACLKKYCECFQAGVLCQDYCKCDGCKNKE 373
>gi|297721065|ref|NP_001172895.1| Os02g0274400 [Oryza sativa Japonica Group]
gi|255670792|dbj|BAH91624.1| Os02g0274400, partial [Oryza sativa Japonica Group]
Length = 140
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/95 (63%), Positives = 65/95 (68%), Gaps = 7/95 (7%)
Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH----GDHNGI 229
KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL D N I
Sbjct: 23 KHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQMKNASDRNHI 82
Query: 230 AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL 264
Q AAN A+ GA GS GY K+ E L
Sbjct: 83 ---QEAANFALNGATGSLGYKNSPVRRKKYQENSL 114
>gi|42563508|ref|NP_187136.3| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75146756|sp|Q84JZ8.1|TCX4_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 4; Short=AtTCX4
gi|28393078|gb|AAO41973.1| unknown protein [Arabidopsis thaliana]
gi|28827398|gb|AAO50543.1| unknown protein [Arabidopsis thaliana]
gi|332640623|gb|AEE74144.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 639
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 79/153 (51%), Gaps = 30/153 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+C CR S+CLKL YCECF+AG++C + C+C
Sbjct: 453 KRCKCRKSQCLKL-----------------------------YCECFSAGLFCGEPCSCQ 483
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC N HE ++ RNP AF PK+ S+ D + + +A+H +GC+C
Sbjct: 484 NCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLWVENSKTPASARHKRGCNC 543
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+KSGC KKYCECF + CS NCRC+ CKN G
Sbjct: 544 RKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFG 576
>gi|89953384|gb|ABD83288.1| Fgenesh protein 45 [Beta vulgaris]
Length = 665
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 88/174 (50%), Gaps = 39/174 (22%)
Query: 49 RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
+A+T D T K+ CNC+ ++CLKL YC+C
Sbjct: 476 KASTSTDDDTTKR---CNCKKTKCLKL-----------------------------YCDC 503
Query: 109 FAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI----ASSPHGAQDAR 163
FAAG YC + C C C N E+E E RNP AF PK+ +SP +
Sbjct: 504 FAAGFYCGESCACQGCFNRPEYEDTVLETRQQIESRNPLAFAPKVMQTVTASPTTGKVVE 563
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
+ + + +A+H +GC+CKKS CLKKYCEC+QAN+ CSE CRC CKN G E
Sbjct: 564 KLSTPS--SARHKRGCNCKKSMCLKKYCECYQANVGCSEGCRCEGCKNIYGVRE 615
>gi|115489476|ref|NP_001067225.1| Os12g0605500 [Oryza sativa Japonica Group]
gi|77557032|gb|ABA99828.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|77557033|gb|ABA99829.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649732|dbj|BAF30244.1| Os12g0605500 [Oryza sativa Japonica Group]
gi|215707264|dbj|BAG93724.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 760
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/186 (37%), Positives = 99/186 (53%), Gaps = 40/186 (21%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SP+ + R D DGT K+C+C+ S+CLKL
Sbjct: 438 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 469
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSP 156
YCECF AG++C + C+C C N N+E ++ +E + + RNP AF PK+ +
Sbjct: 470 ----YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLAFAPKVIRTE 522
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
G Q+ +D+ + +++H +GC+CKKS CLKKYCEC+Q + CS +CRC CKN G
Sbjct: 523 PG-QELADDSNKTPASSRHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCEGCKNAFGRR 581
Query: 217 ERRALF 222
E AL
Sbjct: 582 EGVALL 587
>gi|332249871|ref|XP_003274078.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Nomascus leucogenys]
Length = 517
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 62/124 (50%), Positives = 81/124 (65%), Gaps = 11/124 (8%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L +YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI G
Sbjct: 314 KIALARYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG 373
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
R HNKGC C++SGCLK YCEC++A I+CS C+C+DCKN+E S ER
Sbjct: 374 DVKPR-----------HNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEESPER 422
Query: 219 RALF 222
+ L
Sbjct: 423 KTLM 426
>gi|224131306|ref|XP_002328506.1| predicted protein [Populus trichocarpa]
gi|222838221|gb|EEE76586.1| predicted protein [Populus trichocarpa]
Length = 414
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 53/111 (47%), Positives = 67/111 (60%), Gaps = 1/111 (0%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCECFAAG+YC + C+C+ C NN HE E RNP AF PK+ + +
Sbjct: 141 YCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVSEFG 200
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
E+ + +A+H +GC+CKKS CLKKYCECFQ + CS CRC CKN G
Sbjct: 201 EETNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFG 251
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 187 LKKYCECFQANILCSENCRCLDCKNFEGSEE 217
+K YCECF A + C E C CL+C N E+
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHED 168
>gi|145350962|ref|XP_001419860.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580092|gb|ABO98153.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 510
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 53/107 (49%), Positives = 63/107 (58%), Gaps = 13/107 (12%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
YCECFAAG +C C+C C N E+E + +RNP AF KI + DA +
Sbjct: 208 YCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKIMA------DAGD 261
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
DAR H KGCHCKKS CLKKYCECFQA + C + C+C CKN
Sbjct: 262 DAR-------HTKGCHCKKSACLKKYCECFQAGVKCQDYCKCEGCKN 301
Score = 38.5 bits (88), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 17/26 (65%), Gaps = 1/26 (3%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNN 126
L KYCECF AG+ C D C C C NN
Sbjct: 277 LKKYCECFQAGVKCQDYCKCEGCKNN 302
>gi|301787847|ref|XP_002929338.1| PREDICTED: tesmin-like [Ailuropoda melanoleuca]
gi|281340625|gb|EFB16209.1| hypothetical protein PANDA_019494 [Ailuropoda melanoleuca]
Length = 510
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 63/140 (45%), Positives = 87/140 (62%), Gaps = 22/140 (15%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 303 KITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIGKGKLG 362
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER
Sbjct: 363 -----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 411
Query: 219 RAL-----------FHGDHN 227
+ L F G+H+
Sbjct: 412 KTLMNMPNYMEIGGFEGNHH 431
>gi|403352890|gb|EJY75972.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
trifallax]
Length = 1042
Score = 112 bits (280), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 63/161 (39%), Positives = 79/161 (49%), Gaps = 49/161 (30%)
Query: 52 TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
+D K P K CNC+ S+CLKL YC+CFA
Sbjct: 642 SDNKSTQPLKTT-CNCKKSKCLKL-----------------------------YCDCFAY 671
Query: 112 GIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
G+ C CNC +C N E R++A+ ERNPNAF+PKI Q
Sbjct: 672 GLGCSPDCNCADC-ANTEGNEERKQAMDQITERNPNAFKPKI-----------------Q 713
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
H+KGCHCKKSGCLKKYCEC+Q+ ++C+ C C CKN
Sbjct: 714 EKGFHSKGCHCKKSGCLKKYCECYQSGVVCTNLCACEGCKN 754
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C+CKKS CLK YC+CF + CS +C C DC N EG+EER+
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEERK 694
>gi|356534189|ref|XP_003535640.1| PREDICTED: uncharacterized protein LOC100820491 [Glycine max]
Length = 760
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 77/153 (50%), Gaps = 30/153 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C C
Sbjct: 475 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCACH 505
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N H + RNP AF PK+ S + +D + +A+H +GC+C
Sbjct: 506 DCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIGDDPNKTPASARHKRGCNC 565
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
KKS CLKKYCEC+Q + CS +CRC CKN G
Sbjct: 566 KKSSCLKKYCECYQGGVGCSISCRCEGCKNTYG 598
>gi|194218566|ref|XP_001492479.2| PREDICTED: tesmin-like [Equus caballus]
Length = 383
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 86/133 (64%), Gaps = 11/133 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 168 GSALPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQP 227
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 228 KIGKGKLG-----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 276
Query: 211 NFEGSEERRALFH 223
N+E S ER+ L +
Sbjct: 277 NYEESPERKTLMN 289
>gi|255553069|ref|XP_002517577.1| tso1, putative [Ricinus communis]
gi|223543209|gb|EEF44741.1| tso1, putative [Ricinus communis]
Length = 348
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 76/153 (49%), Gaps = 30/153 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC D C C
Sbjct: 20 KRCNCKRSKCLKL-----------------------------YCECFAAGVYCLDSCACE 50
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N E+E + RNP AF PK+ + E +A+H +GC+C
Sbjct: 51 DCFNRPEYEDTVLDTRQQIEARNPLAFAPKVVKQATNSPANEEGNWTTPSSARHKRGCNC 110
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
KKS CLKKYCEC+QA + CS CRC CKN G
Sbjct: 111 KKSKCLKKYCECYQAGVGCSSGCRCEGCKNSYG 143
>gi|452819762|gb|EME26815.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
sulphuraria]
Length = 444
Score = 112 bits (279), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 61/148 (41%), Positives = 78/148 (52%), Gaps = 34/148 (22%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNC 123
CNC+ S+CLKL YCECFA+G YC C C C
Sbjct: 318 CNCKRSQCLKL-----------------------------YCECFASGSYCTSKCKCNGC 348
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
NN ++ + A TLERNP AF PKI+ S + + R A H++GC+CK+
Sbjct: 349 KNNGDNANHVKTARERTLERNPRAFSPKISESTAAVTEEGQVLR----VAAHHRGCNCKR 404
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKN 211
S C KKYCECFQA + C +NC+C+DCKN
Sbjct: 405 SNCRKKYCECFQAGVPCGDNCKCVDCKN 432
>gi|302843447|ref|XP_002953265.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
nagariensis]
gi|300261362|gb|EFJ45575.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
nagariensis]
Length = 1987
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 61/168 (36%), Positives = 82/168 (48%), Gaps = 54/168 (32%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K C C+ S+CLKL YC+CFAAG YC C+C++
Sbjct: 788 KSCRCKKSQCLKL-----------------------------YCDCFAAGQYCGSCSCIS 818
Query: 123 CHNNVEHE---VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
CHN EH + R+E + A R+P AF KI +P+G KH +GC
Sbjct: 819 CHNRPEHADRVLQRREDIAA---RDPQAFTRKIQLAPNGN-------------GKHKRGC 862
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNF------EGSEERRAL 221
+C+KS CLKKYCEC+Q + C C C++C+N EG+ R AL
Sbjct: 863 NCRKSHCLKKYCECYQGGVKCGIQCTCMECENMDVGSSQEGAGARGAL 910
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 1/74 (1%)
Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
G NPNA R A R + + +K C CKKS CLK YC+CF A
Sbjct: 749 GINANANPNAHRSSGAGGGLTQIQGGGPNRGRRTSENSSKSCRCKKSQCLKLYCDCFAAG 808
Query: 198 ILCSENCRCLDCKN 211
C +C C+ C N
Sbjct: 809 QYCG-SCSCISCHN 821
>gi|402892588|ref|XP_003909492.1| PREDICTED: tesmin [Papio anubis]
Length = 515
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+P
Sbjct: 300 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQP 359
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 360 KIGKGQLG-----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 408
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 409 NYEESPERKTLM 420
>gi|440894221|gb|ELR46727.1| Tesmin [Bos grunniens mutus]
Length = 511
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 81/124 (65%), Gaps = 11/124 (8%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CFA G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 304 KITLAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG 363
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER
Sbjct: 364 -----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 412
Query: 219 RALF 222
+ L
Sbjct: 413 KTLM 416
>gi|329663862|ref|NP_001192832.1| tesmin [Bos taurus]
gi|296471372|tpg|DAA13487.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
Length = 511
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 81/124 (65%), Gaps = 11/124 (8%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CFA G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI G
Sbjct: 304 KITLAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG 363
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER
Sbjct: 364 -----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 412
Query: 219 RALF 222
+ L
Sbjct: 413 KTLM 416
>gi|357606575|gb|EHJ65118.1| hypothetical protein KGM_22197 [Danaus plexippus]
Length = 844
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/159 (43%), Positives = 91/159 (57%), Gaps = 38/159 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S+CLKL YC+CFA G +C+ CN
Sbjct: 626 RPRKACNCTKSQCLKL-----------------------------YCDCFANGEFCNRCN 656
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NCHNN+E+E RQ+A+ L+RNPNAFRPKI S G ++ +HNKGC
Sbjct: 657 CNNCHNNLENEELRQKAIRGCLDRNPNAFRPKIGKSKAGG---------PEIIRRHNKGC 707
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+CK+SGCLK YCEC++A I CS C+C+ C+N E + ER
Sbjct: 708 NCKRSGCLKNYCECYEAKIACSSICKCVGCRNVEETLER 746
>gi|291415497|ref|XP_002723989.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
[Oryctolagus cuniculus]
Length = 450
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/137 (48%), Positives = 86/137 (62%), Gaps = 16/137 (11%)
Query: 86 SNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNP 145
S+T PG K L YC+CFA+G +C+ CNC NC NN HE+ R +A+ A L+RNP
Sbjct: 236 SSTLPGPP-----KIALSGYCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNP 290
Query: 146 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
AF+PKI G+ R HNKGC+CK+SGCLK YCEC++A I+CS C+
Sbjct: 291 EAFQPKIGKGRPGSAKPR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICK 339
Query: 206 CLDCKNFEGSEERRALF 222
C+ CKN+E S ER+ L
Sbjct: 340 CVGCKNYEESPERKTLM 356
>gi|313238970|emb|CBY13958.1| unnamed protein product [Oikopleura dioica]
Length = 394
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 71/111 (63%), Gaps = 11/111 (9%)
Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YCECF G YC C+C CHN ++ RQ+A+ + L+RN +AF+PK+A S
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS----- 259
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
++ KH +GC+CK+SGCLKKYCEC+QA + C++ C C++CKN
Sbjct: 260 -----GKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKN 305
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 32/87 (36%)
Query: 47 RPRANTDGK--DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
+P+ GK + K + CNC+ S CLK K
Sbjct: 253 KPKVAKSGKQVNNKGKHLRGCNCKRSGCLK-----------------------------K 283
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHE 130
YCEC+ AG+ C D CNCV C N +H+
Sbjct: 284 YCECYQAGVPCNDICNCVECKNRDDHD 310
>gi|224143100|ref|XP_002324849.1| predicted protein [Populus trichocarpa]
gi|222866283|gb|EEF03414.1| predicted protein [Populus trichocarpa]
Length = 561
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 95/190 (50%), Gaps = 45/190 (23%)
Query: 41 QESPRSRPR----ANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNT 96
Q SP+ + + ++TDG DG K+CNC+ +RCLK
Sbjct: 405 QSSPKKKRQDHKVSSTDG-DGC----KRCNCKKTRCLK---------------------- 437
Query: 97 QFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 155
+YC+CFAAGIYC + C C C N +E+E EA NP AF PKI
Sbjct: 438 -------RYCDCFAAGIYCAETCACQGCFNRLEYEDTVLEARQQKESHNPLAFAPKIVQ- 489
Query: 156 PHGAQDAREDAREAQL----AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
H + + +A L + +H GC+CK+S C+KKYCEC+QAN+ CS CRC CKN
Sbjct: 490 -HVTEFHAINVEDASLFTPSSGRHKTGCNCKRSMCVKKYCECYQANVGCSNACRCEGCKN 548
Query: 212 FEGSEERRAL 221
G +E L
Sbjct: 549 IHGRKEGEKL 558
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEGSEERR 219
R+D + + K C+CKK+ CLK+YC+CF A I C+E C C C N +E +
Sbjct: 412 RQDHKVSSTDGDGCKRCNCKKTRCLKRYCDCFAAGIYCAETCACQGCFNRLEYEDTVLEA 471
Query: 220 ALFHGDHNGIAF 231
HN +AF
Sbjct: 472 RQQKESHNPLAF 483
>gi|355566221|gb|EHH22600.1| Metallothionein-like 5, testis-specific [Macaca mulatta]
Length = 508
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQP 352
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 353 KIGKGQLG-----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCK 401
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 402 NYEESPERKTLM 413
>gi|414871812|tpg|DAA50369.1| TPA: hypothetical protein ZEAMMB73_080577 [Zea mays]
Length = 662
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/219 (32%), Positives = 100/219 (45%), Gaps = 63/219 (28%)
Query: 43 SPRSRPRANTDGKDGTPKKQKQ-----------CNCRNSRCLKLDFVEIWINIYSNTSPG 91
S ++ P A+ G+ KK+++ CNCR S+CLKL
Sbjct: 334 SQKTMPNADKSGQQIHKKKRRKLQKDDGESCRHCNCRRSKCLKL---------------- 377
Query: 92 AHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLER----NPN 146
YC CFAA +YC C+C C NN HE E V T +R NP
Sbjct: 378 -------------YCPCFAAKVYCSSLCSCQGCSNNHAHE----ELVSCTRKRTESHNPL 420
Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
AF P + + + +D+ + +A+H +GC+C+KS CLKKYCECFQ+ + C +CRC
Sbjct: 421 AFAPTVTHTYGTVLEFGDDSNKTPASARHKRGCNCRKSSCLKKYCECFQSGVGCCISCRC 480
Query: 207 LDCKNFEGSEERRALF--------------HGDHNGIAF 231
+ CKN G E L HG+ + +AF
Sbjct: 481 MSCKNSFGKREGVLLLTTEKLEKGGKAKGTHGEEDKLAF 519
>gi|345784018|ref|XP_854573.2| PREDICTED: tesmin [Canis lupus familiaris]
Length = 516
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/148 (43%), Positives = 90/148 (60%), Gaps = 22/148 (14%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 301 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQP 360
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 361 KIGKGKLG-----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCK 409
Query: 211 NFEGSEERRAL-----------FHGDHN 227
N+E S E++ L F G+H+
Sbjct: 410 NYEESPEQKTLMNMPNYMEIGGFEGNHH 437
>gi|12322849|gb|AAG51411.1|AC009465_11 hypothetical protein; 69435-72676 [Arabidopsis thaliana]
Length = 601
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 69/112 (61%), Gaps = 1/112 (0%)
Query: 104 KYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
+YCECF+AG++C + C+C NC N HE ++ RNP AF PK+ S+ D
Sbjct: 427 RYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDL 486
Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ + +A+H +GC+C+KSGC KKYCECF + CS NCRC+ CKN G
Sbjct: 487 WVENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFG 538
>gi|313245432|emb|CBY40166.1| unnamed protein product [Oikopleura dioica]
Length = 394
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 70/111 (63%), Gaps = 11/111 (9%)
Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YCECF G YC C+C CHN ++ RQ+A+ + L+RN +AF+PK+A S
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS----- 259
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
++ KH +GC+CK+SGCLKKYCEC+QA + C++ C C+ CKN
Sbjct: 260 -----GKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKN 305
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 32/87 (36%)
Query: 47 RPRANTDGK--DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
+P+ GK + K + CNC+ S CLK K
Sbjct: 253 KPKVAKSGKQVNNKGKHLRGCNCKRSGCLK-----------------------------K 283
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHE 130
YCEC+ AG+ C D CNCV C N +H+
Sbjct: 284 YCECYQAGVPCNDICNCVGCKNRDDHD 310
>gi|355751890|gb|EHH56010.1| Metallothionein-like 5, testis-specific, partial [Macaca
fascicularis]
Length = 490
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+P
Sbjct: 275 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQP 334
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 335 KIGKGQLG-----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCK 383
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 384 NYEESPERKTLM 395
>gi|403301068|ref|XP_003941221.1| PREDICTED: tesmin [Saimiri boliviensis boliviensis]
Length = 263
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 85/132 (64%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN++HE+ R +A+ A L RNP AF+P
Sbjct: 39 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQP 98
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI +S G R H+KGC C++SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 99 KIGTSQLGDVKPR-----------HSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCK 147
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 148 NYEESPERKTLM 159
>gi|326427032|gb|EGD72602.1| hypothetical protein PTSG_04338 [Salpingoeca sp. ATCC 50818]
Length = 584
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 74/140 (52%), Gaps = 41/140 (29%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
+ CNC SRCLKL YCEC+AAG YC C C
Sbjct: 407 RSCNCGKSRCLKL-----------------------------YCECYAAGTYCGPQCKCT 437
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC NN+E+E RQ AV +TL++NP+AFRPKI Q+ + + A H +GC C
Sbjct: 438 NCQNNLENEGRRQLAVLSTLDKNPDAFRPKIK------QETTTNRK-----ANHLRGCKC 486
Query: 182 KKSGCLKKYCECFQANILCS 201
KS CL++YCECFQA + C+
Sbjct: 487 TKSKCLQRYCECFQAGVPCT 506
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
+ C+C KS CLK YCEC+ A C C+C +C+N +E RR L
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQL 451
>gi|118352258|ref|XP_001009402.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89291169|gb|EAR89157.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 805
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 94/177 (53%), Gaps = 36/177 (20%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+G+ KK + CNC+ ++CLKL YC+CFAAG +C
Sbjct: 471 NGSNKKTRICNCKKTKCLKL-----------------------------YCDCFAAGEFC 501
Query: 116 DG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
CNC C N V ++ AR + V LE+NP AF K +Q + + ++ LA+K
Sbjct: 502 GAECNCCECSNTVANKEARNKIVEGLLEKNPFAFNVKDIEIEEPSQLSLKAQKK--LASK 559
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAF 231
KGC+C++SGCLKKYC+C+Q + C E+C+C C+N E + + LF + N ++
Sbjct: 560 --KGCNCRRSGCLKKYCQCYQDGLQCGEHCKCNGCENCE--KPAKKLFLAEQNSLSL 612
>gi|348564752|ref|XP_003468168.1| PREDICTED: tesmin-like [Cavia porcellus]
Length = 391
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 60/125 (48%), Positives = 81/125 (64%), Gaps = 11/125 (8%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CFA+G +C+ CNC NC NN+ HE+ R A+ L+RNP AF+PKI
Sbjct: 180 KITLAGYCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIGK---- 235
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
R + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER
Sbjct: 236 -------GRSGDVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 288
Query: 219 RALFH 223
+ L +
Sbjct: 289 KTLMN 293
>gi|119595124|gb|EAW74718.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_a
[Homo sapiens]
Length = 529
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 314 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 373
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 374 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 422
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 423 NYEESPERKTLM 434
>gi|412990962|emb|CCO18334.1| predicted protein [Bathycoccus prasinos]
Length = 671
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 64/108 (59%), Gaps = 15/108 (13%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNV-EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC+CFAAG++C C+C +C N + EV RQ RNPNAF KI
Sbjct: 359 YCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRY-QIESRNPNAFANKIVD--------- 408
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+ + AKH KGCHCKKS CLKKYCECFQAN+ C + C+C CKN
Sbjct: 409 ----DDSVDAKHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKN 452
>gi|359482667|ref|XP_003632803.1| PREDICTED: uncharacterized protein LOC100855293 [Vitis vinifera]
Length = 613
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 65/168 (38%), Positives = 84/168 (50%), Gaps = 34/168 (20%)
Query: 58 TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-D 116
T +K CNC+ ++CLKL YC+C AAG+YC D
Sbjct: 282 TESDRKHCNCKRTQCLKL-----------------------------YCDCLAAGVYCTD 312
Query: 117 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD--AREDAREAQ-LAA 173
C C NC N E+E Q R+P AF P+I + D +ED +A
Sbjct: 313 SCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRIVNPDTDTTDNVLQEDGNWTTPSSA 372
Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEERRA 220
+H +GC+CKKS C KKYCEC+QA I CS+ CRC DC+N F +RRA
Sbjct: 373 RHKRGCNCKKSMCQKKYCECYQAGIGCSDGCRCEDCRNSFGIKADRRA 420
>gi|40806200|ref|NP_004914.2| tesmin isoform a [Homo sapiens]
gi|205371773|sp|Q9Y4I5.2|MTL5_HUMAN RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific; AltName: Full=Testis-specific
metallothionein-like protein
gi|40555823|gb|AAH64579.1| Metallothionein-like 5, testis-specific (tesmin) [Homo sapiens]
gi|119595125|gb|EAW74719.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_b
[Homo sapiens]
Length = 508
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 353 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 401
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 402 NYEESPERKTLM 413
>gi|426369505|ref|XP_004051728.1| PREDICTED: tesmin [Gorilla gorilla gorilla]
Length = 508
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 353 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 401
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 402 NYEESPERKTLM 413
>gi|351709791|gb|EHB12710.1| Tesmin [Heterocephalus glaber]
Length = 283
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 61/125 (48%), Positives = 83/125 (66%), Gaps = 11/125 (8%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CFA+G +C+ C+C NC NN+ HE+ R A+ A L+RNP AF+PKI S
Sbjct: 71 KITLAGYCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKS--- 127
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
R + +HNKGC+CK+SGCLK YCEC++A I+CS C+C+DC N+E S ER
Sbjct: 128 --------RLGDVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEESPER 179
Query: 219 RALFH 223
+ L +
Sbjct: 180 KTLMN 184
>gi|119595126|gb|EAW74720.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_c
[Homo sapiens]
Length = 463
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 248 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 307
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 308 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 356
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 357 NYEESPERKTLM 368
>gi|387192746|gb|AFJ68672.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 510
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 72/151 (47%), Gaps = 44/151 (29%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
CNC+ SRCLKL YC+CFAA +C CNC NC
Sbjct: 225 CNCKRSRCLKL-----------------------------YCQCFAAQAFCIPSCNCQNC 255
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
N R EA+ +ERNPNAF+ K P+G EDA H GC C+K
Sbjct: 256 LNTAGQNHLRAEAIRQIMERNPNAFQTKF--RPNG-----EDA-------VHKMGCKCRK 301
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFEG 214
S CLKKYCECF A CS+ C C+ C+N G
Sbjct: 302 SACLKKYCECFNAGARCSDKCSCVGCQNVAG 332
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%)
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
GC+CK+S CLK YC+CF A C +C C +C N G RA
Sbjct: 224 GCNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRA 266
>gi|37718857|gb|AAR01728.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 766
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 88/176 (50%), Gaps = 35/176 (19%)
Query: 41 QESP-RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
QE+P R R + D D K+C+C+ S+CLKL
Sbjct: 452 QENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKL------------------------ 483
Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFA+ +YC + C+C C N+ HE RNP AF PK+ +
Sbjct: 484 -----YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGP 538
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ ED+ +++H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CKN G
Sbjct: 539 GLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 594
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 511
>gi|327260145|ref|XP_003214896.1| PREDICTED: tesmin-like [Anolis carolinensis]
Length = 377
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/139 (45%), Positives = 85/139 (61%), Gaps = 21/139 (15%)
Query: 103 VKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
V YC+CFA G +C CNC NC+NN +HE+ R A+ A L+RNP AF PKI G
Sbjct: 174 VGYCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLG---- 229
Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
+ +HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S ER+ L
Sbjct: 230 -------DIKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKNYEESPERKTLI 282
Query: 223 ----------HGDHNGIAF 231
H +++ ++F
Sbjct: 283 NMPNDMEIKSHEENDAVSF 301
>gi|108710010|gb|ABF97805.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
Length = 757
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 88/176 (50%), Gaps = 35/176 (19%)
Query: 41 QESP-RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
QE+P R R + D D K+C+C+ S+CLKL
Sbjct: 443 QENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKL------------------------ 474
Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFA+ +YC + C+C C N+ HE RNP AF PK+ +
Sbjct: 475 -----YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGP 529
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ ED+ +++H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CKN G
Sbjct: 530 GLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 585
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 502
>gi|299472731|emb|CBN80299.2| TSO1; transcription factor (Partial) [Ectocarpus siliculosus]
Length = 792
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 77/172 (44%), Gaps = 48/172 (27%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNC 123
CNC+ S+CLKL YC+CFA + C CNC +C
Sbjct: 26 CNCKRSKCLKL-----------------------------YCQCFAMQVVCTASCNCHHC 56
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
+NN E R +A+ LERNP AF DA+ E A H GC C+K
Sbjct: 57 YNNANQESMRLQAINGVLERNPAAF------------DAKFRETEKHDAVVHKTGCKCRK 104
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRA 235
S CLKKYCECF + CSE C C+ C+N AL+ G A +RA
Sbjct: 105 SACLKKYCECFNKGVACSEKCNCVGCRNTV------ALYSEYRGGGAAAERA 150
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
A GC+CK+S CLK YC+CF ++C+ +C C C N E R
Sbjct: 20 APTTDGCNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMR 66
>gi|422294067|gb|EKU21367.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 511
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 72/151 (47%), Gaps = 44/151 (29%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
CNC+ SRCLKL YC+CFAA +C CNC NC
Sbjct: 225 CNCKRSRCLKL-----------------------------YCQCFAAQAFCIPSCNCQNC 255
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
N R EA+ +ERNPNAF+ K P+G EDA H GC C+K
Sbjct: 256 LNTAGQNHLRAEAIRQIMERNPNAFQTKF--RPNG-----EDA-------VHKMGCKCRK 301
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFEG 214
S CLKKYCECF A CS+ C C+ C+N G
Sbjct: 302 SACLKKYCECFNAGARCSDKCSCVGCQNVAG 332
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%)
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
GC+CK+S CLK YC+CF A C +C C +C N G RA
Sbjct: 224 GCNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRA 266
>gi|222625427|gb|EEE59559.1| hypothetical protein OsJ_11844 [Oryza sativa Japonica Group]
Length = 644
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 88/176 (50%), Gaps = 35/176 (19%)
Query: 41 QESP-RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
QE+P R R + D D K+C+C+ S+CLKL
Sbjct: 330 QENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKL------------------------ 361
Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YCECFA+ +YC + C+C C N+ HE RNP AF PK+ +
Sbjct: 362 -----YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGP 416
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ ED+ +++H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CKN G
Sbjct: 417 GLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 472
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 389
>gi|422292803|gb|EKU20105.1| tesmin tso1-like cxc domain-containing protein [Nannochloropsis
gaditana CCMP526]
Length = 365
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 75/213 (35%), Positives = 96/213 (45%), Gaps = 73/213 (34%)
Query: 41 QESPRSRPRANTDGKDGTP-----------KKQKQCNCRNSRCLKLDFVEIWINIYSNTS 89
QE+ RS P + G GTP + K+CNCR S+CLKL
Sbjct: 129 QEARRSPPLNGSGG--GTPQGRGTSGLAATRSSKKCNCRKSKCLKL-------------- 172
Query: 90 PGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
YCECFA G YC D C C++C+N + E R +AV + L RNP+AF
Sbjct: 173 ---------------YCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAF 217
Query: 149 RPKI------ASSP-----------HGAQDAREDAREAQL-------------AAKHNKG 178
KI A+ P G Q + D E + A H +G
Sbjct: 218 TEKIQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRG 277
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C+CKKS C+KKYCECF A + C ENC+C C+N
Sbjct: 278 CNCKKSLCIKKYCECFFAGVYCGENCQCEGCQN 310
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 30/48 (62%)
Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
A + +K C+C+KS CLK YCECF C ++C CLDC N E+ R
Sbjct: 155 ATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVR 202
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVE 128
+ KYCECF AG+YC + C C C NN E
Sbjct: 286 IKKYCECFFAGVYCGENCQCEGCQNNPE 313
>gi|397517193|ref|XP_003828803.1| PREDICTED: tesmin [Pan paniscus]
Length = 508
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K NL YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKINLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ C+
Sbjct: 353 KIGKGQLG-----------NVKPRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCE 401
Query: 211 NFEGSEERRALF 222
N E S ER+ L
Sbjct: 402 NCEESPERKTLM 413
>gi|332837107|ref|XP_001154282.2| PREDICTED: tesmin [Pan troglodytes]
Length = 508
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K NL YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKINLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ C+
Sbjct: 353 KIGKGQLG-----------NVKPRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCE 401
Query: 211 NFEGSEERRALF 222
N E S ER+ L
Sbjct: 402 NCEESPERKTLM 413
>gi|358340153|dbj|GAA48108.1| protein lin-54 homolog [Clonorchis sinensis]
Length = 450
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 82/161 (50%), Gaps = 43/161 (26%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K CNC S CLKL YCECFA G C CNC
Sbjct: 144 RKPCNCTKSHCLKL-----------------------------YCECFAQGQLCQNCNCH 174
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC NN+ +E R A+ TLERNP AF PKI E R KH KGC+C
Sbjct: 175 NCMNNLAYEEERGRAIKMTLERNPTAFHPKIGRG--------EGER------KHTKGCNC 220
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
K+SGCLK YCEC++A I CS+ CRC C+N E S ERR+L
Sbjct: 221 KRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLM 261
>gi|196011968|ref|XP_002115847.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
gi|190581623|gb|EDV21699.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
Length = 316
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/133 (48%), Positives = 82/133 (61%), Gaps = 13/133 (9%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
YCECFA G +C CNC NCHNN+EHE R A+ A LERNP+AFRPKI + G +
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEGNER--- 194
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH- 223
+H+KGC+CK+SGCLK YCEC++A I C+ C+C +CKN E ER+ L
Sbjct: 195 ---------RHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKNREDCLERKNLMQL 245
Query: 224 GDHNGIAFMQRAA 236
D I Q+ A
Sbjct: 246 ADAADIRVKQQTA 258
>gi|326920244|ref|XP_003206384.1| PREDICTED: tesmin-like [Meleagris gallopavo]
Length = 622
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/126 (48%), Positives = 79/126 (62%), Gaps = 11/126 (8%)
Query: 95 NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
N Q K YCECFA+G +C+ CNC NC+NN HE R +A+ L+RNP AF PKI
Sbjct: 417 NLQVKMTSAGYCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQ 476
Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
S G + HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E
Sbjct: 477 SKLG-----------DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 525
Query: 215 SEERRA 220
S +++
Sbjct: 526 SPDKKT 531
>gi|4581563|gb|AAD24668.1|U86074_1 tesmin [Homo sapiens]
Length = 299
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/124 (47%), Positives = 81/124 (65%), Gaps = 11/124 (8%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI G
Sbjct: 92 KITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG 151
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
+ +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER
Sbjct: 152 -----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPER 200
Query: 219 RALF 222
+ L
Sbjct: 201 KTLM 204
>gi|118349584|ref|XP_001008073.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89289840|gb|EAR87828.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 471
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 80/172 (46%), Gaps = 50/172 (29%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-C 118
KKQK CNC+ ++CLKL YC+CFAAG C G C
Sbjct: 127 KKQKTCNCKKTKCLKL-----------------------------YCDCFAAGELCGGEC 157
Query: 119 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
NC C N E R A+ + ++R P+AF PK+ + H KG
Sbjct: 158 NCFGCFNTSNSE-KRAGAIVSIMDRQPDAFGPKVQQNTHK------------------KG 198
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE-GSEERRALFHGDHNGI 229
C+C +SGCLKKYCEC+ + C E C+C CKN + ER AL NGI
Sbjct: 199 CNCTRSGCLKKYCECYTLGVNCGEYCKCTQCKNVDLNHSERLALSMNGSNGI 250
>gi|387219095|gb|AFJ69256.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 365
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 96/213 (45%), Gaps = 73/213 (34%)
Query: 41 QESPRSRPRANTDGKDGTP-----------KKQKQCNCRNSRCLKLDFVEIWINIYSNTS 89
QE+ RS P + G GTP + K+CNCR S+CLKL
Sbjct: 129 QEARRSPPLNGSGG--GTPQGRGTSGLAATRSSKKCNCRKSKCLKL-------------- 172
Query: 90 PGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
YCECFA G YC D C C++C+N + E R +AV + L RNP+AF
Sbjct: 173 ---------------YCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAF 217
Query: 149 RPKI------ASSP-----------HGAQDAREDAREAQL-------------AAKHNKG 178
KI A+ P G Q + D E + A H +G
Sbjct: 218 TEKIQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRG 277
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C+CKKS C++KYCECF A + C ENC+C C+N
Sbjct: 278 CNCKKSLCIQKYCECFFAGVYCGENCQCEGCQN 310
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 30/48 (62%)
Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
A + +K C+C+KS CLK YCECF C ++C CLDC N E+ R
Sbjct: 155 ATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVR 202
Score = 40.0 bits (92), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVE 128
+ KYCECF AG+YC + C C C NN E
Sbjct: 286 IQKYCECFFAGVYCGENCQCEGCQNNPE 313
>gi|307103614|gb|EFN51873.1| hypothetical protein CHLNCDRAFT_139785 [Chlorella variabilis]
Length = 929
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 65/116 (56%), Gaps = 14/116 (12%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC+CFA G YC C+C +C N VE+ A +RNPNAF KI +
Sbjct: 541 YCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQRNPNAFVQKIEA--------- 591
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+A L +H +GC+CKKS C+KKYCECFQA + C E+C+C C N G RR
Sbjct: 592 ----DAALGGQHRRGCNCKKSHCMKKYCECFQAGVPCGEHCKCESCHNTAGHSGRR 643
>gi|118383712|ref|XP_001025010.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89306777|gb|EAS04765.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1040
Score = 105 bits (261), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 82/184 (44%), Gaps = 54/184 (29%)
Query: 58 TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG 117
T K++ +CNC+ S+CLKL YC+CF G +C+
Sbjct: 707 TSKQKVKCNCKKSKCLKL-----------------------------YCDCFNQGQFCNS 737
Query: 118 -CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK-- 174
CNC C N ++ R+ A+ ERNP+AF+PKI + Q + L
Sbjct: 738 ECNCTECSNTENNKAERESAIKQLQERNPDAFKPKIQTKEQIIQQYLSKLQIFMLIISSL 797
Query: 175 ----------------------HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
HNKGC+CKKSGC KKYCEC+ + CS+ C+C C+N
Sbjct: 798 LSKNQIQYLIPKIIKDDEEILIHNKGCNCKKSGCEKKYCECYNTGVKCSDQCKCEGCRNR 857
Query: 213 EGSE 216
+ SE
Sbjct: 858 DPSE 861
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 28/50 (56%)
Query: 170 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
Q +K C+CKKS CLK YC+CF C+ C C +C N E ++ R
Sbjct: 705 QTTSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAER 754
>gi|118399021|ref|XP_001031837.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89286171|gb|EAR84174.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 742
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 76/157 (48%), Gaps = 48/157 (30%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+ +P + K CNCRNS+CLK +YC+CFAAG+YC
Sbjct: 493 EKSPTEAKPCNCRNSKCLK-----------------------------RYCDCFAAGLYC 523
Query: 116 DG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
C C CHN E+E RQ+A+ +R +AF P I ++ H
Sbjct: 524 QAECKCEECHNKPEYEDERQKAIQKKKKRIKDAFLPVIQNNSHI---------------- 567
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
KGCHCK S C KKYC C Q +LCS C+C++C+N
Sbjct: 568 --KGCHCKNSHCQKKYCVCHQNGVLCSSLCQCVECEN 602
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 27/43 (62%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
K C+C+ S CLK+YC+CF A + C C+C +C N E+ R
Sbjct: 500 KPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDER 542
>gi|326495114|dbj|BAJ85653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 766
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 94/181 (51%), Gaps = 39/181 (21%)
Query: 41 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
Q SP+ + R D DGT K+C+C+ S+CLKL
Sbjct: 440 QGSPKKK-RHKFDDGDGT--SCKRCSCKKSKCLKL------------------------- 471
Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSP 156
YCECF AG++C + C+C +C N N+E ++ ++ + + RNP AF PK+ +
Sbjct: 472 ----YCECFHAGVFCSEPCSCQDCLNKPSNMEIVLSTRKQIES---RNPLAFAPKVIRTS 524
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
Q+ E + +A+H +GC+CKKS CLKKYCEC+Q + CS +CR CKN G
Sbjct: 525 EPGQELGEYSNRTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRYEGCKNAFGRR 584
Query: 217 E 217
E
Sbjct: 585 E 585
>gi|444723239|gb|ELW63898.1| Protein lin-54 like protein [Tupaia chinensis]
Length = 783
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 60/99 (60%), Gaps = 12/99 (12%)
Query: 139 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANI 198
A L+RNP AF+PKI G D R H+KGC+CK+SGCLK YCEC++A I
Sbjct: 604 ACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKI 652
Query: 199 LCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 653 MCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 691
>gi|118091374|ref|XP_001231493.1| PREDICTED: tesmin [Gallus gallus]
Length = 503
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/126 (47%), Positives = 78/126 (61%), Gaps = 11/126 (8%)
Query: 95 NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
N K YCECFA+G +C+ CNC NC+NN HE R +A+ L+RNP AF PKI
Sbjct: 297 NLHVKMTSAGYCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQ 356
Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
S G + HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E
Sbjct: 357 SELG-----------DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 405
Query: 215 SEERRA 220
S +++
Sbjct: 406 SPDKKT 411
>gi|302789393|ref|XP_002976465.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
gi|300156095|gb|EFJ22725.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
Length = 710
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 90/205 (43%), Gaps = 64/205 (31%)
Query: 45 RSRPRANTDGKDG----TPKKQ------------KQCNCRNSRCLKLDFVEIWINIYSNT 88
R R +D DG TPKK+ K+C+C+ S+CLKL
Sbjct: 356 RKRLSMQSDSMDGEMGRTPKKRRKSLGEKAGDGCKRCHCKKSKCLKL------------- 402
Query: 89 SPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNA 147
YCECFAAG+YC D C C +C N E RNP A
Sbjct: 403 ----------------YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLA 446
Query: 148 FRPKIAS--------------SPHGAQDA----REDAREAQLAAKHNKGCHCKKSGCLKK 189
F PKI S H ++ + + + +A+H +GC+CKKS CLKK
Sbjct: 447 FAPKIVQTSDNTPTEGVGIWFSGHAMTNSLFALQPNCPDTTGSARHKRGCNCKKSLCLKK 506
Query: 190 YCECFQANILCSENCRCLDCKNFEG 214
YCEC+QA + CS+ C+C CKN G
Sbjct: 507 YCECYQAGVGCSDGCKCNGCKNIYG 531
>gi|159478621|ref|XP_001697401.1| hypothetical protein CHLREDRAFT_105644 [Chlamydomonas reinhardtii]
gi|158274559|gb|EDP00341.1| predicted protein [Chlamydomonas reinhardtii]
Length = 91
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/92 (56%), Positives = 61/92 (66%), Gaps = 1/92 (1%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
YCECFA+ YC+ CNC+ C NN E+E RQ AV A +ERNPNAF+PKI H
Sbjct: 1 YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHETHTPVV 60
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 196
A A + +H KGC+CKKS CLKKYCECFQ
Sbjct: 61 VA-AAGASGRHLKGCNCKKSFCLKKYCECFQV 91
>gi|256089961|ref|XP_002581002.1| tesmin-related [Schistosoma mansoni]
Length = 364
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 82/161 (50%), Gaps = 43/161 (26%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K CNC S CLKL YCECFA G C CNC
Sbjct: 145 RKPCNCTKSHCLKL-----------------------------YCECFAQGQLCQNCNCN 175
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC NN+ +E R A+ TLERNP AF PKI E R KH KGC+C
Sbjct: 176 NCMNNLAYEEERGRAIKMTLERNPTAFHPKIGRG--------EGER------KHTKGCNC 221
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
K+SGCLK YCEC++A I CS+ CRC C+N E S ERR+L
Sbjct: 222 KRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLM 262
Score = 42.0 bits (97), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 42/106 (39%), Gaps = 34/106 (32%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+G K K CNC+ S CLK YCEC+ A I C
Sbjct: 210 EGERKHTKGCNCKRSGCLK-----------------------------NYCECYEAKISC 240
Query: 116 -DGCNCVNCHN---NVEHE-VARQEAVGATLERNPNAFRPKIASSP 156
D C C C N +VE + R A+GA R P++F+ + P
Sbjct: 241 SDLCRCQGCRNTEDSVERRSLMRLAAMGALRSRQPSSFKKHLVKPP 286
>gi|360044492|emb|CCD82040.1| tesmin-related [Schistosoma mansoni]
Length = 363
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 82/161 (50%), Gaps = 43/161 (26%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K CNC S CLKL YCECFA G C CNC
Sbjct: 144 RKPCNCTKSHCLKL-----------------------------YCECFAQGQLCQNCNCN 174
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC NN+ +E R A+ TLERNP AF PKI E R KH KGC+C
Sbjct: 175 NCMNNLAYEEERGRAIKMTLERNPTAFHPKIGRG--------EGER------KHTKGCNC 220
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
K+SGCLK YCEC++A I CS+ CRC C+N E S ERR+L
Sbjct: 221 KRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLM 261
Score = 42.0 bits (97), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 42/106 (39%), Gaps = 34/106 (32%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+G K K CNC+ S CLK YCEC+ A I C
Sbjct: 209 EGERKHTKGCNCKRSGCLK-----------------------------NYCECYEAKISC 239
Query: 116 -DGCNCVNCHN---NVEHE-VARQEAVGATLERNPNAFRPKIASSP 156
D C C C N +VE + R A+GA R P++F+ + P
Sbjct: 240 SDLCRCQGCRNTEDSVERRSLMRLAAMGALRSRQPSSFKKHLVKPP 285
>gi|9279694|dbj|BAB01251.1| DNA binding protein-like [Arabidopsis thaliana]
Length = 593
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 69/135 (51%), Gaps = 30/135 (22%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG YC + C+C+
Sbjct: 328 KRCNCKKSKCLKL-----------------------------YCECFAAGFYCIEPCSCI 358
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC N H+ RNP AF PK+ + + EDA + +A+H +GC+C
Sbjct: 359 NCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNC 418
Query: 182 KKSGCLKKYCECFQA 196
KKS CLKKYCEC+Q+
Sbjct: 419 KKSNCLKKYCECYQS 433
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 8/128 (6%)
Query: 91 GAHLNT---QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQ----EAVGATLER 143
G HLNT KD V F+ I + + + +H++ R+ + G +E
Sbjct: 236 GLHLNTIAMSSKDKNVANEYSFSGNIKVGVQSSLTPVLHSQHDIVRENESGKDSGQIIEV 295
Query: 144 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 203
P + + +P + R + ++ K C+CKKS CLK YCECF A C E
Sbjct: 296 VPKSLA-SVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKKSKCLKLYCECFAAGFYCIEP 354
Query: 204 CRCLDCKN 211
C C++C N
Sbjct: 355 CSCINCFN 362
>gi|219109951|ref|XP_002176728.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411263|gb|EEC51191.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 490
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 40/151 (26%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCNCVNC 123
C C+ +RCLKL YC+CF IYC C C++C
Sbjct: 217 CTCKKTRCLKL-----------------------------YCQCFGVKIYCGPNCRCLDC 247
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK---HNKGCH 180
HN E ARQ A+ L RNP+AF K +P D + +K H GC
Sbjct: 248 HNVPAQEDARQNAMKVILSRNPHAFDTKFQKTP-------VDGATVETPSKLLTHKLGCK 300
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C+KS C+KKYCEC+ ++ C+ +CRC CKN
Sbjct: 301 CRKSACMKKYCECYAGHVYCNTHCRCTGCKN 331
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 37/67 (55%), Gaps = 6/67 (8%)
Query: 159 AQDAREDARE-AQLAAKHNK-----GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
A D RE A +Q+ H + GC CKK+ CLK YC+CF I C NCRCLDC N
Sbjct: 191 ANDGREGATSPSQIETDHLEFVQAVGCTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNV 250
Query: 213 EGSEERR 219
E+ R
Sbjct: 251 PAQEDAR 257
>gi|168010127|ref|XP_001757756.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691032|gb|EDQ77396.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 113
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 67/111 (60%), Gaps = 1/111 (0%)
Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCECFAA ++C G C C NC N E+E R+P AF PKI + R
Sbjct: 1 YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEARDPLAFAPKIVQAAEPTPIPR 60
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+DA + +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC CKN G
Sbjct: 61 DDALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKNMYG 111
>gi|301095347|ref|XP_002896774.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108657|gb|EEY66709.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 394
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 89/189 (47%), Gaps = 51/189 (26%)
Query: 59 PKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG- 117
P++ K C+C+ S CLKL YCEC AA CD
Sbjct: 45 PERTKGCSCKKSNCLKL-----------------------------YCECLAAQRMCDHR 75
Query: 118 CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
CNC C N E R+ AV A L+RNP AF+PK+AS +++H +
Sbjct: 76 CNCEGCKNCSETLAERERAVAAILDRNPLAFQPKVASG----------------SSQHLR 119
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG--SEERRALFHG---DHNGIAFM 232
GC+C+KSGC+K YCEC QA + C+ C C C+N E S ++ +F G D N +
Sbjct: 120 GCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAGVSADDNDLRVT 179
Query: 233 QRAANAAII 241
R + ++
Sbjct: 180 IRPSQRKLV 188
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 3/75 (4%)
Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFHG 224
+ Q + KGC CKKS CLK YCEC A +C C C CKN ER
Sbjct: 40 QVQTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAIL 99
Query: 225 DHNGIAFMQRAANAA 239
D N +AF + A+ +
Sbjct: 100 DRNPLAFQPKVASGS 114
>gi|428180550|gb|EKX49417.1| hypothetical protein GUITHDRAFT_43196, partial [Guillardia theta
CCMP2712]
Length = 113
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 77/151 (50%), Gaps = 41/151 (27%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
++K C+C+ SRCLKL YCECFAAG C GC C
Sbjct: 4 RKKVCSCKKSRCLKL-----------------------------YCECFAAGEICSGCKC 34
Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
V+C N+ +HE R +AV +RN NAF PKI Q H +GC
Sbjct: 35 VDCANDGDHEDMRLQAVDTIKQRNNNAFAPKIVDEIQ------------QDKGMHARGCR 82
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKN 211
CKKS CLKKYCEC+QA + C++ C+C +C+N
Sbjct: 83 CKKSHCLKKYCECYQAGVQCTDKCKCEECQN 113
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
K C CKKS CLK YCECF A +CS C+C+DC N E+ R
Sbjct: 6 KVCSCKKSRCLKLYCECFAAGEICS-GCKCVDCANDGDHEDMR 47
>gi|348686835|gb|EGZ26649.1| hypothetical protein PHYSODRAFT_473582 [Phytophthora sojae]
Length = 407
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 81/169 (47%), Gaps = 48/169 (28%)
Query: 59 PKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG- 117
P++ K C+C+ S CLKL YCEC AA CD
Sbjct: 45 PERTKGCSCKKSNCLKL-----------------------------YCECLAAQRMCDHL 75
Query: 118 CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
CNC C N E R+ AV A LERNP AF PK+AS +++H +
Sbjct: 76 CNCEGCKNCQETLAERERAVAAILERNPLAFLPKVASG----------------SSQHLR 119
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG--SEERRALFHG 224
GC+C+KSGC+K YCEC QA + C+ C C C+N E S ++ +F G
Sbjct: 120 GCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAG 168
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 3/73 (4%)
Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFHG 224
+ Q + KGC CKKS CLK YCEC A +C C C CKN + ER
Sbjct: 40 QVQTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAIL 99
Query: 225 DHNGIAFMQRAAN 237
+ N +AF+ + A+
Sbjct: 100 ERNPLAFLPKVAS 112
>gi|323456808|gb|EGB12674.1| hypothetical protein AURANDRAFT_60658 [Aureococcus anophagefferens]
Length = 1430
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/167 (35%), Positives = 80/167 (47%), Gaps = 43/167 (25%)
Query: 54 GKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGI 113
G+DG+ K CNC+ S+CLKL YCEC+AAG+
Sbjct: 1003 GRDGSGGGFKPCNCKKSKCLKL-----------------------------YCECYAAGV 1033
Query: 114 YCDGCNCVNCHNNVEHEVAR-----QEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 168
+C CNC NC N E A A+G L A + +S AQ R+
Sbjct: 1034 FCRDCNCSNCLNKPEDGEAHPPYPATHALG-RLRHGDVAVPTAVLTS---AQK-----RK 1084
Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
Q + GC CKKS CLKKYCECF+A + C +C+C +C+N++GS
Sbjct: 1085 PQSKQRQESGCFCKKSKCLKKYCECFEAGVHCEASCKCENCENYDGS 1131
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 44/88 (50%), Gaps = 21/88 (23%)
Query: 144 NPNAFRPKIASSP-------HGAQDAREDAREAQL---AAKHNKG----------CHCKK 183
P+A P AS+P G+ D+R A A L +A+ KG C+CKK
Sbjct: 959 TPSAMAPPPASTPLCGPPPPPGSADSRGRAAGATLVSASARKRKGRDGSGGGFKPCNCKK 1018
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKN 211
S CLK YCEC+ A + C +C C +C N
Sbjct: 1019 SKCLKLYCECYAAGVFC-RDCNCSNCLN 1045
>gi|146174407|ref|XP_001019363.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|146144792|gb|EAR99118.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1114
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 77/155 (49%), Gaps = 38/155 (24%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
CNC ++CLKL YC CF G+ C D C C +C
Sbjct: 714 CNCGKTKCLKL-----------------------------YCVCFQQGLMCSDQCRCTDC 744
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA---QDA-REDAR---EAQLAAKHN 176
+N + + R+ A+ A +R P+AF K+ P A +DA R DA + QL A H
Sbjct: 745 YNKIAYLEERERAIKAIKQRYPDAFEKKVNFLPELARKIEDAERLDADLELQNQLLA-HK 803
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
KGC+CKKS C KKYCECF A + C+ CRC C+N
Sbjct: 804 KGCNCKKSACKKKYCECFLAGVKCTYLCRCEQCQN 838
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 24/34 (70%)
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C+C K+ CLK YC CFQ ++CS+ CRC DC N
Sbjct: 713 ACNCGKTKCLKLYCVCFQQGLMCSDQCRCTDCYN 746
>gi|339238897|ref|XP_003381003.1| conserved hypothetical protein [Trichinella spiralis]
gi|316976020|gb|EFV59373.1| conserved hypothetical protein [Trichinella spiralis]
Length = 492
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 63/113 (55%), Gaps = 12/113 (10%)
Query: 111 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
A + C+C NC NN+ +E+ R A+ + LERNP AF+PKI R D
Sbjct: 275 APVLKSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIG-------KGRADTERL- 326
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
HNKGC+CKKS CLK YCEC++A + C+ C+C+ C+N E R H
Sbjct: 327 ----HNKGCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRNTEADRNHRNRGH 375
>gi|118373056|ref|XP_001019722.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89301489|gb|EAR99477.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 495
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 61/159 (38%), Positives = 75/159 (47%), Gaps = 32/159 (20%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
D K ++ C C+NS+CLKL YCECFA G YC
Sbjct: 247 DSEQKAREGCRCKNSKCLKL-----------------------------YCECFAKGAYC 277
Query: 116 -DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
D C C+ C N E E +EA TL RNP+AF K+ QD +A L K
Sbjct: 278 RDICKCLQCSNTEECENEIKEARKVTLTRNPDAFTSKLEVVGTIVQDEDIEANRGMLGYK 337
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
KGC CK++ C KKYCEC+ A + CS C C +C N E
Sbjct: 338 --KGCKCKRTYCKKKYCECYNAGVKCSYLCICENCHNQE 374
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 29/56 (51%), Gaps = 9/56 (16%)
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
D+ + ARE GC CK S CLK YCECF C + C+CL C N E E
Sbjct: 247 DSEQKARE---------GCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECE 293
>gi|119626323|gb|EAX05918.1| hypothetical protein DKFZp686L1814, isoform CRA_b [Homo sapiens]
Length = 223
Score = 98.6 bits (244), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 60/99 (60%), Gaps = 12/99 (12%)
Query: 126 NVEHEV-ARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
NV + V Q A L+RNP AF+PKI G D R H+KGC+CK+S
Sbjct: 30 NVGYAVLPAQYVTQACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRS 78
Query: 185 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
GCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 79 GCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 117
>gi|390470886|ref|XP_003734371.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Callithrix jacchus]
Length = 466
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/132 (41%), Positives = 82/132 (62%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L +C+CFA+G + + CNC NC NN+ HE+ R +A+ A L R+P AF+P
Sbjct: 294 GSALPGPPKTTLAGHCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQP 353
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
+I G ++ +H+KGC+C++ GCL+ YCEC +A I+CS C+C+ CK
Sbjct: 354 QIGKGQLG-----------EVRPRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGCK 402
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 403 NYEESPERKTLM 414
>gi|118355648|ref|XP_001011083.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89292850|gb|EAR90838.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 974
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 76/153 (49%), Gaps = 36/153 (23%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
CNC+ ++CLKL YC+CFA G C C CV C
Sbjct: 636 CNCKKTKCLKL-----------------------------YCDCFAVGELCGQDCKCVEC 666
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK--GCHC 181
NN R +A+ L+RNP AF K P A+ ++++ + L ++K GC+C
Sbjct: 667 CNNEATLNLRNQAIEGILQRNPFAFNVK----PEEAKKSKKELNNSILQNSNSKVIGCNC 722
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+KS CLKKYC C+ + + CSE C+C +CKN E
Sbjct: 723 RKSNCLKKYCLCYHSGMKCSEACKCTECKNLEN 755
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 28/41 (68%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C+CKK+ CLK YC+CF LC ++C+C++C N E + R
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLR 676
>gi|449274469|gb|EMC83611.1| Tesmin, partial [Columba livia]
Length = 289
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/169 (42%), Positives = 92/169 (54%), Gaps = 18/169 (10%)
Query: 92 AHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
+H++ Q K YC+CFA G +C CNC NC NN HE R A+ A LERNP AF PK
Sbjct: 138 SHMHGQVKMTSEGYCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPK 197
Query: 152 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
I G ++ HNKGC+CK+SGCL+ YCECF+A I+CS C+C CKN
Sbjct: 198 IGKRKLG-----------EIKHHHNKGCNCKRSGCLENYCECFEAKIVCSSICKCTGCKN 246
Query: 212 FEGS-EERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 259
+E S +E + L N I +M N V +S + L S K +
Sbjct: 247 YEESPDENKQL-----NVINYMD-IGNKEGNSPVLTSAFNMLPKSRKDR 289
>gi|224000241|ref|XP_002289793.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975001|gb|EED93330.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 712
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 77/176 (43%), Gaps = 58/176 (32%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNC 123
C C+ S+CLKL YC CF+A C C C C
Sbjct: 338 CTCKKSKCLKL-----------------------------YCHCFSANSMCSSFCRCTEC 368
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE-----DAREAQL------A 172
N E R++AV ATL+RNPNAFR K G + A+E A++ QL
Sbjct: 369 MNTTEAGTVREDAVKATLQRNPNAFRSKFVKVTGGEEFAKEWEKNERAKQMQLLNVGDSP 428
Query: 173 AK-----------------HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
AK H GC+C+KS CLK+YCECF A C NC+CL C+N
Sbjct: 429 AKTKTTGVEMAAVAIANPTHKLGCNCRKSFCLKRYCECFGAQTPCGLNCKCLGCQN 484
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 61/153 (39%), Gaps = 23/153 (15%)
Query: 131 VARQEAVGATLERNP--NAFRPKIASSPHGAQDARED----AREAQLAAKHNKG-----C 179
A A ++ E NP N P I S P R A A+L A H C
Sbjct: 279 TAAATAFKSSTETNPPANYPAPTIPSRPSVPVAQRSSIVLQAATARLKAHHYTDNSSVTC 338
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE--GSEERRALFHGDHNGIAFMQRAAN 237
CKKS CLK YC CF AN +CS CRC +C N G+ A+ A +QR N
Sbjct: 339 TCKKSKCLKLYCHCFSANSMCSSFCRCTECMNTTEAGTVREDAV-------KATLQRNPN 391
Query: 238 A---AIIGAVGSSGYGTLMTSNKRKSEELLFGV 267
A + G + N+R + L V
Sbjct: 392 AFRSKFVKVTGGEEFAKEWEKNERAKQMQLLNV 424
>gi|118355040|ref|XP_001010781.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89292548|gb|EAR90536.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 978
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/148 (37%), Positives = 73/148 (49%), Gaps = 38/148 (25%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
CNC+ ++CLKL YCECFA YC +GC+C C
Sbjct: 405 CNCKKTKCLKL-----------------------------YCECFAKLGYCGEGCHCHEC 435
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
N E+E RQ A+ RN +AF+PK S Q+ + A Q+ KGC+C+K
Sbjct: 436 KNRPENEDERQNAIQEAKSRNNDAFQPKTESE----QERKNKANNEQMK----KGCNCRK 487
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKN 211
+ CLKKYCECF A C+ C+C C+N
Sbjct: 488 TKCLKKYCECFNAGTYCNNMCKCDSCEN 515
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 26/41 (63%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C+CKK+ CLK YCECF C E C C +CKN +E+ R
Sbjct: 405 CNCKKTKCLKLYCECFAKLGYCGEGCHCHECKNRPENEDER 445
>gi|307110312|gb|EFN58548.1| hypothetical protein CHLNCDRAFT_140658 [Chlorella variabilis]
Length = 664
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 72/157 (45%), Gaps = 48/157 (30%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGC 118
++Q+ CNC+ S CLK+ YCECFAAG +C C
Sbjct: 260 REQRPCNCKRSMCLKM-----------------------------YCECFAAGGFCAPSC 290
Query: 119 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
+C++C N A L +NPNAF K+ + A H +G
Sbjct: 291 SCLSCSNTPAEMGVVMAAREVVLAKNPNAFEVKVTA-----------------ATGHRRG 333
Query: 179 CHCKKSGCLKKYCECFQANILCS-ENCRCLDCKNFEG 214
C CK+S CLKKYCECF A C+ E C+C DC+N EG
Sbjct: 334 CRCKRSKCLKKYCECFHAGARCNPEVCQCEDCRNTEG 370
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 26/39 (66%)
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
A+ + C+CK+S CLK YCECF A C+ +C CL C N
Sbjct: 259 AREQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSN 297
>gi|118384767|ref|XP_001025523.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89307290|gb|EAS05278.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1022
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 71/147 (48%), Gaps = 35/147 (23%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
C C+ ++CLKL YCECF A YC G NC C
Sbjct: 766 CGCKKTKCLKL-----------------------------YCECFQAMKYCVGTNCQGCL 796
Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
N E+E R+ A+ +RN +AF PK S Q + D+ A H+KGC+CKKS
Sbjct: 797 NKPEYEDQRKHAMELIQQRNSSAFDPKADKS---DQYYKSDSG---TKAIHSKGCNCKKS 850
Query: 185 GCLKKYCECFQANILCSENCRCLDCKN 211
C KKYCEC+Q C+ C+C +CKN
Sbjct: 851 DCRKKYCECYQMGAKCTHLCKCYNCKN 877
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 30/59 (50%), Gaps = 2/59 (3%)
Query: 179 CHCKKSGCLKKYCECFQANILC-SENCR-CLDCKNFEGSEERRALFHGDHNGIAFMQRA 235
C CKK+ CLK YCECFQA C NC+ CL+ +E + N AF +A
Sbjct: 766 CGCKKTKCLKLYCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKA 824
>gi|403353185|gb|EJY76132.1| Tso1, putative [Oxytricha trifallax]
Length = 712
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 78/179 (43%), Gaps = 57/179 (31%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
K CNC+ S+CLKL YCECFA +C C C
Sbjct: 449 KSCNCKKSKCLKL-----------------------------YCECFANNKFCGANCACN 479
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR----------EAQL 171
C N+ EH+ R A L RNP AFRPK+ + +D + +A L
Sbjct: 480 GCSNHAEHDDERLHAKEQILMRNPLAFRPKVETQQDETSLFSKDIQITKINIQEQSDALL 539
Query: 172 AA-----------------KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+ +H +GC+CKKS C KKYCECFQ + CS+ C+C +CKN E
Sbjct: 540 SGLSKPQTFGALAVSDKDKRHFRGCNCKKSNCQKKYCECFQQGVNCSDLCKCEECKNNE 598
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 44/97 (45%), Gaps = 7/97 (7%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-----NGIAF 231
K C+CKKS CLK YCECF N C NC C C N ++ R H N +AF
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDER--LHAKEQILMRNPLAF 506
Query: 232 MQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVA 268
+ ++ S + + +S+ LL G++
Sbjct: 507 RPKVETQQDETSLFSKDIQITKINIQEQSDALLSGLS 543
>gi|384246119|gb|EIE19610.1| hypothetical protein COCSUDRAFT_9740, partial [Coccomyxa
subellipsoidea C-169]
Length = 90
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/95 (67%), Positives = 68/95 (71%), Gaps = 9/95 (9%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI-ASSPH---GAQ 160
YCECFA+G YCDGCNCVNC NN E E RQ AV A LERNPNAFRPKI P G
Sbjct: 1 YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCTTGTH 60
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
A + A AA+HNKGC+CKKSGCLKKYCECFQ
Sbjct: 61 VAIQGA-----AARHNKGCNCKKSGCLKKYCECFQ 90
>gi|242033599|ref|XP_002464194.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
gi|241918048|gb|EER91192.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
Length = 641
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 83/190 (43%), Gaps = 38/190 (20%)
Query: 45 RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
R R N DGK + C+C+ S+CLKL
Sbjct: 344 RRRKLQNDDGKSC-----RHCSCKKSKCLKL----------------------------- 369
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC CFAA +YC + C+C C NN HE A T RNP AF P + +
Sbjct: 370 YCACFAAKVYCSEFCSCQGCSNNHMHEEAVSHIRKQTESRNPLAFAPTVTRKCGSVSELG 429
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS--ENCRCLDCKNFEGSEERRAL 221
+D+ +A+H +GC+C+KS CLKKYCECFQ +L E L + + E +
Sbjct: 430 DDSNNTPASARHKRGCNCRKSSCLKKYCECFQVLMLLFSLEGVLLLTTEKLKKGGEAKGT 489
Query: 222 FHGDHNGIAF 231
HG +AF
Sbjct: 490 -HGKEEKLAF 498
>gi|330843039|ref|XP_003293472.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
gi|325076199|gb|EGC30006.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
Length = 478
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 69/149 (46%), Gaps = 60/149 (40%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K+C+C+NS+CLK+ YCECFA I C+ C C
Sbjct: 103 KKCHCKNSKCLKM-----------------------------YCECFANKILCNNCQCFG 133
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C NN A E +G ++ L KH++GCHC+
Sbjct: 134 CQNNE----ANIEKIGND---------------------------QSVLIDKHSRGCHCR 162
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKN 211
KSGCLKKYCECFQA I C+ENC+C DCKN
Sbjct: 163 KSGCLKKYCECFQAGIPCNENCKCYDCKN 191
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/42 (57%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
K CHCK S CLK YCECF ILC+ NC+C C+N E + E+
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCN-NCQCFGCQNNEANIEK 143
>gi|428163094|gb|EKX32185.1| hypothetical protein GUITHDRAFT_148793 [Guillardia theta CCMP2712]
Length = 496
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 61/108 (56%), Gaps = 5/108 (4%)
Query: 104 KYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
+YC CF A + C+GC+CV+C+N+ +HE R A+ +P AF ++ + A +
Sbjct: 299 QYCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVRAEGDAASVSV 358
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
E +H +GC CK S CLKKYCECF+ + CS C C DC N
Sbjct: 359 ESK-----PKQHVRGCKCKNSKCLKKYCECFEFGVSCSSKCDCKDCMN 401
>gi|340505892|gb|EGR32168.1| hypothetical protein IMG5_093910 [Ichthyophthirius multifiliis]
Length = 260
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 88/182 (48%), Gaps = 44/182 (24%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
+Q+QCNC+ S+CLKL YC+CFA G YC+ C+
Sbjct: 60 QQRQCNCKKSKCLKL-----------------------------YCDCFAIGEYCNPICH 90
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C C NN ++ R+ A+ ERNPNAF +I +++QL H +GC
Sbjct: 91 CFECKNNESNKEKREMALKQNKERNPNAFTSRII-------------QQSQLNN-HQRGC 136
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAA 239
+CKKSGC KKYC+C+ + CS CRC+ C+N E ++ + + Q+ N+
Sbjct: 137 NCKKSGCQKKYCQCYLDGMECSRYCRCIGCENCNIFENKKICLNNKNIEKQIQQQKGNSD 196
Query: 240 II 241
I
Sbjct: 197 YI 198
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 32/51 (62%)
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
L A + C+CKKS CLK YC+CF C+ C C +CKN E ++E+R +
Sbjct: 56 LKALQQRQCNCKKSKCLKLYCDCFAIGEYCNPICHCFECKNNESNKEKREM 106
>gi|149061864|gb|EDM12287.1| rCG47760, isoform CRA_b [Rattus norvegicus]
Length = 371
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/107 (42%), Positives = 65/107 (60%), Gaps = 15/107 (14%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G L K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
K+ GA R H+KGC+CK+SGCLK YCEC++A+
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAS 355
>gi|145489787|ref|XP_001430895.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397996|emb|CAK63497.1| unnamed protein product [Paramecium tetraurelia]
Length = 276
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 72/153 (47%), Gaps = 44/153 (28%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-C 118
++ K CNCR S+CLKL YC+CFA G C C
Sbjct: 144 QEHKPCNCRKSKCLKL-----------------------------YCDCFAVGKLCSSKC 174
Query: 119 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
NC C NN + + R + +ERNP AF K+ +E + H KG
Sbjct: 175 NCCGCFNNSSNLLERNSFIQKMIERNPQAFNQKV--------------QEVESKMTHAKG 220
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C+C+KSGC KKYCEC+Q I CS+NC+C C N
Sbjct: 221 CNCRKSGCQKKYCECYQMGIECSDNCKCDGCLN 253
>gi|145545444|ref|XP_001458406.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426226|emb|CAK91009.1| unnamed protein product [Paramecium tetraurelia]
Length = 218
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 73/150 (48%), Gaps = 44/150 (29%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
K CNC+ S+CLKL YC+CFAAG C CNC
Sbjct: 83 KPCNCKKSKCLKL-----------------------------YCDCFAAGKLCSSKCNCC 113
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C NN + + R + + +ERNP AF K+ +E H+KGC+C
Sbjct: 114 GCFNNSSNLLERNQFIEKMVERNPEAFNQKV--------------KEVDYKLAHSKGCNC 159
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
+KSGC KKYCEC+Q I CS+NC+C C+N
Sbjct: 160 RKSGCKKKYCECYQLGIECSDNCKCDGCQN 189
>gi|145535353|ref|XP_001453415.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421126|emb|CAK86018.1| unnamed protein product [Paramecium tetraurelia]
Length = 348
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 71/149 (47%), Gaps = 30/149 (20%)
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
QCNC+ S+CLKL YCECF C CNC
Sbjct: 167 QCNCKKSKCLKL-----------------------------YCECFTNNWVCSQSCNCTE 197
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C N +++ R +A+ L RNP AF P + ++ Q +E + + + KGC+CK
Sbjct: 198 CKNRIDNPNERSKAIEEALLRNPEAFAPILTNNGQQPQVIQEQKSQKDIQKETKKGCNCK 257
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKN 211
KS C KKYCEC+ N C+E C+C +C N
Sbjct: 258 KSECKKKYCECYSINQKCTELCKCENCLN 286
>gi|145478369|ref|XP_001425207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392276|emb|CAK57809.1| unnamed protein product [Paramecium tetraurelia]
Length = 373
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 89/175 (50%), Gaps = 39/175 (22%)
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA-GIYCDGCNCVN 122
QCNC+ S+CLKL YCECFA G+ CNC +
Sbjct: 187 QCNCKKSKCLKL-----------------------------YCECFANNGVCSQSCNCQD 217
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN-----K 177
C N +++ R +A+ L RN +AF + S+ GAQ ++D + + +K N K
Sbjct: 218 CKNRIDNPQERSKAIEEALLRNTDAFV-QCFSTKGGAQFVQQD-KPLKEPSKDNSSVIRK 275
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE--ERRALFHGDHNGIA 230
GC+CKKSGC KKYCEC+ NI C+E C+C C N +E ++ L DHN ++
Sbjct: 276 GCNCKKSGCKKKYCECYSQNIRCNELCKCEHCLNKTDAEIQAQQDLGQEDHNNLS 330
>gi|118356010|ref|XP_001011264.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89293031|gb|EAR91019.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 617
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 59/108 (54%), Gaps = 14/108 (12%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCECF G YC D C C C NN ++ R++A+ RN +AF K + +
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQITSNST---- 412
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
H KGCHCK++ CLKKYCECF A + C+ NC+C +C+N
Sbjct: 413 ---------VTHKKGCHCKRTHCLKKYCECFNAGLKCTNNCKCEECRN 451
>gi|300175264|emb|CBK20575.2| unnamed protein product [Blastocystis hominis]
Length = 396
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 71/150 (47%), Gaps = 46/150 (30%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNC 123
CNC+ SRCLKL YCECF + YC CNC+NC
Sbjct: 151 CNCKKSRCLKL-----------------------------YCECFKSQEYCSSECNCLNC 181
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
+NN +HE R+EAV RNP+ F + D ++ ++ GC C++
Sbjct: 182 YNNRKHESEREEAVQRMKARNPHCFENHV--------DEKKGVNKS--------GCRCRR 225
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFE 213
+ C KKYCECF + C + C+C DCKN E
Sbjct: 226 THCDKKYCECFSHGVPCGDQCQCKDCKNEE 255
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 12/85 (14%)
Query: 145 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG----------CHCKKSGCLKKYCECF 194
P +F ++++ ++DA E A + +AK + C+CKKS CLK YCECF
Sbjct: 109 PTSFSFLLSAA--SSRDALESAFPREKSAKSTRATQPSIPGLVTCNCKKSRCLKLYCECF 166
Query: 195 QANILCSENCRCLDCKNFEGSEERR 219
++ CS C CL+C N E R
Sbjct: 167 KSQEYCSSECNCLNCYNNRKHESER 191
>gi|302753274|ref|XP_002960061.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
gi|300171000|gb|EFJ37600.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
Length = 155
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 80/173 (46%), Gaps = 43/173 (24%)
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
+C+C+ S+CLKL YCECFA+G YC D C C N
Sbjct: 1 RCSCKKSKCLKL-----------------------------YCECFASGSYCLDSCACGN 31
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----------- 171
C N EHE E RNP AF P++ S + + L
Sbjct: 32 CSNRQEHEDVVSETRQLIQSRNPLAFAPRVISPAEVVRVCQAFVSSNSLIFVSPAAAAIP 91
Query: 172 --AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
+KH +GC+CKKS CLKKYCECFQ+ + CS+ C+C CKN G+++ F
Sbjct: 92 AATSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKNTFGAKQGDDFF 144
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 34/65 (52%), Gaps = 5/65 (7%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE----ERRALFHGDHNGIAFMQR 234
C CKKS CLK YCECF + C ++C C +C N + E E R L N +AF R
Sbjct: 2 CSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQ-SRNPLAFAPR 60
Query: 235 AANAA 239
+ A
Sbjct: 61 VISPA 65
>gi|145526438|ref|XP_001449030.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416596|emb|CAK81633.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 75/156 (48%), Gaps = 30/156 (19%)
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
QCNC+ S+CLKL YCECFA C CNC
Sbjct: 163 QCNCKKSKCLKL-----------------------------YCECFANNWVCSQSCNCTE 193
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C N +++ R +A+ L RNP AF + ++ Q E + + + + KGC+CK
Sbjct: 194 CKNRIDNPNERSKAIEEALLRNPEAFSTILTNNGQQPQIIPEPKSQKEQSKETKKGCNCK 253
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
KS C KKYCEC+ N C++ C+C +C N E +E+
Sbjct: 254 KSECKKKYCECYSINQKCTDLCKCENCLNKEEPQEQ 289
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 50/95 (52%), Gaps = 15/95 (15%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANA 238
C+CKKS CLK YCECF N +CS++C C +CKN + R+ +A
Sbjct: 164 CNCKKSKCLKLYCECFANNWVCSQSCNCTECKNRIDNPNERS-------------KAIEE 210
Query: 239 AIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQS 273
A++ + T++T+N ++ + + + K+QS
Sbjct: 211 ALLR--NPEAFSTILTNNGQQPQIIPEPKSQKEQS 243
>gi|224054140|ref|XP_002298111.1| predicted protein [Populus trichocarpa]
gi|222845369|gb|EEE82916.1| predicted protein [Populus trichocarpa]
Length = 781
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 83/157 (52%), Gaps = 32/157 (20%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
++CNC+ SRCLKL YCECFAAGIYC D C+CV
Sbjct: 441 RRCNCKKSRCLKL-----------------------------YCECFAAGIYCLDTCSCV 471
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIAS-SPHGAQDAREDAREAQLAA-KHNKGC 179
NC N E+E + T RNP AF PK+ + + + + E+ + + ++ +H KGC
Sbjct: 472 NCINKPEYEDTVLDMRQQTEARNPLAFAPKVVNNATNSPANMMEEGKWMKTSSSRHKKGC 531
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
+CKKS C KKYCECFQ + C CRC C N G++
Sbjct: 532 NCKKSKCSKKYCECFQGGVGCCNGCRCEGCYNPYGNK 568
>gi|145478035|ref|XP_001425040.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392108|emb|CAK57642.1| unnamed protein product [Paramecium tetraurelia]
Length = 386
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 73/159 (45%), Gaps = 35/159 (22%)
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
QCNC+ S+CLKL YCECFA C CNC +
Sbjct: 191 QCNCKKSKCLKL-----------------------------YCECFANNWVCSQSCNCQD 221
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN-----K 177
C N +++ R +A+ L RNP+AF ++ + + K N K
Sbjct: 222 CKNRIDNPQERSKAIEEALLRNPDAFAQCFQQKSQTQFSVQQQDKPLKEPTKDNSNITRK 281
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
GC+CKKSGC KKYCEC+ N+ CS+ C+C C N +E
Sbjct: 282 GCNCKKSGCKKKYCECYSQNLKCSDLCKCEQCLNRTDAE 320
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 30/41 (73%), Gaps = 1/41 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 218
C+CKKS CLK YCECF N +CS++C C DCKN + +ER
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQER 232
>gi|301631659|ref|XP_002944915.1| PREDICTED: protein lin-54 homolog, partial [Xenopus (Silurana)
tropicalis]
Length = 183
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 84/154 (54%), Gaps = 40/154 (25%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
K +K CNC S+CLKL YC+CFA G C+ CN
Sbjct: 9 KTKKPCNCTRSQCLKL-----------------------------YCDCFANGDLCNNCN 39
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN EHE+ R +AV A L RNP+AF+PKI E + + +HNKGC
Sbjct: 40 CSNCYNNAEHELERFKAVKACLYRNPDAFQPKI-----------EKGKIGDVKPRHNKGC 88
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+CK+SGC K YCEC++A I CS C+C+ CKN +
Sbjct: 89 NCKRSGCFKNYCECYEAKIACSSVCKCVGCKNVK 122
>gi|145552635|ref|XP_001461993.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429830|emb|CAK94620.1| unnamed protein product [Paramecium tetraurelia]
Length = 384
Score = 92.0 bits (227), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 73/159 (45%), Gaps = 35/159 (22%)
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
QCNC+ S+CLKL YCECFA C CNC +
Sbjct: 191 QCNCKKSKCLKL-----------------------------YCECFANNWVCSQSCNCQD 221
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN-----K 177
C N +++ R +A+ L RNP+AF ++ + + K N K
Sbjct: 222 CKNRIDNPQERSKAIEEALLRNPDAFAQCFQQKGQTQFSVQQQDKPLKEPTKDNSNITRK 281
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
GC+CKKSGC KKYCEC+ N+ CS+ C+C C N +E
Sbjct: 282 GCNCKKSGCKKKYCECYSQNLKCSDLCKCEQCLNRTDAE 320
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 30/41 (73%), Gaps = 1/41 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 218
C+CKKS CLK YCECF N +CS++C C DCKN + +ER
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQER 232
>gi|302804648|ref|XP_002984076.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
gi|300148428|gb|EFJ15088.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
Length = 141
Score = 92.0 bits (227), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 60/168 (35%), Positives = 79/168 (47%), Gaps = 43/168 (25%)
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
+C+C+ S+CLKL YCECFA+G YC D C C N
Sbjct: 1 RCSCKKSKCLKL-----------------------------YCECFASGSYCLDSCACGN 31
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----------- 171
C N EHE E RNP AF P++ S + + L
Sbjct: 32 CSNRQEHEDVVSETRQLIQSRNPLAFAPRVISPAEVVRVCQAFVSSNSLIFVSPAAAAIP 91
Query: 172 --AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
+KH +GC+CKKS CLKKYCECFQ+ + CS+ C+C CKN G+++
Sbjct: 92 AATSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKNTFGAKQ 139
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 34/65 (52%), Gaps = 5/65 (7%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE----ERRALFHGDHNGIAFMQR 234
C CKKS CLK YCECF + C ++C C +C N + E E R L N +AF R
Sbjct: 2 CSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQ-SRNPLAFAPR 60
Query: 235 AANAA 239
+ A
Sbjct: 61 VISPA 65
>gi|403344595|gb|EJY71645.1| Tesmin/TSO1-like CXC domain-containing protein [Oxytricha
trifallax]
Length = 600
Score = 92.0 bits (227), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 67/113 (59%), Gaps = 10/113 (8%)
Query: 102 LVKYCECFAAGIYCD-GCNCVNCHNNVEH-EVARQEAVGATLERNPNAFRPKIAS-SPHG 158
L YC+CFA G C C C+NC N ++ E EA + RNP AF+PK+ P
Sbjct: 456 LKLYCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGP-- 513
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
AQ ++ DA + +H KGC+CK SGC K YCECFQ + C++NCRC+ C N
Sbjct: 514 AQMSQNDADK-----QHRKGCNCKNSGCQKNYCECFQFGLECNKNCRCIGCSN 561
>gi|348678286|gb|EGZ18103.1| hypothetical protein PHYSODRAFT_504652 [Phytophthora sojae]
Length = 713
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 69/134 (51%), Gaps = 13/134 (9%)
Query: 102 LVKYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YC CF++ +C GC C +C N +H+ R EA+ L +P AF AS P A
Sbjct: 543 LKMYCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAF--SFASLPQSAS 600
Query: 161 DA-REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS---- 215
+ +A +GC CKKS CLKKYCECFQ I C+ +CRC+DC N S
Sbjct: 601 TTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCMDCSNHSESTAAH 660
Query: 216 -----EERRALFHG 224
++R A HG
Sbjct: 661 QHPHLQKRSATAHG 674
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 52/109 (47%), Gaps = 16/109 (14%)
Query: 157 HGAQDAREDAREAQLAAKHNK-----------GCHCKKSGCLKKYCECFQANILCSENCR 205
H + A+ED R+ +L +H + C CK +GCLK YC CF + C C
Sbjct: 504 HEPELAKEDKRQ-RLLLRHAQQIQASSVSSSTSCGCK-TGCLKMYCMCFSSRGFCHAGCA 561
Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAAN-AAIIGAVGSSGYGTLM 253
C DCKN G + + N +A RA + A++ + ++G+ L+
Sbjct: 562 CDDCKN--GRKHQTERVEAIQNYLANDPRAFSFASLPQSASTTGFLHLL 608
>gi|149061865|gb|EDM12288.1| rCG47760, isoform CRA_c [Rattus norvegicus]
gi|149061866|gb|EDM12289.1| rCG47760, isoform CRA_c [Rattus norvegicus]
Length = 191
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%), Gaps = 15/99 (15%)
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+PK+ G
Sbjct: 92 KIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLG 147
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
A R H+KGC+CK+SGCLK YCEC++A+
Sbjct: 148 AAKLR-----------HSKGCNCKRSGCLKNYCECYEAS 175
>gi|168049832|ref|XP_001777365.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671214|gb|EDQ57769.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 120
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/118 (45%), Positives = 67/118 (56%), Gaps = 8/118 (6%)
Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA----SSPH-- 157
YCECFAAG+YC G C C +C N E+ RNP AF PKI SSP
Sbjct: 1 YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPG 60
Query: 158 -GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ ++ + +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC C N G
Sbjct: 61 VSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMNKYG 118
>gi|145498644|ref|XP_001435309.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402440|emb|CAK67912.1| unnamed protein product [Paramecium tetraurelia]
Length = 242
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 70/150 (46%), Gaps = 44/150 (29%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
K CNC+ S+CLKL YC+CFAA C CNC
Sbjct: 107 KPCNCKKSKCLKL-----------------------------YCDCFAANKLCSSKCNCC 137
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C NN + + R +ERNP AF K+ +E H+KGC+C
Sbjct: 138 GCFNNSSNLIERNLYREKMVERNPEAFNQKV--------------KEVDQKMAHSKGCNC 183
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
+KSGC KKYCEC+Q I CS+NC+C CKN
Sbjct: 184 RKSGCKKKYCECYQMGIECSDNCKCDGCKN 213
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 28/46 (60%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
K C+CKKS CLK YC+CF AN LCS C C C N + R L+
Sbjct: 107 KPCNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLY 152
>gi|168057921|ref|XP_001780960.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667594|gb|EDQ54220.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 159
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 70/132 (53%), Gaps = 22/132 (16%)
Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA- 162
YCECFA G+YC G C C +C N E+ RNP AF PKI GA+ +
Sbjct: 1 YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIV---QGAEPSP 57
Query: 163 -----------------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
+++A + +A+H +GC+CKKS CLKKYCEC+QA + CSE CR
Sbjct: 58 VPGVTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCR 117
Query: 206 CLDCKNFEGSEE 217
C C N G +E
Sbjct: 118 CEGCMNKYGKKE 129
>gi|403340785|gb|EJY69688.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
trifallax]
Length = 710
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 75/160 (46%), Gaps = 42/160 (26%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCNCV 121
K+C C S+CLKL YCECFAAG+ C C C
Sbjct: 525 KKCTCSKSKCLKL-----------------------------YCECFAAGLVCGLDCGCK 555
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
+C N + A Q+A ++R+P AF K++ Q++ D + H KGC C
Sbjct: 556 DCCNTEDSSDAIQKAKDEIMKRDPLAFEIKVS------QNSGTDKLQ------HRKGCTC 603
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
KKSGC K YCECFQ + C++ C+C C N E ++ L
Sbjct: 604 KKSGCQKGYCECFQLGVPCTDFCKCSGCANCEKKSDQPCL 643
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 34/67 (50%)
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
+I + H + ++ Q K C C KS CLK YCECF A ++C +C C DC
Sbjct: 499 QITHNHHSHHNEQKLGNSPQQPGVSTKKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCC 558
Query: 211 NFEGSEE 217
N E S +
Sbjct: 559 NTEDSSD 565
>gi|302824819|ref|XP_002994049.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
gi|300138103|gb|EFJ04883.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
Length = 144
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 65/118 (55%), Gaps = 8/118 (6%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH------ 157
YCECFAAG+YC D C C +C N E RNP AF PKI +
Sbjct: 1 YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEG 60
Query: 158 -GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ + + + +A+H +GC+CKKS CLKKYCEC+QA + CS+ C+C CKN G
Sbjct: 61 VASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKNIYG 118
>gi|301097298|ref|XP_002897744.1| exosome complex exonuclease RRP41-like protein [Phytophthora
infestans T30-4]
gi|262106765|gb|EEY64817.1| exosome complex exonuclease RRP41-like protein [Phytophthora
infestans T30-4]
Length = 777
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/115 (42%), Positives = 62/115 (53%), Gaps = 10/115 (8%)
Query: 102 LVKYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YC CF++ +C GC C +C N + R EA+ + L +P AF AS P Q
Sbjct: 547 LKMYCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAF--SFASLP---Q 601
Query: 161 DAREDAREAQLAAKHN----KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
D L K + +GC CKKS CLKKYCECFQ I C+ +CRC+DC N
Sbjct: 602 DTNTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCMDCSN 656
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C CK +GCLK YC CF + C C C DCKN S+ R
Sbjct: 540 CGCK-TGCLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTER 579
>gi|281211685|gb|EFA85847.1| hypothetical protein PPL_01079 [Polysphondylium pallidum PN500]
Length = 735
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 61/123 (49%), Gaps = 38/123 (30%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
K C+C+NS+CLK+ YCECFA+ C+GC C
Sbjct: 326 KMCHCKNSKCLKM-----------------------------YCECFASKQLCNGCQCFG 356
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
CHNN +H + + A +TLERNP+AF PK S +G E A KH KGCHC+
Sbjct: 357 CHNNEDHIESVENARASTLERNPDAFGPKFKQSFNG---------EKPKAEKHLKGCHCR 407
Query: 183 KSG 185
+SG
Sbjct: 408 RSG 410
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K CHCK S CLK YCECF + LC+ C+C C N E E
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCN-GCQCFGCHNNEDHIE 365
>gi|145511942|ref|XP_001441893.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409154|emb|CAK74496.1| unnamed protein product [Paramecium tetraurelia]
Length = 374
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 73/150 (48%), Gaps = 32/150 (21%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
CNC+ S+CLKL YCECFA C CNC C
Sbjct: 169 CNCKKSKCLKL-----------------------------YCECFANNWVCSQNCNCCEC 199
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA--REAQLAAKHNKGCHC 181
N +++ R +A+ L RNP+AF + ++ Q +E+ +E+Q KGC+C
Sbjct: 200 KNRIDNPNERSKAIEEALLRNPDAFAAILTNNGQQPQIIQEEKSQKESQKDITTRKGCNC 259
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
KKS C KKYCEC+ N C++ C+C +C N
Sbjct: 260 KKSECKKKYCECYSINQRCTDLCKCENCLN 289
>gi|145534959|ref|XP_001453218.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420929|emb|CAK85821.1| unnamed protein product [Paramecium tetraurelia]
Length = 265
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 92/215 (42%), Gaps = 50/215 (23%)
Query: 24 VPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPK------KQK----QCNCRNSRCL 73
+PH H +P PP + NT K+ K QK CNC+ S+CL
Sbjct: 1 MPHYPHPIPYWYRPPVYFPQYDGQQYQNTLSKELQSKVSQLVQYQKVSHFACNCKKSKCL 60
Query: 74 KLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVA 132
KL YCECFA C CNC C N +++
Sbjct: 61 KL-----------------------------YCECFANNWVCSQNCNCCECKNRIDNPNE 91
Query: 133 RQEAVGATLERNPNAFRPKIASS---PHGAQDA-------REDAREAQLAAKHNKGCHCK 182
R +A L RNP+AF + ++ P QD + +E+Q+ KGC+CK
Sbjct: 92 RSKATEEALLRNPDAFAAILTNNGQQPQIIQDKTLILPLEEKSQKESQIDITTRKGCNCK 151
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
KS C KKYCEC+ N C++ C+C +C N S+E
Sbjct: 152 KSECKKKYCECYSINQKCTDLCKCENCLNKAESQE 186
>gi|357133064|ref|XP_003568148.1| PREDICTED: uncharacterized protein LOC100845078 [Brachypodium
distachyon]
Length = 377
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 78/194 (40%), Gaps = 42/194 (21%)
Query: 25 PHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINI 84
P ++L L T P R R DG K C C+ S+CLKL
Sbjct: 62 PSAANQLALATTPK------RQRVEEAADGNGC-----KHCACKKSKCLKL--------- 101
Query: 85 YSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLER 143
YC CFA G YC + C CV C N ++ Q L R
Sbjct: 102 --------------------YCPCFAGGGYCSEKCGCVPCFNKSDYAETVQTTRKVLLSR 141
Query: 144 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 203
KI +A EDA + + +GC+CKKS CLKKYC+C+Q CS
Sbjct: 142 QKR-MSLKINRRSEANAEAMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLF 200
Query: 204 CRCLDCKNFEGSEE 217
CRC DCKN G E
Sbjct: 201 CRCEDCKNPYGKNE 214
>gi|303283039|ref|XP_003060811.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
gi|226458282|gb|EEH55580.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
Length = 625
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 47/107 (43%), Positives = 55/107 (51%), Gaps = 14/107 (13%)
Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
YCECFAAG +C C+C NC N ++ Q RNP AF KI ++
Sbjct: 281 YCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKIVAT--------- 331
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
D E H K KS CLKKYCECFQA +LC E C+C CKN
Sbjct: 332 DGEEKHKKGCHCK-----KSACLKKYCECFQAGVLCQEYCKCEGCKN 373
>gi|222632736|gb|EEE64868.1| hypothetical protein OsJ_19725 [Oryza sativa Japonica Group]
Length = 223
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 65/125 (52%), Gaps = 15/125 (12%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI--------ASS 155
YC CF + +C D CNC C+N +E A +E +NPNAF P+I A+
Sbjct: 21 YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80
Query: 156 PHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
P + A D + A KH KGC C+KS C K YCECF+ ++ C+ C+CL+C
Sbjct: 81 PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNSVGCTAKCKCLEC 140
Query: 210 KNFEG 214
N G
Sbjct: 141 SNSFG 145
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
C C +S C + YC CF++ CS+NC C C N + E+
Sbjct: 10 CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48
>gi|326491307|dbj|BAK05753.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326502882|dbj|BAJ99069.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 375
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 78/190 (41%), Gaps = 50/190 (26%)
Query: 58 TPKKQK---------QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
TPK+QK C C+ SRCLKL YC C
Sbjct: 66 TPKRQKVEDAADGCKHCACKKSRCLKL-----------------------------YCPC 96
Query: 109 FAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR 167
FA G YC + C CV C N + Q L R KI + EDA
Sbjct: 97 FAGGGYCSEKCGCVPCFNKADFAETVQTTRKVLLSRQ-KRMSLKINRRLEANAETMEDAH 155
Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG-------SEERRA 220
+ + +GC+CKKS CLKKYC+C+Q CS CRC DCKN G E +R
Sbjct: 156 NSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCEDCKNPFGKNEGIMADESKRF 215
Query: 221 LFHG---DHN 227
L+ G DH+
Sbjct: 216 LYTGADLDHS 225
>gi|340507878|gb|EGR33739.1| tesmin tso1-like cxc domain protein [Ichthyophthirius multifiliis]
Length = 142
Score = 85.9 bits (211), Expect = 5e-14, Method: Composition-based stats.
Identities = 51/158 (32%), Positives = 71/158 (44%), Gaps = 39/158 (24%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GC 118
+K + C C+ SRCL+L YCECF GI+C+ C
Sbjct: 10 RKVEFCKCKKSRCLQL-----------------------------YCECFVNGIFCNKSC 40
Query: 119 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
C NC N ++ + A RNP+AF K + H KG
Sbjct: 41 ICTNCGNTENNKKQIESAKQEAKMRNPDAFSQKFLVVKQNQYEG---------IVSHKKG 91
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
C+C K+ C KKYCECF A I C+ENC+C +C+N++ +
Sbjct: 92 CNCTKTQCTKKYCECFNAGIKCTENCKCENCENYKDEQ 129
Score = 47.0 bits (110), Expect = 0.028, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 28/40 (70%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
C CKKS CL+ YCECF I C+++C C +C N E ++++
Sbjct: 15 CKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQ 54
>gi|48475095|gb|AAT44164.1| hypothetical protein, contains Tesmin/TSO1-like CXC domain, PF03638
[Oryza sativa Japonica Group]
Length = 194
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 65/125 (52%), Gaps = 15/125 (12%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI--------ASS 155
YC CF + +C D CNC C+N +E A +E +NPNAF P+I A+
Sbjct: 21 YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80
Query: 156 PHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
P + A D + A KH KGC C+KS C K YCECF+ ++ C+ C+CL+C
Sbjct: 81 PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNSVGCTAKCKCLEC 140
Query: 210 KNFEG 214
N G
Sbjct: 141 SNSFG 145
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
C C +S C + YC CF++ CS+NC C C N + E+
Sbjct: 10 CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48
>gi|348532345|ref|XP_003453667.1| PREDICTED: hypothetical protein LOC100710529 [Oreochromis
niloticus]
Length = 310
Score = 85.5 bits (210), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 52/77 (67%), Gaps = 11/77 (14%)
Query: 141 LERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILC 200
L NP+AFRPKIA GA + HNKGC+CK+SGCLK+YCEC++ANI+C
Sbjct: 172 LGHNPDAFRPKIAGVKSGA-----------VRGWHNKGCNCKRSGCLKRYCECYEANIMC 220
Query: 201 SENCRCLDCKNFEGSEE 217
+ +C+C+ C+N++ +
Sbjct: 221 TSSCKCVGCRNYDAGSQ 237
>gi|145525306|ref|XP_001448475.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416019|emb|CAK81078.1| unnamed protein product [Paramecium tetraurelia]
Length = 188
Score = 85.5 bits (210), Expect = 7e-14, Method: Composition-based stats.
Identities = 51/150 (34%), Positives = 65/150 (43%), Gaps = 54/150 (36%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K CNC+ S+CLK +YC+CFA G C + CNCV
Sbjct: 89 KSCNCKKSQCLK-----------------------------QYCDCFANGQMCSENCNCV 119
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C NN + R+EA + R+P AF+ KGC+C
Sbjct: 120 GCFNNTLNMEQRKEAKVQIINRDPGAFKQSF------------------------KGCNC 155
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
KKSGC KKYCECF + + C+ CRC C N
Sbjct: 156 KKSGCQKKYCECFLSGLACTHLCRCDGCLN 185
Score = 55.1 bits (131), Expect = 9e-05, Method: Composition-based stats.
Identities = 23/45 (51%), Positives = 31/45 (68%)
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+ K C+CKKS CLK+YC+CF +CSENC C+ C N + E+R
Sbjct: 87 NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQR 131
>gi|297743393|emb|CBI36260.3| unnamed protein product [Vitis vinifera]
Length = 326
Score = 85.5 bits (210), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 68/145 (46%), Gaps = 33/145 (22%)
Query: 58 TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-D 116
T +K CNC+ ++CLKL YC+C AAG+YC D
Sbjct: 10 TESDRKHCNCKRTQCLKL-----------------------------YCDCLAAGVYCTD 40
Query: 117 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD--AREDAREAQ-LAA 173
C C NC N E+E Q R+P AF P+I + D +ED +A
Sbjct: 41 SCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRIVNPDTDTTDNVLQEDGNWTTPSSA 100
Query: 174 KHNKGCHCKKSGCLKKYCECFQANI 198
+H +GC+CKKS C KKYCEC+Q I
Sbjct: 101 RHKRGCNCKKSMCQKKYCECYQVYI 125
Score = 45.8 bits (107), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 25/35 (71%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
K C+CK++ CLK YC+C A + C+++C C +C N
Sbjct: 15 KHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLN 49
>gi|403347003|gb|EJY72913.1| Transcription factor, putative [Oxytricha trifallax]
Length = 286
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 69/131 (52%), Gaps = 12/131 (9%)
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
+ +YC+CF YC CN C NN E+E RQ+A+ L +NP AF K + + +
Sbjct: 151 IKRYCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKTRKNFIKK 210
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC--LDCKNFEGS---E 216
+++D+ C+C KSGC KYC C + ILCS+NC+C CKN +G +
Sbjct: 211 SKKDSNLIHT-------CNCSKSGCNNKYCVCLKEGILCSKNCKCNHRKCKNKQGDRNQK 263
Query: 217 ERRALFHGDHN 227
+R + D N
Sbjct: 264 QRSVIISNDEN 274
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 36/67 (53%), Gaps = 6/67 (8%)
Query: 153 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
++ H D +D +L K+ C + C+K+YC+CF+ N C+ NC +CKN
Sbjct: 122 STQNHHDHDLDKDMAIIELCCKN-----CSNTYCIKRYCQCFKLNKYCT-NCNHPECKNN 175
Query: 213 EGSEERR 219
+ +EE R
Sbjct: 176 QENEEER 182
>gi|218197350|gb|EEC79777.1| hypothetical protein OsI_21187 [Oryza sativa Indica Group]
Length = 214
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 64/125 (51%), Gaps = 15/125 (12%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI--------ASS 155
YC CF + +C D CNC C+N +E A +E +NPNAF P+I A+
Sbjct: 21 YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80
Query: 156 PHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
P + A D + A KH KGC C+KS C K YCECF+ ++ C+ C+C +C
Sbjct: 81 PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNSVGCTAKCKCQEC 140
Query: 210 KNFEG 214
N G
Sbjct: 141 SNSFG 145
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
C C +S C + YC CF++ CS+NC C C N + E+
Sbjct: 10 CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48
>gi|226502887|ref|NP_001148761.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|195621962|gb|ACG32811.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|413949786|gb|AFW82435.1| CXC domain containing TSO1-like protein 1 [Zea mays]
Length = 356
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 72/176 (40%), Gaps = 41/176 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K C C+ SRCLKL YC CFA G YC D C C
Sbjct: 62 KHCACKKSRCLKL-----------------------------YCPCFAGGGYCSDKCGCQ 92
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C N Q L R KI P + EDA + + +GC+C
Sbjct: 93 PCFNKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNC 151
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG-------SEERRALFHG---DHN 227
KKS CLKKYC+C+Q CS CRC DC+N G + +R L+ G DH+
Sbjct: 152 KKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNDGIIADDSKRYLYTGADLDHS 207
>gi|308808189|ref|XP_003081405.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
gi|116059867|emb|CAL55574.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
Length = 466
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 50/91 (54%), Gaps = 14/91 (15%)
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C N VE+ ++ RNPNAF KI +D +DAR H KGCHCK
Sbjct: 171 CQNTVENVSVVEKTRSQIEARNPNAFLSKI-------EDDGDDAR-------HTKGCHCK 216
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFE 213
KS CLKKYCECFQA + C E C+C C N E
Sbjct: 217 KSACLKKYCECFQAGVKCQEYCKCEGCMNKE 247
>gi|242091037|ref|XP_002441351.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
gi|241946636|gb|EES19781.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
Length = 359
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 69/170 (40%), Gaps = 38/170 (22%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K C C+ SRCLKL YC CF+ G YC D C C
Sbjct: 67 KHCACKKSRCLKL-----------------------------YCPCFSGGGYCSDKCGCQ 97
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C N Q L R KI P + EDA + + +GC+C
Sbjct: 98 PCFNKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNC 156
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE-------RRALFHG 224
KKS CLKKYC+C+Q CS CRC DC+N G E +R L+ G
Sbjct: 157 KKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNEGIIADDSKRYLYTG 206
>gi|224007227|ref|XP_002292573.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971435|gb|EED89769.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 886
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 60/118 (50%), Gaps = 17/118 (14%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVA---RQEAVGATLERNPNAFRPKIASSPH 157
L+ YC+CF I+C + C C++C N VE+ A R AV LER P AF
Sbjct: 19 LLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERRPEAF--------- 69
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
RE A+ K + GC C K+ CLKKYC CF A + C C C +CKN G+
Sbjct: 70 ----GRESGAFAERRNKKDGGCTCAKNSCLKKYCVCFGAKMQCRARCNCSNCKNPFGT 123
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 61/114 (53%), Gaps = 13/114 (11%)
Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNP-NAFRPKIASSPHGA 159
L KYC CF A + C CNC NC N + ++ V T+E NA + GA
Sbjct: 95 LKKYCVCFGAKMQCRARCNCSNCKNPF-GTIRIEDGVTCTIEGGKVNALK--------GA 145
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
A R+ AK N C C ++ CLK YC+CF+ +++C++NC CL+CKN E
Sbjct: 146 --AFVPIRKLANLAKGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCLECKNSE 197
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 38/77 (49%), Gaps = 15/77 (19%)
Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--------------FEGSEE 217
AAK C C K+ CL YC+CF NI C E+C C+DCKN EG E
Sbjct: 4 AAKKAPSCKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLE 63
Query: 218 RRALFHGDHNGIAFMQR 234
RR G +G AF +R
Sbjct: 64 RRPEAFGRESG-AFAER 79
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 48/113 (42%), Gaps = 24/113 (21%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEV---ARQEAVGATLERNPNAFRPKIASSPHGAQ 160
YC+CF + C D C+C+ C N+ E+ R + LE+ P+ F+ K A G
Sbjct: 174 YCDCFRQSMVCNDNCSCLECKNSEEYSGPMGERTLVIKDILEKRPHIFQ-KTAKRKAGG- 231
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRCLDCKNF 212
GC C+K+ C YC+C + C C C++C N
Sbjct: 232 -----------------GCSCQKNKCRAGYCDCKGSGGKCDPAVCTCINCDNM 267
>gi|224092440|ref|XP_002309611.1| predicted protein [Populus trichocarpa]
gi|222855587|gb|EEE93134.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 55/103 (53%), Gaps = 6/103 (5%)
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----AAKHNKG 178
C N E+E E RNP AF PKI H + D + L + +H G
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQ--HVTEFQAIDVEDVDLFTPYSGRHKTG 501
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
C+CK+S C+KKYCEC+QAN+ CS CRC C+N G +E A+
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRNIHGRKEEYAM 544
>gi|115464805|ref|NP_001056002.1| Os05g0509400 [Oryza sativa Japonica Group]
gi|48475201|gb|AAT44270.1| unknown protein [Oryza sativa Japonica Group]
gi|113579553|dbj|BAF17916.1| Os05g0509400 [Oryza sativa Japonica Group]
gi|215766036|dbj|BAG98264.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218197077|gb|EEC79504.1| hypothetical protein OsI_20571 [Oryza sativa Indica Group]
Length = 374
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 78/192 (40%), Gaps = 52/192 (27%)
Query: 58 TPKKQK-----------QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYC 106
TPK+QK C C+ SRCLKL YC
Sbjct: 70 TPKRQKVEESADGNGCKHCACKKSRCLKL-----------------------------YC 100
Query: 107 ECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
CFA G YC + C C C N + Q L R KI +A ED
Sbjct: 101 PCFAGGGYCSEKCGCQPCFNKALYAETVQTTRKVLLSRQ-KRMSLKINRRSEANAEAVED 159
Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG-------SEER 218
A + + +GC+CKKS CLKKYC+C+Q CS CRC DC+N G E +
Sbjct: 160 AHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCEDCRNPFGKNEGIMAEESK 219
Query: 219 RALFHG---DHN 227
R L+ G DH+
Sbjct: 220 RFLYTGADLDHS 231
>gi|145523856|ref|XP_001447761.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415283|emb|CAK80364.1| unnamed protein product [Paramecium tetraurelia]
Length = 573
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 35/66 (53%), Positives = 48/66 (72%), Gaps = 8/66 (12%)
Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
AF+PKI++S D++ D KHNKGC CKKSGC+KKYCECFQA + CS+ C+C
Sbjct: 433 AFQPKISTS-----DSKPDPLNF---GKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKC 484
Query: 207 LDCKNF 212
++C+N+
Sbjct: 485 IECRNY 490
>gi|224035483|gb|ACN36817.1| unknown [Zea mays]
Length = 356
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 72/176 (40%), Gaps = 41/176 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K C C+ SRCLKL YC CFA G YC D C C
Sbjct: 62 KHCACKKSRCLKL-----------------------------YCPCFAGGGYCSDKCGCQ 92
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C + Q L R KI P + EDA + + +GC+C
Sbjct: 93 PCFSKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNC 151
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG-------SEERRALFHG---DHN 227
KKS CLKKYC+C+Q CS CRC DC+N G + +R L+ G DH+
Sbjct: 152 KKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNDGIIADDSKRYLYTGADLDHS 207
>gi|219363019|ref|NP_001136646.1| uncharacterized protein LOC100216775 [Zea mays]
gi|194696492|gb|ACF82330.1| unknown [Zea mays]
Length = 278
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 37/80 (46%), Positives = 50/80 (62%)
Query: 143 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 202
RNP AF PK+ + + R+ + + +A+H +GC+CKKS CLKKYCECFQ + CS
Sbjct: 20 RNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSI 79
Query: 203 NCRCLDCKNFEGSEERRALF 222
+CRC CKN G E A+
Sbjct: 80 SCRCEGCKNAFGKREGAAVL 99
>gi|226500562|ref|NP_001150787.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|194699308|gb|ACF83738.1| unknown [Zea mays]
gi|195641826|gb|ACG40381.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|224028923|gb|ACN33537.1| unknown [Zea mays]
gi|413945974|gb|AFW78623.1| CXC domain containing TSO1-like protein 1 [Zea mays]
Length = 359
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 68/170 (40%), Gaps = 38/170 (22%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K C C+ SRCLKL YC CF+ G YC D C C
Sbjct: 67 KHCACKKSRCLKL-----------------------------YCPCFSGGGYCSDKCGCQ 97
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C N L R KI P + EDA + + +GC+C
Sbjct: 98 PCFNKEAFAETVHTTRKVLLSRQ-KRMSMKINRRPEANTEPMEDAHHSSSSTPPKRGCNC 156
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE-------RRALFHG 224
KKS CLKKYC+C+Q CS CRC DC+N G E +R L+ G
Sbjct: 157 KKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNEGIMADDSKRYLYTG 206
>gi|226507366|ref|NP_001150621.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
gi|195640640|gb|ACG39788.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
gi|219884557|gb|ACL52653.1| unknown [Zea mays]
gi|414868874|tpg|DAA47431.1| TPA: Tesmin/TSO1-like CXC domain containing protein [Zea mays]
Length = 394
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 74/170 (43%), Gaps = 45/170 (26%)
Query: 43 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
S R R R DG DG K C+C+ SRCLKL
Sbjct: 72 SDRKRRRKAEDG-DGC----KTCSCKKSRCLKL--------------------------- 99
Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YC CFA+G++C + C C CHN + + + A L K + QD
Sbjct: 100 --YCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPL---------KPVQTFEAGQD 148
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
E + + K C CKKSGCLKKYC+C+Q CS NC+C DC+N
Sbjct: 149 IVEQVIRSPMDLIRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 197
>gi|428176775|gb|EKX45658.1| hypothetical protein GUITHDRAFT_152657 [Guillardia theta CCMP2712]
Length = 434
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 132/337 (39%), Gaps = 77/337 (22%)
Query: 52 TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
TD +K+K C C ++CLK +YC CF
Sbjct: 4 TDNAQKKSEKEKGCRCGRTKCLK-----------------------------QYCACFRN 34
Query: 112 GIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
+ C + C CV+CHN+ +HE AR AV +P AF+ + ++
Sbjct: 35 DVRCTNDCVCVDCHNDGKHEQARMMAVRLVRLNDPMAFKGT------SLELENQEVHTPN 88
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF--HGDHNG 228
K +GC C++S C KKYCECF A + CS NC C C N G++ + + G +
Sbjct: 89 GTLKTVRGCRCRRSKCQKKYCECFGAGLKCSTNCVCEGCLN--GNDPSKPFYPSGGSASS 146
Query: 229 IAFMQRA--ANAAIIGAVGSSGY-----GTLMTSNKRKSEELLFG---------VAAKDQ 272
+A M A A ++ G + + +L TS + +L F A K Q
Sbjct: 147 VARMAPAIPAGFSVSNMAGKTHFDRTSKASLDTSRASRPPQLSFSRTSAPSAEQKAEKSQ 206
Query: 273 SV----------IRNPQSQQGN--------NVQNSAVPCSQSVPVSQTANASVLGSTKSA 314
S+ +RNP S + + S P +Q V + + S+
Sbjct: 207 SLASRRPPISVEVRNPTSMRVACAVTEIPLHDSRSRAPVAQEVKLEDKVEVNKENKPLSS 266
Query: 315 LRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVG 351
L +P AD++ +E SL +SS LT G
Sbjct: 267 LDNPAADLVL---ERESSSLNYYMSSTPQLNLTPTWG 300
>gi|219112185|ref|XP_002177844.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410729|gb|EEC50658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 455
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 69/155 (44%), Gaps = 51/155 (32%)
Query: 59 PKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG- 117
P Q CNC SRCLKL YC CF AG C
Sbjct: 332 PAGQVACNCIRSRCLKL-----------------------------YCTCFQAGKMCQPG 362
Query: 118 -CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 176
C C+ C N+VE R++A+ TL++ P+AF+ K P G
Sbjct: 363 ICTCIACANSVEDHPERKQAIKHTLQKRPDAFQTK--DRPVGL----------------- 403
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
GC CK + C++KYCECF+ + C++ C CL+C+N
Sbjct: 404 -GCACKNNKCIRKYCECFRNGLSCADKCCCLNCEN 437
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 20/127 (15%)
Query: 105 YCECFAAGIYCD--GCNCVNCHNNVEH---EVARQEAVGATLERNPNAFRPKIASSPHGA 159
YC+CF AG C+ CNC C N + + AR A+ A L RNP AF G
Sbjct: 271 YCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAFVTA------GV 324
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNF--EGSE 216
A + Q+A C+C +S CLK YC CFQA +C C C+ C N + E
Sbjct: 325 ASAVQKLPAGQVA------CNCIRSRCLKLYCTCFQAGKMCQPGICTCIACANSVEDHPE 378
Query: 217 ERRALFH 223
++A+ H
Sbjct: 379 RKQAIKH 385
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 178 GCHCKKSGCLKKYCECFQANILCSE-NCRCLDCKN 211
GC C KS C+ YC+CF+A C+ NC C CKN
Sbjct: 259 GCSCPKSKCVALYCDCFKAGRRCNPINCNCTACKN 293
>gi|427797635|gb|JAA64269.1| Putative dna-dependent, partial [Rhipicephalus pulchellus]
Length = 184
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 45/64 (70%), Gaps = 1/64 (1%)
Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFM 232
+H KGC+CK+SGCLK YCEC++A ILCS C+C+ CKNFE S ER+ L D +
Sbjct: 3 RHTKGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKNFEDSSERKTLMQLADAAEVRVQ 62
Query: 233 QRAA 236
Q+AA
Sbjct: 63 QQAA 66
>gi|431910169|gb|ELK13242.1| Tesmin [Pteropus alecto]
Length = 484
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 72/116 (62%), Gaps = 13/116 (11%)
Query: 82 INIYSNTSPGAHLNTQFKDNL--VKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
+N+ + SP + ++ KYC+CFA+G C+ CNC NC NN+ HE+ R +A+ A
Sbjct: 273 LNLITQQSPAEVTESSMDSDVGVRKYCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKA 332
Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
L+RNP AF+PKI P DA+ +HN+GC+CK+SGCLK YCEC++
Sbjct: 333 CLDRNPEAFQPKIG--PGSLGDAK---------PRHNRGCNCKRSGCLKNYCECYE 377
>gi|293336269|ref|NP_001170528.1| uncharacterized protein LOC100384542 [Zea mays]
gi|238005872|gb|ACR33971.1| unknown [Zea mays]
Length = 272
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 47/75 (62%)
Query: 143 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 202
RNP AF PK+ + D + + + +A+H +GC+CKKS CLKKYCECFQ + CS
Sbjct: 14 RNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSV 73
Query: 203 NCRCLDCKNFEGSEE 217
+CRC CKN G E
Sbjct: 74 SCRCEGCKNAFGKRE 88
>gi|145513118|ref|XP_001442470.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409823|emb|CAK75073.1| unnamed protein product [Paramecium tetraurelia]
Length = 176
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 61/126 (48%), Gaps = 37/126 (29%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
CNC+ S+CLKL YC+CF AG+ C CNC +C
Sbjct: 85 CNCKKSKCLKL-----------------------------YCDCFTAGVTCGKDCNCCSC 115
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
HNN EH R+ + +ERNP+AFRPK+ S + ED E +H KGC+CKK
Sbjct: 116 HNNDEHPKEREIVIKQIMERNPSAFRPKVESKSNS-----ED--EQDHKPRHFKGCNCKK 168
Query: 184 SGCLKK 189
S KK
Sbjct: 169 SNWSKK 174
Score = 46.2 bits (108), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 27/43 (62%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
C+CKKS CLK YC+CF A + C ++C C C N + + R +
Sbjct: 85 CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREI 127
>gi|326504624|dbj|BAK06603.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 405
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 77/176 (43%), Gaps = 49/176 (27%)
Query: 48 PRANTDGKDGTPKKQK--------QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
P A +G KK+K C+C+ SRCLKL
Sbjct: 72 PPAEANGSSDRRKKRKAEDGEGCRTCSCKKSRCLKL------------------------ 107
Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
YC CFA+G +C + C C C+N H V + G L+ + +
Sbjct: 108 -----YCVCFASGSHCSELCGCDPCYNKPIHGVQQNTPPGLPLQ---------VVRTAEA 153
Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
Q++ E AR Q+ K C CKKSGCLKKYC+C+Q CS NC+C DCKN G
Sbjct: 154 GQNSAEFAR-YQMDFFRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 207
Score = 42.4 bits (98), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 31/61 (50%), Gaps = 2/61 (3%)
Query: 153 ASSPHGAQDAREDAREAQLA--AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
A SP + D R+ + A + + C CKKS CLK YC CF + CSE C C C
Sbjct: 69 AVSPPAEANGSSDRRKKRKAEDGEGCRTCSCKKSRCLKLYCVCFASGSHCSELCGCDPCY 128
Query: 211 N 211
N
Sbjct: 129 N 129
>gi|145496358|ref|XP_001434170.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401293|emb|CAK66773.1| unnamed protein product [Paramecium tetraurelia]
Length = 387
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 74/159 (46%), Gaps = 40/159 (25%)
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA-GIYCDGCNCVN 122
QCNC+ S+CLKL YCECFA G+ CNC
Sbjct: 194 QCNCKKSKCLKL-----------------------------YCECFANNGVCSQSCNCQE 224
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIAS--SPHGAQDAREDA---REAQLAAKHN- 176
C N +++ R +A+ L RN +AF + +P Q + + + +K N
Sbjct: 225 CKNRIDNPQERSKAIEEALLRNSDAFAQCFTTKGAPQFVQQGIFNFIQDKPIKEPSKDNS 284
Query: 177 ----KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
KGC+CKKSGC KKYCEC+ N+ C++ C+C C N
Sbjct: 285 SVVHKGCNCKKSGCKKKYCECYSQNLKCNDLCKCEHCLN 323
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 30/41 (73%), Gaps = 1/41 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 218
C+CKKS CLK YCECF N +CS++C C +CKN + +ER
Sbjct: 195 CNCKKSKCLKLYCECFANNGVCSQSCNCQECKNRIDNPQER 235
>gi|242086178|ref|XP_002443514.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
gi|241944207|gb|EES17352.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
Length = 409
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 74/173 (42%), Gaps = 45/173 (26%)
Query: 43 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
S R R R DG DG K C+C+ SRC+KL
Sbjct: 87 SDRKRRRKAEDG-DGC----KTCSCKKSRCMKL--------------------------- 114
Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YC CFA+G C + C C C N +VA + A+ L K + QD
Sbjct: 115 --YCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPL---------KPVQTSEAGQD 163
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
E + + K C CKKSGCLKKYC+C+Q CS NC+C DC+N G
Sbjct: 164 IVEQVIRSPMDLIRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRNPFG 215
>gi|413949791|gb|AFW82440.1| hypothetical protein ZEAMMB73_340549 [Zea mays]
Length = 298
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 60/131 (45%), Gaps = 9/131 (6%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
+ YC CFA G YC D C C C N Q L R KI P
Sbjct: 14 ICSYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANA 72
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG------ 214
+ EDA + + +GC+CKKS CLKKYC+C+Q CS CRC DC+N G
Sbjct: 73 EPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNDGII 132
Query: 215 -SEERRALFHG 224
+ +R L+ G
Sbjct: 133 ADDSKRYLYTG 143
>gi|145506479|ref|XP_001439200.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406384|emb|CAK71803.1| unnamed protein product [Paramecium tetraurelia]
Length = 175
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 56/111 (50%), Gaps = 25/111 (22%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L +YC+CFA G C + CNCV C NN + R+EA R+P AF+
Sbjct: 86 LKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTRDPRAFKQAF-------- 137
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
KGC+CKKSGC KKYCECFQ N+ C+ CRC C N
Sbjct: 138 ----------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCVN 172
Score = 46.6 bits (109), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 26/35 (74%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
K C+CKKS CLK+YC+CF +CSENC C+ C N
Sbjct: 76 KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFN 110
>gi|145518329|ref|XP_001445042.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412475|emb|CAK77645.1| unnamed protein product [Paramecium tetraurelia]
Length = 190
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 65/150 (43%), Gaps = 54/150 (36%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K CNC+ S+CLK +YC+CFA G C + CNCV
Sbjct: 91 KTCNCKKSQCLK-----------------------------QYCDCFANGQVCSENCNCV 121
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C NN + R++A R+P AF+ KGC+C
Sbjct: 122 GCFNNSLNNSQRKDAKLQIQARDPGAFKQAF------------------------KGCNC 157
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
KKSGC KKYCECFQ N+ C+ CRC C N
Sbjct: 158 KKSGCQKKYCECFQNNLQCTHQCRCEGCVN 187
>gi|170044124|ref|XP_001849708.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867389|gb|EDS30772.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 586
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 56/111 (50%), Gaps = 30/111 (27%)
Query: 39 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
G ++ R P T D KK + CNC S+CLKL
Sbjct: 506 GSEDETRKGPAVGTIFPDEAYKK-RPCNCTKSQCLKL----------------------- 541
Query: 99 KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 149
YC+CFA G +C CNC +C+NN+++E RQ+A+ ATLERNP+AF+
Sbjct: 542 ------YCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFK 586
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
A + C+C KS CLK YC+CF C NC C DC N +EE R
Sbjct: 525 AYKKRPCNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEER 570
>gi|115489472|ref|NP_001067223.1| Os12g0605300 [Oryza sativa Japonica Group]
gi|77557030|gb|ABA99826.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649730|dbj|BAF30242.1| Os12g0605300 [Oryza sativa Japonica Group]
Length = 407
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 55/111 (49%), Gaps = 12/111 (10%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC CFA+G +C + C C C+N H V R A K + QD+
Sbjct: 114 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTP----------ALPLKAVQTLETGQDSV 163
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
E + + K C CKKSGCLKKYC+C+Q CS NC+C DCKN G
Sbjct: 164 EQLIRSPMDFGRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 213
>gi|125537321|gb|EAY83809.1| hypothetical protein OsI_39026 [Oryza sativa Indica Group]
gi|125579998|gb|EAZ21144.1| hypothetical protein OsJ_36791 [Oryza sativa Japonica Group]
Length = 421
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 55/111 (49%), Gaps = 12/111 (10%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC CFA+G +C + C C C+N H V R A K + QD+
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTP----------ALPLKAVQTLETGQDSV 177
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
E + + K C CKKSGCLKKYC+C+Q CS NC+C DCKN G
Sbjct: 178 EQLIRSPMDFGRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 227
>gi|145536949|ref|XP_001454192.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421945|emb|CAK86795.1| unnamed protein product [Paramecium tetraurelia]
Length = 196
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 66/151 (43%), Gaps = 54/151 (35%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNC 120
++ CNC+ S+CLK +YC+CFA G C + CNC
Sbjct: 96 ERSCNCKKSQCLK-----------------------------QYCDCFANGQMCSENCNC 126
Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
V C NN + R+EA + R+P AF+ KGC+
Sbjct: 127 VGCFNNTLNMEQRKEAKVQIINRDPGAFKQSF------------------------KGCN 162
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKN 211
CKKSGC KKYCECF + + C+ CRC C N
Sbjct: 163 CKKSGCQKKYCECFLSGLACTHLCRCDGCLN 193
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 9/99 (9%)
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK---HNKGCH 180
N +E E +E V +++ F+ ++++ + + Q+ +K + + C+
Sbjct: 47 QNRIEME---EETVETSIQL---VFKEYLSTNQDYYFLLQISQKSLQIQSKLFINERSCN 100
Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
CKKS CLK+YC+CF +CSENC C+ C N + E+R
Sbjct: 101 CKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQR 139
>gi|428185670|gb|EKX54522.1| hypothetical protein GUITHDRAFT_63344, partial [Guillardia theta
CCMP2712]
Length = 125
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 61/118 (51%), Gaps = 21/118 (17%)
Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFR-------PKIA 153
L +YC+CF I C C C +CHN+ +HE R EA+ NP+AF+ +
Sbjct: 21 LKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRMNNPSAFKGTALELEDQEV 80
Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
++P G + K +GC CK+S C KKYCECF A I C+ NC C DC N
Sbjct: 81 TTPKGGK-------------KTVRGCRCKRSKCQKKYCECFSAGIPCTSNCVCTDCAN 125
Score = 45.8 bits (107), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 30/43 (69%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+GC C ++ CLK+YC+CF+ +I C+ +C C DC N EE+R
Sbjct: 11 RGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKR 53
>gi|449016427|dbj|BAM79829.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 932
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 15/117 (12%)
Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L KYC+CFAAG +C C C C N+ +++ + A + +R+ S+ A
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRS---------STSGEAL 869
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
+ A E + ++GC CK++GCLK+YCECFQA C+ C C C N G E
Sbjct: 870 NGEPSASE-----RSSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLNCRGLSE 921
>gi|195153923|ref|XP_002017873.1| GL17406 [Drosophila persimilis]
gi|194113669|gb|EDW35712.1| GL17406 [Drosophila persimilis]
Length = 554
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 29/122 (23%)
Query: 42 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
+SP+ TD G ++K CNC S+CLKL
Sbjct: 456 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKL-------------------------- 489
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YC+CFA G +C C C +C NN+++EV R+ A+ + L+RNP+AF K + A+D
Sbjct: 490 ---YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFNLKHGEDSNNAKD 546
Query: 162 AR 163
+
Sbjct: 547 VK 548
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 38/84 (45%), Gaps = 6/84 (7%)
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
+I SP A DA A K C+C KS CLK YC+CF C ++C C DC
Sbjct: 453 EILKSPQLASAPTTDALAGM--ASRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCF 509
Query: 211 N---FEGSEERRALFHGDHNGIAF 231
N +E ER D N AF
Sbjct: 510 NNLDYEVERERAIRSCLDRNPSAF 533
>gi|145484384|ref|XP_001428202.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395286|emb|CAK60804.1| unnamed protein product [Paramecium tetraurelia]
Length = 164
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 76/174 (43%), Gaps = 60/174 (34%)
Query: 41 QESPRSRPRANTDGKDGTP--KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
Q R + R T+ P +KQ+QC C+ S+CLK+
Sbjct: 16 QTPTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKM----------------------- 52
Query: 99 KDNLVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
YC+C A G YC C+C NCHN+ +H R + + +E+N +A
Sbjct: 53 ------YCDCLAFGEYCGSECSCQNCHNDDKHPEQRSKVLEQMVEKNSSA---------- 96
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
QL C+C+KS C KKYCEC+ A + CSE C+C DCKN
Sbjct: 97 ------------QLK------CNCRKSNCQKKYCECYNAGVKCSELCKCDDCKN 132
>gi|223974815|gb|ACN31595.1| unknown [Zea mays]
Length = 391
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 78/186 (41%), Gaps = 56/186 (30%)
Query: 37 PPGKQE----SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
PP E S R R R DG DG K C+C+ SRCLKL
Sbjct: 51 PPAPAEANGNSDRKRRRKADDG-DGC----KACSCKKSRCLKL----------------- 88
Query: 93 HLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
YC C+A+G +C + C C CHN GA P
Sbjct: 89 ------------YCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PV 122
Query: 152 IASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
+ P +A ++ E + + + + C CKKSGCLKKYC+C+Q CS NC+C D
Sbjct: 123 LPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDD 182
Query: 209 CKNFEG 214
C+N G
Sbjct: 183 CRNPFG 188
>gi|219362923|ref|NP_001137104.1| uncharacterized protein LOC100217281 [Zea mays]
gi|194698372|gb|ACF83270.1| unknown [Zea mays]
gi|194708474|gb|ACF88321.1| unknown [Zea mays]
gi|414877953|tpg|DAA55084.1| TPA: hypothetical protein ZEAMMB73_574510 [Zea mays]
Length = 394
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 78/186 (41%), Gaps = 56/186 (30%)
Query: 37 PPGKQE----SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
PP E S R R R DG DG K C+C+ SRCLKL
Sbjct: 53 PPAPAEANGNSDRKRRRKADDG-DGC----KACSCKKSRCLKL----------------- 90
Query: 93 HLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
YC C+A+G +C + C C CHN GA P
Sbjct: 91 ------------YCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PV 124
Query: 152 IASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
+ P +A ++ E + + + + C CKKSGCLKKYC+C+Q CS NC+C D
Sbjct: 125 LPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDD 184
Query: 209 CKNFEG 214
C+N G
Sbjct: 185 CRNPFG 190
>gi|159475976|ref|XP_001696090.1| hypothetical protein CHLREDRAFT_104804 [Chlamydomonas reinhardtii]
gi|158275261|gb|EDP01039.1| predicted protein [Chlamydomonas reinhardtii]
Length = 84
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 50/94 (53%), Gaps = 12/94 (12%)
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
L YC+CFAAG +C C+C +C N E+ Q+ R+P AF KI +P
Sbjct: 2 LKLYCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMDAP----- 56
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
KH +GC+CK+S CLKKYCECFQ
Sbjct: 57 -------GGGGGKHKRGCNCKRSHCLKKYCECFQ 83
>gi|145479163|ref|XP_001425604.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392675|emb|CAK58206.1| unnamed protein product [Paramecium tetraurelia]
Length = 164
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 77/174 (44%), Gaps = 60/174 (34%)
Query: 41 QESPRSRPRANTDGKDGTP--KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
Q R + R T+ P +KQ+QC C+ S+CLK+
Sbjct: 16 QTPTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKM----------------------- 52
Query: 99 KDNLVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
YC+C A G YC C+C NCHN+ +H R + + +E+N +P
Sbjct: 53 ------YCDCLAFGEYCGSECSCQNCHNDDKHPEQRNKVLEQMVEKN----------TP- 95
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
AQ+ C+C+KS C KKYCEC+ A + CSE C+C DCKN
Sbjct: 96 -----------AQIK------CNCRKSNCQKKYCECYNAGVKCSELCKCDDCKN 132
>gi|384253407|gb|EIE26882.1| hypothetical protein COCSUDRAFT_9768, partial [Coccomyxa
subellipsoidea C-169]
Length = 81
Score = 71.6 bits (174), Expect = 1e-09, Method: Composition-based stats.
Identities = 36/91 (39%), Positives = 49/91 (53%), Gaps = 17/91 (18%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC+CFA G++C D C C +C N + A +A + ++P AF+PK+ + G
Sbjct: 5 YCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPKVLDASGG----- 59
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECF 194
H KGC C+KS CLKKYCECF
Sbjct: 60 -----------HVKGCACRKSRCLKKYCECF 79
Score = 39.7 bits (91), Expect = 4.8, Method: Composition-based stats.
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 186 CLKKYCECFQANILCSENCRCLDCKN 211
CLK YC+CF + C++ C C DC+N
Sbjct: 1 CLKLYCDCFATGLFCNDACMCKDCEN 26
>gi|339238879|ref|XP_003380994.1| conserved hypothetical protein [Trichinella spiralis]
gi|316976043|gb|EFV59390.1| conserved hypothetical protein [Trichinella spiralis]
Length = 238
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 72/165 (43%), Gaps = 49/165 (29%)
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
++ CNC S+CLKL YC+CFA G +C C+C
Sbjct: 55 RRPCNCTKSQCLKL-----------------------------YCDCFANGEFCSNCHCT 85
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN---KG 178
NC NN+ +E+ R A+ + LERNP AF+PKI + + +++ AK G
Sbjct: 86 NCLNNLTNELDRSRAIKSCLERNPMAFQPKIGN------------KTSEIMAKDELTLNG 133
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
C + +K A + C+ C+C+ C+N E R H
Sbjct: 134 CT-TRDAIVKSQ----AARVSCTVRCKCVGCRNTEADRNHRNRGH 173
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 32/62 (51%), Gaps = 4/62 (6%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG---DHNGIAFMQRA 235
C+C KS CLK YC+CF CS NC C +C N +E R+ + N +AF +
Sbjct: 58 CNCTKSQCLKLYCDCFANGEFCS-NCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKI 116
Query: 236 AN 237
N
Sbjct: 117 GN 118
>gi|340507965|gb|EGR33791.1| hypothetical protein IMG5_037680 [Ichthyophthirius multifiliis]
Length = 96
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 41/81 (50%), Gaps = 9/81 (11%)
Query: 136 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
AV RNPNAF K D H KGC CKKS CLKKYCECF
Sbjct: 2 AVNDAKNRNPNAFNIKFQIIEEKQDDK---------IVIHKKGCTCKKSNCLKKYCECFN 52
Query: 196 ANILCSENCRCLDCKNFEGSE 216
A +LC++NC C CKN + E
Sbjct: 53 AGVLCNQNCVCDSCKNMQKEE 73
>gi|413949787|gb|AFW82436.1| hypothetical protein ZEAMMB73_293494 [Zea mays]
Length = 341
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 53/134 (39%), Gaps = 31/134 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K C C+ SRCLKL YC CFA G YC D C C
Sbjct: 62 KHCACKKSRCLKL-----------------------------YCPCFAGGGYCSDKCGCQ 92
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C N Q L R KI P + EDA + + +GC+C
Sbjct: 93 PCFNKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNC 151
Query: 182 KKSGCLKKYCECFQ 195
KKS CLKKYC+C+Q
Sbjct: 152 KKSSCLKKYCDCYQ 165
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 22/41 (53%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K C CKKS CLK YC CF CS+ C C C N E E
Sbjct: 62 KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSE 102
>gi|145501814|ref|XP_001436887.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404032|emb|CAK69490.1| unnamed protein product [Paramecium tetraurelia]
Length = 159
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 63/150 (42%), Gaps = 54/150 (36%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K CNC+ S+CLK +YC+CFA G C + CNCV
Sbjct: 60 KSCNCKKSQCLK-----------------------------QYCDCFANGQMCSENCNCV 90
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
C NN + R+ A + R+P AF+ KGC+C
Sbjct: 91 GCFNNTLNIEQRKVAKVQIVNRDPGAFKQSF------------------------KGCNC 126
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
KKSGC KKYCECF + C+ CRC C N
Sbjct: 127 KKSGCQKKYCECFLNGLTCTHLCRCDGCLN 156
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 32/47 (68%)
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
+ K C+CKKS CLK+YC+CF +CSENC C+ C N + E+R +
Sbjct: 58 NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKV 104
>gi|145534021|ref|XP_001452755.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420454|emb|CAK85358.1| unnamed protein product [Paramecium tetraurelia]
Length = 326
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 44/66 (66%), Gaps = 7/66 (10%)
Query: 148 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 207
FRPK+ S + ED ++ + +H KGC+CKKS CLKKYCEC+Q + CSE C+C
Sbjct: 244 FRPKVESKSNS-----EDEQDHK--PRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCD 296
Query: 208 DCKNFE 213
+CKN E
Sbjct: 297 ECKNCE 302
>gi|194696554|gb|ACF82361.1| unknown [Zea mays]
Length = 372
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 56/114 (49%), Gaps = 18/114 (15%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC C+A+G +C + C C CHN GA P + P +A
Sbjct: 69 YCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PVLPLKPVQTLEAG 114
Query: 164 EDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
++ E + + + + C CKKSGCLKKYC+C+Q CS NC+C DC+N G
Sbjct: 115 QEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRNPFG 168
>gi|325187539|emb|CCA22077.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 319
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 11/81 (13%)
Query: 133 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCE 192
R +A+ + LERNP AF+PK+ ++ + H +GC+C+KS CLK YCE
Sbjct: 20 RSQAIASILERNPLAFQPKVVANGSETNNLH-----------HLRGCNCRKSNCLKNYCE 68
Query: 193 CFQANILCSENCRCLDCKNFE 213
C QA + C+ C C C N E
Sbjct: 69 CHQARVSCTNRCACHKCCNTE 89
>gi|50787704|emb|CAH04409.1| hypothetical protein [Euplotes vannus]
Length = 482
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 65/154 (42%), Gaps = 36/154 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCNCV 121
K C C+ S+C+KL YCECF + +C C C
Sbjct: 357 KGCTCKRSKCIKL-----------------------------YCECFLSKGFCGPSCTCN 387
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKI----ASSPHGAQDAREDAREAQLAAKHNK 177
+C N E+E + L RNP AF K+ AS P G A + L +H K
Sbjct: 388 DCENMEENEEEIVKIREKILSRNPKAFEKKLQIEEASQPLGMGPKAASAPSSNL--RHIK 445
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
GC CK+S C YCEC Q C+E C C DC N
Sbjct: 446 GCTCKRSKCSNNYCECHQHGAKCTELCSCTDCNN 479
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
+EA L+ KGC CK+S C+K YCECF + C +C C DC
Sbjct: 349 KEANLS--ECKGCTCKRSKCIKLYCECFLSKGFCGPSCTCNDC 389
>gi|38637223|dbj|BAD03489.1| hypothetical protein [Oryza sativa Japonica Group]
gi|38637276|dbj|BAD03540.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 141
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 50/73 (68%), Gaps = 6/73 (8%)
Query: 411 NERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFR 470
N+RPLSPGT+ALMCDE+D+ F + + A+ ST +++ I+ EQE+ +LT FR
Sbjct: 66 NKRPLSPGTQALMCDEQDAVFQDS--KAPDATPSTTEEN----LVDIFKEQEKRVLTNFR 119
Query: 471 DFLNRLITCGSIK 483
D+L +L TCG ++
Sbjct: 120 DYLCKLATCGKLQ 132
>gi|299115549|emb|CBN75752.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 313
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 62/148 (41%), Gaps = 43/148 (29%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
C C+ RCLKL YC+CFA +C C C C
Sbjct: 201 CTCKKCRCLKL-----------------------------YCDCFAQSRFCGSACLCKQC 231
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
N + V Q A A LERNP+AF H A + R + + GC C +
Sbjct: 232 CNRL-GAVVVQAARRAVLERNPDAF--------HRAHSL--ETRPGSVTS--GAGCGCTR 278
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKN 211
S CLKKYC CFQ ++LC C+C C N
Sbjct: 279 SHCLKKYCVCFQKSLLCGAGCKCEGCLN 306
Score = 38.9 bits (89), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 42/97 (43%), Gaps = 11/97 (11%)
Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
+R R K A++ D R A AA+ + C CKK CLK YC+CF + C
Sbjct: 164 QRQAAGKRLKKAAAGVKVDDTRSPPPSAAAAAEQHNPCTCKKCRCLKLYCDCFAQSRFCG 223
Query: 202 ENCRCLDCKNFEGS----EERRAL-------FHGDHN 227
C C C N G+ RRA+ FH H+
Sbjct: 224 SACLCKQCCNRLGAVVVQAARRAVLERNPDAFHRAHS 260
>gi|145546570|ref|XP_001458968.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426790|emb|CAK91571.1| unnamed protein product [Paramecium tetraurelia]
Length = 272
Score = 65.5 bits (158), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 54/115 (46%), Gaps = 16/115 (13%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L +YC CF +G C D C C +C N E R EA+ ++ +
Sbjct: 159 LKRYCACFHSGRMCLDECQCKDCKNCTEFAEERDEAINHVYKK--------------CHR 204
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
D + E L+ + + GC CK +GC KKYCECF+ +C E C C DC N S
Sbjct: 205 DKKVPVNEL-LSLQMSYGCKCKSTGCQKKYCECFKRGQICGEQCACEDCLNIPFS 258
>gi|194703140|gb|ACF85654.1| unknown [Zea mays]
Length = 204
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 78/186 (41%), Gaps = 56/186 (30%)
Query: 37 PPGKQE----SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
PP E S R R R DG DG K C+C+ SRCLKL
Sbjct: 53 PPAPAEANGNSDRKRRRKADDG-DGC----KACSCKKSRCLKL----------------- 90
Query: 93 HLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
YC C+A+G +C + C C CHN GA P
Sbjct: 91 ------------YCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PV 124
Query: 152 IASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
+ P +A ++ E + + + + C CKKSGCLKKYC+C+Q CS NC+C D
Sbjct: 125 LPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDD 184
Query: 209 CKNFEG 214
C+N G
Sbjct: 185 CRNPFG 190
>gi|145505956|ref|XP_001438944.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406117|emb|CAK71547.1| unnamed protein product [Paramecium tetraurelia]
Length = 272
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 57/121 (47%), Gaps = 16/121 (13%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L +YC CF +G C D C C +C N + R EA+ ++ +
Sbjct: 159 LKRYCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKK--------------CHR 204
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
D + E L+ + + GC CK +GC KKYCECF+ +C E C C C N ++ ++
Sbjct: 205 DKKVPVNEL-LSLQMSYGCKCKTTGCQKKYCECFKRGQICGEQCSCEGCLNIPSAQNQKE 263
Query: 221 L 221
L
Sbjct: 264 L 264
Score = 45.8 bits (107), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C+C +S CLK+YC CF + +C + C+C DCKN E R
Sbjct: 151 CNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTDFYEER 191
>gi|291001253|ref|XP_002683193.1| predicted protein [Naegleria gruberi]
gi|284096822|gb|EFC50449.1| predicted protein [Naegleria gruberi]
Length = 459
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 33/46 (71%)
Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
++ + + KGC+CKK+GCLK+YCECFQ N C+ CRC CKN+
Sbjct: 186 QQPKFVDNNAKGCNCKKTGCLKRYCECFQNNKRCTIKCRCQGCKNY 231
>gi|170073469|ref|XP_001870379.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870088|gb|EDS33471.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 103
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/71 (47%), Positives = 45/71 (63%), Gaps = 12/71 (16%)
Query: 137 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 196
+ A L ++P +PKI GA A EDA +H KGC+CK+SGCLK YCEC++A
Sbjct: 35 ITAGLGQSP--VQPKI-----GATSAAEDATR-----RHTKGCNCKRSGCLKNYCECYEA 82
Query: 197 NILCSENCRCL 207
I CS NC+C+
Sbjct: 83 KIACSGNCKCI 93
>gi|224014361|ref|XP_002296843.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
CCMP1335]
gi|220968480|gb|EED86827.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
CCMP1335]
Length = 141
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 12/107 (11%)
Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC+CF G C+ CNC +C N E R+ + L+ NP AF ++D+
Sbjct: 46 YCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAF----------SEDSD 95
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
+ + Q ++ N GC C S CLKKYC CF A+ C++ C C DC+
Sbjct: 96 KFNTKRQRMSRGN-GCACPSSHCLKKYCSCFGADAGCTDKCSCNDCE 141
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 218
C C KS C+K YC+CF LC+ C C DCKN E EER
Sbjct: 35 CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREER 75
>gi|195058418|ref|XP_001995449.1| GH22623 [Drosophila grimshawi]
gi|193899655|gb|EDV98521.1| GH22623 [Drosophila grimshawi]
Length = 916
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 58/136 (42%), Gaps = 35/136 (25%)
Query: 2 RSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKK 61
+SP Q +P Q+ N++ P L L P +P A+ D G+ +
Sbjct: 700 KSPQQKLPILQSSSTQD---NKLCVPAKPL-FDILKPAPMPTPSPAASASLDVLGGS--R 753
Query: 62 QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
+K CNC S+CLKL YC+CFA G +C C C
Sbjct: 754 RKHCNCSKSQCLKL-----------------------------YCDCFANGEFCQDCTCK 784
Query: 122 NCHNNVEHEVARQEAV 137
+C NN+++EV R+ A+
Sbjct: 785 DCFNNLDYEVKRERAI 800
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 8/76 (10%)
Query: 146 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
+ +P +P A A D L K C+C KS CLK YC+CF C ++C
Sbjct: 728 DILKPAPMPTPSPAASASLDV----LGGSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCT 782
Query: 206 CLDCKN---FEGSEER 218
C DC N +E ER
Sbjct: 783 CKDCFNNLDYEVKRER 798
>gi|195155474|ref|XP_002018629.1| GL25902 [Drosophila persimilis]
gi|194114782|gb|EDW36825.1| GL25902 [Drosophila persimilis]
Length = 296
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 145 PNAFRPKIASSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSEN 203
P+A ++ ++P R R + + KGC CK++ C+K YC+C+Q+ +C E
Sbjct: 5 PSALTTRVIAAPVKTAAGRRQPRTPKTPKESTIKGCACKRTSCIKNYCDCYQSMRICHEF 64
Query: 204 CRCLDCKN 211
C+C+DCKN
Sbjct: 65 CKCVDCKN 72
>gi|432959452|ref|XP_004086297.1| PREDICTED: uncharacterized protein LOC101167937 [Oryzias latipes]
Length = 377
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 29/78 (37%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K C+C SRCLKL YC+CF++G+ C CN
Sbjct: 325 RPRKTCHCTKSRCLKL-----------------------------YCDCFSSGMMCSSCN 355
Query: 120 CVNCHNNVEHEVARQEAV 137
C+NCHNN E+E R EA+
Sbjct: 356 CINCHNNAENESQRHEAI 373
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 30/43 (69%), Gaps = 1/43 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
K CHC KS CLK YC+CF + ++CS +C C++C N +E +R
Sbjct: 328 KTCHCTKSRCLKLYCDCFSSGMMCS-SCNCINCHNNAENESQR 369
>gi|198476800|ref|XP_001357481.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
gi|198137857|gb|EAL34551.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
Length = 296
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 145 PNAFRPKIASSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSEN 203
P+A ++ ++P R R + + KGC CK++ C+K YC+C+Q+ +C E
Sbjct: 5 PSALTTRVIAAPVKTAAGRRQPRTPKTPKESTIKGCACKRTSCIKNYCDCYQSMRICHEF 64
Query: 204 CRCLDCKN 211
C+C+DCKN
Sbjct: 65 CKCVDCKN 72
>gi|224123378|ref|XP_002319064.1| predicted protein [Populus trichocarpa]
gi|222857440|gb|EEE94987.1| predicted protein [Populus trichocarpa]
Length = 213
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 48/114 (42%), Gaps = 33/114 (28%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YCECFAAG+YC + C+C++C NN HE E G RNP AF PK+ + A
Sbjct: 8 LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIRNLDSAS 67
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+ F+ + CS CRC CKN G
Sbjct: 68 E--------------------------------FRGGVGCSSYCRCEGCKNTFG 89
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 32/72 (44%), Gaps = 10/72 (13%)
Query: 187 LKKYCECFQANILCSENCRCLDCKNFEGSE----ERRALFHGDHNGIAFMQRA-----AN 237
L YCECF A + C E C CLDC N G E E R N +AF + +
Sbjct: 8 LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIES-RNPLAFAPKVIRNLDSA 66
Query: 238 AAIIGAVGSSGY 249
+ G VG S Y
Sbjct: 67 SEFRGGVGCSSY 78
>gi|300176906|emb|CBK25475.2| unnamed protein product [Blastocystis hominis]
Length = 123
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 56/108 (51%), Gaps = 18/108 (16%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCEC AG C D C CV+C N + R++A+ + NP AF S+ G +D+R
Sbjct: 26 YCECLKAGRTCTDLCACVDCLNRAKFP-ERKKALKHIKKTNPKAFL--RVSNLKGHEDSR 82
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C C+KS C KKYC C+Q+ C+ C+C+DCKN
Sbjct: 83 --------------CCACQKSLCNKKYCSCYQSGRYCTPKCKCVDCKN 116
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER-RALFH-GDHNGIAFMQ 233
C C K+ CLK YCEC +A C++ C C+DC N ER +AL H N AF++
Sbjct: 15 CTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKFPERKKALKHIKKTNPKAFLR 71
>gi|145494055|ref|XP_001433022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400138|emb|CAK65625.1| unnamed protein product [Paramecium tetraurelia]
Length = 188
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 54/137 (39%), Gaps = 40/137 (29%)
Query: 105 YCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCECFAAG C+ CNC C+N + + ++ G
Sbjct: 89 YCECFAAGKACNMDCNCQGCNNTICSQSIKKVQTG------------------------- 123
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
GC+CKK+GCLKKYCEC+ C+ C C+DC N E SE +
Sbjct: 124 --------------GCNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCNQEDSESEKENEQ 169
Query: 224 GDHNGIAFMQRAANAAI 240
I Q N I
Sbjct: 170 LQQQSIDCGQTITNQII 186
>gi|145522672|ref|XP_001447180.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414680|emb|CAK79783.1| unnamed protein product [Paramecium tetraurelia]
Length = 263
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 62/151 (41%), Gaps = 46/151 (30%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCN 119
+Q+ CNC+NS CLK +YC CF +G C C
Sbjct: 144 QQQPCNCKNSGCLK-----------------------------RYCRCFHSGRMCLKECQ 174
Query: 120 CV-NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
C +C NN +H+ R A+ + + +R R R+A G
Sbjct: 175 CSEDCQNNEQHQEQRNNAI---IHVDQKCYR------------NRRMPRDALFKLDVIYG 219
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDC 209
C C KS C K+YCEC+ N C++ C+C DC
Sbjct: 220 CSCTKSKCRKRYCECYLRNQKCTDKCKCFDC 250
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRC-LDCKNFEGSEERR--ALFHGDH 226
C+CK SGCLK+YC CF + +C + C+C DC+N E +E+R A+ H D
Sbjct: 148 CNCKNSGCLKRYCRCFHSGRMCLKECQCSEDCQNNEQHQEQRNNAIIHVDQ 198
>gi|195434044|ref|XP_002065013.1| GK14905 [Drosophila willistoni]
gi|194161098|gb|EDW75999.1| GK14905 [Drosophila willistoni]
Length = 250
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 45/69 (65%), Gaps = 8/69 (11%)
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
K A S G + AR A++ + KGC CK+S C+K YC+C+Q+ +CS++CRC+ C+
Sbjct: 3 KPAPSGTGKRSARGQAQQPTI-----KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCR 57
Query: 211 NFEGSEERR 219
N ++ER+
Sbjct: 58 N---TQERK 63
>gi|403367645|gb|EJY83644.1| hypothetical protein OXYTRI_18624 [Oxytricha trifallax]
gi|403370830|gb|EJY85281.1| hypothetical protein OXYTRI_16860 [Oxytricha trifallax]
Length = 590
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 49/110 (44%), Gaps = 32/110 (29%)
Query: 105 YCECFAAGIYCD--GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
YC+C G CD C+C +C N +E++ R E
Sbjct: 481 YCDCLKNGQACDPSSCSCASCLNTIENQNLRLEI-------------------------- 514
Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRCLDCKN 211
++ ++ Q AK +GC CK S C K+YCECFQ C C+C DCKN
Sbjct: 515 -QEKKQKQGQAK--EGCSCKNSQCQKRYCECFQNGRQCDPSKCKCKDCKN 561
>gi|145488037|ref|XP_001430023.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397118|emb|CAK62625.1| unnamed protein product [Paramecium tetraurelia]
Length = 255
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 51/115 (44%), Gaps = 19/115 (16%)
Query: 99 KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
K N +K YC CF C D C C C N EH R A+ K+
Sbjct: 140 KSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNAL------------EKVKQKL 187
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
H ++ + +++ GC C+KS C K YCECF N CS +CRC DC N
Sbjct: 188 HRQKNDDDLFDRSKIW-----GCKCQKSQCQKNYCECFVRNQKCSSSCRCKDCAN 237
Score = 46.2 bits (108), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 30/66 (45%), Gaps = 8/66 (12%)
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--------FEGSEERRALFHGDHNGI 229
C C KS CLK YC CF N CS+ C+C +CKN F E+ + H N
Sbjct: 135 ACRCTKSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNALEKVKQKLHRQKNDD 194
Query: 230 AFMQRA 235
R+
Sbjct: 195 DLFDRS 200
>gi|395834329|ref|XP_003790159.1| PREDICTED: protein lin-54 homolog [Otolemur garnettii]
Length = 677
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 172 AAKHNKGCHCKKSGCLKKY--CECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNG 228
A++ K C+C KS CLK + A I+CS C+C+ CKNFE S ER+ L H D
Sbjct: 518 ASRPRKPCNCTKSLCLKLWEVNNISMAKIMCSSICKCIGCKNFEESPERKTLMHLADAAE 577
Query: 229 IAFMQRAA 236
+ Q+ A
Sbjct: 578 VRVQQQTA 585
>gi|145548172|ref|XP_001459767.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427593|emb|CAK92370.1| unnamed protein product [Paramecium tetraurelia]
Length = 303
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 56/148 (37%), Gaps = 47/148 (31%)
Query: 65 CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
CNC S CLKL YC+CF C + C C++C
Sbjct: 183 CNCSKSNCLKL-----------------------------YCQCFHQNKQCTELCKCLDC 213
Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
N H RQ A+ K+ S + D + GC C+K
Sbjct: 214 KNCDYHTQVRQTALEKI----------KMKSQRQKHDNDLFDLSKVW-------GCKCQK 256
Query: 184 SGCLKKYCECFQANILCSENCRCLDCKN 211
S C K YCECF+ N C+ +CRC DC N
Sbjct: 257 SQCQKNYCECFRRNQKCNSSCRCKDCAN 284
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 22/83 (26%)
Query: 133 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG----CHCKKSGCLK 188
R++ + A L+RN E +E+ L ++ N+ C+C KS CLK
Sbjct: 151 RKQNINANLQRN------------------YEYNQESDLNSQSNEHTMNPCNCSKSNCLK 192
Query: 189 KYCECFQANILCSENCRCLDCKN 211
YC+CF N C+E C+CLDCKN
Sbjct: 193 LYCQCFHQNKQCTELCKCLDCKN 215
>gi|125580000|gb|EAZ21146.1| hypothetical protein OsJ_36793 [Oryza sativa Japonica Group]
Length = 710
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 87/226 (38%), Gaps = 75/226 (33%)
Query: 1 MRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPK 60
+R+P A+P H+ + VP + + Q SP+ + R D DGT
Sbjct: 383 LRTPPCALPGIGL---HLNALATVPKDKMTAVVAYVGESSQGSPKKK-RHKFDNGDGT-- 436
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCN 119
K+C+C+ S+CLKL YCECF AG++C + C+
Sbjct: 437 SCKRCSCKKSKCLKL-----------------------------YCECFHAGVFCSEPCS 467
Query: 120 CVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 176
C C N N+E ++ +E + + RNP AF PK+ + G + A
Sbjct: 468 CQGCLNMPSNMETVLSTREQIES---RNPLAFAPKVIRTEPGQELA-------------- 510
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
+ CS +CRC CKN G E AL
Sbjct: 511 -------------------GGVGCSVSCRCEGCKNAFGRREGVALL 537
>gi|145496915|ref|XP_001434447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401573|emb|CAK67050.1| unnamed protein product [Paramecium tetraurelia]
Length = 321
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 50/115 (43%), Gaps = 19/115 (16%)
Query: 99 KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
K N +K YC+CF C + C C C N +H RQ A+ + KI S
Sbjct: 205 KSNCLKLYCQCFHQNRQCTELCKCFACKNCDDHLPVRQTALE----------KIKIKSQR 254
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
D D GC C+KS C K YCEC+ N CS +CRC DC N
Sbjct: 255 QKHDDDLFDRSRVW-------GCKCQKSQCQKNYCECYIRNQKCSSSCRCKDCAN 302
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 43/86 (50%), Gaps = 2/86 (2%)
Query: 128 EHEVARQEAVGATLERNPNAFRP-KIASSPHGAQDAREDAR-EAQLAAKHNKGCHCKKSG 185
+H+ + TLE+ R K+ S + QD +++ +Q + C C KS
Sbjct: 148 DHQKKADDPCSHTLEQRSRRIRKNKLNPSFNTTQDPNQESDLHSQQQEPNTTPCKCTKSN 207
Query: 186 CLKKYCECFQANILCSENCRCLDCKN 211
CLK YC+CF N C+E C+C CKN
Sbjct: 208 CLKLYCQCFHQNRQCTELCKCFACKN 233
>gi|145520829|ref|XP_001446270.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413747|emb|CAK78873.1| unnamed protein product [Paramecium tetraurelia]
Length = 330
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 48/113 (42%), Gaps = 30/113 (26%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YC+CF G +C + C C++C NN E E + + RN N
Sbjct: 224 LKFYCQCFQQGKFCGETCECLDCCNN-EQNKELIEQTRSQINRNLN-------------- 268
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
C CKKS CLKKYCEC+ A C +C+C C N+E
Sbjct: 269 --------------DEVKCFCKKSKCLKKYCECYNAGKKCGVDCKCEGCMNYE 307
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 28/41 (68%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
+GC CKK+ CLK YC+CFQ C E C CLDC N E ++E
Sbjct: 214 QGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKE 254
>gi|194759119|ref|XP_001961797.1| GF14749 [Drosophila ananassae]
gi|190615494|gb|EDV31018.1| GF14749 [Drosophila ananassae]
Length = 324
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 35/45 (77%), Gaps = 3/45 (6%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
KGC CK+S C+K YC+C+Q+ ++CS+ CRC+ C+N +E R+A+
Sbjct: 103 KGCCCKRSQCIKNYCDCYQSMVICSKFCRCVGCRN---TEVRKAV 144
>gi|298711558|emb|CBJ32620.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 78
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
L YC+CFA + C D C C NC+N+ EH V R+ AV L R+P+AF K + PH
Sbjct: 3 LKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKFKTEPH 59
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 1/36 (2%)
Query: 186 CLKKYCECFQANILCSENCRCLDCKN-FEGSEERRA 220
CLK YC+CF ++CS+ CRC +C N E ERRA
Sbjct: 2 CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRA 37
>gi|145477875|ref|XP_001424960.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392027|emb|CAK57562.1| unnamed protein product [Paramecium tetraurelia]
Length = 240
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 46/108 (42%), Gaps = 23/108 (21%)
Query: 105 YCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCEC A G C C C NC N + ++ + E + N KI S
Sbjct: 83 YCECLAKGRLCSSACRCENCQN----RTSNKKVLHVIKELDLNKNLKKIKS--------- 129
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+ GC CKKS CLKKYCECF + C C C +C+N
Sbjct: 130 ---------KRFKDGCTCKKSMCLKKYCECFHSGKSCGYGCNCENCQN 168
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%)
Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
R+ L ++ C+C KS C K YCEC LCS CRC +C+N
Sbjct: 60 RDDSLLLSSDEPCNCSKSECRKMYCECLAKGRLCSSACRCENCQN 104
>gi|145484801|ref|XP_001428410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395495|emb|CAK61012.1| unnamed protein product [Paramecium tetraurelia]
Length = 240
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 54/118 (45%), Gaps = 23/118 (19%)
Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCEC A G C C C NCHN +++ V L++N
Sbjct: 83 YCECLAKGRLCSSFCRCQNCHNKTSNKLVLN--VIEELDQNR------------------ 122
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
R+ + + GC C+KS CLKKYCECF + C C+C++C+N+ E + +
Sbjct: 123 --NRKRFRSRRFKDGCTCQKSMCLKKYCECFNSGKSCGSGCKCVNCENYVLDEVEKII 178
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 21/33 (63%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C+C +SGC K YCEC LCS CRC +C N
Sbjct: 72 CNCSRSGCRKMYCECLAKGRLCSSFCRCQNCHN 104
Score = 39.7 bits (91), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 155
L KYCECF +G C GC CVNC N V EV ++ + + ++ + F K S+
Sbjct: 144 LKKYCECFNSGKSCGSGCKCVNCENYVLDEV--EKIICSPNQKKTSKFSSKFESA 196
>gi|195473860|ref|XP_002089210.1| GE18995 [Drosophila yakuba]
gi|194175311|gb|EDW88922.1| GE18995 [Drosophila yakuba]
Length = 237
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 154 SSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
S+P R + R+A+ KGC CK+S C+K YC+C+Q+ +C+ CRC+DC+N
Sbjct: 2 STPKKKSVDRSEGRKAKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRNT 61
Query: 213 E 213
E
Sbjct: 62 E 62
>gi|145541620|ref|XP_001456498.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424310|emb|CAK89101.1| unnamed protein product [Paramecium tetraurelia]
Length = 262
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 48/110 (43%), Gaps = 37/110 (33%)
Query: 104 KYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
KYC C G C C CV+C NN +F
Sbjct: 141 KYCFCAKVGQSCSNLCKCVDCSNN-------------------KSFM------------- 168
Query: 163 REDAREAQLA-AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+++ Q+ A+ KGC CKK+ CLK YC+CF + CS NC+C+ CKN
Sbjct: 169 ---SQQIQVTNAQEQKGCKCKKNFCLKGYCDCFAKGMQCSSNCKCISCKN 215
>gi|145515421|ref|XP_001443610.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410999|emb|CAK76213.1| unnamed protein product [Paramecium tetraurelia]
Length = 275
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 50/112 (44%), Gaps = 16/112 (14%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L +YC CF +G C D C C +C N R+ A+ ++ +
Sbjct: 161 LKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREAAINYVFKK--------------CNR 206
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
D A E L+ + + GC CK +GC KKYCEC + C + C C DC N
Sbjct: 207 DKNVPANEL-LSLQISYGCKCKSTGCQKKYCECLKRGQACGDLCSCEDCLNI 257
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 30/43 (69%), Gaps = 1/43 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE-GSEERRA 220
C+C +S CLK+YC CF + +C + C+C DCKN + SEER A
Sbjct: 153 CNCSQSSCLKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREA 195
>gi|145476147|ref|XP_001424096.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391159|emb|CAK56698.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 52/115 (45%), Gaps = 17/115 (14%)
Query: 99 KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
K N +K YC CF G C + C C +C N E+ R EAV I
Sbjct: 86 KTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQRHEAVEY------------IKKKA 133
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
H + +E K GC+CKK+ CLK+YCECF C+ +C C C+N
Sbjct: 134 HRNKKV---TQEKLFETKEVWGCNCKKTRCLKRYCECFIRQKTCTVDCNCNYCEN 185
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 6/59 (10%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 237
C C K+ CLK YC CF C E C+C DCKN + +R H + ++++ A+
Sbjct: 82 CSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQR------HEAVEYIKKKAH 134
>gi|224070845|ref|XP_002303261.1| predicted protein [Populus trichocarpa]
gi|222840693|gb|EEE78240.1| predicted protein [Populus trichocarpa]
Length = 163
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 40/91 (43%), Gaps = 30/91 (32%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
++CNC+ SRCLKL YCECFAAG YC D C+C
Sbjct: 81 RRCNCKRSRCLKL-----------------------------YCECFAAGNYCLDTCSCE 111
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKI 152
NC N E+ A RNP AF PK+
Sbjct: 112 NCFNKSEYADAVLHMRQQIEARNPVAFAPKV 142
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 34/71 (47%), Gaps = 7/71 (9%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-----NGIAFMQ 233
C+CK+S CLK YCECF A C + C C +C F SE A+ H N +AF
Sbjct: 83 CNCKRSRCLKLYCECFAAGNYCLDTCSCENC--FNKSEYADAVLHMRQQIEARNPVAFAP 140
Query: 234 RAANAAIIGAV 244
+ I V
Sbjct: 141 KVVKHTTISPV 151
>gi|145491063|ref|XP_001431531.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398636|emb|CAK64133.1| unnamed protein product [Paramecium tetraurelia]
Length = 246
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 50/112 (44%), Gaps = 17/112 (15%)
Query: 102 LVKYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
+ +YC CF +G C C C + C N + R EA+ N +R
Sbjct: 144 IKRYCRCFHSGKTCLPECQCQDGCLNKEHNNCERSEAIKHV---NEKCYR---------- 190
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
R+ +EA GC C KS C K+YCECF N C+E C+C DC N
Sbjct: 191 --NRKIPKEALFKLDVIYGCSCTKSKCRKRYCECFLRNQNCTEKCKCFDCCN 240
>gi|312380815|gb|EFR26710.1| hypothetical protein AND_07034 [Anopheles darlingi]
Length = 524
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 53/120 (44%), Gaps = 34/120 (28%)
Query: 61 KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
K++ CNC S+CLKL YC+CFA G +C CNC
Sbjct: 325 KKRPCNCTKSQCLKL-----------------------------YCDCFANGEFCYNCNC 355
Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
+CHN + + ARQ+A+ RN + F + GA ++ + E ++ K + G +
Sbjct: 356 KDCHNTQKDDYARQKAIRC---RNTDQFLQEFHY--FGASESSRKSLEDGISPKLSPGLY 410
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C+C KS CLK YC+CF C NC C DC N + + R
Sbjct: 329 CNCTKSQCLKLYCDCFANGEFCY-NCNCKDCHNTQKDDYAR 368
>gi|223993335|ref|XP_002286351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977666|gb|EED95992.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 957
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 32/49 (65%)
Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
R++A ++ K + C CKKS CLKKYCECF ++ C+ C+C +C N
Sbjct: 643 RQEAINSKALTKQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGN 691
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 32/59 (54%), Gaps = 6/59 (10%)
Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNFEGSEERRALFHGDHNGIAF 231
K +KGC CKK+ CLK YC CF +I C + C C C N + R GD NG+A
Sbjct: 313 KTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERCANTQDELARD----GD-NGMAM 366
>gi|195388490|ref|XP_002052913.1| GJ17819 [Drosophila virilis]
gi|194149370|gb|EDW65068.1| GJ17819 [Drosophila virilis]
Length = 225
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 33/45 (73%), Gaps = 3/45 (6%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
KGC CK+S C+K YC+C+Q+ +CS C+C+ C+N +EER A+
Sbjct: 25 KGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRN---TEERLAV 66
>gi|194856723|ref|XP_001968813.1| GG24300 [Drosophila erecta]
gi|190660680|gb|EDV57872.1| GG24300 [Drosophila erecta]
Length = 239
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 29/37 (78%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
KGC CK+S C+K YC+C+Q+ +C++ CRC+ C+N E
Sbjct: 26 KGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRNTE 62
>gi|219114349|ref|XP_002176345.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217402591|gb|EEC42581.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 636
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%)
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
GC C++S CLKKYCECFQ C NC+C +C+N
Sbjct: 245 GCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQNL 279
>gi|145521975|ref|XP_001446837.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414326|emb|CAK79440.1| unnamed protein product [Paramecium tetraurelia]
Length = 285
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 52/115 (45%), Gaps = 23/115 (20%)
Query: 102 LVKYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLE---RNPNAFRPKIASSP 156
L +YC CF +G C C CV C NN +H R +A+ E RN N
Sbjct: 168 LKRYCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEKCHRNKNV--------- 218
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
++A K GC+CKK+ C YCECF C+ +C+C +C+N
Sbjct: 219 ---------PKDALFKLKDCFGCNCKKTRCQTGYCECFLRKSKCTMDCQCDNCEN 264
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLD 208
C C+ SGCLK+YC CF + +C ++C+C++
Sbjct: 160 CFCRNSGCLKRYCRCFHSGRMCLKDCQCVE 189
>gi|145524257|ref|XP_001447956.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415489|emb|CAK80559.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 47/108 (43%), Gaps = 18/108 (16%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC CF I C + C C +CHN ++ R +A+ + K+ D
Sbjct: 236 YCACFHRNIECSELCQCHDCHNKSDYSQIRTQALE----------KVKVKQQRRKHDDDL 285
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
D GC C+KS C K YCECF N CS C+C +C+N
Sbjct: 286 FDKDTVW-------GCQCRKSQCKKNYCECFIRNKKCSSLCKCNNCEN 326
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 31/57 (54%), Gaps = 3/57 (5%)
Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+ H + +D E L +N C C KS CLK YC CF NI CSE C+C DC N
Sbjct: 204 TTHYREFFDKDLDEHNL---NNSPCKCSKSHCLKLYCACFHRNIECSELCQCHDCHN 257
>gi|145517899|ref|XP_001444827.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412260|emb|CAK77430.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 49/111 (44%), Gaps = 16/111 (14%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YC CF G C + C C +C N H R EAV ++ A
Sbjct: 90 LKMYCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEYIKKK---------------AH 134
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
++ +E K GC+CKK+ CLK+YCEC+ C+ C C C+N
Sbjct: 135 RNKKVPQEKLFETKDIWGCNCKKTRCLKRYCECYIRQKTCTVECNCNHCEN 185
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 6/59 (10%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 237
C C K+ CLK YC CF +C E+C+C DCKN FH + ++++ A+
Sbjct: 82 CSCTKTYCLKMYCSCFHNGQICGESCKCEDCKN------TNMHFHQRDEAVEYIKKKAH 134
>gi|147832526|emb|CAN77361.1| hypothetical protein VITISV_011036 [Vitis vinifera]
Length = 514
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 46/121 (38%), Gaps = 45/121 (37%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YC+C AAG+YC D C C NC N E+E Q R+P AF P+I
Sbjct: 244 LSSYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRI-------- 295
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEERR 219
A I CS+ CRC DC+N F +RR
Sbjct: 296 -----------------------------------AGIGCSDGCRCEDCRNSFGIKADRR 320
Query: 220 A 220
A
Sbjct: 321 A 321
>gi|219121911|ref|XP_002181300.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407286|gb|EEC47223.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 381
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 50/134 (37%), Gaps = 53/134 (39%)
Query: 58 TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG 117
PK K C C+N+ CLKL YC CF G +CD
Sbjct: 272 VPKTVKPCKCKNTHCLKL-----------------------------YCTCFQKGSFCDP 302
Query: 118 --CNCVNCHNNVEHEVA---RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 172
C C++C+N E RQEAV L R R DA E++
Sbjct: 303 DICKCIDCYNLREFNETGGKRQEAVSEILLR-------------------RIDAFESRPK 343
Query: 173 AKHNKGCHCKKSGC 186
K +GC CKK+ C
Sbjct: 344 KKTGEGCACKKNRC 357
Score = 45.4 bits (106), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNF 212
K C CK + CLK YC CFQ C + C+C+DC N
Sbjct: 277 KPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNL 313
>gi|195118094|ref|XP_002003575.1| GI21907 [Drosophila mojavensis]
gi|193914150|gb|EDW13017.1| GI21907 [Drosophila mojavensis]
Length = 234
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 33/45 (73%), Gaps = 3/45 (6%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
KGC CK+S C+K YC+C+Q+ +CS C+C+ C+N ++ER A+
Sbjct: 25 KGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRN---TQERLAV 66
>gi|397568266|gb|EJK46048.1| hypothetical protein THAOC_35308 [Thalassiosira oceanica]
Length = 754
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 25/42 (59%)
Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
H GC C+KS CLKKYCECF + C +C C +C N E
Sbjct: 421 HRTGCKCRKSRCLKKYCECFSNGVKCGASCSCENCGNQPAQE 462
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 28/40 (70%), Gaps = 1/40 (2%)
Query: 176 NKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNFEG 214
+KGC C K+ CLK YC CF N+LC+++ C C DCKN E
Sbjct: 231 DKGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDCKNCEA 270
>gi|195034702|ref|XP_001988959.1| GH11450 [Drosophila grimshawi]
gi|193904959|gb|EDW03826.1| GH11450 [Drosophila grimshawi]
Length = 225
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 34/45 (75%), Gaps = 3/45 (6%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
+GC CK+S C+K YC+C+Q+ +CS +C+C+ C+N +E+R A+
Sbjct: 22 RGCCCKRSQCIKNYCDCYQSMAVCSIHCKCIGCRN---TEQRMAV 63
>gi|145506933|ref|XP_001439427.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406611|emb|CAK72030.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 53/115 (46%), Gaps = 17/115 (14%)
Query: 99 KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
K N +K YC CF G C + C C +C N + R +AV ++ A R K S
Sbjct: 86 KTNCLKMYCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEYIKKK---AHRNKKVS-- 140
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+E K GC+CKK+ CLK+YCEC+ C+ C C C+N
Sbjct: 141 ----------QETLFETKDIWGCNCKKTRCLKRYCECYIRQKACTVECNCTHCEN 185
Score = 42.4 bits (98), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 21/33 (63%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C C K+ CLK YC CF C E+C+C DCKN
Sbjct: 82 CSCTKTNCLKMYCSCFHNGQTCVESCKCEDCKN 114
>gi|195342745|ref|XP_002037959.1| GM18017 [Drosophila sechellia]
gi|194132809|gb|EDW54377.1| GM18017 [Drosophila sechellia]
Length = 240
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 27/35 (77%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
KGC CK+S C+K YC+C+Q+ +C++ CRC C+N
Sbjct: 23 KGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRN 57
>gi|145524820|ref|XP_001448232.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415776|emb|CAK80835.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 47/111 (42%), Gaps = 16/111 (14%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YC CF G C + C C +C N E+ R AV I H +
Sbjct: 90 LKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEY------------IRKKAHRNK 137
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+E K GC+CKK+ C K+YCECF C+ +C C C+N
Sbjct: 138 KV---TQEKLFETKDVWGCNCKKTRCQKRYCECFIRQKTCTVDCNCNHCEN 185
Score = 45.4 bits (106), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 7/77 (9%)
Query: 161 DAREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
D E+ E+Q L + C C K+ CLK YC CF C E C+C DC+N + +R
Sbjct: 63 DLDENKEESQILENTKWQPCSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQR 122
Query: 220 ALFHGDHNGIAFMQRAA 236
HN + ++++ A
Sbjct: 123 ------HNAVEYIRKKA 133
>gi|145512403|ref|XP_001442118.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409390|emb|CAK74721.1| unnamed protein product [Paramecium tetraurelia]
Length = 278
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 49/115 (42%), Gaps = 23/115 (20%)
Query: 102 LVKYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLE---RNPNAFRPKIASSP 156
L +YC CF +G C C CV C NN +H R A+ E RN N
Sbjct: 161 LKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEKCHRNKNI--------- 211
Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
++A K GC CKKS C YCECF C+ +C C +C+N
Sbjct: 212 ---------PKDALFKLKDCFGCSCKKSRCQTGYCECFLRKSKCTIDCECQNCEN 257
Score = 42.0 bits (97), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLD-CKNFEGSEERRAL 221
C C+ SGCLK+YC CF + +C + C+C++ C N E E R +
Sbjct: 153 CFCRNSGCLKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNM 196
>gi|145490209|ref|XP_001431105.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398208|emb|CAK63707.1| unnamed protein product [Paramecium tetraurelia]
Length = 223
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 47/111 (42%), Gaps = 16/111 (14%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YC CF G +C C C C N E ++ R +AV ++ A
Sbjct: 97 LKMYCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKK---------------AH 141
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
++ +E GC+C K+ C+KKYCECF C+ C C C N
Sbjct: 142 RNKKVPKEKIFETVEIWGCNCSKTRCVKKYCECFIRGKKCTVECNCDHCDN 192
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 26/41 (63%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C C+K+ CLK YC CF C ++CRC +C+N E + +R
Sbjct: 89 CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKR 129
>gi|222628471|gb|EEE60603.1| hypothetical protein OsJ_14005 [Oryza sativa Japonica Group]
Length = 97
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 30/47 (63%)
Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
++ ++ C C KS C+K YC CF A CS+NCRC +CKN E E+
Sbjct: 1 MSNDQHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYED 47
Score = 46.2 bits (108), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 34/68 (50%), Gaps = 6/68 (8%)
Query: 99 KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
K N VK YC CF+A YC C C NC N +E +E V +NP AF PKI
Sbjct: 13 KSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAFDPKIVR-- 70
Query: 157 HGAQDARE 164
QDA E
Sbjct: 71 --VQDASE 76
>gi|145531914|ref|XP_001451718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419384|emb|CAK84321.1| unnamed protein product [Paramecium tetraurelia]
Length = 242
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 46/108 (42%), Gaps = 24/108 (22%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCECFA G C C C NC N + + LE+
Sbjct: 108 YCECFARGKTCGKHCGCQNCQN---------KQMNKKLEKIQKK--------------LI 144
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+ R Q +++ KGC C KS C K+C C Q C+ CRC+DCKN
Sbjct: 145 KQKRIKQRTSRYLKGCTCGKSKCQNKFCSCHQDGKYCNSGCRCVDCKN 192
>gi|77415370|gb|AAI06127.1| Lin54 protein [Mus musculus]
Length = 523
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 48/93 (51%), Gaps = 29/93 (31%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+ T + +K CNC S CLKL YC+CFA G +C
Sbjct: 451 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 481
Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
+ CNC NC+NN+EHE RQ+A+ A L+RNP AF
Sbjct: 482 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 514
>gi|145477681|ref|XP_001424863.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391930|emb|CAK57465.1| unnamed protein product [Paramecium tetraurelia]
Length = 259
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 52/111 (46%), Gaps = 17/111 (15%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L YC CF C C C C N+ +H + V +E+ + K + H
Sbjct: 151 LQLYCSCFHNRRQCTQECKCSECFNDGQHS----DEVLKAIEQ----IKIKEQRASHHDL 202
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
D+ D R+ GC CKK+ C+K YCECF + CS +C+C +C+N
Sbjct: 203 DSF-DTRQVW-------GCKCKKTQCIKGYCECFIRHKKCSSHCQCSNCQN 245
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
K CHC K+ CL+ YC CF C++ C+C +C N
Sbjct: 141 KPCHCSKTNCLQLYCSCFHNRRQCTQECKCSECFN 175
>gi|24581949|ref|NP_608936.1| tombola [Drosophila melanogaster]
gi|7296988|gb|AAF52259.1| tombola [Drosophila melanogaster]
gi|294862322|gb|ADE06681.2| RT06796p1 [Drosophila melanogaster]
Length = 243
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 28/38 (73%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
C CK+S C+K YC+C+Q+ +C++ CRC+ C+N E E
Sbjct: 28 CCCKRSQCIKNYCDCYQSMAICTKFCRCVGCRNTEVRE 65
>gi|145481073|ref|XP_001426559.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393634|emb|CAK59161.1| unnamed protein product [Paramecium tetraurelia]
Length = 254
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 53/145 (36%), Gaps = 49/145 (33%)
Query: 80 IWINIYSNTSPGAHLNTQF-------------KDNLVKY-CECFAAGIYCD-GCNCVNCH 124
+ +N P HL Q+ K +KY CECF G C C C C
Sbjct: 116 VIVNKLEKVQPSDHLQQQYQNIRNESITCRCKKTKCIKYYCECFQYGQECSFKCECTGCC 175
Query: 125 NN-VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
N V+ + + V T+E +GC C+K
Sbjct: 176 NKQVKQSSGKMQLVSPTIEF---------------------------------QGCKCQK 202
Query: 184 SGCLKKYCECFQANILCSENCRCLD 208
C KKYCEC + N C++ C+C+D
Sbjct: 203 QNCSKKYCECLKRNQRCTDLCKCID 227
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 20/33 (60%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C CKK+ C+K YCECFQ CS C C C N
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCN 176
>gi|145531469|ref|XP_001451501.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419156|emb|CAK84104.1| unnamed protein product [Paramecium tetraurelia]
Length = 192
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 46/110 (41%), Gaps = 40/110 (36%)
Query: 105 YCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCECFA G C CNC+ C+N+ + ++ G
Sbjct: 85 YCECFAVGRSCSIDCNCLGCNNSNCSQKIKKSQCG------------------------- 119
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
GC+CKK+GC+KKYCEC+ C+ C C+DC N E
Sbjct: 120 --------------GCNCKKTGCVKKYCECYLKKQRCTFLCNCIDCCNQE 155
>gi|118376115|ref|XP_001021240.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89303007|gb|EAS00995.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 379
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 57/121 (47%), Gaps = 19/121 (15%)
Query: 104 KYCECFAAGIYC--DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
KYC C+A +C C C +C N Q++ +T ++ +PK+ SS
Sbjct: 257 KYCSCYANNNFCVPSICKCQSCMNTF------QQSYNSTSQK-----KPKLISSQKSLSQ 305
Query: 162 A--REDAREAQLAAKHNKG---CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
+ QL + C C+KS C K+YC+C+++ CS C+C +C N +G +
Sbjct: 306 VILYQIQNINQLKIQKENATISCTCQKSFCKKQYCDCYKSGKKCSTLCKCQNCFN-QGQK 364
Query: 217 E 217
E
Sbjct: 365 E 365
>gi|145528842|ref|XP_001450215.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417815|emb|CAK82818.1| unnamed protein product [Paramecium tetraurelia]
Length = 244
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 58/151 (38%), Gaps = 26/151 (17%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YCECFA G C C C NC N + + +G +R
Sbjct: 108 YCECFARGKTCGKHCGCQNCQN-----MQMNKKLGKIQKR------------------LV 144
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR--AL 221
+ R Q ++ KGC C KS C +C C Q C+ C+C+DCKN + +
Sbjct: 145 KQKRIKQRTPRYLKGCTCGKSKCQNNFCSCHQDGKYCNSECKCVDCKNVYNFCLKIFPEI 204
Query: 222 FHGDHNGIAFMQRAANAAIIGAVGSSGYGTL 252
H + N + IIG SS + L
Sbjct: 205 EHVESNEMESTNANVVLFIIGCENSSNFQNL 235
>gi|229594976|ref|XP_001032555.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|225566469|gb|EAR84892.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 622
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%)
Query: 156 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
P+ + + ++E ++ C C+KS CLK YCECF +C C CL+CKN +
Sbjct: 306 PNNKLQSIDSSKEGEILRVSKTPCKCRKSLCLKLYCECFARGEICGHACVCLECKNSKNH 365
Query: 216 EERR 219
E R
Sbjct: 366 LELR 369
Score = 45.4 bits (106), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 46/117 (39%), Gaps = 42/117 (35%)
Query: 40 KQESPRSRPRANTDGKDG-------TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
K++ P ++ ++ K+G TP C CR S CLKL
Sbjct: 302 KKQQPNNKLQSIDSSKEGEILRVSKTP-----CKCRKSLCLKL----------------- 339
Query: 93 HLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
YCECFA G C C C+ C N+ H R EA+ +NP+AF
Sbjct: 340 ------------YCECFARGEICGHACVCLECKNSKNHLELRNEAIKVIEAKNPSAF 384
>gi|297722381|ref|NP_001173554.1| Os03g0638300 [Oryza sativa Japonica Group]
gi|255674730|dbj|BAH92282.1| Os03g0638300, partial [Oryza sativa Japonica Group]
Length = 113
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 19 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 59
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 43/107 (40%), Gaps = 34/107 (31%)
Query: 47 RPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYC 106
R + D D K+C+C+ S+CLKL YC
Sbjct: 7 RQKCQNDNVDSC----KRCSCKKSKCLKL-----------------------------YC 33
Query: 107 ECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 152
ECFA+ +YC + C+C C N+ HE RNP AF PK+
Sbjct: 34 ECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKV 80
>gi|145512279|ref|XP_001442056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409328|emb|CAK74659.1| unnamed protein product [Paramecium tetraurelia]
Length = 139
Score = 46.6 bits (109), Expect = 0.040, Method: Composition-based stats.
Identities = 43/165 (26%), Positives = 62/165 (37%), Gaps = 50/165 (30%)
Query: 48 PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCE 107
P+ G D K+ C C+ ++CLKL YC+
Sbjct: 6 PKLQIQGNDSEETKEG-CQCQKTKCLKL-----------------------------YCQ 35
Query: 108 CFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 166
CF G C C C C N++ + R +A+G + + + ++ K
Sbjct: 36 CFHDGKCCGQSCGCNGCKNSLNDYLERNKAIGKIIHKYKDDYKQK--------------- 80
Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
L+ +GC CKKS C YCECF C E C C C+N
Sbjct: 81 ----LSGDKGQGCCCKKSKCKLNYCECFIKGRSCGEQCHCKKCEN 121
>gi|145545532|ref|XP_001458450.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426270|emb|CAK91053.1| unnamed protein product [Paramecium tetraurelia]
Length = 220
Score = 45.8 bits (107), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 45/114 (39%), Gaps = 22/114 (19%)
Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGAT---LERNPNAFRPKIASSPH 157
L YC CF G C C C C N ++ + R++AV +RN KI +
Sbjct: 97 LQMYCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYAKKKAQRNKKVPEGKIFETTE 156
Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
GC+C K+ C+K YCECF CS C C C N
Sbjct: 157 IW------------------GCNCSKTRCVKNYCECFIRGKKCSVECNCQHCDN 192
Score = 38.5 bits (88), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 26/42 (61%)
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C C KS CL+ YC CF +C ++C+C +C+N E + +R
Sbjct: 88 SCSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKR 129
>gi|428180254|gb|EKX49122.1| hypothetical protein GUITHDRAFT_151625 [Guillardia theta CCMP2712]
Length = 289
Score = 45.8 bits (107), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 23/33 (69%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C CK+S C+KKYCECFQ + C+ C C DC N
Sbjct: 158 CKCKRSRCMKKYCECFQNSSACTSRCTCNDCGN 190
>gi|218194439|gb|EEC76866.1| hypothetical protein OsI_15056 [Oryza sativa Indica Group]
Length = 177
Score = 45.8 bits (107), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 33/73 (45%), Gaps = 5/73 (6%)
Query: 93 HLNTQFKDNLVKYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
H + + YC CF+A YC C C NC N +E +E V +NP AF PK
Sbjct: 88 HFKKRARGKQESYCLCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAFDPK 147
Query: 152 IASSPHGAQDARE 164
I QDA E
Sbjct: 148 IVR----VQDASE 156
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 179 CHCKKS--GCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
CH KK G + YC CF A CS+NCRC +CKN E E+
Sbjct: 87 CHFKKRARGKQESYCLCFSAYGYCSQNCRCTNCKNREYYED 127
>gi|145529794|ref|XP_001450680.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418302|emb|CAK83283.1| unnamed protein product [Paramecium tetraurelia]
Length = 229
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 23/111 (20%)
Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L +YC+CFA G C C C CHN ++ R+EA+ +N + + P G
Sbjct: 66 LKQYCDCFANGNMCTTQCQCQGCHNTEDYLEEREEAINKLKMQNQSIEK----EVPVGIS 121
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+ ++ LK+YC+CFQ N CSE+C+C +C N
Sbjct: 122 CKCKKSKC------------------LKRYCDCFQNNQKCSESCQCNNCSN 154
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
C C KS CLK+YC+CF +C+ C+C C N E E R
Sbjct: 58 CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEER 98
>gi|145484609|ref|XP_001428314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395399|emb|CAK60916.1| unnamed protein product [Paramecium tetraurelia]
Length = 264
Score = 45.1 bits (105), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
K CHC K+ CL+ YC CF C++ C+C DC N
Sbjct: 146 KPCHCSKTNCLQLYCSCFHNRRQCTQECKCCDCYN 180
>gi|145520831|ref|XP_001446271.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413748|emb|CAK78874.1| unnamed protein product [Paramecium tetraurelia]
Length = 229
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 1/56 (1%)
Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
D +++Q K K C C KS CLK+YC+CF +C+ C+C C N E +E R
Sbjct: 45 DKQQSQNIPKSLK-CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERG 99
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 23/111 (20%)
Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
L +YC+CFA G C C C CHN +++ R EA+ +N + + P G
Sbjct: 66 LKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERGEAINKLKLQNQSIEK----EVPVGIS 121
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
+ ++ K+YC+CFQ N C E+C+C +C N
Sbjct: 122 CKCKKSKCL------------------KRYCDCFQNNQKCHESCQCSNCSN 154
>gi|145490347|ref|XP_001431174.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398277|emb|CAK63776.1| unnamed protein product [Paramecium tetraurelia]
Length = 280
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 22/41 (53%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
K CHC K+ CL+ YC CF C CRC DC N EE
Sbjct: 149 KPCHCTKTHCLQLYCSCFHNRRRCMSECRCNDCYNDGQHEE 189
>gi|145545676|ref|XP_001458522.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426342|emb|CAK91125.1| unnamed protein product [Paramecium tetraurelia]
Length = 289
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
K CHC K+ CL+ YC CF C C+C DC N +G E L
Sbjct: 156 KPCHCTKTHCLQLYCSCFHNRRQCMSECKCNDCYN-DGQHEEDVL 199
>gi|145495603|ref|XP_001433794.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400914|emb|CAK66397.1| unnamed protein product [Paramecium tetraurelia]
Length = 253
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
CHC K+ CL+ YC CF C+ C+C DC N +G E L
Sbjct: 149 CHCAKTHCLQLYCSCFHNRRPCTNECKCNDCYN-DGKHEDEVL 190
>gi|145521839|ref|XP_001446769.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414258|emb|CAK79372.1| unnamed protein product [Paramecium tetraurelia]
Length = 151
Score = 43.1 bits (100), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 51/127 (40%), Gaps = 20/127 (15%)
Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
YC+CF G C + C+C C N++ R +A+ + +
Sbjct: 38 YCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHK-------------------Y 78
Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
+D + + + GC CKKS C YCECF C + C C C+N + ++ H
Sbjct: 79 KDESKLKFSGDKGSGCCCKKSKCQLNYCECFIKGRSCGDQCHCKKCQNGKKQNQKICANH 138
Query: 224 GDHNGIA 230
D I
Sbjct: 139 EDQIQIC 145
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 49/113 (43%), Gaps = 12/113 (10%)
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--FEGSEERRALFHGDHNGIAFMQRA 235
GC C+K+ CLK YC+CF C E C C CKN + E +A+ + + +
Sbjct: 26 GCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAI-------VKIIHKY 78
Query: 236 ANAAIIGAVGSSGYGTLMTSNKRK---SEELLFGVAAKDQSVIRNPQSQQGNN 285
+ + + G G G +K + E + G + DQ + Q+ + N
Sbjct: 79 KDESKLKFSGDKGSGCCCKKSKCQLNYCECFIKGRSCGDQCHCKKCQNGKKQN 131
>gi|145530936|ref|XP_001451240.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418884|emb|CAK83843.1| unnamed protein product [Paramecium tetraurelia]
Length = 263
Score = 42.7 bits (99), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 21/35 (60%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
K CHC K+ CL+ YC CF N CS C+C C N
Sbjct: 145 KPCHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFN 179
Score = 38.9 bits (89), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 46/209 (22%), Positives = 75/209 (35%), Gaps = 51/209 (24%)
Query: 9 PQWQARPQHVRMVNRVPHPVHKLPLPTLPPG---KQESPRSRPRANTDGKDGTPKKQ--K 63
P++Q+ + +R +R + + + P G K++ R++ R + D K
Sbjct: 86 PEYQSSQKQLRFSDRSQNHIEEENKPKQARGIKDKEQRQRTKRRVDYGVGDSFSNDSDLK 145
Query: 64 QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVN 122
C+C ++CL+L YC CF C C C
Sbjct: 146 PCHCSKTKCLQL-----------------------------YCSCFHKNKACSSKCQCKG 176
Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
C N+ H+V E V A + +R ++ S + D R+ KG
Sbjct: 177 CFNDGNHKV---EMVKALQKVKLKEYR--VSQSDLDSFDTRQVLGCKCKKTHCKKG---- 227
Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKN 211
YCECF C+ C+C +C N
Sbjct: 228 -------YCECFIRGRKCTSQCKCCECNN 249
>gi|145492971|ref|XP_001432482.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399594|emb|CAK65085.1| unnamed protein product [Paramecium tetraurelia]
Length = 253
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
K CHC K+ CL+ YC CF C+ C+C DC N +G E +
Sbjct: 124 KACHCSKTHCLQLYCSCFHNRRPCTNECKCNDCFN-DGKHEDEVI 167
>gi|145483579|ref|XP_001427812.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394895|emb|CAK60414.1| unnamed protein product [Paramecium tetraurelia]
Length = 261
Score = 42.0 bits (97), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
C C KS CLK+YC+CF +C+ C+C C N
Sbjct: 90 CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHN 122
>gi|167382874|ref|XP_001736307.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165901477|gb|EDR27560.1| hypothetical protein EDI_093510 [Entamoeba dispar SAW760]
Length = 176
Score = 42.0 bits (97), Expect = 0.86, Method: Composition-based stats.
Identities = 17/39 (43%), Positives = 22/39 (56%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
C C+ S CL+ C CFQ CS C C CKN + +E+
Sbjct: 15 CRCQTSACLESNCSCFQGRNFCSPLCTCNCCKNRKENEQ 53
>gi|345309828|ref|XP_001507154.2| PREDICTED: protein lin-54 homolog [Ornithorhynchus anatinus]
Length = 182
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 20/28 (71%)
Query: 196 ANILCSENCRCLDCKNFEGSEERRALFH 223
A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 49 AKIMCSSICKCIGCKNFEESPERKTLMH 76
>gi|414884761|tpg|DAA60775.1| TPA: hypothetical protein ZEAMMB73_412328 [Zea mays]
Length = 581
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 33/71 (46%), Gaps = 16/71 (22%)
Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPL-ADILQPQDMKELCSLLV 336
PQS+ G V C P +R PL ADI+Q +D+KELC LLV
Sbjct: 481 PQSRAGQLVMKPLTECITKGP---------------RMREPLLADIVQAEDIKELCKLLV 525
Query: 337 MVSSEATKTLT 347
+VS EA K
Sbjct: 526 VVSREAAKAYV 536
>gi|145508477|ref|XP_001440188.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407394|emb|CAK72791.1| unnamed protein product [Paramecium tetraurelia]
Length = 235
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 32/58 (55%)
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
K+ + + + + +QL++ GC+C+K C K+YCEC + NI C+ C C +
Sbjct: 160 KLNCECYNCSNKQPKSTPSQLSSIEFNGCNCRKQNCSKRYCECQKRNIKCTSKCNCCE 217
>gi|407037160|gb|EKE38521.1| hypothetical protein ENU1_163380 [Entamoeba nuttalli P19]
Length = 176
Score = 41.2 bits (95), Expect = 1.6, Method: Composition-based stats.
Identities = 16/39 (41%), Positives = 22/39 (56%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
C C+ + CL+ C CFQ CS C C CKN + +E+
Sbjct: 15 CRCQTTACLESNCSCFQGGNFCSPLCTCNCCKNRKENEQ 53
>gi|67482063|ref|XP_656381.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56473579|gb|EAL50998.1| hypothetical protein EHI_153340 [Entamoeba histolytica HM-1:IMSS]
gi|449705185|gb|EMD45288.1| Hypothetical protein EHI5A_018850 [Entamoeba histolytica KU27]
Length = 176
Score = 40.8 bits (94), Expect = 1.7, Method: Composition-based stats.
Identities = 16/39 (41%), Positives = 22/39 (56%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
C C+ + CL+ C CFQ CS C C CKN + +E+
Sbjct: 15 CRCQTTACLESNCSCFQGGNFCSPLCTCNCCKNRKENEQ 53
>gi|344241563|gb|EGV97666.1| Solute carrier family 2, facilitated glucose transporter member 10
[Cricetulus griseus]
Length = 423
Score = 40.0 bits (92), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 24/39 (61%)
Query: 521 GYGSAKSAVSVTLEPSCTTTSVSSRQNNDLPMRVGLSNG 559
G G AVS+ PSC TTS +SRQ DLP +GL NG
Sbjct: 197 GIGLVSFAVSLDSGPSCLTTSNTSRQMGDLPQSLGLPNG 235
>gi|354476758|ref|XP_003500590.1| PREDICTED: solute carrier family 2, facilitated glucose transporter
member 10 [Cricetulus griseus]
Length = 544
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 24/39 (61%)
Query: 521 GYGSAKSAVSVTLEPSCTTTSVSSRQNNDLPMRVGLSNG 559
G G AVS+ PSC TTS +SRQ DLP +GL NG
Sbjct: 318 GIGLVSFAVSLDSGPSCLTTSNTSRQMGDLPQSLGLPNG 356
>gi|115615245|ref|XP_001199031.1| PREDICTED: uncharacterized protein LOC763146 [Strongylocentrotus
purpuratus]
Length = 1617
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 6/39 (15%)
Query: 179 CHCKKSGCL------KKYCECFQANILCSENCRCLDCKN 211
C CKK+ C K C C Q NI C++ CRC++CKN
Sbjct: 1560 CSCKKTSCEEPKAAGKGRCSCRQHNITCTDLCRCINCKN 1598
>gi|384246535|gb|EIE20025.1| hypothetical protein COCSUDRAFT_58250 [Coccomyxa subellipsoidea
C-169]
Length = 556
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 44/70 (62%), Gaps = 8/70 (11%)
Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
+RNP AF+ KI + + + + + ++ H +GCHCKKS C KKYCECFQA + C
Sbjct: 162 QRNPQAFQEKI--------EKQANGQGSGVSGLHKRGCHCKKSHCKKKYCECFQAGVNCG 213
Query: 202 ENCRCLDCKN 211
+C+C +C N
Sbjct: 214 VHCKCDECHN 223
>gi|390344245|ref|XP_003726079.1| PREDICTED: uncharacterized protein LOC764337 [Strongylocentrotus
purpuratus]
Length = 1742
Score = 39.3 bits (90), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 6/39 (15%)
Query: 179 CHCKKSGCL------KKYCECFQANILCSENCRCLDCKN 211
C CKK+ C K C C Q NI C++ CRC++CKN
Sbjct: 1685 CSCKKTSCEEPKAAGKGRCSCRQHNITCTDLCRCINCKN 1723
>gi|145520699|ref|XP_001446205.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413682|emb|CAK78808.1| unnamed protein product [Paramecium tetraurelia]
Length = 226
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 23/35 (65%), Gaps = 1/35 (2%)
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLD-CKN 211
GC+C+K C K YCEC + I C+ CRC D CKN
Sbjct: 178 GCNCRKQDCSKGYCECQKRKIKCTSKCRCCDECKN 212
>gi|291227245|ref|XP_002733597.1| PREDICTED: hypothetical protein, partial [Saccoglossus kowalevskii]
Length = 1578
Score = 38.9 bits (89), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 19/34 (55%)
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
A H C CKKS CL ++C C +A + C C C
Sbjct: 1527 ATHKNYCRCKKSKCLTRHCICVKAGVKCVVACMC 1560
>gi|226069328|dbj|BAH36888.1| Polycomb group protein [Physcomitrella patens subsp. patens]
gi|237637250|gb|ACR07925.1| CLF protein [Physcomitrella patens]
Length = 999
Score = 38.9 bits (89), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 7/90 (7%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR---ALFHGDHNGIAFMQ 233
+GCHC KS C + C CF A C + C +C + G E+ A+ + HN + M
Sbjct: 791 RGCHCAKSNCCSRQCPCFAAGRECDPDV-CRNC--WVGCGEKHVKPAVEYTCHN-MKLML 846
Query: 234 RAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
+ ++G +G+G + K E L
Sbjct: 847 KQQQRVLLGRSDVAGWGAFLKKTVAKHEYL 876
>gi|224010277|ref|XP_002294096.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970113|gb|EED88451.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 401
Score = 38.9 bits (89), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 18/29 (62%), Gaps = 5/29 (17%)
Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCL 207
CHC KS CLK YC+CFQ + RCL
Sbjct: 372 CHCSKSKCLKLYCDCFQ-----QKEVRCL 395
>gi|345481297|ref|XP_003424335.1| PREDICTED: hypothetical protein LOC100679675 [Nasonia vitripennis]
Length = 539
Score = 38.5 bits (88), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 25/47 (53%), Gaps = 3/47 (6%)
Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD--CKNFEGSEE 217
K+NKGC CK C K C C + I+C CRC D C N E +E
Sbjct: 104 TKNNKGCSCK-GNCSNKICGCVKKGIMCQVFCRCNDAQCMNQEPEKE 149
>gi|325191197|emb|CCA25983.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 511
Score = 38.5 bits (88), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 39/88 (44%), Gaps = 8/88 (9%)
Query: 156 PHGAQDAREDAREAQLAAKHNKGCHCKKSG-CLKKYCECFQANILCSENCRCLDCKN-FE 213
P G+ D REDA E NK C+ SG C C C + N +C+ CRC C+N F
Sbjct: 299 PLGSSDRREDALEK------NKVVSCRCSGGCRNGRCACVRENGMCTPACRCTSCQNPFS 352
Query: 214 GSEERRALFHGDHNGIAFMQRAANAAII 241
+ A N FMQ A ++
Sbjct: 353 IVKAAGANMDILKNDSCFMQNACKGRVM 380
>gi|168031358|ref|XP_001768188.1| polycomb group protein [Physcomitrella patens subsp. patens]
gi|162680626|gb|EDQ67061.1| polycomb group protein [Physcomitrella patens subsp. patens]
Length = 892
Score = 38.5 bits (88), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 7/90 (7%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR---ALFHGDHNGIAFMQ 233
+GCHC KS C + C CF A C + C +C + G E+ A+ + HN + M
Sbjct: 681 RGCHCAKSNCCSRQCPCFAAGRECDPDV-CRNC--WVGCGEKHVKPAVEYTCHN-MKLML 736
Query: 234 RAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
+ ++G +G+G + K E L
Sbjct: 737 KQQQRVLLGRSDVAGWGAFLKKTVAKHEYL 766
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.314 0.128 0.377
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,707,507,268
Number of Sequences: 23463169
Number of extensions: 363981260
Number of successful extensions: 819997
Number of sequences better than 100.0: 669
Number of HSP's better than 100.0 without gapping: 565
Number of HSP's successfully gapped in prelim test: 104
Number of HSP's that attempted gapping in prelim test: 816086
Number of HSP's gapped (non-prelim): 2268
length of query: 571
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 423
effective length of database: 8,886,646,355
effective search space: 3759051408165
effective search space used: 3759051408165
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 80 (35.4 bits)