BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy128
(446 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|193713718|ref|XP_001949510.1| PREDICTED: hypothetical protein LOC100161166 [Acyrthosiphon pisum]
Length = 590
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 49/94 (52%), Positives = 60/94 (63%), Gaps = 8/94 (8%)
Query: 250 VIKPVKSPSPPPVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL 309
VI PV +PPP DE QVWICP C DDGS PMIGCDGCD WYHWVCVG+
Sbjct: 503 VIPPVNE-TPPPAFYFDEA----GNQVWICPMCTKPDDGS-PMIGCDGCDVWYHWVCVGI 556
Query: 310 VAEPETSDWFCPKCSKVDEGSRKEKKKRGRKKKT 343
P+ + WFCP+C + + +++ K KRGR +K
Sbjct: 557 QCPPDCAVWFCPRC--LAKRAQQPKGKRGRPRKN 588
>gi|270015055|gb|EFA11503.1| hypothetical protein TcasGA2_TC014217 [Tribolium castaneum]
Length = 865
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 44/76 (57%), Positives = 50/76 (65%), Gaps = 7/76 (9%)
Query: 249 IVIKPVKSPSPPPVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVG 308
IV P+ P+PP Y D +VWICP CG QDDGS PMIGCDGCDAWYHWVCVG
Sbjct: 770 IVDVPIVPPAPP-----SRYIDADGNEVWICPGCGSQDDGS-PMIGCDGCDAWYHWVCVG 823
Query: 309 LVAEPETS-DWFCPKC 323
+ P+ S DW+C C
Sbjct: 824 IQVPPDDSEDWYCRHC 839
>gi|328700739|ref|XP_001945181.2| PREDICTED: hypothetical protein LOC100167514 [Acyrthosiphon pisum]
Length = 906
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/75 (58%), Positives = 52/75 (69%), Gaps = 7/75 (9%)
Query: 269 DTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKCS--KV 326
D G+ QVWICP CG QDDGS PM+GCDGCDAWYHWVCVG+ P+ + W+CP C +
Sbjct: 827 DAGN--QVWICPTCGKQDDGS-PMVGCDGCDAWYHWVCVGIQCPPDCAVWYCPTCLLKRA 883
Query: 327 DEGSRK--EKKKRGR 339
+ G EK KRGR
Sbjct: 884 ERGVTAPVEKAKRGR 898
>gi|189233756|ref|XP_001814148.1| PREDICTED: similar to bip2 CG2009-PA [Tribolium castaneum]
Length = 872
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 37/54 (68%), Positives = 41/54 (75%), Gaps = 2/54 (3%)
Query: 271 GDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETS-DWFCPKC 323
D +VWICP CG QDDGS PMIGCDGCDAWYHWVCVG+ P+ S DW+C C
Sbjct: 794 ADGNEVWICPGCGSQDDGS-PMIGCDGCDAWYHWVCVGIQVPPDDSEDWYCRHC 846
>gi|357604267|gb|EHJ64117.1| hypothetical protein KGM_08958 [Danaus plexippus]
Length = 1167
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 40/80 (50%), Positives = 49/80 (61%), Gaps = 6/80 (7%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC--- 323
Y ++W+CPACG D+GS PMIGCDGCD WYHW+CVG+ +P T DWFC C
Sbjct: 1081 YVDEQGNKIWVCPACGRPDNGS-PMIGCDGCDGWYHWICVGITEDPGATEDWFCKSCVAK 1139
Query: 324 -SKVDEGSRKEKKKRGRKKK 342
+ + KKRGRK K
Sbjct: 1140 RAAMVLAGVTSGKKRGRKPK 1159
>gi|195133998|ref|XP_002011425.1| GI14094 [Drosophila mojavensis]
gi|193912048|gb|EDW10915.1| GI14094 [Drosophila mojavensis]
Length = 1094
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 35/58 (60%), Positives = 41/58 (70%), Gaps = 2/58 (3%)
Query: 267 EYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
Y D ++WICPACG DDGS MIGCDGCDAWYHW+CVG++ P + DWFC C
Sbjct: 1016 SYVDADGYRIWICPACGKVDDGS-AMIGCDGCDAWYHWICVGILVAPNDNEDWFCRVC 1072
>gi|353230188|emb|CCD76359.1| hypothetical protein Smp_127180 [Schistosoma mansoni]
Length = 1690
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 52/84 (61%), Gaps = 10/84 (11%)
Query: 242 KRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAW 301
+RES+ +IV++ A + Y D +Q+W+CP C ++DDG+L MIGCD C W
Sbjct: 1417 QRESASQIVVES---------AGSSYYFNNDGEQIWLCPICLLEDDGNL-MIGCDNCQDW 1466
Query: 302 YHWVCVGLVAEPETSDWFCPKCSK 325
YH C+GL PE WFCPKCS+
Sbjct: 1467 YHSTCLGLSKAPEVPQWFCPKCSQ 1490
>gi|256072183|ref|XP_002572416.1| hypothetical protein [Schistosoma mansoni]
Length = 1282
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 52/84 (61%), Gaps = 10/84 (11%)
Query: 242 KRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAW 301
+RES+ +IV++ A + Y D +Q+W+CP C ++DDG+L MIGCD C W
Sbjct: 926 QRESASQIVVE---------SAGSSYYFNNDGEQIWLCPICLLEDDGNL-MIGCDNCQDW 975
Query: 302 YHWVCVGLVAEPETSDWFCPKCSK 325
YH C+GL PE WFCPKCS+
Sbjct: 976 YHSTCLGLSKAPEVPQWFCPKCSQ 999
>gi|195172552|ref|XP_002027061.1| GL18176 [Drosophila persimilis]
gi|194112839|gb|EDW34882.1| GL18176 [Drosophila persimilis]
Length = 1497
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 47/78 (60%), Gaps = 6/78 (7%)
Query: 251 IKPVKSPSPPP----VAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVC 306
++P SP PP + Y + ++WICPACG DDGS MIGCDGCDAWYHW C
Sbjct: 1402 VEPTHSPDCPPQIVELNRPSSYVDAEGNRIWICPACGKVDDGSA-MIGCDGCDAWYHWTC 1460
Query: 307 VGLVAEP-ETSDWFCPKC 323
VG+ P + DWFC C
Sbjct: 1461 VGITVAPKDNDDWFCRVC 1478
>gi|115711961|ref|XP_797041.2| PREDICTED: uncharacterized protein LOC592424 [Strongylocentrotus
purpuratus]
Length = 1047
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 32/52 (61%), Positives = 39/52 (75%)
Query: 272 DSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKC 323
D +++W CPAC DDG LPMIGCD CD WYHW CVG+ EP T+DW+C +C
Sbjct: 982 DGEKIWYCPACKKADDGVLPMIGCDTCDDWYHWECVGITEEPSTNDWYCKRC 1033
>gi|307200668|gb|EFN80771.1| Transcription initiation factor TFIID subunit 3 [Harpegnathos
saltator]
Length = 1052
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 36/51 (70%), Positives = 40/51 (78%), Gaps = 2/51 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
+QVWICPACG QDDGS PMIGCD CDAWYHWVCVG+ P + DW+C C
Sbjct: 979 RQVWICPACGNQDDGS-PMIGCDDCDAWYHWVCVGMQVPPADDEDWYCRFC 1028
>gi|195469351|ref|XP_002099601.1| GE14514 [Drosophila yakuba]
gi|194185702|gb|EDW99313.1| GE14514 [Drosophila yakuba]
Length = 1413
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 41/58 (70%), Gaps = 2/58 (3%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKCS 324
Y + ++WICPACG DDGS MIGCDGCDAWYHW+CVG+ P + DWFC C+
Sbjct: 1339 YVDAEGNRIWICPACGKVDDGS-AMIGCDGCDAWYHWICVGITFAPKDNDDWFCRVCA 1395
>gi|24638625|ref|NP_651923.2| bip2 [Drosophila melanogaster]
gi|7304310|gb|AAF59342.1| bip2 [Drosophila melanogaster]
Length = 1406
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 34/57 (59%), Positives = 40/57 (70%), Gaps = 2/57 (3%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
Y + ++WICPACG DDGS MIGCDGCDAWYHW+CVG+ P + DWFC C
Sbjct: 1332 YVDAEGNRIWICPACGKVDDGSA-MIGCDGCDAWYHWICVGITFAPKDNDDWFCRVC 1387
>gi|198462037|ref|XP_001352319.2| GA15182 [Drosophila pseudoobscura pseudoobscura]
gi|198139921|gb|EAL29265.2| GA15182 [Drosophila pseudoobscura pseudoobscura]
Length = 1497
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 47/78 (60%), Gaps = 6/78 (7%)
Query: 251 IKPVKSPSPPP----VAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVC 306
++P SP PP + Y + ++WICPACG DDGS MIGCDGCDAWYHW C
Sbjct: 1402 VEPNHSPDCPPQIVELNRPSSYVDAEGNRIWICPACGKVDDGSA-MIGCDGCDAWYHWTC 1460
Query: 307 VGLVAEP-ETSDWFCPKC 323
VG+ P + DWFC C
Sbjct: 1461 VGITVAPKDNDDWFCRVC 1478
>gi|194913567|ref|XP_001982727.1| GG16447 [Drosophila erecta]
gi|190647943|gb|EDV45246.1| GG16447 [Drosophila erecta]
Length = 1418
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 40/58 (68%), Gaps = 2/58 (3%)
Query: 267 EYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
Y + ++WICPACG DDGS MIGCDGCDAWYHW+CVG+ P + DWFC C
Sbjct: 1343 SYVDAEGNRIWICPACGKVDDGS-AMIGCDGCDAWYHWICVGITFAPKDNDDWFCRVC 1399
>gi|347963346|ref|XP_001687763.2| AGAP000189-PA [Anopheles gambiae str. PEST]
gi|333467243|gb|EDO64354.2| AGAP000189-PA [Anopheles gambiae str. PEST]
Length = 1684
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 35/53 (66%), Positives = 40/53 (75%), Gaps = 2/53 (3%)
Query: 272 DSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETS-DWFCPKC 323
D VWICPACG DDG+ PMIGCDGCDAWYHWVCVG+ P+ + DW+C C
Sbjct: 1607 DGNTVWICPACGRVDDGT-PMIGCDGCDAWYHWVCVGIQVPPDDNEDWYCRVC 1658
>gi|350410088|ref|XP_003488940.1| PREDICTED: hypothetical protein LOC100743667 [Bombus impatiens]
Length = 1002
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 36/57 (63%), Positives = 41/57 (71%), Gaps = 3/57 (5%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
+D G K VW CPACG QDDGS PM+GCD CDAWYHWVCVG+ P + DW+C C
Sbjct: 924 FDQGGHK-VWFCPACGNQDDGS-PMVGCDDCDAWYHWVCVGMQVPPADNEDWYCRTC 978
>gi|6634096|emb|CAB64265.1| BIP2 protein [Drosophila melanogaster]
Length = 1406
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 33/57 (57%), Positives = 40/57 (70%), Gaps = 2/57 (3%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
Y + ++WICPACG D+GS MIGCDGCDAWYHW+CVG+ P + DWFC C
Sbjct: 1332 YVDAEGNRIWICPACGKVDEGSA-MIGCDGCDAWYHWICVGITFAPKDNDDWFCRVC 1387
>gi|195553685|ref|XP_002076719.1| GD15475 [Drosophila simulans]
gi|194202298|gb|EDX15874.1| GD15475 [Drosophila simulans]
Length = 683
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 40/58 (68%), Gaps = 2/58 (3%)
Query: 267 EYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
Y + ++WICPACG DDGS MIGCDGCDAWYHW+CVG+ P + DWFC C
Sbjct: 608 SYVDAEGNRIWICPACGKVDDGS-AMIGCDGCDAWYHWICVGITFAPKDNDDWFCRIC 664
>gi|260816297|ref|XP_002602908.1| hypothetical protein BRAFLDRAFT_98089 [Branchiostoma floridae]
gi|229288221|gb|EEN58920.1| hypothetical protein BRAFLDRAFT_98089 [Branchiostoma floridae]
Length = 1052
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 32/53 (60%), Positives = 38/53 (71%), Gaps = 1/53 (1%)
Query: 272 DSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKCS 324
+ + +WICPAC DDGS PM+GCD CD WYHW CVG+ EP WFCP+CS
Sbjct: 983 EGETIWICPACSRPDDGS-PMVGCDNCDDWYHWPCVGITEEPTEDKWFCPRCS 1034
>gi|350589635|ref|XP_003357819.2| PREDICTED: transcription initiation factor TFIID subunit 3-like,
partial [Sus scrofa]
Length = 793
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 35/51 (68%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKC 323
Q+WICP C DDGS PMIGCDGCD WYHW CVG+ A PE WFCPKC
Sbjct: 727 NQIWICPGCNKPDDGS-PMIGCDGCDDWYHWPCVGITAAPPEEMQWFCPKC 776
>gi|321467967|gb|EFX78955.1| hypothetical protein DAPPUDRAFT_319993 [Daphnia pulex]
Length = 964
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 39/64 (60%), Positives = 45/64 (70%), Gaps = 3/64 (4%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKCSKV 326
YD +K +WICPACG QDDG+ PMIGCD CD WYHWVCVG+ P ET WFC +C
Sbjct: 890 YDEFGNK-IWICPACGRQDDGT-PMIGCDECDDWYHWVCVGIRVPPAETESWFCQRCIAK 947
Query: 327 DEGS 330
+GS
Sbjct: 948 KQGS 951
>gi|4584215|emb|CAB40628.1| Bip2 protein [Drosophila melanogaster]
gi|13374081|emb|CAC34474.1| TAFII155 protein [Drosophila melanogaster]
Length = 1408
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 33/57 (57%), Positives = 40/57 (70%), Gaps = 2/57 (3%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
Y + ++WICPACG D+GS MIGCDGCDAWYHW+CVG+ P + DWFC C
Sbjct: 1334 YVDTEGNRIWICPACGKVDEGSA-MIGCDGCDAWYHWICVGITFAPKDNDDWFCRVC 1389
>gi|157127577|ref|XP_001661098.1| hypothetical protein AaeL_AAEL010862 [Aedes aegypti]
gi|108872884|gb|EAT37109.1| AAEL010862-PA [Aedes aegypti]
Length = 372
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 49/77 (63%), Gaps = 15/77 (19%)
Query: 271 GDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETS-DWFCPKCSKVDEG 329
D VWICPACG DDG+ PMIGCDGCDAWYHWVCVG+ P+++ DW+C C
Sbjct: 294 ADGNVVWICPACGRVDDGT-PMIGCDGCDAWYHWVCVGIQVPPDSNEDWYCRVC------ 346
Query: 330 SRKEKKKRGRKKKTHQD 346
G+K+++H D
Sbjct: 347 -------IGKKQESHGD 356
>gi|195149688|ref|XP_002015788.1| GL10830 [Drosophila persimilis]
gi|198456467|ref|XP_002138246.1| GA24504 [Drosophila pseudoobscura pseudoobscura]
gi|194109635|gb|EDW31678.1| GL10830 [Drosophila persimilis]
gi|198135625|gb|EDY68804.1| GA24504 [Drosophila pseudoobscura pseudoobscura]
Length = 80
Score = 90.5 bits (223), Expect = 2e-15, Method: Composition-based stats.
Identities = 43/64 (67%), Positives = 55/64 (85%), Gaps = 1/64 (1%)
Query: 360 ELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLRE-GLAPQCEALKNTFFECKRSLL 418
+L D++ACAG RADLK+CLLE+DC +VDRKTPRQCL + + P+C+ L+NTF+ECKRSLL
Sbjct: 9 KLADETACAGVRADLKMCLLESDCCRVDRKTPRQCLMDNNVPPECQVLRNTFYECKRSLL 68
Query: 419 DTRQ 422
D RQ
Sbjct: 69 DNRQ 72
>gi|195380361|ref|XP_002048939.1| GJ21319 [Drosophila virilis]
gi|194143736|gb|EDW60132.1| GJ21319 [Drosophila virilis]
Length = 80
Score = 90.1 bits (222), Expect = 2e-15, Method: Composition-based stats.
Identities = 42/64 (65%), Positives = 56/64 (87%), Gaps = 1/64 (1%)
Query: 360 ELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLRE-GLAPQCEALKNTFFECKRSLL 418
+L D++ACAG RADLK+CLLE+DC ++DRKTPRQCL++ + P+C+ L+NTF+ECKRSLL
Sbjct: 9 KLADETACAGVRADLKMCLLESDCCRLDRKTPRQCLQDNNVPPECQVLRNTFYECKRSLL 68
Query: 419 DTRQ 422
D RQ
Sbjct: 69 DNRQ 72
>gi|198415671|ref|XP_002126955.1| PREDICTED: similar to TAF3 RNA polymerase II, TATA box binding
protein (TBP)-associated factor [Ciona intestinalis]
Length = 850
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 272 DSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
D +VWICP C DDGS PMIGCD CD WYHW CVG++ EP E +WFCP C
Sbjct: 782 DGNRVWICPGCNKPDDGS-PMIGCDKCDDWYHWPCVGIIQEPPENEEWFCPNC 833
>gi|170057999|ref|XP_001864729.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877239|gb|EDS40622.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 417
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 35/53 (66%), Positives = 41/53 (77%), Gaps = 2/53 (3%)
Query: 272 DSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETS-DWFCPKC 323
D VWICPACG DDG+ PMIGCDGCDAWYHWVCVG+ P+++ DW+C C
Sbjct: 340 DGNVVWICPACGRVDDGT-PMIGCDGCDAWYHWVCVGIQVPPDSNEDWYCRVC 391
>gi|17861888|gb|AAL39421.1| GM10839p [Drosophila melanogaster]
Length = 363
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 40/58 (68%), Gaps = 2/58 (3%)
Query: 267 EYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
Y + ++WICPACG DDGS MIGCDGCDAWYHW+CVG+ P + DWFC C
Sbjct: 288 SYVDAEGNRIWICPACGKVDDGSA-MIGCDGCDAWYHWICVGITFAPKDNDDWFCRVC 344
>gi|432089718|gb|ELK23535.1| Transcription initiation factor TFIID subunit 3 [Myotis davidii]
Length = 777
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 35/53 (66%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCDGCD WYHW CVGL+A PE WFC KC+
Sbjct: 711 NQIWICPGCNKPDDGS-PMIGCDGCDDWYHWPCVGLMAAPPEEMQWFCSKCAN 762
>gi|426240699|ref|XP_004014231.1| PREDICTED: transcription initiation factor TFIID subunit 3 [Ovis
aries]
Length = 930
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 35/53 (66%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKCS
Sbjct: 864 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCSN 915
>gi|345310087|ref|XP_001515948.2| PREDICTED: transcription initiation factor TFIID subunit 3-like
[Ornithorhynchus anatinus]
Length = 1013
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 947 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 998
>gi|74181644|dbj|BAE32543.1| unnamed protein product [Mus musculus]
Length = 932
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 865 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 916
>gi|351698946|gb|EHB01865.1| Transcription initiation factor TFIID subunit 3, partial
[Heterocephalus glaber]
Length = 791
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 725 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 776
>gi|262231748|ref|NP_082024.2| transcription initiation factor TFIID subunit 3 [Mus musculus]
gi|110287981|sp|Q5HZG4.2|TAF3_MOUSE RecName: Full=Transcription initiation factor TFIID subunit 3;
AltName: Full=140 kDa TATA box-binding
protein-associated factor; AltName: Full=TBP-associated
factor 3; AltName: Full=Transcription initiation factor
TFIID 140 kDa subunit; Short=TAF(II)140; Short=TAF140;
Short=TAFII-140; Short=TAFII140
gi|13374178|emb|CAC34476.1| TAFII140 protein [Mus musculus]
gi|148676053|gb|EDL08000.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
gi|187951735|gb|AAI37618.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
gi|187957546|gb|AAI37619.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
Length = 932
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 865 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 916
>gi|57242908|gb|AAH89030.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
Length = 948
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 881 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 932
>gi|405113028|ref|NP_001258271.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Rattus norvegicus]
gi|149021019|gb|EDL78626.1| rCG55816 [Rattus norvegicus]
Length = 930
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 863 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 914
>gi|73949156|ref|XP_544263.2| PREDICTED: transcription initiation factor TFIID subunit 3 [Canis
lupus familiaris]
Length = 932
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 866 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 917
>gi|355723106|gb|AES07785.1| TAF3 RNA polymerase II, TATA box binding protein -associated
factor, 140kDa [Mustela putorius furo]
Length = 933
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 868 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 919
>gi|417413279|gb|JAA52976.1| Putative transcription initiation factor tfiid subunit 3, partial
[Desmodus rotundus]
Length = 972
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 906 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 957
>gi|151301171|ref|NP_114129.1| transcription initiation factor TFIID subunit 3 [Homo sapiens]
gi|74747393|sp|Q5VWG9.1|TAF3_HUMAN RecName: Full=Transcription initiation factor TFIID subunit 3;
AltName: Full=140 kDa TATA box-binding
protein-associated factor; AltName: Full=TBP-associated
factor 3; AltName: Full=Transcription initiation factor
TFIID 140 kDa subunit; Short=TAF(II)140; Short=TAF140;
Short=TAFII-140; Short=TAFII140
gi|225000128|gb|AAI72366.1| TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 140kDa [synthetic construct]
Length = 929
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLV-AEPETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++ A PE WFCPKC+
Sbjct: 863 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMTAPPEEMQWFCPKCAN 914
>gi|402879598|ref|XP_003903421.1| PREDICTED: transcription initiation factor TFIID subunit 3 [Papio
anubis]
Length = 931
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLV-AEPETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++ A PE WFCPKC+
Sbjct: 865 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMTAPPEEMQWFCPKCAN 916
>gi|332833584|ref|XP_001157768.2| PREDICTED: transcription initiation factor TFIID subunit 3 [Pan
troglodytes]
Length = 931
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLV-AEPETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++ A PE WFCPKC+
Sbjct: 865 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMTAPPEEMQWFCPKCAN 916
>gi|410963187|ref|XP_003988147.1| PREDICTED: transcription initiation factor TFIID subunit 3 [Felis
catus]
Length = 1017
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/54 (62%), Positives = 38/54 (70%), Gaps = 2/54 (3%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 950 GNQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 1002
>gi|297300462|ref|XP_002808540.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor TFIID
subunit 3-like [Macaca mulatta]
Length = 1116
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 1050 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 1101
>gi|344277699|ref|XP_003410637.1| PREDICTED: transcription initiation factor TFIID subunit 3
[Loxodonta africana]
Length = 930
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 37/53 (69%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG+ A PE WFCPKC+
Sbjct: 864 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGITAAPPEEMQWFCPKCAN 915
>gi|126340354|ref|XP_001363162.1| PREDICTED: transcription initiation factor TFIID subunit 3
[Monodelphis domestica]
Length = 932
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 37/53 (69%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVGL+A PE WFCP C+
Sbjct: 866 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGLMAAPPEEMQWFCPMCAN 917
>gi|395827493|ref|XP_003786936.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
TFIID subunit 3, partial [Otolemur garnettii]
Length = 1004
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 938 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 989
>gi|193671822|ref|XP_001951331.1| PREDICTED: uncharacterized protein C2orf64 homolog [Acyrthosiphon
pisum]
Length = 79
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/67 (62%), Positives = 51/67 (76%), Gaps = 1/67 (1%)
Query: 357 DPSELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLRE-GLAPQCEALKNTFFECKR 415
D EL D CAG RADLK+CLLE+DC +++RK P+ CL+E P+C AL+NTFFECKR
Sbjct: 5 DEGELTDHRPCAGVRADLKMCLLESDCCKIERKLPKDCLKEKNCPPECLALQNTFFECKR 64
Query: 416 SLLDTRQ 422
SLLD RQ
Sbjct: 65 SLLDNRQ 71
>gi|432861263|ref|XP_004069581.1| PREDICTED: transcription initiation factor TFIID subunit 3-like
isoform 1 [Oryzias latipes]
Length = 933
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/53 (67%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCS 324
Q+WICP C DDGS PMIGCD CD WYHW CVG+VAE PE WFC KCS
Sbjct: 866 GNQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIVAEPPEDQPWFCVKCS 917
>gi|359071307|ref|XP_003586802.1| PREDICTED: transcription initiation factor TFIID subunit 3 [Bos
taurus]
gi|440906269|gb|ELR56551.1| Transcription initiation factor TFIID subunit 3 [Bos grunniens
mutus]
Length = 932
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/51 (66%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKC 323
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC
Sbjct: 866 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKC 915
>gi|195450727|ref|XP_002072606.1| GK13694 [Drosophila willistoni]
gi|194168691|gb|EDW83592.1| GK13694 [Drosophila willistoni]
Length = 1430
Score = 87.4 bits (215), Expect = 1e-14, Method: Composition-based stats.
Identities = 42/83 (50%), Positives = 50/83 (60%), Gaps = 6/83 (7%)
Query: 246 SPKIVIKPVKSPSPPPVAA----ADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAW 301
+P + P KS PPP Y + ++WICPACG DDGS MIGCDGCDAW
Sbjct: 1330 APTATVSPAKSECPPPQTVESNRPSSYVDAEGNRIWICPACGKVDDGS-AMIGCDGCDAW 1388
Query: 302 YHWVCVGL-VAEPETSDWFCPKC 323
YHW+CVG+ VA + DWFC C
Sbjct: 1389 YHWICVGIQVAPKDNDDWFCRVC 1411
>gi|348575371|ref|XP_003473463.1| PREDICTED: transcription initiation factor TFIID subunit 3 [Cavia
porcellus]
Length = 928
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/52 (65%), Positives = 38/52 (73%), Gaps = 2/52 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCS 324
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 862 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGVMAAPPEEMQWFCPKCA 912
>gi|312379722|gb|EFR25909.1| hypothetical protein AND_08343 [Anopheles darlingi]
Length = 164
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/64 (62%), Positives = 44/64 (68%), Gaps = 3/64 (4%)
Query: 261 PVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL-VAEPETSDWF 319
P A A D D VWICPACG DDG+ PMIGCDGCDAWYHW+CVG+ VA E DW+
Sbjct: 77 PAAPAQMTDV-DGNTVWICPACGRVDDGT-PMIGCDGCDAWYHWICVGIEVAPDENEDWY 134
Query: 320 CPKC 323
C C
Sbjct: 135 CRVC 138
>gi|432861265|ref|XP_004069582.1| PREDICTED: transcription initiation factor TFIID subunit 3-like
isoform 2 [Oryzias latipes]
Length = 930
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/53 (67%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCS 324
Q+WICP C DDGS PMIGCD CD WYHW CVG+VAE PE WFC KCS
Sbjct: 863 GNQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIVAEPPEDQPWFCVKCS 914
>gi|74193687|dbj|BAE22792.1| unnamed protein product [Mus musculus]
Length = 554
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/54 (62%), Positives = 38/54 (70%), Gaps = 2/54 (3%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 486 GNQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 538
>gi|441626275|ref|XP_004089137.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor TFIID
subunit 3 [Nomascus leucogenys]
Length = 1057
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLV-AEPETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++ A PE WFCPKC+
Sbjct: 991 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMTAPPEEMQWFCPKCAN 1042
>gi|291401982|ref|XP_002717391.1| PREDICTED: RNA polymerase II transcription factor TAFII140
[Oryctolagus cuniculus]
Length = 930
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 33/53 (62%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
++WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 864 NKIWICPGCNKPDDGS-PMIGCDDCDGWYHWPCVGVMAAPPEEMQWFCPKCAN 915
>gi|118793122|ref|XP_320694.3| AGAP011820-PA [Anopheles gambiae str. PEST]
gi|116117227|gb|EAA00792.3| AGAP011820-PA [Anopheles gambiae str. PEST]
Length = 82
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/66 (63%), Positives = 53/66 (80%), Gaps = 3/66 (4%)
Query: 360 ELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLR--EGLAP-QCEALKNTFFECKRS 416
EL DKSACAG RADLK+CLL++DC + ++KTPR+CL +G P +C L+NTFFECKRS
Sbjct: 9 ELADKSACAGVRADLKMCLLQSDCCKKEKKTPRECLNRTDGSVPSECFVLRNTFFECKRS 68
Query: 417 LLDTRQ 422
+LD RQ
Sbjct: 69 MLDNRQ 74
>gi|194754419|ref|XP_001959492.1| GF12904 [Drosophila ananassae]
gi|190620790|gb|EDV36314.1| GF12904 [Drosophila ananassae]
Length = 80
Score = 86.7 bits (213), Expect = 2e-14, Method: Composition-based stats.
Identities = 39/63 (61%), Positives = 52/63 (82%), Gaps = 1/63 (1%)
Query: 361 LVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLR-EGLAPQCEALKNTFFECKRSLLD 419
L D++ACAG RADLK+CLLE+DC ++ + TPRQCL+ + P+C+ L+NTF+ECKRSLLD
Sbjct: 10 LADETACAGVRADLKMCLLESDCCRIAKNTPRQCLQANNVPPECQVLRNTFYECKRSLLD 69
Query: 420 TRQ 422
RQ
Sbjct: 70 NRQ 72
>gi|291225073|ref|XP_002732526.1| PREDICTED: LRRGT00199-like [Saccoglossus kowalevskii]
Length = 1109
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 38/51 (74%), Gaps = 1/51 (1%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKCS 324
+++WICP C + DDGS PMIGCD CD WYHW CVG+ EP +W+C +C+
Sbjct: 1040 QKIWICPQCKLPDDGS-PMIGCDTCDDWYHWPCVGITEEPSADEWYCKRCT 1089
>gi|392928200|ref|NP_001257272.1| Protein TAF-3, isoform a [Caenorhabditis elegans]
gi|320202826|emb|CBZ01777.1| Protein TAF-3, isoform a [Caenorhabditis elegans]
Length = 1007
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 56/101 (55%), Gaps = 14/101 (13%)
Query: 245 SSPKIVIKPVKSPSPPPVAAADEYDTGD--SKQVWICPACGVQDDGSLPMIGCDGCDAWY 302
+S ++I+P+ ++ D+ G S+++WICP C V MIGCD C W+
Sbjct: 903 ASTPVIIRPL-------LSQEDDDSNGSESSEEIWICPVCSVAYTVGANMIGCDQCQDWF 955
Query: 303 HWVCVGLVAEPETSDWFCPKCSKVDEGSRKEKKKRGRKKKT 343
HW CVGL AEP S WFC +C+K ++ KK G++ T
Sbjct: 956 HWHCVGLTAEPTDSKWFCTRCTKGNKS-----KKHGKRSAT 991
>gi|341902012|gb|EGT57947.1| hypothetical protein CAEBREN_18774 [Caenorhabditis brenneri]
Length = 998
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 47/81 (58%), Gaps = 2/81 (2%)
Query: 260 PPVAAADEYDTGD--SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSD 317
P ++ DE G S+++WICP C V M+GCD C W+HW CVG+ AEP S
Sbjct: 905 PLLSQEDEDSNGSDSSEEIWICPVCSVAYTVGANMVGCDMCQDWFHWHCVGITAEPTDSK 964
Query: 318 WFCPKCSKVDEGSRKEKKKRG 338
WFC +CSK ++ + K+ G
Sbjct: 965 WFCNRCSKGNKSKKHGKRSHG 985
>gi|354465004|ref|XP_003494970.1| PREDICTED: transcription initiation factor TFIID subunit 3
[Cricetulus griseus]
gi|344238969|gb|EGV95072.1| Transcription initiation factor TFIID subunit 3 [Cricetulus
griseus]
Length = 930
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 33/48 (68%), Positives = 35/48 (72%), Gaps = 2/48 (4%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFC 320
Q+WICP C DDGS PMIGCDGCD WYHW CVGL+A PE WFC
Sbjct: 864 NQIWICPGCNKPDDGS-PMIGCDGCDDWYHWPCVGLMAAPPEEMQWFC 910
>gi|348512811|ref|XP_003443936.1| PREDICTED: transcription initiation factor TFIID subunit 3-like
[Oreochromis niloticus]
Length = 943
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 33/49 (67%), Positives = 36/49 (73%), Gaps = 2/49 (4%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLV-AEPETSDWFC 320
Q+WICP C DDGS PMIGCDGCD WYHW+CVG+V A PE WFC
Sbjct: 876 GNQIWICPGCNKPDDGS-PMIGCDGCDDWYHWLCVGVVTAPPEDQQWFC 923
>gi|427786637|gb|JAA58770.1| Putative transcription initiation factor tfiid subunit 3
[Rhipicephalus pulchellus]
Length = 1137
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 33/58 (56%), Positives = 40/58 (68%), Gaps = 2/58 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKCSKVDEGS 330
++WICPAC DDGS PMIGCD CD WYHWVCVG+V P E W+C +C +G+
Sbjct: 1068 NKIWICPACARPDDGS-PMIGCDECDDWYHWVCVGIVVPPKEEESWYCNRCIAKRQGA 1124
>gi|449480695|ref|XP_004176569.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
TFIID subunit 3, partial [Taeniopygia guttata]
Length = 917
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 36/53 (67%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVGL A PE WFC KC+
Sbjct: 851 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGLTAAPPEEMQWFCSKCAN 902
>gi|449272885|gb|EMC82588.1| Transcription initiation factor TFIID subunit 3, partial [Columba
livia]
Length = 872
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 33/53 (62%), Positives = 36/53 (67%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL-VAEPETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG+ A PE WFC KC+
Sbjct: 806 NQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGITTAPPEEVQWFCSKCAN 857
>gi|195064387|ref|XP_001996560.1| GH23929 [Drosophila grimshawi]
gi|193892106|gb|EDV90972.1| GH23929 [Drosophila grimshawi]
Length = 1591
Score = 84.7 bits (208), Expect = 8e-14, Method: Composition-based stats.
Identities = 39/75 (52%), Positives = 47/75 (62%), Gaps = 2/75 (2%)
Query: 257 PSPPPVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL-VAEPET 315
P+ P + Y + ++WICPACG DDGS MIGCDGCDAWYHW CVG+ VA +
Sbjct: 1506 PTTPELNRPSSYVDAEGYRIWICPACGKVDDGS-AMIGCDGCDAWYHWTCVGIHVAPNDN 1564
Query: 316 SDWFCPKCSKVDEGS 330
DWFC C +GS
Sbjct: 1565 EDWFCRVCITKKKGS 1579
>gi|195124363|ref|XP_002006663.1| GI18462 [Drosophila mojavensis]
gi|193911731|gb|EDW10598.1| GI18462 [Drosophila mojavensis]
Length = 80
Score = 84.7 bits (208), Expect = 9e-14, Method: Composition-based stats.
Identities = 40/64 (62%), Positives = 56/64 (87%), Gaps = 1/64 (1%)
Query: 360 ELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLRE-GLAPQCEALKNTFFECKRSLL 418
+L D++ACAG RADLK+CLLE++C ++D+KTPRQCL++ + P+C+ L+NTF+ECKRSLL
Sbjct: 9 KLADETACAGVRADLKMCLLESECCRLDKKTPRQCLQDNSVPPECQVLRNTFYECKRSLL 68
Query: 419 DTRQ 422
D RQ
Sbjct: 69 DNRQ 72
>gi|195436236|ref|XP_002066075.1| GK22128 [Drosophila willistoni]
gi|194162160|gb|EDW77061.1| GK22128 [Drosophila willistoni]
Length = 80
Score = 84.3 bits (207), Expect = 1e-13, Method: Composition-based stats.
Identities = 40/64 (62%), Positives = 56/64 (87%), Gaps = 1/64 (1%)
Query: 360 ELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLRE-GLAPQCEALKNTFFECKRSLL 418
+L D++ACAG RADLK+CLLE++C ++D+KTPRQCL++ + P+C+ L+NTF+ECKRSLL
Sbjct: 9 KLADETACAGVRADLKMCLLESECCRLDKKTPRQCLQDNSVPPECQILRNTFYECKRSLL 68
Query: 419 DTRQ 422
D RQ
Sbjct: 69 DNRQ 72
>gi|195058006|ref|XP_001995367.1| GH23123 [Drosophila grimshawi]
gi|193899573|gb|EDV98439.1| GH23123 [Drosophila grimshawi]
Length = 80
Score = 84.3 bits (207), Expect = 1e-13, Method: Composition-based stats.
Identities = 39/64 (60%), Positives = 55/64 (85%), Gaps = 1/64 (1%)
Query: 360 ELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLRE-GLAPQCEALKNTFFECKRSLL 418
+L D ++CAG RADLK+CLLE++C ++D+KTPRQCL++ + P+C+ L+NTF+ECKRSLL
Sbjct: 9 KLADVTSCAGVRADLKMCLLESECCRLDKKTPRQCLQDNSVPPECQVLRNTFYECKRSLL 68
Query: 419 DTRQ 422
D RQ
Sbjct: 69 DNRQ 72
>gi|357614210|gb|EHJ68964.1| hypothetical protein KGM_12107 [Danaus plexippus]
Length = 81
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 42/70 (60%), Positives = 52/70 (74%), Gaps = 1/70 (1%)
Query: 354 FERDPSELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLREGLAP-QCEALKNTFFE 412
F D LVD S CAG RADLKLCLL +DC + +KTPR+CL++GL P +C L+ +FFE
Sbjct: 4 FGPDEVGLVDTSPCAGIRADLKLCLLNSDCCKEHKKTPRECLKDGLVPNECLQLRQSFFE 63
Query: 413 CKRSLLDTRQ 422
CKRSLLD R+
Sbjct: 64 CKRSLLDNRR 73
>gi|148227854|ref|NP_001089280.1| transcription initiation factor TFIID subunit 3 [Xenopus laevis]
gi|82178917|sp|Q5EAW9.1|TAF3_XENLA RecName: Full=Transcription initiation factor TFIID subunit 3;
AltName: Full=TBP-associated factor 3
gi|58702034|gb|AAH90215.1| MGC85078 protein [Xenopus laevis]
Length = 845
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 33/51 (64%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL-VAEPETSDWFCPKC 323
Q+WICP C DDGS PMIGCD CD WYHW CVG+ A PE WFC KC
Sbjct: 779 NQIWICPGCNKPDDGS-PMIGCDQCDDWYHWPCVGINAAPPEDEQWFCTKC 828
>gi|301604546|ref|XP_002931902.1| PREDICTED: transcription initiation factor TFIID subunit 3 isoform
2 [Xenopus (Silurana) tropicalis]
Length = 930
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 33/51 (64%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL-VAEPETSDWFCPKC 323
Q+WICP C DDGS PMIGCD CD WYHW CVG+ A PE WFC KC
Sbjct: 864 NQIWICPGCNKADDGS-PMIGCDLCDDWYHWPCVGINAAPPEDEQWFCTKC 913
>gi|301604544|ref|XP_002931901.1| PREDICTED: transcription initiation factor TFIID subunit 3 isoform
1 [Xenopus (Silurana) tropicalis]
Length = 928
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 33/51 (64%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL-VAEPETSDWFCPKC 323
Q+WICP C DDGS PMIGCD CD WYHW CVG+ A PE WFC KC
Sbjct: 862 NQIWICPGCNKADDGS-PMIGCDLCDDWYHWPCVGINAAPPEDEQWFCTKC 911
>gi|308482241|ref|XP_003103324.1| hypothetical protein CRE_27660 [Caenorhabditis remanei]
gi|308260114|gb|EFP04067.1| hypothetical protein CRE_27660 [Caenorhabditis remanei]
Length = 990
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 3/73 (4%)
Query: 256 SPSPPPVAAADEYDTG---DSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE 312
+P P P+ + ++ D+ S+++W+CP C V M+GCD C W+HW CVG+ AE
Sbjct: 891 APPPRPLISQEDGDSDGSESSEEIWVCPVCSVAYTVGANMVGCDQCQDWFHWHCVGITAE 950
Query: 313 PETSDWFCPKCSK 325
P S WFC +CSK
Sbjct: 951 PTDSKWFCNRCSK 963
>gi|332374160|gb|AEE62221.1| unknown [Dendroctonus ponderosae]
Length = 83
Score = 83.6 bits (205), Expect = 2e-13, Method: Composition-based stats.
Identities = 44/73 (60%), Positives = 55/73 (75%), Gaps = 3/73 (4%)
Query: 353 RFERDPSELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQC--LREGLAP-QCEALKNT 409
R+E + L DKSACAG RADLK+CLLE+DC + +KTPR+C L +G P +C L+NT
Sbjct: 3 RYENENETLKDKSACAGVRADLKMCLLESDCCRSRQKTPRECLNLNDGTVPSECYVLRNT 62
Query: 410 FFECKRSLLDTRQ 422
FFECKRS+LD RQ
Sbjct: 63 FFECKRSMLDARQ 75
>gi|391345558|ref|XP_003747052.1| PREDICTED: uncharacterized protein LOC100897414 [Metaseiulus
occidentalis]
Length = 782
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/56 (60%), Positives = 40/56 (71%), Gaps = 4/56 (7%)
Query: 272 DSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSD--WFCPKCSK 325
D ++WICPAC DDGS PMIGCD CD WYHW CVG + EP + D WFC +C+K
Sbjct: 716 DGNKIWICPACAKPDDGS-PMIGCDQCDDWYHWECVG-IDEPPSEDVQWFCTRCTK 769
>gi|328793512|ref|XP_001121700.2| PREDICTED: transcription initiation factor TFIID subunit 3-like
[Apis mellifera]
Length = 131
Score = 83.2 bits (204), Expect = 2e-13, Method: Composition-based stats.
Identities = 37/64 (57%), Positives = 43/64 (67%), Gaps = 3/64 (4%)
Query: 261 PVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWF 319
P + D G QVWICPACG QDDGS PM+GCD CDAWYHWVCVG+ P + +W+
Sbjct: 46 PPTSTTTVDQG-GHQVWICPACGNQDDGS-PMVGCDDCDAWYHWVCVGMQVPPADNENWY 103
Query: 320 CPKC 323
C C
Sbjct: 104 CRYC 107
>gi|392928202|ref|NP_001257273.1| Protein TAF-3, isoform b [Caenorhabditis elegans]
gi|320202827|emb|CBZ01778.1| Protein TAF-3, isoform b [Caenorhabditis elegans]
Length = 348
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 56/101 (55%), Gaps = 14/101 (13%)
Query: 245 SSPKIVIKPVKSPSPPPVAAADEYDTGD--SKQVWICPACGVQDDGSLPMIGCDGCDAWY 302
+S ++I+P+ ++ D+ G S+++WICP C V MIGCD C W+
Sbjct: 244 ASTPVIIRPL-------LSQEDDDSNGSESSEEIWICPVCSVAYTVGANMIGCDQCQDWF 296
Query: 303 HWVCVGLVAEPETSDWFCPKCSKVDEGSRKEKKKRGRKKKT 343
HW CVGL AEP S WFC +C+K ++ KK G++ T
Sbjct: 297 HWHCVGLTAEPTDSKWFCTRCTKGNKS-----KKHGKRSAT 332
>gi|112363124|ref|NP_001036209.1| transcription initiation factor TFIID subunit 3 [Danio rerio]
Length = 898
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 37/53 (69%), Gaps = 2/53 (3%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCS 324
Q+WICP C DDGS PMIGCD CD WYHW CVGL+A PE WFC KC+
Sbjct: 831 GNQIWICPGCNKADDGS-PMIGCDECDDWYHWPCVGLLAAPPEDQSWFCIKCA 882
>gi|405961203|gb|EKC27039.1| Transcription initiation factor TFIID subunit 3 [Crassostrea gigas]
Length = 687
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 36/57 (63%), Positives = 39/57 (68%), Gaps = 3/57 (5%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
YD K VWICPAC + DDGS PMIGCD CD WYHW CVG+ EP E W+C KC
Sbjct: 605 YDESGQK-VWICPACKMPDDGS-PMIGCDICDDWYHWPCVGVKEEPAEEESWYCSKC 659
>gi|194768316|ref|XP_001966258.1| GF22835 [Drosophila ananassae]
gi|190618560|gb|EDV34084.1| GF22835 [Drosophila ananassae]
Length = 236
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 34/63 (53%), Positives = 41/63 (65%), Gaps = 2/63 (3%)
Query: 261 PVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWFC 320
P+ Y + ++WICPACG DDGS MIGCDGCDAWYHW CVG+ P+ DW+C
Sbjct: 157 PINRPSSYVDAEGNRIWICPACGKVDDGS-AMIGCDGCDAWYHWTCVGIEIAPK-DDWYC 214
Query: 321 PKC 323
C
Sbjct: 215 RVC 217
>gi|71895059|ref|NP_001026012.1| transcription initiation factor TFIID subunit 3 [Gallus gallus]
gi|82075449|sp|Q5F489.1|TAF3_CHICK RecName: Full=Transcription initiation factor TFIID subunit 3;
AltName: Full=TBP-associated factor 3
gi|60098429|emb|CAH65045.1| hypothetical protein RCJMB04_2c9 [Gallus gallus]
Length = 930
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 32/53 (60%), Positives = 35/53 (66%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+W CP C DDGS PMIGCD CD WYHW CVG+ A PE WFC KC+
Sbjct: 864 NQIWFCPGCNKPDDGS-PMIGCDDCDDWYHWPCVGITAAPPEEMQWFCSKCAN 915
>gi|395840595|ref|XP_003793140.1| PREDICTED: transcription initiation factor TFIID subunit 3-like
[Otolemur garnettii]
Length = 904
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 32/51 (62%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 275 QVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCS 324
Q+WICP C DDGS PMIGCD C WYHW CVG+ A PE WFC KC+
Sbjct: 834 QIWICPGCNKPDDGS-PMIGCDNCSDWYHWPCVGITAAPPEEVQWFCLKCA 883
>gi|157167842|ref|XP_001662427.1| PET191 polypeptide, putative [Aedes aegypti]
gi|108871285|gb|EAT35510.1| AAEL012327-PA [Aedes aegypti]
Length = 82
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 41/65 (63%), Positives = 52/65 (80%), Gaps = 3/65 (4%)
Query: 361 LVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCL--REGLAP-QCEALKNTFFECKRSL 417
L D+SACA RADLK+CLL++DC + +RK PR+CL ++G P +C AL+NTFFECKRSL
Sbjct: 10 LADESACARVRADLKMCLLQSDCCKKERKLPRECLSRQDGSVPDECFALRNTFFECKRSL 69
Query: 418 LDTRQ 422
LD RQ
Sbjct: 70 LDNRQ 74
>gi|26351163|dbj|BAC39218.1| unnamed protein product [Mus musculus]
Length = 191
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 34/54 (62%), Positives = 38/54 (70%), Gaps = 2/54 (3%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 123 GNQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 175
>gi|405953382|gb|EKC21056.1| Serine/threonine-protein kinase TBK1 [Crassostrea gigas]
Length = 1721
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 36/57 (63%), Positives = 39/57 (68%), Gaps = 3/57 (5%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
YD K VWICPAC + DDGS PMIGCD CD WYHW CVG+ EP E W+C KC
Sbjct: 1023 YDESGQK-VWICPACKMPDDGS-PMIGCDICDDWYHWPCVGVKEEPAEEESWYCSKC 1077
>gi|195402233|ref|XP_002059711.1| GJ14147 [Drosophila virilis]
gi|194155925|gb|EDW71109.1| GJ14147 [Drosophila virilis]
Length = 1576
Score = 81.3 bits (199), Expect = 1e-12, Method: Composition-based stats.
Identities = 39/76 (51%), Positives = 47/76 (61%), Gaps = 5/76 (6%)
Query: 252 KPVKSPSPPP---VAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVG 308
+P S + PP + Y D ++WICPACG DDGS MIGCDGCDAWYHW CVG
Sbjct: 1483 QPEGSQTLPPNLELNRPSSYVDADGYRIWICPACGKVDDGS-AMIGCDGCDAWYHWTCVG 1541
Query: 309 LVAEP-ETSDWFCPKC 323
++ P + DWFC C
Sbjct: 1542 ILVAPNDNEDWFCRVC 1557
>gi|45550888|ref|NP_652380.2| CG13018 [Drosophila melanogaster]
gi|45445552|gb|AAF58293.2| CG13018 [Drosophila melanogaster]
gi|66772563|gb|AAY55593.1| IP03340p [Drosophila melanogaster]
gi|220959972|gb|ACL92529.1| CG13018-PA [synthetic construct]
Length = 79
Score = 80.9 bits (198), Expect = 1e-12, Method: Composition-based stats.
Identities = 40/63 (63%), Positives = 52/63 (82%), Gaps = 2/63 (3%)
Query: 361 LVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLREGLAP-QCEALKNTFFECKRSLLD 419
L D++ACAG RADLK+CLLE+DC ++ KTPRQCL++ P +C+ L+NTF+ECKRSLLD
Sbjct: 10 LADETACAGVRADLKMCLLESDCCKMG-KTPRQCLQDNNVPAECQVLRNTFYECKRSLLD 68
Query: 420 TRQ 422
RQ
Sbjct: 69 NRQ 71
>gi|410908439|ref|XP_003967698.1| PREDICTED: transcription initiation factor TFIID subunit 3-like
[Takifugu rubripes]
Length = 930
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 33/53 (62%), Positives = 35/53 (66%), Gaps = 2/53 (3%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL-VAEPETSDWFCPKCS 324
Q+WICP C DDGS PMIGCD CD WYHW CV + A PE WFC KCS
Sbjct: 863 GNQIWICPGCNKPDDGS-PMIGCDDCDDWYHWACVRIDTAPPEDQQWFCIKCS 914
>gi|269146714|gb|ACZ28303.1| cytochrome c oxidase assembly protein PET191 [Simulium nigrimanum]
Length = 81
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/69 (59%), Positives = 50/69 (72%), Gaps = 3/69 (4%)
Query: 357 DPSELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLR--EGLAPQ-CEALKNTFFEC 413
D L DKS C+G RADLK+CLLE+DC + D+K PR+CL +G P+ C L+ TFFEC
Sbjct: 5 DGEVLADKSTCSGVRADLKMCLLESDCCKRDKKLPRECLNTFDGSVPENCLVLRKTFFEC 64
Query: 414 KRSLLDTRQ 422
KRSLLD RQ
Sbjct: 65 KRSLLDNRQ 73
>gi|380795909|gb|AFE69830.1| transcription initiation factor TFIID subunit 3, partial [Macaca
mulatta]
Length = 131
Score = 80.5 bits (197), Expect = 1e-12, Method: Composition-based stats.
Identities = 39/81 (48%), Positives = 49/81 (60%), Gaps = 6/81 (7%)
Query: 245 SSPKIVIKPVKSPSPPPVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHW 304
+S K ++ V + + DE+ Q+WICP C DDGS PMIGCD CD WYHW
Sbjct: 40 ASAKAPVRSVVTETVSTYVIRDEW----GNQIWICPGCNKPDDGS-PMIGCDDCDDWYHW 94
Query: 305 VCVGLVAE-PETSDWFCPKCS 324
CVG++A PE WFCPKC+
Sbjct: 95 PCVGIMAAPPEEMQWFCPKCA 115
>gi|194883182|ref|XP_001975682.1| GG22444 [Drosophila erecta]
gi|195334244|ref|XP_002033794.1| GM20230 [Drosophila sechellia]
gi|195485857|ref|XP_002091262.1| GE12335 [Drosophila yakuba]
gi|190658869|gb|EDV56082.1| GG22444 [Drosophila erecta]
gi|194125764|gb|EDW47807.1| GM20230 [Drosophila sechellia]
gi|194177363|gb|EDW90974.1| GE12335 [Drosophila yakuba]
Length = 79
Score = 80.5 bits (197), Expect = 1e-12, Method: Composition-based stats.
Identities = 40/63 (63%), Positives = 52/63 (82%), Gaps = 2/63 (3%)
Query: 361 LVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLREGLAP-QCEALKNTFFECKRSLLD 419
L D++ACAG RADLK+CLLE+DC ++ KTPRQCL++ P +C+ L+NTF+ECKRSLLD
Sbjct: 10 LADETACAGVRADLKMCLLESDCCKMG-KTPRQCLQDNNVPSECQVLRNTFYECKRSLLD 68
Query: 420 TRQ 422
RQ
Sbjct: 69 NRQ 71
>gi|263359642|gb|ACY70478.1| hypothetical protein DVIR88_6g0015 [Drosophila virilis]
Length = 1576
Score = 80.1 bits (196), Expect = 2e-12, Method: Composition-based stats.
Identities = 35/57 (61%), Positives = 40/57 (70%), Gaps = 2/57 (3%)
Query: 268 YDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKC 323
Y D ++WICPACG DDGS MIGCDGCDAWYHW CVG++ P + DWFC C
Sbjct: 1502 YVDADGYRIWICPACGKVDDGS-AMIGCDGCDAWYHWTCVGILVAPNDNEDWFCRVC 1557
>gi|312073729|ref|XP_003139651.1| hypothetical protein LOAG_04066 [Loa loa]
Length = 1397
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 124/247 (50%), Gaps = 42/247 (17%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK-- 159
G+ +K KS+ KD++ K EK+ +K ++++ ++ K+KEK E+ EEK K
Sbjct: 1149 GQRRKRKSVDKDEERGKGAEKD-VKNTRERR----CLSESKEKEKHHSERTVVEEKVKGR 1203
Query: 160 -KEEKEKKEEKEFKMKEDKE-------KPVEKIRKEE-----------KDSEKEKKSKDK 200
KE K+K++ K+ DKE K +EK+ E+ K+S K + ++D
Sbjct: 1204 NKELKDKEKRKKHSGGRDKEHSIGKSIKEIEKLSGEDESFGSLSNVRTKNSGKNEVTRD- 1262
Query: 201 DKKLKKEKLKKKKKEK-ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSP 259
L L +K ER + KE +L+ + D + S+ ++K + S
Sbjct: 1263 ---LNNSHLPYDSVQKPERVMYAKE---QLSKRCSADSDIFESGSSTADSLVKSLDKKS- 1315
Query: 260 PPVAAADEYDTGDSKQVWICPACGVQD-DGSLPMIGCDGCDAWYHWVCVG-LVAEPETSD 317
A D D+ D+ +WICP C V +G+ M+GCD CD W+HW CVG LVA P+ +
Sbjct: 1316 ---ATLDSDDSLDT--MWICPECSVAYVEGATDMVGCDACDNWFHWSCVGLLVAPPDDAP 1370
Query: 318 WFCPKCS 324
W+C C+
Sbjct: 1371 WYCQNCA 1377
>gi|308487030|ref|XP_003105711.1| hypothetical protein CRE_17917 [Caenorhabditis remanei]
gi|308255167|gb|EFO99119.1| hypothetical protein CRE_17917 [Caenorhabditis remanei]
Length = 570
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 2/55 (3%)
Query: 269 DTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKC 323
DTG +VW CP C V M+GCD C+ W+HW CVGL AEP S W+CP+C
Sbjct: 489 DTGG--KVWYCPVCSVVYANDKYMVGCDKCEHWFHWDCVGLTAEPSDSKWYCPRC 541
>gi|443723916|gb|ELU12135.1| hypothetical protein CAPTEDRAFT_175820 [Capitella teleta]
Length = 253
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 31/55 (56%), Positives = 38/55 (69%), Gaps = 2/55 (3%)
Query: 272 DSKQV-WICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKCSK 325
+S Q+ WICP C DDGS PMIGCD CD WYHW CVG+ EP W+C +C++
Sbjct: 163 ESGQITWICPTCKKADDGS-PMIGCDNCDDWYHWFCVGITREPSDEQWYCVRCTR 216
>gi|327271987|ref|XP_003220768.1| PREDICTED: transcription initiation factor TFIID subunit 3-like
isoform 1 [Anolis carolinensis]
Length = 937
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 30/52 (57%), Positives = 36/52 (69%), Gaps = 2/52 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL-VAEPETSDWFCPKCS 324
++WICP C DDGS PMIGCD CD WYHW CVG+ A P+ WFC +C+
Sbjct: 871 NKIWICPGCEKPDDGS-PMIGCDDCDDWYHWPCVGIKAAPPQEVQWFCSRCA 921
>gi|327271989|ref|XP_003220769.1| PREDICTED: transcription initiation factor TFIID subunit 3-like
isoform 2 [Anolis carolinensis]
Length = 924
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 30/52 (57%), Positives = 36/52 (69%), Gaps = 2/52 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGL-VAEPETSDWFCPKCS 324
++WICP C DDGS PMIGCD CD WYHW CVG+ A P+ WFC +C+
Sbjct: 858 NKIWICPGCEKPDDGS-PMIGCDDCDDWYHWPCVGIKAAPPQEVQWFCSRCA 908
>gi|307189532|gb|EFN73909.1| Uncharacterized protein C2orf64-like protein [Camponotus
floridanus]
Length = 83
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 38/73 (52%), Positives = 53/73 (72%), Gaps = 3/73 (4%)
Query: 353 RFERDPSELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLR--EGLAP-QCEALKNT 409
RFE + L DKS CA RADLK+CLL++DC ++ + TPR+CLR +G P +C AL+N
Sbjct: 3 RFEEEGETLKDKSRCANIRADLKMCLLQSDCCKIYKHTPRECLRIQDGTVPDECFALRNI 62
Query: 410 FFECKRSLLDTRQ 422
FF+CK S++D R+
Sbjct: 63 FFDCKHSIIDGRR 75
>gi|392928204|ref|NP_001257274.1| Protein TAF-3, isoform c [Caenorhabditis elegans]
gi|320202828|emb|CBZ01779.1| Protein TAF-3, isoform c [Caenorhabditis elegans]
Length = 209
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 54/99 (54%), Gaps = 10/99 (10%)
Query: 245 SSPKIVIKPVKSPSPPPVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHW 304
+S ++I+P+ S D + S+++WICP C V MIGCD C W+HW
Sbjct: 105 ASTPVIIRPLLSQEDD-----DSNGSESSEEIWICPVCSVAYTVGANMIGCDQCQDWFHW 159
Query: 305 VCVGLVAEPETSDWFCPKCSKVDEGSRKEKKKRGRKKKT 343
CVGL AEP S WFC +C+K + + KK G++ T
Sbjct: 160 HCVGLTAEPTDSKWFCTRCTKGN-----KSKKHGKRSAT 193
>gi|431917649|gb|ELK16914.1| Transcription initiation factor TFIID subunit 3 [Pteropus alecto]
Length = 207
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/53 (64%), Positives = 37/53 (69%), Gaps = 2/53 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLV-AEPETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVGL+ A PE WFC KC+
Sbjct: 141 NQIWICPGCDKPDDGS-PMIGCDDCDDWYHWSCVGLMTAPPEEMQWFCNKCAN 192
>gi|393908774|gb|EJD75204.1| hypothetical protein LOAG_17614 [Loa loa]
Length = 413
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 238 EEKTKRESSPKIVIKPVKSPSPPPVAAADE----YDTGDS-KQVWICPACGVQD-DGSLP 291
E+ +KR S+ + + S + V + D+ D+ DS +WICP C V +G+
Sbjct: 300 EQLSKRCSADSDIFESGSSTADSLVKSLDKKSATLDSDDSLDTMWICPECSVAYVEGATD 359
Query: 292 MIGCDGCDAWYHWVCVG-LVAEPETSDWFCPKCS 324
M+GCD CD W+HW CVG LVA P+ + W+C C+
Sbjct: 360 MVGCDACDNWFHWSCVGLLVAPPDDAPWYCQNCA 393
>gi|194319914|pdb|2K16|A Chain A, Solution Structure Of The Free Taf3 Phd Domain
gi|194319916|pdb|2K17|A Chain A, Solution Structure Of The Taf3 Phd Domain In Complex With
A H3k4me3 Peptide
Length = 75
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/54 (62%), Positives = 38/54 (70%), Gaps = 2/54 (3%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
Q+WICP C DDGS PMIGCD CD WYHW CVG++A PE WFCPKC+
Sbjct: 15 GNQIWICPGCNKPDDGS-PMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPKCAN 67
>gi|449668642|ref|XP_004206835.1| PREDICTED: uncharacterized protein LOC100199050 [Hydra
magnipapillata]
Length = 525
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 28/53 (52%), Positives = 34/53 (64%), Gaps = 2/53 (3%)
Query: 273 SKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEP-ETSDWFCPKCS 324
S ++ CP C DDGS PMIGCD CD WYHW CV + P + + WFC KC+
Sbjct: 453 SADLYYCPGCDRPDDGS-PMIGCDSCDGWYHWSCVNITVAPADDAQWFCSKCA 504
>gi|313215495|emb|CBY16210.1| unnamed protein product [Oikopleura dioica]
Length = 438
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 32/63 (50%), Positives = 38/63 (60%), Gaps = 2/63 (3%)
Query: 263 AAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDG-CDAWYHWVCVGLVAEPETSDWFCP 321
A +Y T ++ W CP CG DDGS PMIGCDG C WYH CVG+ DW+CP
Sbjct: 366 ADGKQYYTSFEEKDWRCPTCGKLDDGS-PMIGCDGDCGDWYHNTCVGIKGPVPEGDWYCP 424
Query: 322 KCS 324
KC+
Sbjct: 425 KCT 427
>gi|402594618|gb|EJW88544.1| hypothetical protein WUBG_00544 [Wuchereria bancrofti]
Length = 1408
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 40/61 (65%), Gaps = 3/61 (4%)
Query: 268 YDTGDS-KQVWICPACGVQD-DGSLPMIGCDGCDAWYHWVCVGL-VAEPETSDWFCPKCS 324
D+ DS +WICP C V +G+ M+GCD CD W+HW CVGL VA P+ + W+C C+
Sbjct: 1329 LDSDDSLDNMWICPECSVAYVEGATDMVGCDACDNWFHWSCVGLLVAPPDDAPWYCQNCA 1388
Query: 325 K 325
+
Sbjct: 1389 R 1389
>gi|170589625|ref|XP_001899574.1| Bromodomain associated family protein [Brugia malayi]
gi|158593787|gb|EDP32382.1| Bromodomain associated family protein [Brugia malayi]
Length = 1391
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 40/61 (65%), Gaps = 3/61 (4%)
Query: 268 YDTGDS-KQVWICPACGVQD-DGSLPMIGCDGCDAWYHWVCVG-LVAEPETSDWFCPKCS 324
D+ DS +WICP C V +G+ M+GCD CD W+HW CVG LVA P+ + W+C C+
Sbjct: 1312 LDSDDSLDNMWICPECSVAYVEGATDMVGCDACDNWFHWSCVGLLVAPPDDAPWYCQNCA 1371
Query: 325 K 325
+
Sbjct: 1372 R 1372
>gi|313224411|emb|CBY20200.1| unnamed protein product [Oikopleura dioica]
Length = 403
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/63 (50%), Positives = 38/63 (60%), Gaps = 2/63 (3%)
Query: 263 AAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDG-CDAWYHWVCVGLVAEPETSDWFCP 321
A +Y T ++ W CP CG DDGS PMIGCDG C WYH CVG+ DW+CP
Sbjct: 331 ADGKQYYTSFEEKDWRCPTCGKLDDGS-PMIGCDGDCGDWYHNTCVGIKGPVPEGDWYCP 389
Query: 322 KCS 324
KC+
Sbjct: 390 KCT 392
>gi|170049119|ref|XP_001870888.1| PET191 polypeptide [Culex quinquefasciatus]
gi|170049123|ref|XP_001870890.1| PET191 polypeptide [Culex quinquefasciatus]
gi|167871023|gb|EDS34406.1| PET191 polypeptide [Culex quinquefasciatus]
gi|167871025|gb|EDS34408.1| PET191 polypeptide [Culex quinquefasciatus]
Length = 82
Score = 74.3 bits (181), Expect = 1e-10, Method: Composition-based stats.
Identities = 39/66 (59%), Positives = 51/66 (77%), Gaps = 3/66 (4%)
Query: 360 ELVDKSACAGSRADLKLCLLETDCVQVDRKTPRQCLR--EGLAP-QCEALKNTFFECKRS 416
+L D++ACA RADLK+CLL++DC + + K PR+CL +G P +C AL+NTFFECKRS
Sbjct: 9 QLADETACARVRADLKMCLLQSDCCKKEHKLPRECLSRTDGSVPDECHALRNTFFECKRS 68
Query: 417 LLDTRQ 422
LLD RQ
Sbjct: 69 LLDNRQ 74
>gi|268577303|ref|XP_002643633.1| Hypothetical protein CBG16376 [Caenorhabditis briggsae]
Length = 367
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 41/68 (60%), Gaps = 5/68 (7%)
Query: 272 DSKQVWICPACGVQ-DDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKCSKVDEGS 330
D +++WICP C V DG+ CD C W+HW CVG+ AEP S WFC +CSK ++
Sbjct: 292 DDEEIWICPVCSVAYTDGA----NCDQCQDWFHWHCVGITAEPTDSKWFCNRCSKGNKSK 347
Query: 331 RKEKKKRG 338
+ K+ G
Sbjct: 348 KHGKRAGG 355
>gi|340369448|ref|XP_003383260.1| PREDICTED: hypothetical protein LOC100636564 [Amphimedon
queenslandica]
Length = 339
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 275 QVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLV-AEPETSDWFCPKC 323
+V+ICP C DDG+ PMI CD CD W H CVGL A P+ WFCP C
Sbjct: 280 EVYICPICKRPDDGT-PMICCDNCDVWLHMHCVGLACAPPDNKKWFCPPC 328
>gi|47224439|emb|CAG08689.1| unnamed protein product [Tetraodon nigroviridis]
Length = 774
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 78/217 (35%), Gaps = 78/217 (35%)
Query: 165 KKEEK---EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
K+ K F +K+D E + + D K K E S+
Sbjct: 563 SKDGKMAWNFGLKDD-----------ELFAPLDFSFADGAVKAKPEA---------PSAL 602
Query: 222 EKEVIPKLTFKFGT-----------DMEEKTKRESSPKIVIKPV---KSPSP----PPV- 262
VIP+LT + G D + +T +P P + P P PPV
Sbjct: 603 PSPVIPRLTLRVGGQDKIVISKVLPDSDARTPTPRTPAAKTGPGNQPRMPQPPALLPPVR 662
Query: 263 -----------AAADEYDTGDS-----------------------KQVWICPACGVQDDG 288
A+ +G S Q+WICP C DDG
Sbjct: 663 SNRDRQCLRGEASGSARRSGLSASVFHGPRMAFLLPSSQIRDEWGNQIWICPGCNKPDDG 722
Query: 289 SLPMIGCDGCDAWYHWVCVG-LVAEPETSDWFCPKCS 324
S PMIGCD CD WYHW CV L A PE WFC KCS
Sbjct: 723 S-PMIGCDDCDDWYHWACVRILAAPPEDQQWFCIKCS 758
>gi|341881443|gb|EGT37378.1| hypothetical protein CAEBREN_19314 [Caenorhabditis brenneri]
Length = 678
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 28/44 (63%)
Query: 276 VWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWF 319
++ICP C V + + M+ CD C+ W+HW CVG+ +EP WF
Sbjct: 605 LYICPVCSVAYNHAANMVSCDKCEEWFHWHCVGITSEPTEEQWF 648
>gi|196006447|ref|XP_002113090.1| predicted protein [Trichoplax adhaerens]
gi|190585131|gb|EDV25200.1| predicted protein [Trichoplax adhaerens]
Length = 667
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 39/77 (50%), Gaps = 11/77 (14%)
Query: 257 PSPPPV---AAADEY------DTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCV 307
PS PPV A DE T S CP C + DDGS M+ CD C+ W+H+ CV
Sbjct: 576 PSLPPVYVKAIVDETISTANNATETSNSYDPCPVCHLLDDGS-SMVFCDECEKWFHFKCV 634
Query: 308 GLVAEPETSD-WFCPKC 323
GL P+ D WFC C
Sbjct: 635 GLEIAPKEEDAWFCNAC 651
>gi|326911084|ref|XP_003201892.1| PREDICTED: transcription initiation factor TFIID subunit 3-like
[Meleagris gallopavo]
Length = 911
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/31 (67%), Positives = 22/31 (70%), Gaps = 1/31 (3%)
Query: 274 KQVWICPACGVQDDGSLPMIGCDGCDAWYHW 304
Q+W CP C DDGS PMIGCD CD WYHW
Sbjct: 882 NQIWFCPGCNKPDDGS-PMIGCDDCDDWYHW 911
>gi|390340600|ref|XP_003725279.1| PREDICTED: uncharacterized protein LOC100893111 [Strongylocentrotus
purpuratus]
Length = 1497
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 72/115 (62%), Gaps = 7/115 (6%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE---KKEKEKGKKEEKEKKEEKE 164
KS KDK K++EK + K ++ +DKD+ K+ +DKE K++K++ +++EK+K E+
Sbjct: 1114 KSRHKDKDRDKEREKPRDKDRERHRDKDRDKDSHRDKEHERKRDKDRDRQKEKDKDRERR 1173
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
K E E KEK E+ + EKDS K+K ++KD+ +KE+ K+ KEK+R
Sbjct: 1174 KDREGE----SSKEKDRERRKDREKDSSKDKDRREKDRDGRKEREKESSKEKDRG 1224
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 72/110 (65%), Gaps = 4/110 (3%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
+DK +H+ KE E+ K+DK D+D+ K K+KD+E+++ +G+ +++ +E ++ +E+
Sbjct: 1142 RDKDSHRDKEHER---KRDK-DRDRQKEKDKDRERRKDREGESSKEKDRERRKDREKDSS 1197
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
K K+ +EK + ++ EK+S KEK +KD+ +EK K + +EK+ SH
Sbjct: 1198 KDKDRREKDRDGRKEREKESSKEKDRGEKDRSKHREKEKDRDREKKERSH 1247
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 66/113 (58%), Gaps = 10/113 (8%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
KD++ K +E E K+K ++ KD+ K+ KDK+++EK++ ++E+EK+ KE
Sbjct: 1168 KDRERRKDREGESSKEKDRERRKDREKDSSKDKDRREKDRDGRKEREKESSKE------- 1220
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKD-KDKKLKKEKLKKKKKEKERSSHEK 223
K+ EK K R++EKD ++EKK + +D K E+ KKE+ R EK
Sbjct: 1221 --KDRGEKDRSKHREKEKDRDREKKERSHRDNKESAERTDSGKKERHRDRDEK 1271
>gi|380420333|ref|NP_001154919.2| nipped-B-like protein B [Danio rerio]
gi|408407682|sp|F1QBY1.1|NIPLB_DANRE RecName: Full=Nipped-B-like protein B
Length = 2876
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 74/105 (70%), Gaps = 4/105 (3%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
+ KE+ K + KK KD +K + K++DKE +K++EK + +E EK EK++ +E E ++D
Sbjct: 746 RDKEQVKGRDKKRSKDLEKCREKDQDKELEKDREKNQDKELEKGREKDQDKELEKGREKD 805
Query: 177 KEKPVEKIRKEEKDSEKEK-KSKDKDKKLKK--EKLKKKKKEKER 218
++K +EK R++++D E EK + KD+DK+L+K EK + K +EK+R
Sbjct: 806 RDKEMEKAREKDQDKELEKGREKDQDKELEKGQEKDRDKVREKDR 850
Score = 45.1 bits (105), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 80/115 (69%), Gaps = 2/115 (1%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKEEKE 164
+DK ++KD++ ++ KE EK ++K K+ +K + K++DKE +K +EK + +E EK EK+
Sbjct: 770 QDKELEKDREKNQDKELEKGREKDQDKELEKGREKDRDKEMEKAREKDQDKELEKGREKD 829
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK-KSKDKDKKLKKEKLKKKKKEKER 218
+ +E E ++D++K EK R + +D +++K + KD+DK +K++ K ++K++E+
Sbjct: 830 QDKELEKGQEKDRDKVREKDRDKVRDKDRDKVREKDRDKVREKDRDKLREKDREK 884
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 83/131 (63%), Gaps = 8/131 (6%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK---KEKEKG 152
TS E +DK +++K K EK + K+ + +DK+++K++EKD+EK KE EKG
Sbjct: 670 TSKSSRVEHNRDKEQEQEKVGDKGLEKGREKELEKGRDKERVKDQEKDQEKGRDKEVEKG 729
Query: 153 KKEE--KEKKEEKEKKEEKEFKMKEDK--EKPVEKIRKEEKDSEKEK-KSKDKDKKLKKE 207
+ +E K++ +E+EK +KE DK K +EK R++++D E EK + K++DK+L+K
Sbjct: 730 RYKERVKDRVKEQEKVRDKEQVKGRDKKRSKDLEKCREKDQDKELEKDREKNQDKELEKG 789
Query: 208 KLKKKKKEKER 218
+ K + KE E+
Sbjct: 790 REKDQDKELEK 800
Score = 41.2 bits (95), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 77/127 (60%), Gaps = 22/127 (17%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK---KEKEKGKKEEKEKKEEKEKKEEKE 170
K+ K ++KE++K + K+ KD K +EKD++K K++EK + +E EK EK++ +E E
Sbjct: 740 KEQEKVRDKEQVKGRDKKRSKDLEKCREKDQDKELEKDREKNQDKELEKGREKDQDKELE 799
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEK-KSKDKDKKLKK------------------EKLKK 211
++D++K +EK R++++D E EK + KD+DK+L+K +K +
Sbjct: 800 KGREKDRDKEMEKAREKDQDKELEKGREKDQDKELEKGQEKDRDKVREKDRDKVRDKDRD 859
Query: 212 KKKEKER 218
K +EK+R
Sbjct: 860 KVREKDR 866
>gi|333777884|dbj|BAK23967.1| nipped-B like a [Danio rerio]
Length = 2876
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 74/105 (70%), Gaps = 4/105 (3%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
+ KE+ K + KK KD +K + K++DKE +K++EK + +E EK EK++ +E E ++D
Sbjct: 746 RDKEQVKGRDKKRSKDLEKCREKDQDKELEKDREKNQDKELEKGREKDQDKELEKGREKD 805
Query: 177 KEKPVEKIRKEEKDSEKEK-KSKDKDKKLKK--EKLKKKKKEKER 218
++K +EK R++++D E EK + KD+DK+L+K EK + K +EK+R
Sbjct: 806 RDKEMEKAREKDQDKELEKGREKDQDKELEKGQEKDRDKVREKDR 850
Score = 45.1 bits (105), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 80/115 (69%), Gaps = 2/115 (1%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKEEKE 164
+DK ++KD++ ++ KE EK ++K K+ +K + K++DKE +K +EK + +E EK EK+
Sbjct: 770 QDKELEKDREKNQDKELEKGREKDQDKELEKGREKDRDKEMEKAREKDQDKELEKGREKD 829
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK-KSKDKDKKLKKEKLKKKKKEKER 218
+ +E E ++D++K EK R + +D +++K + KD+DK +K++ K ++K++E+
Sbjct: 830 QDKELEKGQEKDRDKVREKDRDKVRDKDRDKVREKDRDKVREKDRDKLREKDREK 884
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 83/131 (63%), Gaps = 8/131 (6%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK---KEKEKG 152
TS E +DK +++K K EK + K+ + +DK+++K++EKD+EK KE EKG
Sbjct: 670 TSKSSRVEHNRDKEQEQEKVGDKGLEKGREKELEKGRDKERVKDQEKDQEKGRDKEVEKG 729
Query: 153 KKEE--KEKKEEKEKKEEKEFKMKEDK--EKPVEKIRKEEKDSEKEK-KSKDKDKKLKKE 207
+ +E K++ +E+EK +KE DK K +EK R++++D E EK + K++DK+L+K
Sbjct: 730 RYKERVKDRVKEQEKVRDKEQVKGRDKKRSKDLEKCREKDQDKELEKDREKNQDKELEKG 789
Query: 208 KLKKKKKEKER 218
+ K + KE E+
Sbjct: 790 REKDQDKELEK 800
Score = 41.2 bits (95), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 77/127 (60%), Gaps = 22/127 (17%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK---KEKEKGKKEEKEKKEEKEKKEEKE 170
K+ K ++KE++K + K+ KD K +EKD++K K++EK + +E EK EK++ +E E
Sbjct: 740 KEQEKVRDKEQVKGRDKKRSKDLEKCREKDQDKELEKDREKNQDKELEKGREKDQDKELE 799
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEK-KSKDKDKKLKK------------------EKLKK 211
++D++K +EK R++++D E EK + KD+DK+L+K +K +
Sbjct: 800 KGREKDRDKEMEKAREKDQDKELEKGREKDQDKELEKGQEKDRDKVREKDRDKVRDKDRD 859
Query: 212 KKKEKER 218
K +EK+R
Sbjct: 860 KVREKDR 866
>gi|195470186|ref|XP_002087389.1| GE16637 [Drosophila yakuba]
gi|194173490|gb|EDW87101.1| GE16637 [Drosophila yakuba]
Length = 4999
Score = 47.0 bits (110), Expect = 0.019, Method: Composition-based stats.
Identities = 24/75 (32%), Positives = 48/75 (64%)
Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
DKE++EKE +K+ +EK++ + EKE + KE +EK + + + EK+ +EK ++K+
Sbjct: 2006 DKEQREKEIREKDLREKEQRERDNREKELREKELREKEMREKEQREKELHREKDQREKEH 2065
Query: 203 KLKKEKLKKKKKEKE 217
+ K++ + + E+E
Sbjct: 2066 REKEQSRRAMEVEQE 2080
>gi|395539001|ref|XP_003771462.1| PREDICTED: transcription initiation factor TFIID subunit 3
[Sarcophilus harrisii]
Length = 798
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 22/47 (46%), Gaps = 16/47 (34%)
Query: 295 CDG---------------CDAWYHWVCVGLVAE-PETSDWFCPKCSK 325
C G CD WYHW CVG++A PE WFCP C+
Sbjct: 737 CPGCNKPDDGSPMIGCDDCDDWYHWPCVGIMAAPPEEMQWFCPMCAN 783
>gi|321467016|gb|EFX78008.1| hypothetical protein DAPPUDRAFT_105663 [Daphnia pulex]
Length = 4511
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 88/122 (72%), Gaps = 10/122 (8%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKE 167
+KD+ +++EK++I+K+ ++KD+ + + +EKD KEK+EK++ +KE +EK +++ E
Sbjct: 2421 EKDRLLKEEEEKQRIQKESEEKDRLQKETEEKDRLLKEKEEKQRMQKESEEKDRLQKEAE 2480
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKDKDKKLK----KEKLKKKKKEKERSSHE 222
EK+ +KE++EK +I+KE ++ ++ +K++++KD+ LK K++++K+ +EK+R E
Sbjct: 2481 EKDRLLKEEEEK--RRIQKESEEKDRLQKETEEKDRLLKEEEEKQRMQKESEEKDRLQKE 2538
Query: 223 KE 224
E
Sbjct: 2539 AE 2540
Score = 45.1 bits (105), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 89/122 (72%), Gaps = 10/122 (8%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKE 167
+KD+ +++EK++I+K+ ++KD+ + + +EKD KE++EK++ +KE +EK +++ E
Sbjct: 2361 EKDRLLKEEEEKQRIQKESEEKDRLQKEAEEKDRLLKEEEEKQRIQKESEEKDRLQKEAE 2420
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKDKDKKLK----KEKLKKKKKEKERSSHE 222
EK+ +KE++EK ++I+KE ++ ++ +K++++KD+ LK K++++K+ +EK+R E
Sbjct: 2421 EKDRLLKEEEEK--QRIQKESEEKDRLQKETEEKDRLLKEKEEKQRMQKESEEKDRLQKE 2478
Query: 223 KE 224
E
Sbjct: 2479 AE 2480
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 76/116 (65%), Gaps = 10/116 (8%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
++K H + E EK + +K+ ++KD++ +E +K++ +KE G+K+ +K + EEK+
Sbjct: 2191 EEKQHLQMEAEKDRLQKEAEEKDRLLKEEDEKQRIQKESGEKDRLQK-----ETEEKDRL 2245
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE----KLKKKKKEKERSSHEKE 224
+KE++EK +I EEKD +K++++KD+ LK+E +++K+ +EK+R E E
Sbjct: 2246 LKEEEEKQRIQIESEEKD-RLQKEAEEKDRLLKEEEEKRRIQKESEEKDRLQKEAE 2300
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 88/122 (72%), Gaps = 10/122 (8%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKE 167
+KD+ +++EK +I+K+ ++KD+ + + +EKD KE++EK++ +KE +EK +++ E
Sbjct: 2301 EKDRLLKEEEEKRRIQKESEEKDRLQKEAEEKDRLLKEEEEKQRIQKESEEKDRLQKEAE 2360
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKDKDKKLK----KEKLKKKKKEKERSSHE 222
EK+ +KE++EK ++I+KE ++ ++ +K++++KD+ LK K++++K+ +EK+R E
Sbjct: 2361 EKDRLLKEEEEK--QRIQKESEEKDRLQKEAEEKDRLLKEEEEKQRIQKESEEKDRLQKE 2418
Query: 223 KE 224
E
Sbjct: 2419 AE 2420
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 82/121 (67%), Gaps = 8/121 (6%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKE 167
+KD+ +K+EK++++K+ ++KD+ + + +EKD KE++EK + +KE +EK +++ E
Sbjct: 2451 EKDRLLKEKEEKQRMQKESEEKDRLQKEAEEKDRLLKEEEEKRRIQKESEEKDRLQKETE 2510
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK----KEKLKKKKKEKERSSHEK 223
EK+ +KE++EK + EEKD +K++++KD+ LK K++++K+ +EK R E
Sbjct: 2511 EKDRLLKEEEEKQRMQKESEEKD-RLQKEAEEKDRLLKEEEEKQRIQKELEEKNRLQKET 2569
Query: 224 E 224
E
Sbjct: 2570 E 2570
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 81/121 (66%), Gaps = 8/121 (6%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK---EKEKGKKEEKEKKEEKEKKE 167
+KD+ +++EK++I+K+ ++KD+ + + +EKD+ K EK++ +KE +EK +++ E
Sbjct: 2391 EKDRLLKEEEEKQRIQKESEEKDRLQKEAEEKDRLLKEEEEKQRIQKESEEKDRLQKETE 2450
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE----KLKKKKKEKERSSHEK 223
EK+ +KE +EK + EEKD +K++++KD+ LK+E +++K+ +EK+R E
Sbjct: 2451 EKDRLLKEKEEKQRMQKESEEKD-RLQKEAEEKDRLLKEEEEKRRIQKESEEKDRLQKET 2509
Query: 224 E 224
E
Sbjct: 2510 E 2510
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 87/125 (69%), Gaps = 10/125 (8%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK---EKEKGKKEEKEKKEEKE 164
+S +KD+ + +EK+++ K++++K + +I+++EKD+ +K EK++ KEE+EK+ ++
Sbjct: 2228 ESGEKDRLQKETEEKDRLLKEEEEKQRIQIESEEKDRLQKEAEEKDRLLKEEEEKRRIQK 2287
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKDKDKKLK----KEKLKKKKKEKERS 219
+ EEK+ KE +EK +++ KEE++ + +K+S++KD+ K K++L K+++EK+R
Sbjct: 2288 ESEEKDRLQKEAEEK--DRLLKEEEEKRRIQKESEEKDRLQKEAEEKDRLLKEEEEKQRI 2345
Query: 220 SHEKE 224
E E
Sbjct: 2346 QKESE 2350
>gi|347441835|emb|CCD34756.1| hypothetical protein [Botryotinia fuckeliana]
Length = 983
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 5/55 (9%)
Query: 292 MIGC--DGCD-AWYHWVCVGLVA-EPETSDWFCPKCSKV-DEGSRKEKKKRGRKK 341
MI C D C+ W+H CV + A P T+ W+CP C + G + E RGRK+
Sbjct: 929 MIACENDNCEKEWFHLECVEMTAVPPRTTKWYCPDCRIILGIGEKGEVNARGRKR 983
>gi|442624888|ref|NP_001259802.1| split ends, isoform G [Drosophila melanogaster]
gi|440213048|gb|AGB92339.1| split ends, isoform G [Drosophila melanogaster]
Length = 5487
Score = 43.5 bits (101), Expect = 0.19, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEF 171
DK+ +K+ +EK ++K+++++D + + +DK+ +EKE +KE++EK+ +EK + E+E
Sbjct: 1924 DKEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQREREH 1983
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ KE + ++ ++ +E S K K
Sbjct: 1984 REKEQSRRAMDVEQEGRGGRMRELSSYQKSK 2014
>gi|6467825|gb|AAF13218.1|AF188205_1 Spen RNP motif protein long isoform [Drosophila melanogaster]
Length = 5533
Score = 43.5 bits (101), Expect = 0.19, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEF 171
DK+ +K+ +EK ++K+++++D + + +DK+ +EKE +KE++EK+ +EK + E+E
Sbjct: 1997 DKEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQREREH 2056
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ KE + ++ ++ +E S K K
Sbjct: 2057 REKEQSRRAMDVEQEGRGGRMRELSSYQKSK 2087
>gi|24580581|ref|NP_524718.2| split ends, isoform B [Drosophila melanogaster]
gi|10727420|gb|AAF51534.2| split ends, isoform B [Drosophila melanogaster]
Length = 5533
Score = 43.5 bits (101), Expect = 0.19, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEF 171
DK+ +K+ +EK ++K+++++D + + +DK+ +EKE +KE++EK+ +EK + E+E
Sbjct: 1997 DKEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQREREH 2056
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ KE + ++ ++ +E S K K
Sbjct: 2057 REKEQSRRAMDVEQEGRGGRMRELSSYQKSK 2087
>gi|24580579|ref|NP_722615.1| split ends, isoform A [Drosophila melanogaster]
gi|46397733|sp|Q8SX83.2|SPEN_DROME RecName: Full=Protein split ends
gi|10727421|gb|AAF51535.2| split ends, isoform A [Drosophila melanogaster]
Length = 5560
Score = 43.5 bits (101), Expect = 0.19, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEF 171
DK+ +K+ +EK ++K+++++D + + +DK+ +EKE +KE++EK+ +EK + E+E
Sbjct: 1997 DKEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQREREH 2056
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ KE + ++ ++ +E S K K
Sbjct: 2057 REKEQSRRAMDVEQEGRGGRMRELSSYQKSK 2087
>gi|24580583|ref|NP_722616.1| split ends, isoform C [Drosophila melanogaster]
gi|6715140|gb|AAF26299.1|AF184612_1 split ends [Drosophila melanogaster]
gi|22945598|gb|AAN10511.1| split ends, isoform C [Drosophila melanogaster]
Length = 5476
Score = 43.5 bits (101), Expect = 0.21, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEF 171
DK+ +K+ +EK ++K+++++D + + +DK+ +EKE +KE++EK+ +EK + E+E
Sbjct: 1940 DKEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQREREH 1999
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ KE + ++ ++ +E S K K
Sbjct: 2000 REKEQSRRAMDVEQEGRGGRMRELSSYQKSK 2030
>gi|442624890|ref|NP_001259803.1| split ends, isoform H [Drosophila melanogaster]
gi|440213049|gb|AGB92340.1| split ends, isoform H [Drosophila melanogaster]
Length = 5458
Score = 43.5 bits (101), Expect = 0.21, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEF 171
DK+ +K+ +EK ++K+++++D + + +DK+ +EKE +KE++EK+ +EK + E+E
Sbjct: 1922 DKEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQREREH 1981
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ KE + ++ ++ +E S K K
Sbjct: 1982 REKEQSRRAMDVEQEGRGGRMRELSSYQKSK 2012
>gi|6979936|gb|AAF34661.1|AF221715_1 split ends long isoform [Drosophila melanogaster]
Length = 5554
Score = 43.5 bits (101), Expect = 0.21, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEF 171
DK+ +K+ +EK ++K+++++D + + +DK+ +EKE +KE++EK+ +EK + E+E
Sbjct: 1991 DKEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQREREH 2050
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ KE + ++ ++ +E S K K
Sbjct: 2051 REKEQSRRAMDVEQEGRGGRMRELSSYQKSK 2081
>gi|386768866|ref|NP_001245816.1| split ends, isoform D [Drosophila melanogaster]
gi|442624893|ref|NP_001259804.1| split ends, isoform E [Drosophila melanogaster]
gi|383291252|gb|AFH03493.1| split ends, isoform D [Drosophila melanogaster]
gi|440213050|gb|AGB92341.1| split ends, isoform E [Drosophila melanogaster]
Length = 5505
Score = 43.5 bits (101), Expect = 0.21, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEF 171
DK+ +K+ +EK ++K+++++D + + +DK+ +EKE +KE++EK+ +EK + E+E
Sbjct: 1942 DKEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQREREH 2001
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ KE + ++ ++ +E S K K
Sbjct: 2002 REKEQSRRAMDVEQEGRGGRMRELSSYQKSK 2032
>gi|442624895|ref|NP_001259805.1| split ends, isoform F [Drosophila melanogaster]
gi|440213051|gb|AGB92342.1| split ends, isoform F [Drosophila melanogaster]
Length = 5510
Score = 43.1 bits (100), Expect = 0.25, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEF 171
DK+ +K+ +EK ++K+++++D + + +DK+ +EKE +KE++EK+ +EK + E+E
Sbjct: 1947 DKEQKEKEIREKDLREKEQRERDNREKELRDKDLREKEMREKEQREKELHREKDQREREH 2006
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ KE + ++ ++ +E S K K
Sbjct: 2007 REKEQSRRAMDVEQEGRGGRMRELSSYQKSK 2037
>gi|405973316|gb|EKC38036.1| Lysine-specific demethylase 5A [Crassostrea gigas]
Length = 1883
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 32/71 (45%), Gaps = 13/71 (18%)
Query: 255 KSPSPPPVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPE 314
K+P P D Y IC C + DG M+ CDGCD +H C+ + PE
Sbjct: 275 KNPGPGAYTTVDLY---------ICHMCN-RGDGEEYMLLCDGCDDAFHTYCL-IPPMPE 323
Query: 315 T--SDWFCPKC 323
DW CPKC
Sbjct: 324 VPKGDWRCPKC 334
>gi|328872521|gb|EGG20888.1| SNF2-related domain-containing protein [Dictyostelium fasciculatum]
Length = 2077
Score = 42.4 bits (98), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 90/133 (67%), Gaps = 4/133 (3%)
Query: 95 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKK-DKKDKDKIKNKEKDKEKKEKEKGK 153
T+SE + ++ K +K+ H + EKE+++K++ ++ +K+K++ + +KE+ EKE+ +
Sbjct: 592 TTTSEKEEQEARENEAKLEKEKHDQLEKERLEKERLEQLEKEKLEQERLEKERLEKERLE 651
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
KE E+ E++ ++E+ ++++++ + +EK R E + EKEK + + ++L+KE+L++ +
Sbjct: 652 KERLEQLEKERLEKERLEQLEKERLEQLEKERIENERLEKEKLERLEKERLEKERLEQLE 711
Query: 214 KEK---ERSSHEK 223
KE+ ER ++EK
Sbjct: 712 KERLENERIANEK 724
>gi|348532937|ref|XP_003453962.1| PREDICTED: nipped-B-like protein [Oreochromis niloticus]
Length = 2680
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 58/83 (69%), Gaps = 4/83 (4%)
Query: 140 KEKDKEK-KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE---K 195
KE++ EK KE+++ K+ EK++E+++ +EKE DKE+ +EK ++ ++D EK +
Sbjct: 671 KERNIEKEKERDRDKERHTEKEKERDRDKEKERDRDRDKERHIEKEKERDRDKEKHIEKE 730
Query: 196 KSKDKDKKLKKEKLKKKKKEKER 218
+ +DKD+ ++KEK + K K+++R
Sbjct: 731 RDRDKDRHIEKEKERDKAKDRDR 753
>gi|47227297|emb|CAF96846.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2196
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 6/39 (15%)
Query: 292 MIGCDGCDAWYHWVCVGL------VAEPETSDWFCPKCS 324
MI CD C+ W+H CVG+ + E D+ CP C+
Sbjct: 192 MICCDRCEEWFHGDCVGITEARGRLMERNGEDYICPNCT 230
Score = 38.9 bits (89), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 48/66 (72%), Gaps = 1/66 (1%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK-KEKEKGKKEEKEKKEEKE 164
+D+S +KD+ + KEK++ + + +KD+D+ +NKEKD+++ KEK++ + KEK +++
Sbjct: 1886 QDRSKEKDRDQDRNKEKDRDQDRNKEKDRDQDRNKEKDRDRSKEKDRDQDRNKEKDRDRD 1945
Query: 165 KKEEKE 170
+ +EK+
Sbjct: 1946 RSKEKD 1951
>gi|390596707|gb|EIN06108.1| hypothetical protein PUNSTDRAFT_145461 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 991
Score = 41.2 bits (95), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 71/106 (66%), Gaps = 11/106 (10%)
Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV---- 181
++ +++K+K++ + +E+KEKE+ +E+KEK+ + ++++EKE + +E KEK
Sbjct: 664 RQAQEQKEKER-----QAQEQKEKERQAQEQKEKERQAQEQKEKERQAQEQKEKERQAQE 718
Query: 182 --EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
EK R+ ++ EKE++++++ +K ++ + +K+K+ + R EKE+
Sbjct: 719 QKEKERQAQEQKEKERRAQEQKEKERRAQEQKEKERRAREQREKEL 764
>gi|37725922|gb|AAO38039.1| reticulocyte binding-like protein 2b [Plasmodium reichenowi]
Length = 3427
Score = 40.4 bits (93), Expect = 1.8, Method: Composition-based stats.
Identities = 22/88 (25%), Positives = 56/88 (63%)
Query: 125 IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
+K+++ +K K +E ++++EK+ ++E+E K ++++K+ + K +E K + EK
Sbjct: 2767 LKRQEQEKQAQLQKEEELKRQEQEKQAQLQKEEELKRQEQEKQAQLQKEEELKRQEQEKQ 2826
Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+ +K+ E +++ ++K +L+KE+ K+
Sbjct: 2827 AQLQKEEELKRQEQEKQAQLQKEEELKR 2854
>gi|189241962|ref|XP_968788.2| PREDICTED: similar to CG8677 CG8677-PA [Tribolium castaneum]
Length = 2314
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 24/47 (51%), Gaps = 4/47 (8%)
Query: 279 CPACGVQDDGSLPMIGCDGCDAWYHWVCVG--LVAEPETSDWFCPKC 323
C CG D ++ CD CD +H C+ L+ PE DWFCP C
Sbjct: 1460 CQKCGKSDHPEWILL-CDSCDNGWHCSCLRPPLLVIPE-GDWFCPPC 1504
>gi|270015458|gb|EFA11906.1| hypothetical protein TcasGA2_TC004063 [Tribolium castaneum]
Length = 2306
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 24/47 (51%), Gaps = 4/47 (8%)
Query: 279 CPACGVQDDGSLPMIGCDGCDAWYHWVCVG--LVAEPETSDWFCPKC 323
C CG D ++ CD CD +H C+ L+ PE DWFCP C
Sbjct: 1452 CQKCGKSDHPEWILL-CDSCDNGWHCSCLRPPLLVIPE-GDWFCPPC 1496
>gi|308457811|ref|XP_003091269.1| hypothetical protein CRE_26828 [Caenorhabditis remanei]
gi|308257606|gb|EFP01559.1| hypothetical protein CRE_26828 [Caenorhabditis remanei]
Length = 616
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 76/136 (55%), Gaps = 6/136 (4%)
Query: 116 THKKKEKEKI---KKKKDKKDKDKIKNKEKDKEKKEKEKGKKE--EKEKKEEKEKKEEKE 170
T KKKE ++ K +DK K+K ++ D+E+ KEK KE +KEK ++++ +EK
Sbjct: 131 TEKKKEILRLVNEKAARDKAAKEKADQEKADQERDAKEKADKEKADKEKADKEKSDKEKA 190
Query: 171 FKMKEDKEKPV-EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 229
K K DKEK EK KE+ D EK K K +K KEK K K KE+ ++E++ +
Sbjct: 191 DKEKSDKEKADKEKADKEKADKEKADKEKADKEKADKEKADKDKAAKEKVERDRELLKLV 250
Query: 230 TFKFGTDMEEKTKRES 245
+K D K K+E+
Sbjct: 251 NYKADKDRIAKEKKEA 266
>gi|198476752|ref|XP_002132434.1| GA25185 [Drosophila pseudoobscura pseudoobscura]
gi|198137836|gb|EDY69836.1| GA25185 [Drosophila pseudoobscura pseudoobscura]
Length = 4986
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 72/104 (69%), Gaps = 8/104 (7%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
+TH ++EKEK +K ++ KEK+++ +++E+ +++E++ KEEKE+K ++E + K
Sbjct: 1051 QTHHQEEKEKERKSHQEE-------KEKERKAQQQEEKERDERKAKEEKERKAQEEREQK 1103
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
E EK ++++++ + ++E++ ++K+ + +KE ++ +EKE+
Sbjct: 1104 ERDEKEQQRVQEQREQEKREREQREKEHR-EKEHRERDHREKEK 1146
>gi|194766417|ref|XP_001965321.1| GF20711 [Drosophila ananassae]
gi|190617931|gb|EDV33455.1| GF20711 [Drosophila ananassae]
Length = 5735
Score = 39.3 bits (90), Expect = 3.8, Method: Composition-based stats.
Identities = 23/77 (29%), Positives = 54/77 (70%), Gaps = 2/77 (2%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE-EKEFKMKEDKEKPVEKIRKE 187
K+ ++K++ + ++++KE++EKE +KE++EK+ + K++ EKE KE +EK + + KE
Sbjct: 2115 KELREKEQREKEQREKEQREKELREKEQREKELVRAKEQREKELHEKEQREKELLR-EKE 2173
Query: 188 EKDSEKEKKSKDKDKKL 204
+++ E+++ + ++L
Sbjct: 2174 QREREQQEARMARMREL 2190
Score = 38.1 bits (87), Expect = 9.7, Method: Composition-based stats.
Identities = 18/50 (36%), Positives = 35/50 (70%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
E + EE +K EKE+++E+E K + ++EK +++RK+++ EKE+K+
Sbjct: 2016 EDDSSDSEEPAEKSEKEQRQEREKKERLEREKREKELRKQQEREEKERKA 2065
>gi|194853276|ref|XP_001968134.1| GG24702 [Drosophila erecta]
gi|190660001|gb|EDV57193.1| GG24702 [Drosophila erecta]
Length = 5592
Score = 38.9 bits (89), Expect = 5.0, Method: Composition-based stats.
Identities = 53/194 (27%), Positives = 106/194 (54%), Gaps = 23/194 (11%)
Query: 45 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 104
K+ K + + ++ K K S K +S +++ S TP + + SSE
Sbjct: 1870 KLHSKHNNRSESDKKTKKSEKNASSSDKRKNSSTSQSSKSATPRIED----DSSE----- 1920
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD-KEKKEKEKGKKEEKEKKEEK 163
D++ K +K + + ++K +++KDK++KD K E++ KE+K ++ +++E K +E+
Sbjct: 1921 -ADETADKAEKNQRHEREKKERQEKDKREKDLRKQVEREEKERKALQEEREKEDRKAKEE 1979
Query: 164 EKKEEKEFKMKEDKEKPVEKIR---------KEEKDS---EKEKKSKDKDKKLKKEKLKK 211
EK+ E+E K +ED+EK + R KE+K+ EK+ + K++ ++ +EK +
Sbjct: 1980 EKEREREKKAQEDREKKEREERELREKELRDKEQKEKEIREKDLREKEQRERDNREKELR 2039
Query: 212 KKKEKERSSHEKEV 225
+K+ +E+ EKE+
Sbjct: 2040 EKEMREKEQREKEL 2053
>gi|111218572|ref|XP_001134472.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
gi|90970872|gb|EAS66936.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
Length = 1720
Score = 38.5 bits (88), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 5/66 (7%)
Query: 261 PVAAADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPET---SD 317
P A+ + D K C C + D + MI CD CD WYH CV ++E +
Sbjct: 1112 PDEQANSINIKDDKDRLYC-VCQKKYDKTKFMIACDRCDEWYHGDCV-YISEKDAKRIKS 1169
Query: 318 WFCPKC 323
+ C C
Sbjct: 1170 YVCANC 1175
>gi|307172721|gb|EFN64027.1| Myosin light chain kinase, smooth muscle [Camponotus floridanus]
Length = 7605
Score = 38.1 bits (87), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 63/107 (58%), Gaps = 9/107 (8%)
Query: 106 KDKSIKKDKKTHK-KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK---- 160
KD+ IKKD+K HK +E E +KK +D K+K EK ++ E K E +E K
Sbjct: 4113 KDERIKKDEKEHKITQETESLKKHED----GKVKQIEKQEKVNATESIKDEIQESKKIIE 4168
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
EK K E+ E + KE+ E+ + K+E+++ +++K DK KK++K+
Sbjct: 4169 TEKIKSEKNEQQNKENYEEINKMAEKQEQETVQKRKESDKSKKIEKD 4215
>gi|195155519|ref|XP_002018651.1| GL25911 [Drosophila persimilis]
gi|194114804|gb|EDW36847.1| GL25911 [Drosophila persimilis]
Length = 5525
Score = 38.1 bits (87), Expect = 8.7, Method: Composition-based stats.
Identities = 45/187 (24%), Positives = 101/187 (54%), Gaps = 5/187 (2%)
Query: 39 GTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSS 98
TP D P ++ S KE+ +EK KE+ + + +
Sbjct: 2104 ATPRIEDSTDTDDPGER-SEKEQRQEREKAHQKERHEREKRKQQEREEKEKERKAHHHQP 2162
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKK-KDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
ED + K+ ++ +TH ++EKEK +K +++K+K++ ++++KE++E++ +++E+
Sbjct: 2163 ED--KEKEKAHPERKTQTHHQEEKEKERKSHQEEKEKERKAQQQEEKEREERKAKEEKER 2220
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+ +EE+E+KE E K ++ ++ E+ ++E + EKE + K+ ++ +EK K++ +
Sbjct: 2221 KAQEEREQKERDE-KEQQRVQEQREQEKREREQREKEHREKEHRERDHREKEKEQSSRRS 2279
Query: 218 RSSHEKE 224
S ++E
Sbjct: 2280 ISDSDQE 2286
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.309 0.129 0.377
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,846,311,271
Number of Sequences: 23463169
Number of extensions: 411710828
Number of successful extensions: 16297421
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 87234
Number of HSP's successfully gapped in prelim test: 121859
Number of HSP's that attempted gapping in prelim test: 6671496
Number of HSP's gapped (non-prelim): 3145122
length of query: 446
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 300
effective length of database: 8,933,572,693
effective search space: 2680071807900
effective search space used: 2680071807900
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 78 (34.7 bits)