BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy8831
(88 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|328721860|ref|XP_001943307.2| PREDICTED: hypothetical protein LOC100166174 isoform 1
[Acyrthosiphon pisum]
gi|328721862|ref|XP_003247419.1| PREDICTED: hypothetical protein LOC100166174 isoform 2
[Acyrthosiphon pisum]
Length = 637
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/48 (75%), Positives = 40/48 (83%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+QTGNKRSGV+CANCST TTLWRRNNNGEPVCNACGL ++N P
Sbjct: 464 QQTGNKRSGVACANCSTNTTTLWRRNNNGEPVCNACGLYFKLHNVNRP 511
Score = 42.4 bits (98), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC+ T LWRR+ G +CNACGL +N P
Sbjct: 417 CVNCAANVTPLWRRDGTGHHLCNACGLYNRINGVNRP 453
>gi|345491204|ref|XP_001607832.2| PREDICTED: hypothetical protein LOC100124020 [Nasonia vitripennis]
Length = 737
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/46 (71%), Positives = 35/46 (76%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
TG +R GV CANCSTT TTLWRRNNNGEPVCNACGL +N P
Sbjct: 560 TGGRRLGVRCANCSTTTTTLWRRNNNGEPVCNACGLYFKLHGVNRP 605
Score = 35.4 bits (80), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ +CNACG+ T I+ P
Sbjct: 508 GRECVNCGAHTTPLWRRDGTTY-LCNACGICSKTNGISRP 546
>gi|357609511|gb|EHJ66491.1| GATA transcription factor GATAc [Danaus plexippus]
Length = 633
Score = 75.1 bits (183), Expect = 6e-12, Method: Composition-based stats.
Identities = 34/46 (73%), Positives = 38/46 (82%), Gaps = 3/46 (6%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL---IMNTVDIN 54
GN+R GV+CANC T+ TTLWRRNNNGEPVCNACGL + N VDIN
Sbjct: 505 GNRRVGVTCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNIVDIN 550
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 17/37 (45%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC+ T LWRR+ G +CNACGL +N P
Sbjct: 452 CVNCAAATTPLWRRDGTGHYLCNACGLYTRINGVNRP 488
>gi|157125526|ref|XP_001654370.1| transcription factor GATA-4 (GATA binding factor-4) [Aedes aegypti]
gi|108873605|gb|EAT37830.1| AAEL010224-PA [Aedes aegypti]
Length = 719
Score = 74.3 bits (181), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 31/45 (68%), Positives = 36/45 (80%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
GN+R+GV CANCSTT TTLWRRNN GEPVCNACGL +++ P
Sbjct: 531 GNRRTGVVCANCSTTTTTLWRRNNRGEPVCNACGLYHKLHNVDRP 575
Score = 40.0 bits (92), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI 47
C NC ++ T LWRR+N G +CNAC L
Sbjct: 477 CVNCGSSDTPLWRRDNVGHTLCNACALY 504
>gi|56900908|gb|AAW31747.1| GATA transcription factor GATAc [Aedes aegypti]
Length = 719
Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/45 (68%), Positives = 36/45 (80%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
GN+R+GV CANCSTT TTLWRRNN GEPVCNACGL +++ P
Sbjct: 530 GNRRTGVVCANCSTTTTTLWRRNNRGEPVCNACGLYHKLHNVDRP 574
Score = 40.0 bits (92), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI 47
C NC ++ T LWRR+N G +CNAC L
Sbjct: 476 CVNCGSSDTPLWRRDNVGHTLCNACALY 503
>gi|170043916|ref|XP_001849613.1| GATA transcription factor GATAc [Culex quinquefasciatus]
gi|167867188|gb|EDS30571.1| GATA transcription factor GATAc [Culex quinquefasciatus]
Length = 578
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 36/45 (80%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
GN+R+GV+CANC TT TTLWRRNN GEPVCNACGL +++ P
Sbjct: 505 GNRRTGVTCANCQTTTTTLWRRNNRGEPVCNACGLYHKLHNVDRP 549
Score = 39.7 bits (91), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI 47
C NC ++ T LWRR+N G +CNAC L
Sbjct: 451 CVNCGSSDTPLWRRDNVGHTLCNACALY 478
>gi|383863207|ref|XP_003707073.1| PREDICTED: uncharacterized protein LOC100875315 [Megachile
rotundata]
Length = 703
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/46 (69%), Positives = 36/46 (78%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
TG +R+GV CANC TT TTLWRRNNNGEPVCNACGL ++N P
Sbjct: 492 TGVRRTGVQCANCRTTNTTLWRRNNNGEPVCNACGLYYKLHNVNRP 537
>gi|54262117|ref|NP_001005725.1| GATA transcription factor e [Strongylocentrotus purpuratus]
gi|52001519|gb|AAU21562.1| GATA transcription factor [Strongylocentrotus purpuratus]
Length = 567
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 37/49 (75%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+G++R G++CANC T+ TTLWRRN +GEPVCNACGL +N P
Sbjct: 305 RLQSGSRREGITCANCHTSTTTLWRRNKDGEPVCNACGLYFKLHGVNRP 353
Score = 42.7 bits (99), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 20/40 (50%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL N P
Sbjct: 259 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRP 298
>gi|37781291|gb|AAP35028.1| GATA transcription factor alpha [Patiria miniata]
Length = 644
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+G++R G+ CANC TT TTLWRRN GEPVCNACGL +N P
Sbjct: 403 RLQSGSRREGIVCANCHTTTTTLWRRNKEGEPVCNACGLYYKLHSVNRP 451
Score = 41.2 bits (95), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 18/31 (58%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
G C NC T LWRR+ G +CNACGL
Sbjct: 357 GRECVNCGAISTPLWRRDGTGHYLCNACGLY 387
>gi|347967441|ref|XP_001230593.3| AGAP002236-PA [Anopheles gambiae str. PEST]
gi|333466294|gb|EAU77738.3| AGAP002236-PA [Anopheles gambiae str. PEST]
Length = 747
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/45 (68%), Positives = 35/45 (77%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
GN+RSGV+CANC TT TTLWRRNN G+PVCNACGL +N P
Sbjct: 551 GNRRSGVTCANCQTTTTTLWRRNNQGDPVCNACGLYYKLHSVNRP 595
Score = 36.2 bits (82), Expect = 2.9, Method: Composition-based stats.
Identities = 14/27 (51%), Positives = 18/27 (66%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C NC ++ T LWRR+ G +CNAC L
Sbjct: 496 CVNCGSSDTPLWRRDIVGHTLCNACAL 522
>gi|37781177|gb|AAP34298.1| GATA transcription factor [Patiria miniata]
Length = 676
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 32/58 (55%), Positives = 38/58 (65%), Gaps = 3/58 (5%)
Query: 2 ILTFGIRE---QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+L F R Q+G++R G+ CANC TT TTLWRRN GEPVCNACGL +N P
Sbjct: 426 LLRFKDRRWKPQSGSRREGIVCANCHTTTTTLWRRNKEGEPVCNACGLYYKLHSVNRP 483
Score = 41.6 bits (96), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL ++ P
Sbjct: 357 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMSRP 396
>gi|285157832|gb|ADC35039.1| GATA456a [Themiste lageniformis]
Length = 730
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+G++R G+ CANC T+ TTLWRRNN GEPVCNACGL +N P
Sbjct: 440 QSGSRRMGLQCANCGTSTTTLWRRNNEGEPVCNACGLYYKLHQVNRP 486
Score = 43.1 bits (100), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 387 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 426
>gi|165292347|dbj|BAF98879.1| GATA binding protein 3 isoform 6 [Carassius auratus langsdorfii]
Length = 242
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 36/49 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ Q+ +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 114 QAQSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 162
>gi|165292339|dbj|BAF98875.1| GATA binding protein 3 isoform 2 [Carassius auratus langsdorfii]
Length = 261
Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 36/49 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ Q+ +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 133 QAQSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 181
>gi|328786511|ref|XP_001121244.2| PREDICTED: hypothetical protein LOC725389 [Apis mellifera]
Length = 808
Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 35/45 (77%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G +R+GV CANC T+ TTLWRRNNNGEPVCNACGL ++N P
Sbjct: 596 GVRRTGVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRP 640
>gi|307198089|gb|EFN79142.1| GATA-binding factor A [Harpegnathos saltator]
Length = 728
Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 30/46 (65%), Positives = 36/46 (78%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
TG +R+GV CANC T+ TTLWRRNN+GEPVCNACGL ++N P
Sbjct: 555 TGVRRTGVQCANCKTSNTTLWRRNNSGEPVCNACGLYFKLHNMNRP 600
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 27/52 (51%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNG 71
CANC+ + T LWRR+ G +CNACGL N P P + I+ G
Sbjct: 505 CANCAASLTPLWRRDGTGHYLCNACGLYSKMNGHNRPPMRCPKPKQTITPTG 556
>gi|405974156|gb|EKC38824.1| Transcription factor GATA-4 [Crassostrea gigas]
Length = 443
Score = 70.9 bits (172), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 31/56 (55%), Positives = 37/56 (66%)
Query: 1 MILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
M + + ++R G+SCANCSTT TTLWRRNN GEPVCNACGL +N P
Sbjct: 252 MYPDYNPLPYSASRRVGLSCANCSTTTTTLWRRNNEGEPVCNACGLYYKLHGVNRP 307
>gi|391331261|ref|XP_003740068.1| PREDICTED: GATA-binding factor 2-like [Metaseiulus occidentalis]
Length = 480
Score = 70.9 bits (172), Expect = 8e-11, Method: Composition-based stats.
Identities = 31/49 (63%), Positives = 37/49 (75%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+ +R+G SCANC TT TTLWRRN+NGEPVCNACGL ++N P
Sbjct: 292 RLQSAARRAGTSCANCKTTTTTLWRRNHNGEPVCNACGLYYKLHNVNRP 340
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 237 KNKTRSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 285
>gi|380017690|ref|XP_003692780.1| PREDICTED: uncharacterized protein LOC100866028 [Apis florea]
Length = 809
Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 35/45 (77%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G +R+GV CANC T+ TTLWRRNNNGEPVCNACGL ++N P
Sbjct: 596 GVRRTGVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRP 640
>gi|307186220|gb|EFN71904.1| Transcription factor GATA-4 [Camponotus floridanus]
Length = 713
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/46 (67%), Positives = 35/46 (76%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
TG +R+GV CANC T TTLWRRNNNGEPVCNACGL ++N P
Sbjct: 539 TGVRRTGVQCANCRTNNTTLWRRNNNGEPVCNACGLYFKLHNVNRP 584
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 19/42 (45%), Positives = 24/42 (57%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMP 61
CANC++ T LWRR+ G +CNACGL +N P P
Sbjct: 490 CANCASQMTPLWRRDGAGHYLCNACGLYGKMNGVNRPPMRCP 531
>gi|258645094|ref|NP_001158260.1| grain [Tribolium castaneum]
Length = 492
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ +R+G SCANC TT TTLWRRN NGEPVCNACGL ++N P
Sbjct: 316 QSAARRAGTSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVNRP 362
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 257 RNKSRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 305
>gi|322782909|gb|EFZ10627.1| hypothetical protein SINV_01972 [Solenopsis invicta]
Length = 726
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 34/43 (79%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+GV CANC T+ TTLWRRNNNGEPVCNACGL ++N P
Sbjct: 557 RRTGVQCANCRTSNTTLWRRNNNGEPVCNACGLYFKLHNVNRP 599
Score = 41.6 bits (96), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 20/28 (71%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI 47
C NC+ + T LWRR+ +G +CNACGL
Sbjct: 503 CVNCAASMTPLWRRDGSGHYLCNACGLY 530
>gi|332016552|gb|EGI57433.1| GATA-binding factor A [Acromyrmex echinatior]
Length = 704
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 34/43 (79%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+GV CANC T+ TTLWRRNNNGEPVCNACGL ++N P
Sbjct: 535 RRAGVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRP 577
Score = 43.1 bits (100), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 22/37 (59%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC+ + T LWRR+ G +CNACGL +N P
Sbjct: 482 CVNCAASMTPLWRRDGTGHYLCNACGLYNKMNGVNRP 518
>gi|270015717|gb|EFA12165.1| hypothetical protein TcasGA2_TC002315 [Tribolium castaneum]
Length = 409
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ +R+G SCANC TT TTLWRRN NGEPVCNACGL ++N P
Sbjct: 233 QSAARRAGTSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVNRP 279
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 174 RNKSRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 222
>gi|270008082|gb|EFA04530.1| grain [Tribolium castaneum]
Length = 537
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ Q G +R+GV CANC T TTLWRRNN GEPVCNACGL ++N P
Sbjct: 332 QAQPGPRRNGVQCANCKTGNTTLWRRNNQGEPVCNACGLYFKLHNVNRP 380
Score = 41.2 bits (95), Expect = 0.080, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C NC + T LWRR+ G +CNACGL
Sbjct: 287 CVNCGASVTPLWRRDGTGHYLCNACGL 313
>gi|291461544|dbj|BAI83406.1| GATA-C [Parasteatoda tepidariorum]
Length = 528
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NGEPVCNACGL ++N P
Sbjct: 359 RRLSAARRAGTSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVNRP 407
Score = 44.3 bits (103), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 305 RSKGRSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 353
>gi|157278177|ref|NP_001098188.1| transcription factor GATA-3 [Oryzias latipes]
gi|62005802|dbj|BAD91312.1| transcription factor GATA-3 [Oryzias latipes]
Length = 440
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 300 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRP 348
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 246 RPKTRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 294
>gi|348521214|ref|XP_003448121.1| PREDICTED: transcription factor GATA-3-like isoform 1 [Oreochromis
niloticus]
Length = 443
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 303 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRP 351
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 249 RPKTRSTSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 297
>gi|18858733|ref|NP_571286.1| transcription factor GATA-3 [Danio rerio]
gi|3183010|sp|Q91428.1|GATA3_DANRE RecName: Full=Transcription factor GATA-3; AltName:
Full=GATA-binding factor 3
gi|1245717|gb|AAA93491.1| transcription factor [Danio rerio]
gi|190336965|gb|AAI62389.1| GATA-binding protein 3 [Danio rerio]
gi|190336980|gb|AAI62401.1| GATA-binding protein 3 [Danio rerio]
Length = 438
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 298 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 346
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 244 RPKTRSSSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 292
>gi|195037647|ref|XP_001990272.1| GH19248 [Drosophila grimshawi]
gi|193894468|gb|EDV93334.1| GH19248 [Drosophila grimshawi]
Length = 696
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 521 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 567
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 458 VRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 507
>gi|340719407|ref|XP_003398145.1| PREDICTED: hypothetical protein LOC100650330 [Bombus terrestris]
Length = 813
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/45 (66%), Positives = 35/45 (77%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G +R+GV CANC T+ TTLWRRNNNGEPVCNACGL ++N P
Sbjct: 589 GVRRTGVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRP 633
>gi|195390588|ref|XP_002053950.1| GJ23061 [Drosophila virilis]
gi|194152036|gb|EDW67470.1| GJ23061 [Drosophila virilis]
Length = 686
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 511 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 557
Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 448 VRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 497
>gi|148222826|ref|NP_001084335.1| GATA-binding factor 3 [Xenopus laevis]
gi|83406000|gb|AAI10755.1| XGATA-3 protein [Xenopus laevis]
Length = 435
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 298 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 346
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 244 RPKTRSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 292
>gi|204306643|gb|ACH99859.1| GATA-3 [Cyprinus carpio]
Length = 442
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 302 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 350
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 248 RPKTRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 296
>gi|120964|sp|P23773.1|GATA3_XENLA RecName: Full=GATA-binding factor 3; AltName: Full=Transcription
factor xGATA-3
gi|214173|gb|AAA49724.1| GATA binding factor-3 [Xenopus laevis]
Length = 435
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 298 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 346
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 244 RPKTRSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 292
>gi|52345846|ref|NP_001004967.1| GATA binding protein 3 [Xenopus (Silurana) tropicalis]
gi|49522444|gb|AAH75479.1| GATA binding protein 3 [Xenopus (Silurana) tropicalis]
Length = 437
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 299 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 347
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 245 RPKTRSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 293
>gi|165292335|dbj|BAF98873.1| GATA binding protein 3 [Carassius auratus langsdorfii]
Length = 441
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 301 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 349
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 RPKTRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 295
>gi|194904144|ref|XP_001981009.1| GG17473 [Drosophila erecta]
gi|190652712|gb|EDV49967.1| GG17473 [Drosophila erecta]
Length = 487
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 312 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 358
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 22/43 (51%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 256 REEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 298
>gi|126340357|ref|XP_001363328.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Monodelphis domestica]
Length = 444
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|395538986|ref|XP_003771455.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Sarcophilus harrisii]
Length = 444
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|242020471|ref|XP_002430677.1| endothelial transcription factor GATA-2, putative [Pediculus
humanus corporis]
gi|212515857|gb|EEB17939.1| endothelial transcription factor GATA-2, putative [Pediculus
humanus corporis]
Length = 399
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NGEPVCNACGL ++N P
Sbjct: 342 RRLSAARRAGTSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVNRP 390
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 288 RNKSRSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 336
>gi|56605890|ref|NP_001008444.1| GATA-binding factor 3 [Gallus gallus]
gi|120961|sp|P23825.1|GATA3_CHICK RecName: Full=GATA-binding factor 3; Short=GATA-3; AltName:
Full=Transcription factor NF-E1c
gi|62968|emb|CAA40253.1| NF-E1 [Gallus gallus]
Length = 444
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|117276621|gb|ABK32791.1| GATA transcription factor 123 [Platynereis dumerilii]
Length = 523
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN+NG+PVCNACGL +N P
Sbjct: 334 RRLSAARRAGTSCANCGTTTTTLWRRNHNGDPVCNACGLYYKLHSVNRP 382
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 26/50 (52%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 279 VRTKSRSSSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGANRP 328
>gi|449480617|ref|XP_004176564.1| PREDICTED: LOW QUALITY PROTEIN: GATA-binding factor 3-like
[Taeniopygia guttata]
Length = 445
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 307 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 355
>gi|285157761|gb|ADC35036.1| GATA456a [Chaetopterus sp. MB-2010a]
Length = 597
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/48 (60%), Positives = 36/48 (75%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+++ ++R G+SCANC TT TTLWRRNN GEPVCNACGL +N P
Sbjct: 407 KKSASRRVGLSCANCQTTTTTLWRRNNEGEPVCNACGLYYKLHGVNRP 454
Score = 42.7 bits (99), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 334 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 373
>gi|24644977|ref|NP_731211.1| grain, isoform A [Drosophila melanogaster]
gi|3183017|sp|P91623.1|GATC_DROME RecName: Full=GATA-binding factor C; AltName: Full=Protein grain;
AltName: Full=Transcription factor GATA-C; AltName:
Full=dGATA-C
gi|1815600|dbj|BAA09102.1| GATA transcription factor [Drosophila melanogaster]
gi|7298992|gb|AAF54195.1| grain, isoform A [Drosophila melanogaster]
gi|17945890|gb|AAL48991.1| RE40104p [Drosophila melanogaster]
gi|220942548|gb|ACL83817.1| grn-PA [synthetic construct]
gi|220952714|gb|ACL88900.1| grn-PA [synthetic construct]
Length = 486
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 311 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 357
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 22/43 (51%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 255 REEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 297
>gi|449272886|gb|EMC82589.1| GATA-binding factor 3 [Columba livia]
Length = 444
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|432089717|gb|ELK23534.1| Trans-acting T-cell-specific transcription factor GATA-3 [Myotis
davidii]
Length = 444
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|417401032|gb|JAA47421.1| Putative trans-acting t-cell-specific transcription factor gata-3
[Desmodus rotundus]
Length = 444
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|198452168|ref|XP_001358657.2| GA21945, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198131814|gb|EAL27798.2| GA21945, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 699
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 524 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 570
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 461 VRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 510
>gi|195145358|ref|XP_002013663.1| GL23278 [Drosophila persimilis]
gi|194102606|gb|EDW24649.1| GL23278 [Drosophila persimilis]
Length = 705
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 530 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 576
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 467 VRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 516
>gi|73949076|ref|XP_849153.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
isoform 1 [Canis lupus familiaris]
Length = 444
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|74185986|dbj|BAE34137.1| unnamed protein product [Mus musculus]
Length = 408
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 270 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 318
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 211 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 264
>gi|117646668|emb|CAL37449.1| hypothetical protein [synthetic construct]
gi|208966350|dbj|BAG73189.1| GATA binding protein 3 [synthetic construct]
Length = 443
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 305 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYELHNINRP 353
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 299
>gi|221378310|ref|NP_001138026.1| grain, isoform B [Drosophila melanogaster]
gi|220903030|gb|ACL83485.1| grain, isoform B [Drosophila melanogaster]
Length = 699
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 524 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 570
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 461 VRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 510
>gi|195453771|ref|XP_002073935.1| GK12883 [Drosophila willistoni]
gi|194170020|gb|EDW84921.1| GK12883 [Drosophila willistoni]
Length = 491
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 317 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 363
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 22/43 (51%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 261 REEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 303
>gi|195499000|ref|XP_002096762.1| GE24873 [Drosophila yakuba]
gi|194182863|gb|EDW96474.1| GE24873 [Drosophila yakuba]
Length = 707
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 532 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 578
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 469 VRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 518
>gi|194746249|ref|XP_001955593.1| GF16170 [Drosophila ananassae]
gi|190628630|gb|EDV44154.1| GF16170 [Drosophila ananassae]
Length = 703
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 528 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 574
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 465 VRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 514
>gi|195330796|ref|XP_002032089.1| GM26366 [Drosophila sechellia]
gi|194121032|gb|EDW43075.1| GM26366 [Drosophila sechellia]
Length = 703
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 528 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 574
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 465 VRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 514
>gi|281339813|gb|EFB15397.1| hypothetical protein PANDA_008702 [Ailuropoda melanoleuca]
Length = 412
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 274 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 322
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 215 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 268
>gi|390178038|ref|XP_003736551.1| GA21945, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859299|gb|EIM52624.1| GA21945, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 488
Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 313 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 359
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 22/43 (51%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 257 REEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 299
>gi|301769219|ref|XP_002920028.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like [Ailuropoda melanoleuca]
Length = 411
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 273 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 321
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 214 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 267
>gi|431917650|gb|ELK16915.1| Trans-acting T-cell-specific transcription factor GATA-3 [Pteropus
alecto]
Length = 409
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 271 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 319
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 212 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 265
>gi|241998888|ref|XP_002434087.1| endothelial transcription factor GATA-2, putative [Ixodes
scapularis]
gi|215495846|gb|EEC05487.1| endothelial transcription factor GATA-2, putative [Ixodes
scapularis]
Length = 385
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NGEPVCNACGL ++N P
Sbjct: 222 RRLSAARRAGTSCANCKTTTTTLWRRNQNGEPVCNACGLYFKLHNVNRP 270
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 168 RTKGRSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 216
>gi|345310085|ref|XP_003428923.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3-like [Ornithorhynchus
anatinus]
Length = 440
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 302 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 350
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 257 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 296
>gi|165292343|dbj|BAF98877.1| GATA binding protein 3 isoform 4 [Carassius auratus langsdorfii]
Length = 277
Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 149 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 197
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 104 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 143
>gi|195108873|ref|XP_001999017.1| GI24284 [Drosophila mojavensis]
gi|193915611|gb|EDW14478.1| GI24284 [Drosophila mojavensis]
Length = 378
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 203 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 249
Score = 44.3 bits (103), Expect = 0.011, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 22/43 (51%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 147 REEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 189
>gi|442617963|ref|NP_001262366.1| grain, isoform C [Drosophila melanogaster]
gi|440217190|gb|AGB95748.1| grain, isoform C [Drosophila melanogaster]
Length = 712
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 524 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 570
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 461 VRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 510
>gi|426240701|ref|XP_004014232.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3 [Ovis aries]
Length = 421
Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 283 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 331
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 224 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 277
>gi|195572641|ref|XP_002104304.1| GD20889 [Drosophila simulans]
gi|194200231|gb|EDX13807.1| GD20889 [Drosophila simulans]
Length = 292
Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G SCANC TT TTLWRRN +GEPVCNACGL ++N P
Sbjct: 117 QSAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRP 163
>gi|327290258|ref|XP_003229840.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like, partial [Anolis carolinensis]
Length = 364
Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 226 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 274
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 167 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 220
>gi|157112345|ref|XP_001651800.1| gata-binding factor-c (transcription factor gata-c) [Aedes
aegypti]
gi|108878107|gb|EAT42332.1| AAEL006114-PA, partial [Aedes aegypti]
Length = 169
Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 34/47 (72%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ +R+G SCANC TT TTLWRRN GEPVCNACGL ++N P
Sbjct: 3 QSAARRAGTSCANCKTTTTTLWRRNQGGEPVCNACGLYYKLHNVNRP 49
>gi|326911086|ref|XP_003201893.1| PREDICTED: GATA-binding factor 3-like [Meleagris gallopavo]
Length = 408
Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 270 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 318
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 211 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 264
>gi|53147411|dbj|BAD52301.1| transcription factor Gata-binding protein 2/3 [Eptatretus burgeri]
Length = 491
Score = 68.9 bits (167), Expect = 3e-10, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 343 RRLSAARRAGTSCANCQTTITTLWRRNANGDPVCNACGLYFKLHNVNRP 391
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 298 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 337
>gi|328697578|ref|XP_001945293.2| PREDICTED: hypothetical protein LOC100159936 [Acyrthosiphon pisum]
Length = 657
Score = 68.9 bits (167), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/47 (61%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ +R+G SCANC T+ TTLWRRN NGEPVCNACGL ++N P
Sbjct: 469 QSAARRAGTSCANCKTSTTTLWRRNQNGEPVCNACGLYYKLHNVNRP 515
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 419 GRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 458
>gi|374671149|gb|AEZ56247.1| transcription factor GATA-3 [Gadus morhua]
Length = 440
Score = 68.9 bits (167), Expect = 4e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 304 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRP 352
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 250 RPKTRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 298
>gi|410963189|ref|XP_003988148.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Felis catus]
Length = 492
Score = 68.6 bits (166), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 354 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 402
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 296 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 348
>gi|195452168|ref|XP_002073242.1| GK13254 [Drosophila willistoni]
gi|194169327|gb|EDW84228.1| GK13254 [Drosophila willistoni]
Length = 746
Score = 68.6 bits (166), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/44 (65%), Positives = 34/44 (77%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
N+R+GV+CANCST TTLWRRNN G PVCNACGL ++N P
Sbjct: 518 NRRNGVTCANCSTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRP 561
>gi|3702858|gb|AAC62961.1| GATA transcription factor [Strongylocentrotus purpuratus]
Length = 119
Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 37/49 (75%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+G++R G++CANC T+ TTLWRRN +GEPVCNACGL +N P
Sbjct: 47 RLQSGSRREGITCANCHTSTTTLWRRNKDGEPVCNACGLYFKLHGVNRP 95
Score = 41.2 bits (95), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 20/40 (50%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL N P
Sbjct: 1 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRP 40
>gi|344254252|gb|EGW10356.1| Endonuclease VIII-like 2 [Cricetulus griseus]
Length = 509
Score = 68.6 bits (166), Expect = 5e-10, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 74 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 122
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 29 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 68
>gi|284005110|ref|NP_001164701.1| gata4 transcription factor [Saccoglossus kowalevskii]
gi|283462246|gb|ADB22417.1| gata4 transcription factor [Saccoglossus kowalevskii]
Length = 516
Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+ CANC TT TTLWRRNN GEPVCNACGL +N P
Sbjct: 342 RRLSASRRVGLQCANCHTTQTTLWRRNNEGEPVCNACGLYFKLHSVNRP 390
Score = 42.7 bits (99), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 297 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGMNRP 336
>gi|117276623|gb|ABK32792.1| GATA transcription factor 456 [Platynereis dumerilii]
Length = 364
Score = 68.6 bits (166), Expect = 5e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRNN GEPVCNACGL +N P
Sbjct: 302 RRLSASRRVGLSCANCHTTTTTLWRRNNEGEPVCNACGLYYKLHGVNRP 350
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 257 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGLNRP 296
>gi|19386566|gb|AAL86577.1| GATA-3 [Raja eglanteria]
Length = 444
Score = 68.6 bits (166), Expect = 5e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 301 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 349
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 19/41 (46%), Positives = 22/41 (53%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G C NC T T LWRR+ G +CNACGL N P
Sbjct: 255 TGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 295
>gi|238863844|gb|ACR66216.1| transcription factor GATA456a, partial [Branchiostoma floridae]
Length = 424
Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+ CANC TT TTLWRRNN GEPVCNACGL ++N P
Sbjct: 336 RRLSASRRVGLQCANCRTTQTTLWRRNNEGEPVCNACGLYFKLHNVNRP 384
Score = 45.8 bits (107), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL +N P
Sbjct: 291 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 330
>gi|323652552|gb|ADX98527.1| GATA-binding protein 3 [Coturnix japonica]
Length = 281
Score = 68.2 bits (165), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 194 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 242
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 136 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 188
>gi|335353907|dbj|BAK39710.1| GATA binding protein 3 [Tursiops truncatus]
Length = 251
Score = 68.2 bits (165), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 163 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 211
Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 104 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 157
>gi|347971495|ref|XP_562743.4| AGAP004228-PA [Anopheles gambiae str. PEST]
gi|333468694|gb|EAL40667.4| AGAP004228-PA [Anopheles gambiae str. PEST]
Length = 603
Score = 68.2 bits (165), Expect = 6e-10, Method: Composition-based stats.
Identities = 29/47 (61%), Positives = 34/47 (72%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ +R+G SCANC TT TTLWRRN GEPVCNACGL ++N P
Sbjct: 387 QSAARRAGTSCANCKTTTTTLWRRNQGGEPVCNACGLYYKLHNVNRP 433
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 5/78 (6%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP-----EKIMPP 62
R + G C NC T T LWRR+ G +CNACGL N P +++
Sbjct: 326 RTKARTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVSS 385
Query: 63 IESMISRNGDALSVITST 80
++S R G + + +T
Sbjct: 386 LQSAARRAGTSCANCKTT 403
>gi|195036112|ref|XP_001989515.1| GH18758 [Drosophila grimshawi]
gi|193893711|gb|EDV92577.1| GH18758 [Drosophila grimshawi]
Length = 749
Score = 68.2 bits (165), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/44 (65%), Positives = 34/44 (77%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
N+R+GV+CANCST TTLWRRNN G PVCNACGL ++N P
Sbjct: 518 NRRNGVTCANCSTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRP 561
>gi|124001857|gb|ABM87875.1| GATA3 [Papio hamadryas]
Length = 136
Score = 68.2 bits (165), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 33/43 (76%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 4 RRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 46
>gi|349603634|gb|AEP99422.1| Trans-acting T-cell-specific transcription factor GATA-3-like
protein, partial [Equus caballus]
Length = 136
Score = 68.2 bits (165), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 33/43 (76%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 4 RRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 46
>gi|124054318|gb|ABM89372.1| GATA3 [Pongo pygmaeus]
Length = 136
Score = 68.2 bits (165), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 33/43 (76%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 4 RRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 46
>gi|285157785|gb|ADC35037.1| GATA123a [Chaetopterus sp. MB-2010a]
Length = 489
Score = 67.8 bits (164), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC T+ TTLWRRN NG+PVCNACGL ++N P
Sbjct: 310 RRLSAARRAGTSCANCGTSTTTLWRRNPNGDPVCNACGLYYKLHNVNRP 358
Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 265 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGANRP 304
>gi|242010040|ref|XP_002425784.1| GATA binding factor-1B, putative [Pediculus humanus corporis]
gi|212509717|gb|EEB13046.1| GATA binding factor-1B, putative [Pediculus humanus corporis]
Length = 907
Score = 67.8 bits (164), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 28/36 (77%), Positives = 30/36 (83%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
N+R+GV CANC TT TTLWRRNN GEPVCNACGL
Sbjct: 712 NRRTGVECANCRTTNTTLWRRNNTGEPVCNACGLYF 747
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC+T T LWRR+++G +CNACGL +N P
Sbjct: 646 ECVNCATCRTPLWRRDDDGHYLCNACGLYNKVNGVNRP 683
>gi|356582220|ref|NP_001239112.1| trans-acting T-cell-specific transcription factor GATA-3 [Ovis
aries]
gi|342671952|dbj|BAK57314.1| transcription factor GATA3 [Ovis aries]
Length = 444
Score = 67.8 bits (164), Expect = 7e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTLGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|440906268|gb|ELR56550.1| Trans-acting T-cell-specific transcription factor GATA-3 [Bos
grunniens mutus]
Length = 444
Score = 67.8 bits (164), Expect = 7e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|116003829|ref|NP_001070272.1| trans-acting T-cell-specific transcription factor GATA-3 [Bos
taurus]
gi|122132376|sp|Q08DV0.1|GATA3_BOVIN RecName: Full=Trans-acting T-cell-specific transcription factor
GATA-3; AltName: Full=GATA-binding factor 3
gi|115305262|gb|AAI23556.1| GATA binding protein 3 [Bos taurus]
gi|296481519|tpg|DAA23634.1| TPA: trans-acting T-cell-specific transcription factor GATA-3 [Bos
taurus]
Length = 443
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 305 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 353
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 299
>gi|410918522|ref|XP_003972734.1| PREDICTED: transcription factor GATA-3-like isoform 1 [Takifugu
rubripes]
Length = 444
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 304 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRP 352
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 250 RPKSRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 298
>gi|344277697|ref|XP_003410636.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3-like [Loxodonta africana]
Length = 444
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|410918524|ref|XP_003972735.1| PREDICTED: transcription factor GATA-3-like isoform 2 [Takifugu
rubripes]
Length = 451
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 311 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRP 359
Score = 42.7 bits (99), Expect = 0.027, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 257 RPKSRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 305
>gi|327264202|ref|XP_003216904.1| PREDICTED: erythroid transcription factor-like [Anolis
carolinensis]
Length = 383
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G C+NC TT TTLWRRN NGEPVCNACGL ++N P
Sbjct: 240 SKRAGTQCSNCQTTTTTLWRRNVNGEPVCNACGLYFKLHNVNRP 283
Score = 42.7 bits (99), Expect = 0.027, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ G +CNACGL N P
Sbjct: 193 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 229
>gi|291401984|ref|XP_002717407.1| PREDICTED: GATA binding protein 3 [Oryctolagus cuniculus]
Length = 444
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|238863840|gb|ACR66214.1| transcription factor GATA123 [Branchiostoma floridae]
Length = 473
Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL +N P
Sbjct: 318 RRLSAARRAGTQCANCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVNRP 366
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 3/46 (6%)
Query: 14 KRS---GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
KRS G C NC T T LWRR+ G +CNACGL N P
Sbjct: 267 KRSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 312
>gi|348575369|ref|XP_003473462.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3-like [Cavia porcellus]
Length = 444
Score = 67.8 bits (164), Expect = 9e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|113205712|ref|NP_001038032.1| trans-acting T-cell-specific transcription factor GATA-3 [Sus
scrofa]
gi|91177057|gb|ABE26883.1| GATA binding protein 3 [Sus scrofa]
Length = 444
Score = 67.8 bits (164), Expect = 9e-10, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|313217859|emb|CBY41259.1| unnamed protein product [Oikopleura dioica]
Length = 185
Score = 67.8 bits (164), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 35/47 (74%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ +R+G SC+NC TT TTLWRRN+NG+PVCNACGL +N P
Sbjct: 6 QSAARRAGTSCSNCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVNRP 52
>gi|260809775|ref|XP_002599680.1| hypothetical protein BRAFLDRAFT_205741 [Branchiostoma floridae]
gi|229284961|gb|EEN55692.1| hypothetical protein BRAFLDRAFT_205741 [Branchiostoma floridae]
Length = 473
Score = 67.4 bits (163), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL +N P
Sbjct: 318 RRLSAARRAGTQCANCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVNRP 366
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 22/42 (52%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R G C NC T T LWRR+ G +CNACGL N P
Sbjct: 271 RFGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 312
>gi|351708641|gb|EHB11560.1| Trans-acting T-cell-specific transcription factor GATA-3
[Heterocephalus glaber]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|195111292|ref|XP_002000213.1| GI22648 [Drosophila mojavensis]
gi|193916807|gb|EDW15674.1| GI22648 [Drosophila mojavensis]
Length = 750
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/44 (65%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
N+RSGV CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 520 NRRSGVVCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRP 563
>gi|157169245|gb|ABV25957.1| GATA-binding transcription factor A1 [Capitella teleta]
Length = 372
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 36/49 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G +CANC TT TTLWRRN+NG+PVCNACGL ++N P
Sbjct: 315 RRLSAARRAGTNCANCGTTTTTLWRRNHNGDPVCNACGLYYKLHNVNRP 363
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 26/50 (52%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 260 VRTKSRSSSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMSGQNRP 309
>gi|6679951|ref|NP_032117.1| trans-acting T-cell-specific transcription factor GATA-3 [Mus
musculus]
gi|120963|sp|P23772.1|GATA3_MOUSE RecName: Full=Trans-acting T-cell-specific transcription factor
GATA-3; AltName: Full=GATA-binding factor 3
gi|51053|emb|CAA38917.1| GATA-3 factor [Mus musculus]
gi|38566046|gb|AAH62915.1| GATA binding protein 3 [Mus musculus]
gi|74140120|dbj|BAE33786.1| unnamed protein product [Mus musculus]
gi|89275361|gb|ABD66080.1| GATA binding protein 3 [Mus musculus]
gi|89275371|gb|ABD66081.1| GATA binding protein 3 [Mus musculus]
gi|148676049|gb|EDL07996.1| GATA binding protein 3 [Mus musculus]
Length = 443
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 305 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 353
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 251 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 299
>gi|426363955|ref|XP_004049093.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Gorilla gorilla gorilla]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|74207086|dbj|BAE33319.1| unnamed protein product [Mus musculus]
Length = 442
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 304 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 352
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 246 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 298
>gi|403296353|ref|XP_003939076.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Saimiri boliviensis boliviensis]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|395827355|ref|XP_003786870.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Otolemur garnettii]
Length = 445
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 307 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 355
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 253 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 301
>gi|354465002|ref|XP_003494969.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Cricetulus griseus]
gi|344238968|gb|EGV95071.1| Trans-acting T-cell-specific transcription factor GATA-3
[Cricetulus griseus]
Length = 443
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 305 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 353
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 251 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 299
>gi|332217060|ref|XP_003257671.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Nomascus leucogenys]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|297686007|ref|XP_002820561.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Pongo abelii]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|194475182|gb|ACF74542.1| GATA binding protein 3 [Cervus elaphus]
Length = 191
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 114 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 162
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 56 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 108
>gi|117646902|emb|CAL37566.1| hypothetical protein [synthetic construct]
Length = 443
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 305 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 353
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 299
>gi|344281251|ref|XP_003412393.1| PREDICTED: transcription factor GATA-4-like [Loxodonta africana]
Length = 611
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|194767721|ref|XP_001965963.1| GF11880 [Drosophila ananassae]
gi|190619806|gb|EDV35330.1| GF11880 [Drosophila ananassae]
Length = 710
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
N+R+GV+CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 502 NRRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRP 545
>gi|50745252|ref|XP_420041.1| PREDICTED: transcription factor GATA-4 [Gallus gallus]
Length = 410
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 229 RRLSASRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 277
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 184 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 223
>gi|443686031|gb|ELT89440.1| hypothetical protein CAPTEDRAFT_149492, partial [Capitella teleta]
Length = 347
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 36/49 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G +CANC TT TTLWRRN+NG+PVCNACGL ++N P
Sbjct: 262 RRLSAARRAGTNCANCGTTTTTLWRRNHNGDPVCNACGLYYKLHNVNRP 310
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 26/50 (52%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 207 VRTKSRSSSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 256
>gi|195157148|ref|XP_002019458.1| GL12213 [Drosophila persimilis]
gi|194116049|gb|EDW38092.1| GL12213 [Drosophila persimilis]
Length = 760
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
N+R+GV+CANC T TTLWRRNN G PVCNACGL +N P
Sbjct: 535 NRRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHSLNRP 578
>gi|60827672|gb|AAX36808.1| GATA binding protein 3 [synthetic construct]
gi|61368371|gb|AAX43165.1| GATA binding protein 3 [synthetic construct]
Length = 445
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|355782626|gb|EHH64547.1| hypothetical protein EGM_17792 [Macaca fascicularis]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|326916751|ref|XP_003204668.1| PREDICTED: transcription factor GATA-4-like [Meleagris gallopavo]
Length = 410
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 229 RRLSASRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 277
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 184 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 223
>gi|55633439|ref|XP_507651.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
isoform 3 [Pan troglodytes]
gi|397515219|ref|XP_003827854.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Pan paniscus]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|30584633|gb|AAP36569.1| Homo sapiens GATA binding protein 3 [synthetic construct]
gi|61372534|gb|AAX43861.1| GATA binding protein 3 [synthetic construct]
gi|61372538|gb|AAX43862.1| GATA binding protein 3 [synthetic construct]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 305 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 353
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 299
>gi|18959228|ref|NP_579827.1| GATA binding protein 3 [Rattus norvegicus]
gi|12704536|gb|AAK00586.1| GATA-3 [Rattus norvegicus]
gi|149021023|gb|EDL78630.1| GATA binding protein 3 [Rattus norvegicus]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|402879607|ref|XP_003903424.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Papio anubis]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|383417433|gb|AFH31930.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Macaca mulatta]
gi|383417435|gb|AFH31931.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Macaca mulatta]
gi|383417437|gb|AFH31932.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Macaca mulatta]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|109088183|ref|XP_001108337.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like isoform 3 [Macaca mulatta]
gi|355562284|gb|EHH18878.1| hypothetical protein EGK_19445 [Macaca mulatta]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|183000|gb|AAA35870.1| Gata3 enhancer-binding protein [Homo sapiens]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 38.9 bits (89), Expect = 0.42, Method: Composition-based stats.
Identities = 18/49 (36%), Positives = 22/49 (44%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +C CGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCRRCGLYHKMNGQNRP 300
>gi|47218982|emb|CAG02020.1| unnamed protein product [Tetraodon nigroviridis]
Length = 449
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 314 RRLSAARRAGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRP 362
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 269 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 308
>gi|4503929|ref|NP_002042.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 2
[Homo sapiens]
gi|120962|sp|P23771.1|GATA3_HUMAN RecName: Full=Trans-acting T-cell-specific transcription factor
GATA-3; AltName: Full=GATA-binding factor 3
gi|31666|emb|CAA41102.1| hGATA3 transcription factor [Homo sapiens]
gi|13111766|gb|AAH03070.1| GATA binding protein 3 [Homo sapiens]
gi|39843077|gb|AAR32096.1| GATA binding protein 3 [Homo sapiens]
gi|119606773|gb|EAW86367.1| GATA binding protein 3, isoform CRA_a [Homo sapiens]
Length = 443
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 305 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 353
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 299
>gi|31662|emb|CAA38877.1| GATA-3 [Homo sapiens]
Length = 443
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 305 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 353
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 299
>gi|50541959|ref|NP_001002295.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Homo sapiens]
gi|31664|emb|CAA38916.1| hGATA-3 factor [Homo sapiens]
gi|13905020|gb|AAH06793.1| GATA binding protein 3 [Homo sapiens]
gi|61358245|gb|AAX41534.1| GATA binding protein 3 [synthetic construct]
gi|119606774|gb|EAW86368.1| GATA binding protein 3, isoform CRA_b [Homo sapiens]
Length = 444
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 306 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 354
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 252 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|198454788|ref|XP_001359721.2| GA10213 [Drosophila pseudoobscura pseudoobscura]
gi|198132954|gb|EAL28873.2| GA10213 [Drosophila pseudoobscura pseudoobscura]
Length = 759
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
N+R+GV+CANC T TTLWRRNN G PVCNACGL +N P
Sbjct: 538 NRRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHSLNRP 581
>gi|1169844|sp|P43691.1|GATA4_CHICK RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|511480|gb|AAA57503.1| GATA-4 transcription factor, partial [Gallus gallus]
Length = 380
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 199 RRLSASRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 247
Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 154 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 193
>gi|432858527|ref|XP_004068890.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Oryzias latipes]
Length = 456
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 313 RRLSAARRAGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRP 361
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 268 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 307
>gi|410920237|ref|XP_003973590.1| PREDICTED: GATA-binding factor 2-like [Takifugu rubripes]
Length = 457
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 314 RRLSAARRAGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRP 362
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 269 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 308
>gi|348510371|ref|XP_003442719.1| PREDICTED: GATA-binding factor 2-like [Oreochromis niloticus]
Length = 458
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 315 RRLSAARRAGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRP 363
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 270 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 309
>gi|312379466|gb|EFR25730.1| hypothetical protein AND_08684 [Anopheles darlingi]
Length = 566
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 28/50 (56%), Positives = 35/50 (70%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+++QT +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 172 LKKQTATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 221
>gi|432858529|ref|XP_004068891.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Oryzias latipes]
Length = 447
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 304 RRLSAARRAGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRP 352
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 259 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 298
>gi|335371115|gb|AEH57086.1| GATA123 [Bugula neritina]
Length = 281
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/52 (51%), Positives = 36/52 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
R Q+ +R+G CANC T TTLWRRN +G+PVCNACGL ++N P+ +
Sbjct: 122 RRQSATRRAGTCCANCGTFATTLWRRNQSGDPVCNACGLYYKLHNVNRPKTM 173
Score = 42.7 bits (99), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 22/40 (55%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
+ +T G C NC T T LWRR+ G +CNACGL
Sbjct: 68 KSKTKTGTEGRECVNCGATSTPLWRRDGTGHYLCNACGLY 107
>gi|28950682|gb|AAO47082.1| GATA-binding protein 3 [Marmota monax]
Length = 108
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 21 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 69
>gi|148235016|ref|NP_001084098.1| transcription factor GATA-4 [Xenopus laevis]
gi|2494683|sp|Q91677.1|GATA4_XENLA RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4; Short=xGATA-4
gi|1209878|gb|AAB05647.1| transcription factor xGATA-4 [Xenopus laevis]
Length = 392
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 226 RRLSASRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 274
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 181 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 220
>gi|49256546|gb|AAH71107.1| Gata4 protein [Xenopus laevis]
Length = 392
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 226 RRLSASRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 274
Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 181 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 220
>gi|157169241|gb|ABV25955.1| GATA-binding transcription factor B2 [Capitella teleta]
Length = 282
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC T+ TTLWRRNN GEPVCNACGL +N P
Sbjct: 121 SRRVGLSCANCHTSTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 164
Score = 39.7 bits (91), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 19/30 (63%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G CANC +T LWR N G +CNACG+
Sbjct: 71 GRECANCGSTYAPLWRWNGTGHLLCNACGV 100
>gi|425906025|gb|AFY10809.1| Gata456 [Isodiametra pulchra]
Length = 194
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+ T ++R G+ CANC T TTLWRRNN+G+PVCNACGL +N P
Sbjct: 7 RKMTTSRRMGLRCANCHTDSTTLWRRNNDGDPVCNACGLYYKLHHVNRP 55
>gi|395507323|ref|XP_003757975.1| PREDICTED: transcription factor GATA-4 isoform 2 [Sarcophilus
harrisii]
Length = 452
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 271 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 319
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 226 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 265
>gi|395507321|ref|XP_003757974.1| PREDICTED: transcription factor GATA-4 isoform 1 [Sarcophilus
harrisii]
Length = 451
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 270 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 318
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 225 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 264
>gi|189442167|gb|AAI67280.1| GATA binding protein 4 [Xenopus (Silurana) tropicalis]
Length = 394
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 228 RRLSASRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 276
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|40538758|ref|NP_571308.1| GATA-binding protein 2a [Danio rerio]
gi|31544961|gb|AAH53131.1| GATA-binding protein 2a [Danio rerio]
Length = 456
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 313 RRLSAARRAGTCCANCQTTTTTLWRRNGNGDPVCNACGLYYKLHNVNRP 361
Score = 44.3 bits (103), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 268 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 307
>gi|62858609|ref|NP_001016949.1| GATA binding protein 4 [Xenopus (Silurana) tropicalis]
gi|89266736|emb|CAJ83964.1| GATA binding protein 4 [Xenopus (Silurana) tropicalis]
Length = 394
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 228 RRLSASRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 276
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|341575748|gb|AEK80449.1| gata4 [Bubalus bubalis]
Length = 215
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 32 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 80
>gi|194767717|ref|XP_001965961.1| GF11857 [Drosophila ananassae]
gi|190619804|gb|EDV35328.1| GF11857 [Drosophila ananassae]
Length = 1271
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + KR+G+SC+NC TT T+LWRRN NGEPVCNACGL ++ P
Sbjct: 807 RRMSAQKRAGLSCSNCHTTYTSLWRRNPNGEPVCNACGLYFKLHNVPRP 855
>gi|440913584|gb|ELR63022.1| Transcription factor GATA-4, partial [Bos grunniens mutus]
Length = 268
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 85 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 133
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 40 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 79
>gi|387538427|gb|AFJ79490.1| GATA binding protein 1/2/3, partial [Branchiostoma lanceolatum]
Length = 278
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL +N P
Sbjct: 162 RRLSAARRAGTQCANCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVNRP 210
Score = 43.1 bits (100), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 3/46 (6%)
Query: 14 KRS---GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
KRS G C NC T T LWRR+ G +CNACGL N P
Sbjct: 111 KRSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 156
>gi|307775397|ref|NP_001182721.1| transcription factor GATA-3 [Oncorhynchus mykiss]
gi|307342716|emb|CAR95099.1| GATA binding protein 3 [Oncorhynchus mykiss]
Length = 441
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC T+ TTLWRRN NG+PVCNACGL +IN P
Sbjct: 304 RRLSAARRAGTSCANCQTSTTTLWRRNANGDPVCNACGLYYKLHNINRP 352
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 250 RPKTRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 298
>gi|284520963|ref|NP_001165271.1| transcription factor GATA-3 [Salmo salar]
gi|180038038|gb|ACB87012.1| transcription factor GATA-3 [Salmo salar]
Length = 441
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC T+ TTLWRRN NG+PVCNACGL +IN P
Sbjct: 304 RRLSAARRAGTSCANCQTSTTTLWRRNANGDPVCNACGLYYKLHNINRP 352
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 250 RPKTRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 298
>gi|15593990|emb|CAC69835.1| gata4 protein [Bos taurus]
Length = 266
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 83 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 131
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 38 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 77
>gi|443687408|gb|ELT90399.1| hypothetical protein CAPTEDRAFT_175842 [Capitella teleta]
Length = 341
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/35 (80%), Positives = 31/35 (88%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G++R G+ CANCSTT TTLWRRNN GEPVCNACGL
Sbjct: 54 GSRRLGLQCANCSTTTTTLWRRNNEGEPVCNACGL 88
>gi|47229585|emb|CAG06781.1| unnamed protein product [Tetraodon nigroviridis]
Length = 456
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 318 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNVNRP 366
Score = 39.3 bits (90), Expect = 0.31, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 3/51 (5%)
Query: 9 EQTGNKRS---GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G RS G C NC T T LWRR++ G +C CGL N P
Sbjct: 262 KNKGKTRSCTEGRECVNCGATSTPLWRRDSTGHYLCKRCGLYHKMNGQNRP 312
>gi|402887127|ref|XP_003906956.1| PREDICTED: endothelial transcription factor GATA-2 isoform 4 [Papio
anubis]
Length = 568
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 425 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 473
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 371 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 419
>gi|308097152|gb|ADO14130.1| GATA binding protein 4 [Notophthalmus viridescens]
Length = 184
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 16 RRLSASRRVGLSCANCHTTITTLWRRNAEGEPVCNACGLYMKLHGVPRP 64
>gi|301607037|ref|XP_002933110.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Xenopus (Silurana)
tropicalis]
Length = 451
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 308 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 356
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 254 RSKSRSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 302
>gi|444525948|gb|ELV14225.1| Transcription factor GATA-4 [Tupaia chinensis]
Length = 253
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 70 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 118
Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 25 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 64
>gi|328721865|ref|XP_001943353.2| PREDICTED: endothelial transcription factor GATA-2-like
[Acyrthosiphon pisum]
Length = 386
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/48 (60%), Positives = 33/48 (68%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
E N+R G+SC NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 215 ETASNRRLGLSCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHGVNRP 262
Score = 42.4 bits (98), Expect = 0.036, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 164 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 203
>gi|335371117|gb|AEH57087.1| GATA456 [Bugula neritina]
Length = 512
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 36/47 (76%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
++R+G+ CANC+T TTLWRRNN+GEPVCNACGL ++N P +
Sbjct: 310 SRRTGLVCANCNTGTTTLWRRNNDGEPVCNACGLYYKLHNVNRPPSM 356
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 24/42 (57%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMP 61
C NC T LWRR+ +G +CNACGL +N P ++ P
Sbjct: 253 CVNCGAMHTPLWRRDTDGHYLCNACGLYQKMNGLNRPLQVKP 294
>gi|338722454|ref|XP_001495866.2| PREDICTED: transcription factor GATA-4-like [Equus caballus]
Length = 315
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 132 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 180
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 87 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 126
>gi|321467569|gb|EFX78559.1| hypothetical protein DAPPUDRAFT_24785 [Daphnia pulex]
Length = 118
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 36/49 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G +CANC TT TTLWRRN+NGEPVCNACGL ++N P
Sbjct: 47 RRLSAARRAGTTCANCKTTMTTLWRRNHNGEPVCNACGLYYKLHNVNRP 95
Score = 42.4 bits (98), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 21/41 (51%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 1 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 41
>gi|317419025|emb|CBN81063.1| GATA-binding factor 2 [Dicentrarchus labrax]
Length = 345
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 201 RRLSAARRAGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRP 249
Score = 44.3 bits (103), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 156 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 195
>gi|307198085|gb|EFN79138.1| GATA-binding factor A [Harpegnathos saltator]
Length = 375
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/47 (59%), Positives = 31/47 (65%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
QT +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 220 QTATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 266
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 168 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLMS 217
>gi|38569869|gb|AAR24452.1| GATA transcription factor [Nematostella vectensis]
Length = 422
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +N P
Sbjct: 259 RRLSAARRAGTSCANCHTTQTTLWRRNQNGDPVCNACGLYWKLHAVNRP 307
Score = 43.1 bits (100), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
G C NC T T LWRR+ +G +CNACGL
Sbjct: 214 GRECVNCGATSTPLWRRDGSGHYLCNACGLY 244
>gi|390473525|ref|XP_003734617.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-4
[Callithrix jacchus]
Length = 379
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 196 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 244
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 151 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 190
>gi|170060991|ref|XP_001866045.1| GATA-binding factor-C [Culex quinquefasciatus]
gi|167879282|gb|EDS42665.1| GATA-binding factor-C [Culex quinquefasciatus]
Length = 474
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 27/48 (56%), Positives = 36/48 (75%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+Q+ +R+G SCANC TT T+LWRRN +GEPVCNACGL +++ P
Sbjct: 304 QQSAARRAGTSCANCKTTTTSLWRRNQSGEPVCNACGLYYKLHNVDRP 351
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 27/75 (36%), Positives = 35/75 (46%), Gaps = 1/75 (1%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMI 67
+ Q ++ G C NC T T LWRR+ G +CNACGL N P I P + +
Sbjct: 244 QNQPKHREEGRECVNCGATSTPLWRRDATGHYLCNACGLYYKMNGQNRP-LIKPKRKPVT 302
Query: 68 SRNGDALSVITSTPN 82
S+ A TS N
Sbjct: 303 SQQSAARRAGTSCAN 317
>gi|193788522|dbj|BAG53416.1| unnamed protein product [Homo sapiens]
Length = 236
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 53 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 101
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 8 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 47
>gi|195389406|ref|XP_002053368.1| GJ23377 [Drosophila virilis]
gi|194151454|gb|EDW66888.1| GJ23377 [Drosophila virilis]
Length = 771
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/44 (65%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
N+RSGV CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 551 NRRSGVICANCHTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRP 594
>gi|108744055|gb|ABG02399.1| GATA binding protein 4 [Bubalus bubalis]
Length = 172
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 29 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 77
>gi|156408197|ref|XP_001641743.1| predicted protein [Nematostella vectensis]
gi|156228883|gb|EDO49680.1| predicted protein [Nematostella vectensis]
Length = 423
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +N P
Sbjct: 260 RRLSAARRAGTSCANCHTTQTTLWRRNQNGDPVCNACGLYWKLHAVNRP 308
Score = 42.7 bits (99), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
G C NC T T LWRR+ +G +CNACGL
Sbjct: 215 GRECVNCGATSTPLWRRDGSGHYLCNACGLY 245
>gi|157115508|ref|XP_001658239.1| gata-binding factor-c (transcription factor gata-c) [Aedes aegypti]
gi|108876905|gb|EAT41130.1| AAEL007205-PA [Aedes aegypti]
Length = 509
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 27/48 (56%), Positives = 36/48 (75%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+Q+ +R+G SCANC TT T+LWRRN +GEPVCNACGL +++ P
Sbjct: 294 QQSAARRAGTSCANCKTTTTSLWRRNQSGEPVCNACGLYYKLHNVDRP 341
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 27/75 (36%), Positives = 35/75 (46%), Gaps = 1/75 (1%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMI 67
+ Q ++ G C NC T T LWRR+ G +CNACGL N P I P + +
Sbjct: 234 QNQPKHREEGRECVNCGATSTPLWRRDATGHYLCNACGLYYKMNGQNRP-LIKPKRKPVT 292
Query: 68 SRNGDALSVITSTPN 82
S+ A TS N
Sbjct: 293 SQQSAARRAGTSCAN 307
>gi|351706178|gb|EHB09097.1| Endothelial transcription factor GATA-2 [Heterocephalus glaber]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKTRSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|12835796|dbj|BAB23365.1| unnamed protein product [Mus musculus]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|47551055|ref|NP_999704.1| GATA transcription factor [Strongylocentrotus purpuratus]
gi|3702856|gb|AAC62960.1| GATA transcription factor [Strongylocentrotus purpuratus]
Length = 431
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/43 (65%), Positives = 31/43 (72%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
KR+G SCANC T TTLWRRN NG+PVCNACGL +N P
Sbjct: 268 KRTGTSCANCQATATTLWRRNPNGDPVCNACGLYYKLHGVNRP 310
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 26/49 (53%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ NG +CNACGL N P
Sbjct: 209 RNKSRSTTEGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRP 257
>gi|47523656|ref|NP_999458.1| transcription factor GATA-4 [Sus scrofa]
gi|22651761|gb|AAM63546.1| transcription factor GATA-4 [Sus scrofa]
Length = 442
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL I+ P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGISRP 253
>gi|161511589|gb|ABX71821.1| GATA1/2/3 [Paracentrotus lividus]
gi|270154815|gb|ACZ62636.1| GATA1/2/3 [Paracentrotus lividus]
Length = 430
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/43 (65%), Positives = 31/43 (72%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
KR+G SCANC T TTLWRRN NG+PVCNACGL +N P
Sbjct: 267 KRTGTSCANCQATATTLWRRNPNGDPVCNACGLYYKLHGVNRP 309
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 26/49 (53%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ NG +CNACGL N P
Sbjct: 208 RNKSRSTTEGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRP 256
>gi|74145051|dbj|BAE22221.1| unnamed protein product [Mus musculus]
Length = 236
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 53 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 101
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 8 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 47
>gi|50540446|ref|NP_001002689.1| GATA-binding protein 2b [Danio rerio]
gi|49900268|gb|AAH76462.1| GATA-binding protein 2b [Danio rerio]
gi|182890930|gb|AAI65802.1| Gata2b protein [Danio rerio]
Length = 427
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R+G CANC T TTLWRRN NGEPVCNACGL ++N P
Sbjct: 306 RRLSASRRAGTCCANCQTGTTTLWRRNANGEPVCNACGLYYKLHNVNRP 354
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 22/41 (53%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
SG C NC T T LWRR+ G +CNACGL N P
Sbjct: 260 SGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|182998|gb|AAA35869.1| transcription factor GATA-2 [Homo sapiens]
Length = 474
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 331 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 379
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACG N P
Sbjct: 277 RSKTRSCSEGRECVNCGATATPLWRRDGTGHYLCNACGFYHKMKGQNRP 325
>gi|226530726|ref|NP_032116.4| endothelial transcription factor GATA-2 [Mus musculus]
gi|21264417|sp|O09100.2|GATA2_MOUSE RecName: Full=Endothelial transcription factor GATA-2; AltName:
Full=GATA-binding protein 2
gi|18181872|dbj|BAA19053.2| GATA-2 protein [Mus musculus]
gi|74191748|dbj|BAE32831.1| unnamed protein product [Mus musculus]
gi|74196104|dbj|BAE32970.1| unnamed protein product [Mus musculus]
gi|76827995|gb|AAI07010.1| GATA binding protein 2 [Mus musculus]
gi|148666827|gb|EDK99243.1| GATA binding protein 2 [Mus musculus]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.014, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|354471588|ref|XP_003498023.1| PREDICTED: transcription factor GATA-4-like [Cricetulus griseus]
Length = 238
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 53 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 101
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 8 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 47
>gi|126336247|ref|XP_001366872.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Monodelphis
domestica]
Length = 473
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 330 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 378
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 276 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 324
>gi|18858731|ref|NP_571309.1| GATA binding protein 1a [Danio rerio]
gi|1132419|gb|AAA86090.1| zg1 [Danio rerio]
Length = 418
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 27/44 (61%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G CANC T+ TTLWRRN +GEPVCNACGL ++N P
Sbjct: 285 SKRAGTQCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVNRP 328
Score = 41.2 bits (95), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWR++ G +CNACGL N P
Sbjct: 238 CVNCGATATPLWRQDGTGHYLCNACGLYHKMNGQNRP 274
>gi|402877493|ref|XP_003902460.1| PREDICTED: transcription factor GATA-4 [Papio anubis]
Length = 443
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 260 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 308
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 215 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 254
>gi|395847143|ref|XP_003796243.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Otolemur garnettii]
gi|395847147|ref|XP_003796245.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Otolemur garnettii]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|327266039|ref|XP_003217814.1| PREDICTED: LOW QUALITY PROTEIN: endothelial transcription factor
GATA-2-like [Anolis carolinensis]
Length = 387
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 244 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 292
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 190 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 238
>gi|291393346|ref|XP_002713202.1| PREDICTED: GATA binding protein 2 isoform 2 [Oryctolagus cuniculus]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|301764531|ref|XP_002917681.1| PREDICTED: endothelial transcription factor GATA-2-like isoform 1
[Ailuropoda melanoleuca]
gi|281347007|gb|EFB22591.1| hypothetical protein PANDA_006028 [Ailuropoda melanoleuca]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|12836018|dbj|BAB23463.1| unnamed protein product [Mus musculus]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|15559217|ref|NP_254277.1| endothelial transcription factor GATA-2 [Rattus norvegicus]
gi|62286680|sp|Q924Y4.1|GATA2_RAT RecName: Full=Endothelial transcription factor GATA-2; AltName:
Full=GATA-binding protein 2
gi|14029161|gb|AAK51128.1| GATA2 [Rattus norvegicus]
gi|38197660|gb|AAH61745.1| GATA binding protein 2 [Rattus norvegicus]
gi|149036689|gb|EDL91307.1| GATA binding protein 2, isoform CRA_a [Rattus norvegicus]
gi|149036690|gb|EDL91308.1| GATA binding protein 2, isoform CRA_a [Rattus norvegicus]
gi|184185940|dbj|BAG30820.1| GATA-binding protein 2 [Rattus norvegicus]
gi|184185942|dbj|BAG30821.1| GATA-binding protein 2 [Rattus norvegicus]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|307186218|gb|EFN71902.1| Transcription factor BCFI [Camponotus floridanus]
Length = 830
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 27/52 (51%), Positives = 37/52 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
R + ++R+G+SC+NC TT T+LWRRN +GEPVCNACGL +N P +
Sbjct: 609 RRLSASRRAGLSCSNCETTMTSLWRRNASGEPVCNACGLYFKLHGVNRPSTM 660
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 29/64 (45%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSV 76
G C NC T LWRR+ G +CNACGL +N P P S R G + S
Sbjct: 564 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASRRAGLSCSN 623
Query: 77 ITST 80
+T
Sbjct: 624 CETT 627
>gi|116284294|gb|AAI24484.1| GATA-binding protein 1 [Danio rerio]
gi|182889206|gb|AAI64788.1| Gata1 protein [Danio rerio]
Length = 418
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 27/44 (61%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G CANC T+ TTLWRRN +GEPVCNACGL ++N P
Sbjct: 285 SKRAGTQCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVNRP 328
Score = 43.1 bits (100), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ G +CNACGL N P
Sbjct: 238 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 274
>gi|426341992|ref|XP_004036301.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Gorilla gorilla gorilla]
gi|426341996|ref|XP_004036303.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Gorilla gorilla gorilla]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|355689765|gb|AER98939.1| GATA binding protein 2 [Mustela putorius furo]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|149728223|ref|XP_001488214.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Equus
caballus]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|332244648|ref|XP_003271486.1| PREDICTED: transcription factor GATA-4 [Nomascus leucogenys]
Length = 442
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|297298911|ref|XP_001087008.2| PREDICTED: transcription factor GATA-4 isoform 1 [Macaca mulatta]
Length = 443
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 260 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 308
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 215 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 254
>gi|47523100|ref|NP_999044.1| endothelial transcription factor GATA-2 [Sus scrofa]
gi|29470193|gb|AAO73945.1| transcription factor GATA-2 [Sus scrofa]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|300797896|ref|NP_001179043.1| endothelial transcription factor GATA-2 [Bos taurus]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|354482833|ref|XP_003503600.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Cricetulus griseus]
gi|344253365|gb|EGW09469.1| Endothelial transcription factor GATA-2 [Cricetulus griseus]
Length = 480
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|300797798|ref|NP_001179806.1| transcription factor GATA-4 [Bos taurus]
gi|296484958|tpg|DAA27073.1| TPA: GATA binding protein 4 [Bos taurus]
Length = 442
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|358334312|dbj|GAA52740.1| transcription factor GATA-3 [Clonorchis sinensis]
Length = 1004
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+ ++R+G C+NC T TTLWRRN NGEPVCNACGL +I P
Sbjct: 658 RLQSSSRRTGTICSNCRTVTTTLWRRNTNGEPVCNACGLYFKLHNIQRP 706
Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 612 GRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNGQNRP 651
>gi|219518827|gb|AAI43480.1| GATA4 protein [Homo sapiens]
Length = 443
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 260 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 308
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 215 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 254
>gi|387541612|gb|AFJ71433.1| transcription factor GATA-4 [Macaca mulatta]
Length = 442
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|114326345|ref|NP_001041577.1| transcription factor GATA-4 [Canis lupus familiaris]
gi|122142221|sp|Q0Q0E4.1|GATA4_CANFA RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|110554951|gb|ABG75570.1| GATA binding protein 4 [Canis lupus familiaris]
Length = 442
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|33188461|ref|NP_002043.2| transcription factor GATA-4 [Homo sapiens]
gi|215274105|sp|P43694.2|GATA4_HUMAN RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|75517731|gb|AAI01581.1| GATA binding protein 4 [Homo sapiens]
gi|85397947|gb|AAI05109.1| GATA binding protein 4 [Homo sapiens]
gi|219518812|gb|AAI43435.1| GATA binding protein 4 [Homo sapiens]
Length = 442
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|332862667|ref|XP_528070.3| PREDICTED: transcription factor GATA-4 [Pan troglodytes]
Length = 408
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 225 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 273
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 180 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 219
>gi|291385798|ref|XP_002709484.1| PREDICTED: GATA-4 zinc-finger transcription factor [Oryctolagus
cuniculus]
Length = 443
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|339247095|ref|XP_003375181.1| GATA-binding factor 2 [Trichinella spiralis]
gi|316971560|gb|EFV55318.1| GATA-binding factor 2 [Trichinella spiralis]
Length = 215
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 42/63 (66%), Gaps = 2/63 (3%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMI 67
+ + +KR+G++CANC T TTLWRRN NG+PVCNACGL ++N P + E +
Sbjct: 48 KRSSASKRTGINCANCGTNTTTLWRRNQNGDPVCNACGLYYKLHNVNRPLSMKK--EGIQ 105
Query: 68 SRN 70
+RN
Sbjct: 106 TRN 108
Score = 43.1 bits (100), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 26/53 (49%), Gaps = 1/53 (1%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
G C NC T T LWRR+ G +CNACGL N P I P S S+
Sbjct: 3 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRPL-IKPKKRSSASK 54
>gi|46909569|ref|NP_032118.2| transcription factor GATA-4 [Mus musculus]
gi|5882286|gb|AAD55266.1|AF179424_1 transcription factor GATA-4 [Mus musculus]
gi|148704120|gb|EDL36067.1| GATA binding protein 4 [Mus musculus]
gi|223460984|gb|AAI37825.1| GATA binding protein 4 [Mus musculus]
Length = 441
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 258 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 306
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 213 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 252
>gi|17402599|dbj|BAB78731.1| transcription factor GATA-4 [Mus musculus]
Length = 442
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|297682265|ref|XP_002818845.1| PREDICTED: transcription factor GATA-4 isoform 1 [Pongo abelii]
Length = 443
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 260 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 308
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 215 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 254
>gi|427930996|pdb|4HC7|A Chain A, Crystal Structure Of The Full Dna Binding Domain Of
Gata3-Complex 2
gi|427930999|pdb|4HC7|B Chain B, Crystal Structure Of The Full Dna Binding Domain Of
Gata3-Complex 2
gi|427931003|pdb|4HC9|A Chain A, Dna Binding By Gata Transcription Factor-Complex 3
gi|427931007|pdb|4HCA|A Chain A, Dna Binding By Gata Transcription Factor-Complex 1
Length = 115
Score = 66.2 bits (160), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 50 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 98
Score = 42.0 bits (97), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 5 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 44
>gi|508484|gb|AAA58496.1| putative [Homo sapiens]
Length = 442
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|403307497|ref|XP_003944229.1| PREDICTED: transcription factor GATA-4 [Saimiri boliviensis
boliviensis]
Length = 442
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|395739374|ref|XP_003777251.1| PREDICTED: transcription factor GATA-4 [Pongo abelii]
Length = 442
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|410951840|ref|XP_003982601.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Felis
catus]
Length = 480
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|25282465|ref|NP_653331.1| transcription factor GATA-4 [Rattus norvegicus]
gi|1169847|sp|P46152.1|GATA4_RAT RecName: Full=Transcription factor GATA-4; AltName:
Full=DNA-binding protein GATA-GT2; AltName:
Full=GATA-binding factor 4
gi|437670|gb|AAA16159.1| DNA binding protein [Rattus norvegicus]
gi|149030263|gb|EDL85319.1| GATA binding protein 4 [Rattus norvegicus]
Length = 440
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 258 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 306
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 213 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 252
>gi|157278507|ref|NP_001098355.1| hematopietic transcription factor GATA-1 [Oryzias latipes]
gi|39748630|dbj|BAD04991.1| hematopietic transcription factor GATA-1 [Oryzias latipes]
Length = 417
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G CANC T+ TTLWRRN NGEPVCNACGL ++N P
Sbjct: 285 SKRAGTQCANCHTSTTTLWRRNANGEPVCNACGLYFKLHNVNRP 328
Score = 41.6 bits (96), Expect = 0.054, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ G +CNACGL N P
Sbjct: 238 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 274
>gi|157169239|gb|ABV25954.1| GATA-binding transcription factor B1 [Capitella teleta]
Length = 750
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/35 (80%), Positives = 31/35 (88%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G++R G+ CANCSTT TTLWRRNN GEPVCNACGL
Sbjct: 463 GSRRLGLQCANCSTTTTTLWRRNNEGEPVCNACGL 497
Score = 42.7 bits (99), Expect = 0.027, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 401 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 440
>gi|20070352|ref|NP_116027.2| endothelial transcription factor GATA-2 isoform 1 [Homo sapiens]
gi|224611699|ref|NP_001139133.1| endothelial transcription factor GATA-2 isoform 1 [Homo sapiens]
gi|114589043|ref|XP_516728.2| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Pan
troglodytes]
gi|332261773|ref|XP_003279941.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Nomascus leucogenys]
gi|397518553|ref|XP_003829449.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Pan
paniscus]
gi|397518555|ref|XP_003829450.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Pan
paniscus]
gi|397518557|ref|XP_003829451.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3 [Pan
paniscus]
gi|229462971|sp|P23769.3|GATA2_HUMAN RecName: Full=Endothelial transcription factor GATA-2; AltName:
Full=GATA-binding protein 2
gi|15990463|gb|AAH15613.1| GATA binding protein 2 [Homo sapiens]
gi|30410997|gb|AAH51342.1| GATA binding protein 2 [Homo sapiens]
gi|119599719|gb|EAW79313.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|119599721|gb|EAW79315.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|119599722|gb|EAW79316.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|119599723|gb|EAW79317.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|208966348|dbj|BAG73188.1| GATA binding protein 2 [synthetic construct]
gi|410209098|gb|JAA01768.1| GATA binding protein 2 [Pan troglodytes]
gi|410267030|gb|JAA21481.1| GATA binding protein 2 [Pan troglodytes]
gi|410288416|gb|JAA22808.1| GATA binding protein 2 [Pan troglodytes]
Length = 480
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|449473953|ref|XP_002194063.2| PREDICTED: GATA-binding factor 2-like [Taeniopygia guttata]
Length = 347
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 204 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 252
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 150 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 198
>gi|440913274|gb|ELR62746.1| Endothelial transcription factor GATA-2, partial [Bos grunniens
mutus]
Length = 385
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 269 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 317
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 215 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 263
>gi|30584557|gb|AAP36531.1| Homo sapiens hypothetical protein MGC2306 [synthetic construct]
gi|61371794|gb|AAX43732.1| GATA binding protein 2 [synthetic construct]
gi|61371798|gb|AAX43733.1| GATA binding protein 2 [synthetic construct]
Length = 481
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|402887121|ref|XP_003906953.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Papio
anubis]
gi|402887123|ref|XP_003906954.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Papio
anubis]
Length = 480
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|355786384|gb|EHH66567.1| hypothetical protein EGM_03585 [Macaca fascicularis]
Length = 480
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRRN G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRNGTGHYLCNACGLYHKMNGQNRP 331
>gi|344276401|ref|XP_003409997.1| PREDICTED: endothelial transcription factor GATA-2-like [Loxodonta
africana]
Length = 404
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 261 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 309
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 10 QTGNK----RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ GNK G C NC T T LWRR+ G +CNACGL N P
Sbjct: 205 KQGNKGRSWSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 255
>gi|297263167|ref|XP_001097903.2| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Macaca mulatta]
Length = 479
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 336 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 384
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 20/41 (48%), Positives = 22/41 (53%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
SG C NC T T LWRR+ G +CNACGL N P
Sbjct: 290 SGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 330
>gi|296225992|ref|XP_002758738.1| PREDICTED: endothelial transcription factor GATA-2 [Callithrix
jacchus]
gi|403268240|ref|XP_003926186.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Saimiri boliviensis boliviensis]
gi|403268242|ref|XP_003926187.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Saimiri boliviensis boliviensis]
gi|403268244|ref|XP_003926188.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Saimiri boliviensis boliviensis]
Length = 480
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|133777354|gb|AAI07011.2| Gata2 protein [Mus musculus]
Length = 447
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 304 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 352
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 250 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 298
>gi|109098094|ref|XP_001097801.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Macaca mulatta]
Length = 480
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|182996|gb|AAA35868.1| GATA-binding protein [Homo sapiens]
Length = 480
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|12803465|gb|AAH02557.1| GATA binding protein 2 [Homo sapiens]
gi|30582181|gb|AAP35317.1| hypothetical protein MGC2306 [Homo sapiens]
gi|60655087|gb|AAX32107.1| GATA binding protein 2 [synthetic construct]
gi|123982652|gb|ABM83067.1| GATA binding protein 2 [synthetic construct]
gi|123997319|gb|ABM86261.1| GATA binding protein 2 [synthetic construct]
gi|189054715|dbj|BAG37347.1| unnamed protein product [Homo sapiens]
Length = 480
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|301607039|ref|XP_002933111.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Xenopus (Silurana)
tropicalis]
Length = 453
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 310 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 358
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 256 RSKSRSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 304
>gi|321467983|gb|EFX78970.1| hypothetical protein DAPPUDRAFT_305027 [Daphnia pulex]
Length = 321
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 37/46 (80%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+GN+R+G++C+NC+T+ TTLWRRN NGEPVCNACGL ++ P
Sbjct: 48 SGNRRAGLTCSNCNTSTTTLWRRNANGEPVCNACGLYFKLHNVCRP 93
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 23/40 (57%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G CANCS T LWRR+ N +CNACGL T N P
Sbjct: 4 GRECANCSAIATPLWRRDGNNHYLCNACGLYKLTNGTNRP 43
>gi|395842329|ref|XP_003793970.1| PREDICTED: transcription factor GATA-4 [Otolemur garnettii]
Length = 442
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 258 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 306
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 213 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 252
>gi|270008083|gb|EFA04531.1| pannier [Tribolium castaneum]
Length = 407
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 170 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 215
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 122 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 161
>gi|332016551|gb|EGI57432.1| Transcription factor GATA-4 [Acromyrmex echinatior]
Length = 833
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R G+SC+NC TT T+LWRRN GEPVCNACGL +N P
Sbjct: 612 RRLSATRRVGLSCSNCETTMTSLWRRNAQGEPVCNACGLYFKLHGVNRP 660
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 567 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 606
>gi|397467354|ref|XP_003805387.1| PREDICTED: transcription factor GATA-4 [Pan paniscus]
Length = 386
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 203 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 251
Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 158 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 197
>gi|345314700|ref|XP_001514983.2| PREDICTED: transcription factor GATA-4-like, partial
[Ornithorhynchus anatinus]
Length = 315
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 67 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 115
Score = 43.1 bits (100), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 22 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 61
>gi|126303987|ref|XP_001381638.1| PREDICTED: transcription factor GATA-4 [Monodelphis domestica]
Length = 451
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 270 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 318
Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 225 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 264
>gi|91094885|ref|XP_973051.1| PREDICTED: similar to AGAP002235-PA [Tribolium castaneum]
Length = 395
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 170 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 215
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 122 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 161
>gi|345324774|ref|XP_001506274.2| PREDICTED: GATA-binding factor 2-like [Ornithorhynchus anatinus]
Length = 250
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 107 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 155
Score = 43.5 bits (101), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 62 GRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 101
>gi|194709238|pdb|3DFV|D Chain D, Adjacent Gata Dna Binding
gi|194709239|pdb|3DFV|C Chain C, Adjacent Gata Dna Binding
gi|194709242|pdb|3DFX|A Chain A, Opposite Gata Dna Binding
gi|194709243|pdb|3DFX|B Chain B, Opposite Gata Dna Binding
Length = 63
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 33/43 (76%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+G SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 4 RRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 46
>gi|410919499|ref|XP_003973222.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Takifugu rubripes]
Length = 456
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 318 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 366
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
Query: 9 EQTGNKRS---GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G RS G C NC T T LWRR++ G +CNACGL N P
Sbjct: 262 KNKGKTRSCTEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRP 312
>gi|395733176|ref|XP_002813217.2| PREDICTED: endothelial transcription factor GATA-2 [Pongo abelii]
Length = 406
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 263 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 311
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 209 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 257
>gi|395516754|ref|XP_003762552.1| PREDICTED: endothelial transcription factor GATA-2 [Sarcophilus
harrisii]
Length = 402
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 259 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 307
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 205 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 253
>gi|301776575|ref|XP_002923708.1| PREDICTED: transcription factor GATA-4-like [Ailuropoda
melanoleuca]
Length = 353
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 170 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 218
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 125 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 164
>gi|119586025|gb|EAW65621.1| GATA binding protein 4, isoform CRA_b [Homo sapiens]
Length = 417
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 234 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 282
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 189 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 228
>gi|119586024|gb|EAW65620.1| GATA binding protein 4, isoform CRA_a [Homo sapiens]
Length = 416
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 233 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 281
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 188 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 227
>gi|194901188|ref|XP_001980134.1| GG16973 [Drosophila erecta]
gi|190651837|gb|EDV49092.1| GG16973 [Drosophila erecta]
Length = 535
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 216 TATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 261
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 168 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 207
>gi|18033511|gb|AAL57180.1| transcription factor GATA-2 [Rattus norvegicus]
Length = 480
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGSCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|351695298|gb|EHA98216.1| Transcription factor GATA-4 [Heterocephalus glaber]
Length = 410
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 227 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 275
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 182 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 221
>gi|3183530|sp|Q08369.3|GATA4_MOUSE RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|1815679|gb|AAB42015.1| GATA-4 [Mus musculus]
Length = 440
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 258 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 306
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 213 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 252
>gi|238863842|gb|ACR66215.1| transcription factor GATA123s [Branchiostoma floridae]
Length = 337
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL +N P
Sbjct: 182 RRLSAARRAGTQCANCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVNRP 230
Score = 42.7 bits (99), Expect = 0.025, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 137 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 176
>gi|449276666|gb|EMC85098.1| GATA-binding factor 2, partial [Columba livia]
Length = 233
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 90 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 138
Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 45 GRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 84
>gi|51242135|ref|NP_001003797.1| GATA-binding factor 2 [Gallus gallus]
gi|326927940|ref|XP_003210145.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Meleagris
gallopavo]
gi|120958|sp|P23824.1|GATA2_CHICK RecName: Full=GATA-binding factor 2; Short=GATA-2; AltName:
Full=Transcription factor NF-E1b
gi|62966|emb|CAA40252.1| NF-E1 [Gallus gallus]
Length = 466
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 323 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 371
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 269 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 317
>gi|269785247|ref|NP_001161551.1| GATA2 transcription factor [Saccoglossus kowalevskii]
gi|268054087|gb|ACY92530.1| GATA2 transcription factor [Saccoglossus kowalevskii]
Length = 352
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC T TTLWRRN NG+PVCNACGL +N P
Sbjct: 183 RRLSAARRAGTSCANCQATQTTLWRRNANGDPVCNACGLYYKLHGVNRP 231
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 129 RTKTRSNSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 177
>gi|313224235|emb|CBY20024.1| unnamed protein product [Oikopleura dioica]
Length = 265
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/43 (62%), Positives = 33/43 (76%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+G SC+NC TT TTLWRRN+NG+PVCNACGL +N P
Sbjct: 90 RRAGTSCSNCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVNRP 132
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 19/44 (43%), Positives = 22/44 (50%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 35 DNSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRP 78
>gi|195501108|ref|XP_002097661.1| GE24361 [Drosophila yakuba]
gi|194183762|gb|EDW97373.1| GE24361 [Drosophila yakuba]
Length = 540
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 220 TATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 265
Score = 43.5 bits (101), Expect = 0.014, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 172 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 211
>gi|426222407|ref|XP_004005383.1| PREDICTED: transcription factor GATA-4 [Ovis aries]
Length = 353
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 170 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 218
Score = 40.8 bits (94), Expect = 0.097, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 20/40 (50%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL N P
Sbjct: 125 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 164
>gi|432103480|gb|ELK30584.1| Endothelial transcription factor GATA-2 [Myotis davidii]
Length = 355
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 212 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 260
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 158 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 206
>gi|410956538|ref|XP_003984899.1| PREDICTED: transcription factor GATA-4 [Felis catus]
Length = 441
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 258 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 306
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 213 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 252
>gi|120974910|gb|ABM46761.1| GATA4 [Gorilla gorilla]
Length = 176
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 53 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 101
Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 8 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 47
>gi|281347691|gb|EFB23275.1| hypothetical protein PANDA_012893 [Ailuropoda melanoleuca]
Length = 347
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 164 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 212
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 119 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 158
>gi|383851463|ref|XP_003701252.1| PREDICTED: GATA-binding factor C-like [Megachile rotundata]
Length = 292
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC T TTLWRRN GEPVCNACGL ++N P
Sbjct: 109 RRLSAARRAGTSCANCKTATTTLWRRNQAGEPVCNACGLYYKLHNVNRP 157
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 64 GRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRP 103
>gi|407025371|gb|AFS65552.1| Gata4/5/6, partial [Parastichopus parvimensis]
Length = 302
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +G++R G++CANC T+ TTLWRRN GEPVCNACG N P
Sbjct: 240 RRMSGSRREGITCANCHTSTTTLWRRNKEGEPVCNACGFYHKMNGQNRP 288
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 22/46 (47%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
TG G C NC T LWRR+ G +CNACGL N P
Sbjct: 189 TGPPEYGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRP 234
>gi|119112909|ref|XP_307949.3| AGAP002235-PA [Anopheles gambiae str. PEST]
gi|15420800|gb|AAK97462.1|AF395080_1 zinc finger transcription factor pannier [Anopheles gambiae]
gi|116132794|gb|EAA03705.3| AGAP002235-PA [Anopheles gambiae str. PEST]
Length = 537
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/48 (58%), Positives = 32/48 (66%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
QT +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 166 SQTATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 213
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S+
Sbjct: 117 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVSQ 167
>gi|256083783|ref|XP_002578117.1| transcription factor GATA-1 [Schistosoma mansoni]
Length = 735
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+ ++R+G C+NC T TTLWRRN NGEPVCNACGL +I P
Sbjct: 245 RLQSSSRRTGTICSNCRTITTTLWRRNTNGEPVCNACGLYFKLHNIQRP 293
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 199 GRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNGQNRP 238
>gi|353228552|emb|CCD74723.1| putative gata binding factor [Schistosoma mansoni]
Length = 735
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+ ++R+G C+NC T TTLWRRN NGEPVCNACGL +I P
Sbjct: 245 RLQSSSRRTGTICSNCRTITTTLWRRNTNGEPVCNACGLYFKLHNIQRP 293
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 199 GRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNGQNRP 238
>gi|285157858|gb|ADC35040.1| GATA456b [Themiste lageniformis]
Length = 496
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/50 (54%), Positives = 35/50 (70%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ + + N+R G+ C+NC TT TTLWRRNN GEPVCNACGL ++ P
Sbjct: 284 VNDHSANRRVGLQCSNCHTTTTTLWRRNNEGEPVCNACGLYFKLHGVSRP 333
Score = 35.0 bits (79), Expect = 6.0, Method: Composition-based stats.
Identities = 19/49 (38%), Positives = 21/49 (42%), Gaps = 9/49 (18%)
Query: 17 GVSCANCSTTCTTLW---------RRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LW RR+ G +CNACGL IN P
Sbjct: 227 GRECVNCGAISTPLWRRDGTGHYLRRDGTGHYLCNACGLYHKMNGINRP 275
>gi|426358838|ref|XP_004046698.1| PREDICTED: transcription factor GATA-4 [Gorilla gorilla gorilla]
Length = 382
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 199 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 247
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 154 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 193
>gi|194901184|ref|XP_001980132.1| GG16970 [Drosophila erecta]
gi|190651835|gb|EDV49090.1| GG16970 [Drosophila erecta]
Length = 749
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 32/43 (74%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+GV+CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 540 RRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRP 582
>gi|442619350|ref|NP_001262620.1| pannier, isoform C [Drosophila melanogaster]
gi|440217479|gb|AGB96000.1| pannier, isoform C [Drosophila melanogaster]
Length = 537
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 214 TATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 259
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 166 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 205
>gi|195328561|ref|XP_002030983.1| GM24279 [Drosophila sechellia]
gi|194119926|gb|EDW41969.1| GM24279 [Drosophila sechellia]
Length = 494
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 214 TATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 259
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 166 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 205
>gi|195570482|ref|XP_002103236.1| GD19066 [Drosophila simulans]
gi|194199163|gb|EDX12739.1| GD19066 [Drosophila simulans]
Length = 736
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 32/43 (74%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+GV+CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 523 RRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRP 565
>gi|120960|sp|P23770.1|GATA2_XENLA RecName: Full=GATA-binding factor 2; Short=GATA-2; AltName:
Full=Transcription factor xGATA-2
gi|214171|gb|AAA49723.1| GATA binding factor-2 [Xenopus laevis]
Length = 452
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC T+ TTLWRRN NG+PVCNACGL ++N P
Sbjct: 309 RRLSAARRAGTCCANCQTSTTTLWRRNANGDPVCNACGLYYKLHNVNRP 357
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 255 RSKSRSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 303
>gi|195328567|ref|XP_002030986.1| GM24276 [Drosophila sechellia]
gi|194119929|gb|EDW41972.1| GM24276 [Drosophila sechellia]
Length = 733
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 32/43 (74%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+GV+CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 524 RRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRP 566
>gi|17136408|ref|NP_476685.1| pannier, isoform A [Drosophila melanogaster]
gi|1709699|sp|P52168.1|PNR_DROME RecName: Full=GATA-binding factor A; AltName: Full=Protein pannier;
AltName: Full=Transcription factor GATA-A; AltName:
Full=dGATA-A
gi|545347|gb|AAB29874.1| C4 zinc finger DNA-binding protein/GATA transcription factor
[Drosophila melanogaster]
gi|23171432|gb|AAN13693.1| pannier, isoform A [Drosophila melanogaster]
gi|159884201|gb|ABX00779.1| RE33290p [Drosophila melanogaster]
Length = 540
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 217 TATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 262
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 166 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 215
>gi|195501114|ref|XP_002097664.1| GE24358 [Drosophila yakuba]
gi|194183765|gb|EDW97376.1| GE24358 [Drosophila yakuba]
Length = 738
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 32/43 (74%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+GV+CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 529 RRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRP 571
>gi|1132421|gb|AAA86091.1| zg2, partial [Danio rerio]
Length = 194
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 51 RRLSAARRAGTCCANCQTTTTTLWRRNGNGDPVCNACGLYYKLHNVNRP 99
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 22/42 (52%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 4 QRGAECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 45
>gi|313240837|emb|CBY33127.1| unnamed protein product [Oikopleura dioica]
Length = 249
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 33/43 (76%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+G SC+NC TT TTLWRRN+NG+PVCNACGL +N P
Sbjct: 74 RRAGTSCSNCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVNRP 116
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 23 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRP 62
>gi|19554935|gb|AAB29876.2| achaete and scute repressor/zinc finger protein [Drosophila
melanogaster]
Length = 540
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 217 TATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 262
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 166 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 215
>gi|162416003|gb|ABX89306.1| pannier protein, partial [Megaselia abdita]
Length = 206
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 32/49 (65%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ T +R G+SC NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 150 KRLTATRRLGLSCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 198
Score = 42.7 bits (99), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 105 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 144
>gi|338728062|ref|XP_003365610.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Equus caballus]
Length = 333
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+ ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 169 RKDPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 217
>gi|12641859|dbj|BAB21550.1| GATA factor e [Drosophila melanogaster]
Length = 704
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 32/43 (74%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+GV+CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 491 RRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRP 533
>gi|320542891|ref|NP_001189231.1| GATAe, isoform B [Drosophila melanogaster]
gi|320542893|ref|NP_001189232.1| GATAe, isoform C [Drosophila melanogaster]
gi|318068787|gb|ADV37322.1| GATAe, isoform B [Drosophila melanogaster]
gi|318068788|gb|ADV37323.1| GATAe, isoform C [Drosophila melanogaster]
Length = 731
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 32/43 (74%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+GV+CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 518 RRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRP 560
>gi|195389402|ref|XP_002053366.1| GJ23378 [Drosophila virilis]
gi|194151452|gb|EDW66886.1| GJ23378 [Drosophila virilis]
Length = 520
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 207 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 252
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 159 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 198
>gi|24647330|ref|NP_732102.1| pannier, isoform B [Drosophila melanogaster]
gi|442619352|ref|NP_001262621.1| pannier, isoform D [Drosophila melanogaster]
gi|23171433|gb|AAF55264.3| pannier, isoform B [Drosophila melanogaster]
gi|218506027|gb|ACK77655.1| RE17974p [Drosophila melanogaster]
gi|440217480|gb|AGB96001.1| pannier, isoform D [Drosophila melanogaster]
Length = 488
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 165 TATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 210
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 114 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 163
>gi|1845570|dbj|BAA11334.1| GATA-4 transcription factor [Homo sapiens]
gi|122893044|gb|ABM67539.1| Homo sapiens GATA4 [Shuttle vector pUCAG.GATA4]
Length = 439
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 256 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 304
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 211 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 250
>gi|20152127|gb|AAM11423.1| SD02611p [Drosophila melanogaster]
Length = 486
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 165 TATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 210
Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 114 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 163
>gi|355779507|gb|EHH63983.1| GATA-binding factor 4 [Macaca fascicularis]
Length = 253
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 76 SRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 119
Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 28 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 67
>gi|24647326|ref|NP_650516.2| GATAe, isoform A [Drosophila melanogaster]
gi|23171431|gb|AAF55262.2| GATAe, isoform A [Drosophila melanogaster]
gi|28603696|gb|AAO47880.1| LD08432p [Drosophila melanogaster]
gi|37951320|dbj|BAD00020.1| transcription factor dGATAe [Drosophila melanogaster]
gi|220942528|gb|ACL83807.1| GATAe-PA [synthetic construct]
gi|220952748|gb|ACL88917.1| GATAe-PA [synthetic construct]
Length = 746
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 32/43 (74%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+GV+CANC T TTLWRRNN G PVCNACGL ++N P
Sbjct: 533 RRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRP 575
>gi|443718381|gb|ELU09033.1| hypothetical protein CAPTEDRAFT_121668 [Capitella teleta]
Length = 131
Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC T+ TTLWRRNN GEPVCNACGL +N P
Sbjct: 56 SRRVGLSCANCHTSTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 99
Score = 38.1 bits (87), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 19/30 (63%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G CANC +T LWR N G +CNACG+
Sbjct: 6 GRECANCGSTYAPLWRWNGTGHLLCNACGV 35
>gi|371501187|emb|CBN81009.2| GATA-binding factor 3 [Dicentrarchus labrax]
Length = 460
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G CANC T+ TTLWRRN NGEPVCNACGL ++N P
Sbjct: 292 SKRAGTLCANCHTSTTTLWRRNANGEPVCNACGLYFKLHNVNRP 335
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDAL 74
C NC T T LWRR+ G +CNACGL N P I P ++S+ L
Sbjct: 245 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP-LIRPKKRLIVSKRAGTL 298
>gi|327288072|ref|XP_003228752.1| PREDICTED: transcription factor GATA-4-like [Anolis carolinensis]
Length = 411
Score = 64.7 bits (156), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+ CANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 230 RRLSASRRIGLMCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 278
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 185 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 224
>gi|195111290|ref|XP_002000212.1| GI22649 [Drosophila mojavensis]
gi|193916806|gb|EDW15673.1| GI22649 [Drosophila mojavensis]
Length = 517
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 210 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 255
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 159 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 208
>gi|383863205|ref|XP_003707072.1| PREDICTED: uncharacterized protein LOC100875205 [Megachile
rotundata]
Length = 941
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 27/52 (51%), Positives = 35/52 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
R + ++R G SC+NC TT T+LWRRN GEPVCNACGL +N P+ +
Sbjct: 673 RRLSASRRVGTSCSNCQTTMTSLWRRNTLGEPVCNACGLYFKLHGVNRPQTM 724
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 29/64 (45%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSV 76
G C NC T LWRR+ G +CNACGL +N P P S R G + S
Sbjct: 628 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASRRVGTSCSN 687
Query: 77 ITST 80
+T
Sbjct: 688 CQTT 691
>gi|148234322|ref|NP_001084043.1| GATA-binding factor 2 [Xenopus laevis]
gi|80476387|gb|AAI08545.1| XGATA-2 protein [Xenopus laevis]
Length = 453
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC T+ TTLWRRN NG+PVCNACGL ++N P
Sbjct: 310 RRLSAARRAGTCCANCQTSTTTLWRRNANGDPVCNACGLYYKLHNVNRP 358
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 256 RSKSRSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 304
>gi|224048927|ref|XP_002186842.1| PREDICTED: transcription factor GATA-4 [Taeniopygia guttata]
Length = 411
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 229 RRLSASRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 277
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 184 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 223
>gi|195036110|ref|XP_001989514.1| GH18759 [Drosophila grimshawi]
gi|193893710|gb|EDV92576.1| GH18759 [Drosophila grimshawi]
Length = 528
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 206 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 251
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 158 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 197
>gi|348540838|ref|XP_003457894.1| PREDICTED: GATA-binding factor 5-A-like [Oreochromis niloticus]
Length = 383
Score = 64.7 bits (156), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R QT ++R+G+ C NC T+ TTLWRRN GEPVCNACGL M + P
Sbjct: 229 RLQTTSRRAGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 277
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC + T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|328786704|ref|XP_001121210.2| PREDICTED: transcription factor GATA-5 [Apis mellifera]
Length = 444
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 233 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 278
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 185 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 224
>gi|328715574|ref|XP_003245664.1| PREDICTED: GATA-binding factor A-like, partial [Acyrthosiphon
pisum]
Length = 232
Score = 64.7 bits (156), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 35/55 (63%)
Query: 5 FGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
G E ++R GVSC NC T TTLWRR+N+G+PVCNACGL N P ++
Sbjct: 57 LGAFETVSSRRYGVSCTNCGTRMTTLWRRDNDGKPVCNACGLYYKLHGFNRPLRM 111
Score = 43.5 bits (101), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 32/73 (43%), Gaps = 13/73 (17%)
Query: 1 MILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPE--- 57
M L FG G C NC T LWRR+ G +CNACGL +N P
Sbjct: 3 MDLQFG---------EGRECVNCGAISTPLWRRDGCGHYLCNACGLYHKMNGMNRPSGRP 53
Query: 58 -KIMPPIESMISR 69
K + E++ SR
Sbjct: 54 AKRLGAFETVSSR 66
>gi|293345|gb|AAA37662.1| GATA-binding transcription factor [Mus musculus]
Length = 439
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 257 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 305
Score = 42.7 bits (99), Expect = 0.024, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 212 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 251
>gi|42566590|gb|AAS21059.1| GATA factor, partial [Nematostella vectensis]
Length = 208
Score = 64.7 bits (156), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL +N P
Sbjct: 45 RRLSAARRAGTSCANCHTTQTTLWRRNQNGDPVCNACGLYWMLHAVNRP 93
Score = 40.8 bits (94), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI 47
C NC T T LWRR+ +G +CNACGL
Sbjct: 3 CVNCGATSTPLWRRDGSGHYLCNACGLY 30
>gi|324507780|gb|ADY43293.1| Transcription factor GATA-3 [Ascaris suum]
Length = 397
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 26/49 (53%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ Q+ KR+G+ C NC T TTLWRRN +G+PVCNACGL +I+ P
Sbjct: 274 KRQSAQKRTGIECVNCKTNNTTLWRRNAHGQPVCNACGLYHKLHNISRP 322
Score = 41.2 bits (95), Expect = 0.074, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 20/40 (50%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL N P
Sbjct: 229 GRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRP 268
>gi|68364574|ref|XP_693371.1| PREDICTED: GATA-binding factor 2 [Danio rerio]
Length = 372
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 27/44 (61%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR G CANC T+ TTLWRRN++GEPVCNACGL ++N P
Sbjct: 245 SKRIGTQCANCQTSTTTLWRRNSSGEPVCNACGLYFKLHNVNRP 288
Score = 42.4 bits (98), Expect = 0.038, Method: Composition-based stats.
Identities = 21/50 (42%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
C NC T T LWRR+ G +CNACGL N P I P ++S+
Sbjct: 198 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP-LIRPKKRPVVSK 246
>gi|348510297|ref|XP_003442682.1| PREDICTED: GATA-binding factor 2-like [Oreochromis niloticus]
Length = 424
Score = 64.7 bits (156), Expect = 8e-09, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G CANC T+ TTLWRRN NGEPVCNACGL ++N P
Sbjct: 290 SKRAGTLCANCHTSTTTLWRRNANGEPVCNACGLYFKLHNVNRP 333
Score = 41.6 bits (96), Expect = 0.054, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDAL 74
C NC T T LWRR+ G +CNACGL N P I P ++S+ L
Sbjct: 243 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP-LIRPKKRLIVSKRAGTL 296
>gi|390179286|ref|XP_003736855.1| GA17817, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859788|gb|EIM52928.1| GA17817, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 496
Score = 64.7 bits (156), Expect = 8e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 169 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 214
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 118 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 167
>gi|198454786|ref|XP_001359720.2| GA17817, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132953|gb|EAL28872.2| GA17817, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 64.7 bits (156), Expect = 8e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 205 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 250
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 157 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 196
>gi|195157146|ref|XP_002019457.1| GL12215 [Drosophila persimilis]
gi|194116048|gb|EDW38091.1| GL12215 [Drosophila persimilis]
Length = 532
Score = 64.7 bits (156), Expect = 8e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 205 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 250
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 157 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 196
>gi|405957946|gb|EKC24122.1| GATA-binding factor A [Crassostrea gigas]
Length = 358
Score = 64.3 bits (155), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 31/42 (73%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R G+SCANC+T+ TTLWRRN GEPVCNACGL +N P
Sbjct: 98 RMGLSCANCNTSTTTLWRRNGEGEPVCNACGLYYKLHQVNRP 139
Score = 42.0 bits (97), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 32/74 (43%), Gaps = 16/74 (21%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVDINAP--------------EKIMPPI 63
C NC T LWRR+ G +CNACGL+ MN P EK PPI
Sbjct: 18 CVNCGAISTPLWRRDGTGHYLCNACGLLHKMNPGQSRTPVKQPNREESPISEEEKAPPPI 77
Query: 64 ESMISRNGDALSVI 77
+ + D +S I
Sbjct: 78 KRIDDAKFDKISGI 91
>gi|340720269|ref|XP_003398563.1| PREDICTED: hypothetical protein LOC100651088 [Bombus terrestris]
Length = 440
Score = 64.3 bits (155), Expect = 8e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 229 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 274
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 181 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 220
>gi|57546103|gb|AAW51922.1| GATA binding protein 4 [Homo sapiens]
Length = 442
Score = 64.3 bits (155), Expect = 8e-09, Method: Composition-based stats.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT T LWRRN GEPVCNACGL M + P
Sbjct: 259 RRLSASRRVGLSCANCQTTTTALWRRNAEGEPVCNACGLYMKLHGVPRP 307
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 214 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 253
>gi|238863846|gb|ACR66217.1| transcription factor GATA456b, partial [Branchiostoma floridae]
Length = 380
Score = 64.3 bits (155), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 31/42 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
R + ++R G+ CANC TT TTLWRRNN GEPVCNACGL
Sbjct: 336 RRLSASRRVGLQCANCRTTQTTLWRRNNEGEPVCNACGLYYK 377
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL +N P
Sbjct: 291 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 330
>gi|147905907|ref|NP_001085355.1| GATA binding protein 4 [Xenopus laevis]
gi|49256044|gb|AAH71122.1| MGC81427 protein [Xenopus laevis]
Length = 394
Score = 64.3 bits (155), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+ CANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 228 RRLSASRRVGLCCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 276
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|86575041|ref|NP_001033435.1| Protein ELT-1, isoform a [Caenorhabditis elegans]
gi|74834771|emb|CAJ30234.1| Protein ELT-1, isoform a [Caenorhabditis elegans]
Length = 488
Score = 64.3 bits (155), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 30/49 (61%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+Q KR+G+ C NC T TTLWRRN G PVCNACGL + P
Sbjct: 332 RQQNAQKRTGIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVERP 380
Score = 38.5 bits (88), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T LWRR+ +G +CNACGL
Sbjct: 289 CVNCGVHNTPLWRRDGSGNYLCNACGLYF 317
>gi|7861539|dbj|BAA95683.1| transcription factor GATA-6 [Mus musculus]
Length = 470
Score = 64.3 bits (155), Expect = 8e-09, Method: Composition-based stats.
Identities = 27/36 (75%), Positives = 30/36 (83%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++R G+SCANC TT TTLWRRN GEPVCNACGL M
Sbjct: 431 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYM 466
Score = 40.0 bits (92), Expect = 0.16, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 384 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 420
>gi|392900108|ref|NP_001255403.1| Protein ELT-1, isoform c [Caenorhabditis elegans]
gi|379657041|emb|CCG28207.1| Protein ELT-1, isoform c [Caenorhabditis elegans]
Length = 487
Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 30/49 (61%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+Q KR+G+ C NC T TTLWRRN G PVCNACGL + P
Sbjct: 331 RQQNAQKRTGIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVERP 379
Score = 38.5 bits (88), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T LWRR+ +G +CNACGL
Sbjct: 288 CVNCGVHNTPLWRRDGSGNYLCNACGLYF 316
>gi|322782943|gb|EFZ10661.1| hypothetical protein SINV_07216 [Solenopsis invicta]
Length = 407
Score = 64.3 bits (155), Expect = 9e-09, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 208 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 253
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 160 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 199
>gi|432105435|gb|ELK31650.1| Transcription factor GATA-6 [Myotis davidii]
Length = 270
Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 29/52 (55%), Positives = 34/52 (65%)
Query: 5 FGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
I + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 103 MAIMDLPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 154
>gi|47208918|emb|CAF91187.1| unnamed protein product [Tetraodon nigroviridis]
Length = 334
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/39 (69%), Positives = 31/39 (79%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL
Sbjct: 286 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGL 324
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 25/49 (51%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 232 RPKSRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 280
>gi|170044815|ref|XP_001850028.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867946|gb|EDS31329.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 119
Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 30/40 (75%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
Q+ +R+G SCANC TT TTLWRRN GEPVCNACGL
Sbjct: 8 QSAARRAGTSCANCKTTTTTLWRRNQGGEPVCNACGLYYK 47
>gi|391347263|ref|XP_003747884.1| PREDICTED: uncharacterized protein LOC100905208 [Metaseiulus
occidentalis]
Length = 594
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 31/44 (70%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
NKR G+ C+NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 371 NKRPGLVCSNCDTNNTTLWRRNNQGEPVCNACGLYYKLHGVNRP 414
Score = 39.3 bits (90), Expect = 0.34, Method: Composition-based stats.
Identities = 17/37 (45%), Positives = 19/37 (51%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T LWRR+ G +CNACGL N P
Sbjct: 272 CVNCGAVSTPLWRRDMAGHYLCNACGLYTKMNGSNRP 308
>gi|24647320|ref|NP_732098.1| serpent, isoform B [Drosophila melanogaster]
gi|23171429|gb|AAN13691.1| serpent, isoform B [Drosophila melanogaster]
Length = 1249
Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL + P
Sbjct: 776 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRP 824
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 27/55 (49%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNG 71
G C NC T LWRR+N G +CNACGL M +N P P S R G
Sbjct: 731 GRECVNCGAISTPLWRRDNTGHYLCNACGLYMKMNGMNRPLIKQPRRLSASKRAG 785
>gi|297275072|ref|XP_002800933.1| PREDICTED: transcription factor GATA-6-like [Macaca mulatta]
Length = 653
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 495 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 538
>gi|147903431|ref|NP_001083725.1| GATA-binding factor 6-B [Xenopus laevis]
gi|51950143|gb|AAH82349.1| Gata-6 protein [Xenopus laevis]
Length = 502
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/44 (61%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G++CANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 340 SRRIGLACANCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVPRP 383
Score = 40.8 bits (94), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 293 CVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 338
>gi|213511622|ref|NP_001133239.1| GATA-binding factor 1 [Salmo salar]
gi|209147311|gb|ACI32884.1| GATA-binding factor 3 [Salmo salar]
Length = 424
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/44 (61%), Positives = 34/44 (77%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G CANC+T+ TTLWRRN +GEPVCNACGL ++N P
Sbjct: 290 SKRAGTQCANCNTSTTTLWRRNASGEPVCNACGLYFKLHNVNRP 333
Score = 40.4 bits (93), Expect = 0.13, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ G +CNACGL N P
Sbjct: 243 CVNCGATNTPLWRRDGTGHYLCNACGLYHKMNGQNRP 279
>gi|157278509|ref|NP_001098356.1| transcription factor GATA-2 [Oryzias latipes]
gi|57157777|dbj|BAD83861.1| transcription factor GATA-2 [Oryzias latipes]
Length = 455
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN +G+PVCNACGL ++N P
Sbjct: 317 RRLSAARRAGTCCANCQTTTTTLWRRNASGDPVCNACGLYYKLHNVNRP 365
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 21/53 (39%), Positives = 28/53 (52%)
Query: 4 TFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+F + +T + G C NC T T LWRR++ G +CNACGL N P
Sbjct: 259 SFKNKGKTRSCTEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRP 311
>gi|332016555|gb|EGI57436.1| GATA-binding factor A [Acromyrmex echinatior]
Length = 407
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 210 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 255
Score = 43.1 bits (100), Expect = 0.022, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 162 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 201
>gi|268551963|ref|XP_002633964.1| C. briggsae CBR-ELT-1 protein [Caenorhabditis briggsae]
Length = 483
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 30/49 (61%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+Q KR+G+ C NC T TTLWRRN G PVCNACGL + P
Sbjct: 327 RQQNAQKRTGIECVNCHTNNTTLWRRNGEGHPVCNACGLYYKLHKVERP 375
Score = 40.4 bits (93), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T LWRR+ +G +CNACGL
Sbjct: 284 CVNCGVHATPLWRRDGSGNYLCNACGLYF 312
>gi|410052588|ref|XP_003954430.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6 [Pan
troglodytes]
Length = 472
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 314 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 357
Score = 39.7 bits (91), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 267 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 312
>gi|308478267|ref|XP_003101345.1| CRE-ELT-1 protein [Caenorhabditis remanei]
gi|308263246|gb|EFP07199.1| CRE-ELT-1 protein [Caenorhabditis remanei]
Length = 412
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 30/49 (61%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+Q KR+G+ C NC T TTLWRRN G PVCNACGL + P
Sbjct: 259 RQQNAQKRTGIECVNCHTNTTTLWRRNGEGHPVCNACGLYYKLHKVERP 307
Score = 40.0 bits (92), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T LWRR+ +G +CNACGL
Sbjct: 216 CVNCGVHATPLWRRDGSGNYLCNACGLYF 244
>gi|194767723|ref|XP_001965964.1| GF11891 [Drosophila ananassae]
gi|190619807|gb|EDV35331.1| GF11891 [Drosophila ananassae]
Length = 551
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 226 TATRRLGLRCTNCGTHTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 271
Score = 43.1 bits (100), Expect = 0.023, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 178 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 217
>gi|444724388|gb|ELW64993.1| Trans-acting T-cell-specific transcription factor GATA-3 [Tupaia
chinensis]
Length = 363
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/39 (69%), Positives = 31/39 (79%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL
Sbjct: 305 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGL 343
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 251 RPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 299
>gi|45361689|ref|NP_989422.1| GATA binding protein 6 [Xenopus (Silurana) tropicalis]
gi|32442468|gb|AAP82292.1| zinc-finger transcription factor Gata6 [Xenopus (Silurana)
tropicalis]
Length = 524
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/44 (61%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G++CANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 363 SRRIGLACANCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVPRP 406
Score = 40.8 bits (94), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 316 CVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 361
>gi|3183008|sp|P70005.1|GAT6B_XENLA RecName: Full=GATA-binding factor 6-B; AltName: Full=Transcription
factor xGATA-6b
gi|1621281|emb|CAA70088.1| GATA-6 protein [Xenopus laevis]
Length = 391
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/44 (61%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G++CANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 229 SRRIGLACANCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVPRP 272
Score = 40.8 bits (94), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 182 CVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 227
>gi|74192130|dbj|BAE34273.1| unnamed protein product [Mus musculus]
Length = 365
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/39 (69%), Positives = 31/39 (79%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +R+G SCANC TT TTLWRRN NG+PVCNACGL
Sbjct: 304 RRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGL 342
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 5 FGIREQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 246 FGCKSRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 298
>gi|78706774|ref|NP_001027190.1| serpent, isoform D [Drosophila melanogaster]
gi|442619343|ref|NP_001262618.1| serpent, isoform F [Drosophila melanogaster]
gi|42415419|gb|AAS15679.1| LD44281p [Drosophila melanogaster]
gi|71854562|gb|AAN13692.3| serpent, isoform D [Drosophila melanogaster]
gi|440217477|gb|AGB95998.1| serpent, isoform F [Drosophila melanogaster]
Length = 746
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL + P
Sbjct: 273 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRP 321
>gi|86575043|ref|NP_001033436.1| Protein ELT-1, isoform b [Caenorhabditis elegans]
gi|119299|sp|P28515.1|ELT1_CAEEL RecName: Full=Transcription factor elt-1
gi|6702|emb|CAA40967.1| elt-1 [Caenorhabditis elegans]
gi|74834772|emb|CAJ30235.1| Protein ELT-1, isoform b [Caenorhabditis elegans]
Length = 416
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 30/49 (61%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+Q KR+G+ C NC T TTLWRRN G PVCNACGL + P
Sbjct: 260 RQQNAQKRTGIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVERP 308
Score = 38.1 bits (87), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T LWRR+ +G +CNACGL
Sbjct: 217 CVNCGVHNTPLWRRDGSGNYLCNACGLYF 245
>gi|386765869|ref|NP_001247128.1| serpent, isoform E [Drosophila melanogaster]
gi|383292736|gb|AFH06446.1| serpent, isoform E [Drosophila melanogaster]
Length = 731
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL + P
Sbjct: 258 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRP 306
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 27/55 (49%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNG 71
G C NC T LWRR+N G +CNACGL M +N P P S R G
Sbjct: 213 GRECVNCGAISTPLWRRDNTGHYLCNACGLYMKMNGMNRPLIKQPRRLSASKRAG 267
>gi|348507924|ref|XP_003441505.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Oreochromis
niloticus]
Length = 455
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN +G+PVCNACGL ++N P
Sbjct: 318 RRLSAARRAGTCCANCQTTTTTLWRRNAHGDPVCNACGLYYKLHNVNRP 366
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
Query: 9 EQTGNKRS---GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G RS G C NC T T LWRR++ G +CNACGL N P
Sbjct: 262 KNKGKTRSCTEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRP 312
>gi|449278190|gb|EMC86134.1| Transcription factor GATA-6, partial [Columba livia]
Length = 396
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 237 SRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 280
Score = 39.3 bits (90), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 190 CVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVP 235
>gi|340720273|ref|XP_003398565.1| PREDICTED: hypothetical protein LOC100651336 [Bombus terrestris]
Length = 985
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/52 (51%), Positives = 34/52 (65%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
R + ++R G SC+NC TT T+LWRRN GEPVCNACGL +N P +
Sbjct: 738 RRLSASRRVGTSCSNCQTTMTSLWRRNTLGEPVCNACGLYFKLHGVNRPHAM 789
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 29/64 (45%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSV 76
G C NC T LWRR+ G +CNACGL +N P P S R G + S
Sbjct: 693 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASRRVGTSCSN 752
Query: 77 ITST 80
+T
Sbjct: 753 CQTT 756
>gi|327269885|ref|XP_003219723.1| PREDICTED: transcription factor GATA-6-like [Anolis carolinensis]
Length = 504
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/43 (65%), Positives = 31/43 (72%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 344 RRLGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 386
Score = 38.1 bits (87), Expect = 0.74, Method: Composition-based stats.
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWR++ G +CNACGL ++ P
Sbjct: 296 CVNCGSFQTPLWRKDGTGNYLCNACGLYTKMNGLSRP 332
>gi|603162|gb|AAC13781.1| BmGATA beta isoform 3, partial [Bombyx mori]
Length = 440
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 29/56 (51%), Positives = 35/56 (62%)
Query: 1 MILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++LT R+ G KR G C NC TT T+LWRRN GE VCNACGL ++N P
Sbjct: 234 ILLTTSRRKGMGAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRP 289
>gi|345803446|ref|XP_547642.3| PREDICTED: transcription factor GATA-6 [Canis lupus familiaris]
Length = 627
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 465 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 508
Score = 39.7 bits (91), Expect = 0.24, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 418 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 454
>gi|313224233|emb|CBY20022.1| unnamed protein product [Oikopleura dioica]
Length = 483
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/43 (62%), Positives = 33/43 (76%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+G SC+NC TT TTLWRRN+NG+PVCNACGL +N P
Sbjct: 308 RRAGTSCSNCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVNRP 350
Score = 42.4 bits (98), Expect = 0.033, Method: Composition-based stats.
Identities = 19/44 (43%), Positives = 22/44 (50%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 253 DNSEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRP 296
>gi|339236855|ref|XP_003379982.1| zinc finger protein [Trichinella spiralis]
gi|316977267|gb|EFV60390.1| zinc finger protein [Trichinella spiralis]
Length = 419
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/48 (56%), Positives = 32/48 (66%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ T +KR G+ C NC T TTLWRRN NG+PVCNACGL +N P
Sbjct: 114 QATTSKRQGLVCVNCQTMQTTLWRRNQNGDPVCNACGLYFKLHRVNRP 161
>gi|242005616|ref|XP_002423660.1| transcription factor GATA-4, putative [Pediculus humanus corporis]
gi|212506820|gb|EEB10922.1| transcription factor GATA-4, putative [Pediculus humanus corporis]
Length = 231
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 31/49 (63%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ T KR G+ C NC T TTLWRRNN GEPVCNACGL IN P
Sbjct: 136 KRMTATKRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGINRP 184
Score = 42.7 bits (99), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + M +
Sbjct: 91 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRMTA 140
>gi|326917513|ref|XP_003205043.1| PREDICTED: transcription factor GATA-6-like [Meleagris gallopavo]
Length = 402
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 243 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 286
Score = 39.7 bits (91), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 196 CVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVP 241
>gi|348507926|ref|XP_003441506.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Oreochromis
niloticus]
Length = 452
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWRRN +G+PVCNACGL ++N P
Sbjct: 315 RRLSAARRAGTCCANCQTTTTTLWRRNAHGDPVCNACGLYYKLHNVNRP 363
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
Query: 9 EQTGNKRS---GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ G RS G C NC T T LWRR++ G +CNACGL N P
Sbjct: 259 KNKGKTRSCTEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRP 309
>gi|47227325|emb|CAF96874.1| unnamed protein product [Tetraodon nigroviridis]
Length = 190
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/44 (61%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G CANC T+ TTLWRRN +GEPVCNACGL ++N P
Sbjct: 53 SKRAGTLCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVNRP 96
Score = 42.7 bits (99), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDAL 74
C NC T T LWRR+ G +CNACGL N P I P ++S+ L
Sbjct: 6 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP-LIRPKKRLIVSKRAGTL 59
>gi|328786707|ref|XP_001121273.2| PREDICTED: hypothetical protein LOC725422 [Apis mellifera]
Length = 942
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/52 (51%), Positives = 34/52 (65%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
R + ++R G SC+NC TT T+LWRRN GEPVCNACGL +N P +
Sbjct: 660 RRLSASRRVGTSCSNCQTTMTSLWRRNTLGEPVCNACGLYFKLHGVNRPHTM 711
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 29/64 (45%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSV 76
G C NC T LWRR+ G +CNACGL +N P P S R G + S
Sbjct: 615 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASRRVGTSCSN 674
Query: 77 ITST 80
+T
Sbjct: 675 CQTT 678
>gi|195452164|ref|XP_002073240.1| GK13255 [Drosophila willistoni]
gi|194169325|gb|EDW84226.1| GK13255 [Drosophila willistoni]
Length = 550
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 210 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRP 255
Score = 42.7 bits (99), Expect = 0.028, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 162 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 201
>gi|297489684|ref|XP_002697773.1| PREDICTED: transcription factor GATA-6, partial [Bos taurus]
gi|296473832|tpg|DAA15947.1| TPA: GATA-6 [Bos taurus]
Length = 344
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 186 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 229
Score = 39.3 bits (90), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 139 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 184
>gi|54306071|ref|NP_571310.2| transcription factor GATA-5 [Danio rerio]
gi|30519592|emb|CAB43400.2| transcription factor gata5 [Danio rerio]
Length = 383
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 44/82 (53%), Gaps = 10/82 (12%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP----------E 57
R Q+ ++R+G+ C NC T+ TTLWRRN GEPVCNACGL M + P
Sbjct: 229 RLQSTSRRAGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKESIQTR 288
Query: 58 KIMPPIESMISRNGDALSVITS 79
K P + S +G +S TS
Sbjct: 289 KRKPKMPKTKSSSGSTVSGATS 310
Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC + T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|397489370|ref|XP_003815701.1| PREDICTED: transcription factor GATA-6 [Pan paniscus]
Length = 402
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 244 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 287
Score = 39.3 bits (90), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 197 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 242
>gi|307198088|gb|EFN79141.1| Transcription factor GATA-4 [Harpegnathos saltator]
Length = 785
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 34/52 (65%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
R + +R G++C+NC TT T+LWRRN GEPVCNACGL +N P +
Sbjct: 609 RRLSATRRMGLACSNCETTMTSLWRRNTMGEPVCNACGLYYKLHGVNRPSTM 660
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 27/59 (45%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALS 75
G C NC T LWRR+ G +CNACGL +N P P S R G A S
Sbjct: 564 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSATRRMGLACS 622
>gi|410977466|ref|XP_003995126.1| PREDICTED: transcription factor GATA-6-like [Felis catus]
Length = 495
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 337 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 380
>gi|94573490|gb|AAI16538.1| GATA-binding protein 5 [Danio rerio]
Length = 383
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+ ++R+G+ C NC T+ TTLWRRN GEPVCNACGL M + P
Sbjct: 229 RLQSTSRRAGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 277
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC + T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|323319539|gb|ADX36140.1| GATA123b [Schmidtea polychroa]
Length = 239
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 36/49 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R+G C+NC+T+ TTLWRRN+NGEPVCNACGL ++ P
Sbjct: 38 RRLSASRRTGTICSNCNTSTTTLWRRNSNGEPVCNACGLYFKLHSVSRP 86
>gi|426253995|ref|XP_004020674.1| PREDICTED: transcription factor GATA-6 [Ovis aries]
Length = 409
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 36/55 (65%)
Query: 2 ILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++ F + ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 240 LIGFPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 294
>gi|162316932|gb|ABX84143.1| GATA-binding protein 6 [Microtus levis]
Length = 248
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 97 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 140
Score = 39.3 bits (90), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 50 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 95
>gi|195157150|ref|XP_002019459.1| GL12212 [Drosophila persimilis]
gi|194116050|gb|EDW38093.1| GL12212 [Drosophila persimilis]
Length = 1126
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL + P
Sbjct: 821 RRLSASKRNGLSCSNCLTTYTSLWRRNPAGEPVCNACGLYFKLHSVTRP 869
>gi|355697729|gb|EHH28277.1| GATA-binding factor 4 [Macaca mulatta]
Length = 304
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 127 SRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 170
Score = 43.5 bits (101), Expect = 0.018, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 79 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 118
>gi|355701861|gb|EHH29214.1| GATA-binding factor 6 [Macaca mulatta]
Length = 398
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 240 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 283
Score = 39.3 bits (90), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 193 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 238
>gi|157125536|ref|XP_001654375.1| transcription factor GATA-4 (GATA binding factor-4) [Aedes aegypti]
gi|108873610|gb|EAT37835.1| AAEL010222-PA [Aedes aegypti]
Length = 1034
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 35/50 (70%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++E + +R G+ C+NC+T T+LWRRN GEPVCNACGL ++N P
Sbjct: 663 VKEPSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRP 712
Score = 43.5 bits (101), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 615 GRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRP 654
>gi|350423466|ref|XP_003493491.1| PREDICTED: GATA-binding factor 5-A-like, partial [Bombus impatiens]
Length = 317
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 31/49 (63%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P +
Sbjct: 100 TATRRLGLCCTNCGTLTTTLWRRNNEGEPVCNACGLYFKLHGVNRPRAM 148
Score = 42.7 bits (99), Expect = 0.031, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 52 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 91
>gi|341880193|gb|EGT36128.1| CBN-ELT-1 protein [Caenorhabditis brenneri]
Length = 393
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 30/49 (61%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+Q KR+G+ C NC T TTLWRRN G PVCNACGL + P
Sbjct: 255 RQQNAQKRTGIECVNCHTNNTTLWRRNGEGHPVCNACGLYYKLHKVERP 303
Score = 38.5 bits (88), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T LWRR+ +G +CNACGL
Sbjct: 212 CVNCGVHNTPLWRRDGSGNYLCNACGLYF 240
>gi|221127849|ref|XP_002159183.1| PREDICTED: uncharacterized protein LOC100208265 [Hydra
magnipapillata]
Length = 408
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G+ C+NC T+ TTLWRRN +GEPVCNACGL +N P
Sbjct: 271 RRLSQARRTGIVCSNCKTSQTTLWRRNGSGEPVCNACGLYYKLHKVNRP 319
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G R +G+ G C NC T T LWRR+ G +CNACGL N P
Sbjct: 216 GSRSNSGDFE-GRECMNCGATSTPLWRRDTRGHYLCNACGLYHKMNGANRP 265
>gi|119220894|gb|ABL61528.1| GATA transcription factor pannier [Drosophila quadrilineata]
Length = 344
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 59 TATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 104
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 57
>gi|56849575|gb|AAW31646.1| GATAa2 transcription factor [Aedes aegypti]
Length = 1034
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 35/50 (70%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++E + +R G+ C+NC+T T+LWRRN GEPVCNACGL ++N P
Sbjct: 663 VKEPSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRP 712
Score = 43.5 bits (101), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 615 GRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRP 654
>gi|307170386|gb|EFN62698.1| GATA-binding factor A [Camponotus floridanus]
Length = 638
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 32/49 (65%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 202 KRLTATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 250
Score = 42.4 bits (98), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 157 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 196
>gi|341894024|gb|EGT49959.1| hypothetical protein CAEBREN_00985 [Caenorhabditis brenneri]
Length = 375
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 30/49 (61%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+Q KR+G+ C NC T TTLWRRN G PVCNACGL + P
Sbjct: 255 RQQNAQKRTGIECVNCHTNNTTLWRRNGEGHPVCNACGLYYKLHKVERP 303
Score = 38.5 bits (88), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T LWRR+ +G +CNACGL
Sbjct: 212 CVNCGVHNTPLWRRDGSGNYLCNACGLYF 240
>gi|344269962|ref|XP_003406816.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Loxodonta africana]
Length = 390
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 226 KRAPASRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 274
Score = 40.8 bits (94), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 2/50 (4%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
C NC + T LWRR+ G +CNACG ++ P++ PP + SR
Sbjct: 185 CVNCGSIQTPLWRRDGTGHYLCNACGXYSKMNGLSGPDQ--PPKRAPASR 232
>gi|170043914|ref|XP_001849612.1| GATA transcription factor GATAd [Culex quinquefasciatus]
gi|167867187|gb|EDS30570.1| GATA transcription factor GATAd [Culex quinquefasciatus]
Length = 520
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 32/49 (65%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 159 KRLTATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 207
Score = 42.7 bits (99), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 114 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 153
>gi|395511636|ref|XP_003760062.1| PREDICTED: transcription factor GATA-6 [Sarcophilus harrisii]
Length = 461
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/43 (65%), Positives = 31/43 (72%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 304 RRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 346
Score = 38.9 bits (89), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 256 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 301
>gi|359322060|ref|XP_541740.3| PREDICTED: LOW QUALITY PROTEIN: endothelial transcription factor
GATA-2 isoform 1 [Canis lupus familiaris]
Length = 480
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R+G CANC TT TTLWR N NG+PVCNACGL ++N P
Sbjct: 337 RRLSAARRAGTCCANCQTTTTTLWRGNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|348576623|ref|XP_003474086.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Cavia porcellus]
Length = 418
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 260 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 303
Score = 41.6 bits (96), Expect = 0.064, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 23/40 (57%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G CANC + T LWRR+ G +CNACGL ++ P
Sbjct: 210 GRECANCGSFQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 249
>gi|345306897|ref|XP_001506818.2| PREDICTED: transcription factor GATA-6-like, partial
[Ornithorhynchus anatinus]
Length = 237
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/44 (61%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC T+ TTLWRRN GEPVCNACGL M + P
Sbjct: 78 SRRLGLSCANCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 121
>gi|157125522|ref|XP_001654368.1| GATA transcription factor (GATAd) [Aedes aegypti]
gi|108873603|gb|EAT37828.1| AAEL010221-PA [Aedes aegypti]
Length = 483
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 167 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 212
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 117 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 166
>gi|56900910|gb|AAW31748.1| GATA transcription factor GATAd [Aedes aegypti]
Length = 483
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 167 TATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 212
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 117 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 166
>gi|7381420|gb|AAF61481.1|AF184154_1 zinc finger DNA binding protein pannier [Ceratitis capitata]
Length = 324
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 156 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 201
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 108 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 147
>gi|383863209|ref|XP_003707074.1| PREDICTED: GATA-binding factor A-like [Megachile rotundata]
Length = 369
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 162 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 207
Score = 42.4 bits (98), Expect = 0.032, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 114 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 153
>gi|442619345|ref|NP_001262619.1| serpent, isoform G [Drosophila melanogaster]
gi|440217478|gb|AGB95999.1| serpent, isoform G [Drosophila melanogaster]
Length = 469
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL + P
Sbjct: 273 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRP 321
>gi|355689778|gb|AER98943.1| Transcription factor GATA-6 [Mustela putorius furo]
Length = 180
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 20 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 63
>gi|355754933|gb|EHH58800.1| GATA-binding factor 6 [Macaca fascicularis]
Length = 410
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 252 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 295
>gi|345490814|ref|XP_003426466.1| PREDICTED: hypothetical protein LOC100678632 [Nasonia
vitripennis]
Length = 262
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R+G+SC+NC TT T+LWRRN G+ VCNACGL IN P
Sbjct: 32 RRLSASRRAGLSCSNCKTTTTSLWRRNAQGDAVCNACGLYFKLHGINRP 80
>gi|390179289|ref|XP_003736856.1| GA26230, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859789|gb|EIM52929.1| GA26230, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 1295
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL + P
Sbjct: 803 RRLSASKRNGLSCSNCLTTYTSLWRRNPAGEPVCNACGLYFKLHSVTRP 851
Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 32/64 (50%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSV 76
G C NC T LWRR+N G +CNACGL M +N P P S RNG + S
Sbjct: 758 GRECVNCGAIQTPLWRRDNTGHYLCNACGLYMKMNGMNRPLIKQPRRLSASKRNGLSCSN 817
Query: 77 ITST 80
+T
Sbjct: 818 CLTT 821
>gi|440903489|gb|ELR54140.1| Transcription factor GATA-6 [Bos grunniens mutus]
Length = 481
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 323 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 366
>gi|291045150|dbj|BAI82362.1| GATA transcription factor [Haemaphysalis longicornis]
Length = 319
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G++C+NC T T+LWRRNN GEPVCNACGL +N P
Sbjct: 60 RRLSASRRVGLTCSNCQTGTTSLWRRNNVGEPVCNACGLYFRLHGVNRP 108
Score = 42.0 bits (97), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 5/48 (10%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD---INAPEKI 59
G C NC + T LWRR+ G +CNACGL MN V I P ++
Sbjct: 15 GRECVNCGSISTPLWRRDGTGHYLCNACGLYNKMNGVHRPVIKTPRRL 62
>gi|405960917|gb|EKC26787.1| Trans-acting T-cell-specific transcription factor GATA-3
[Crassostrea gigas]
Length = 518
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 30/41 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
R + +R+G SCANC T TTLWRRN NG+PVCNACGL
Sbjct: 256 RRLSAARRAGTSCANCGTNTTTLWRRNGNGDPVCNACGLYY 296
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 202 RTKARSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 250
>gi|198454790|ref|XP_002137945.1| GA26230, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132955|gb|EDY68503.1| GA26230, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 1316
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL + P
Sbjct: 824 RRLSASKRNGLSCSNCLTTYTSLWRRNPAGEPVCNACGLYFKLHSVTRP 872
>gi|46909571|ref|NP_034388.2| transcription factor GATA-6 [Mus musculus]
gi|215274120|sp|Q61169.3|GATA6_MOUSE RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
gi|74210838|dbj|BAE25049.1| unnamed protein product [Mus musculus]
gi|148691038|gb|EDL22985.1| GATA binding protein 6 [Mus musculus]
Length = 589
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 431 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 474
Score = 39.3 bits (90), Expect = 0.32, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 384 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 420
>gi|225544221|gb|ACN91531.1| pnr [Ceratitis capitata]
Length = 539
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 159 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 204
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 108 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 157
>gi|215274094|sp|P46153.2|GATA6_RAT RecName: Full=Transcription factor GATA-6; AltName:
Full=DNA-binding protein GATA-GT2; AltName:
Full=GATA-binding factor 6
Length = 587
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 431 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 474
Score = 39.3 bits (90), Expect = 0.32, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 384 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 420
>gi|62205319|gb|AAH93138.1| Gata5 protein, partial [Danio rerio]
Length = 285
Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+ ++R+G+ C NC T+ TTLWRRN GEPVCNACGL M + P
Sbjct: 229 RLQSTSRRAGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 277
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC + T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|156538733|ref|XP_001607848.1| PREDICTED: GATA-binding factor 5-A-like [Nasonia vitripennis]
Length = 442
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 220 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 265
Score = 42.4 bits (98), Expect = 0.036, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 172 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKINGMNRP 211
>gi|18032000|gb|AAL32068.1| pannier-like protein [Calliphora vicina]
Length = 511
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 171 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 216
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 120 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 169
>gi|426385574|ref|XP_004059281.1| PREDICTED: transcription factor GATA-6 [Gorilla gorilla gorilla]
Length = 596
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 438 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 481
Score = 39.3 bits (90), Expect = 0.34, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 391 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 427
>gi|18858735|ref|NP_571311.1| transcription factor GATA-4 [Danio rerio]
gi|6524988|gb|AAF15275.1|AF191577_1 zinc finger transcription factor Gata4 [Danio rerio]
Length = 338
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 32/49 (65%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SC NC T TTLWRRN GEPVCNACGL M + P
Sbjct: 209 RRLSASRRVGLSCTNCQPTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 257
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 164 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 203
>gi|242020217|ref|XP_002430552.1| GATA-binding factor, putative [Pediculus humanus corporis]
gi|212515716|gb|EEB17814.1| GATA-binding factor, putative [Pediculus humanus corporis]
Length = 409
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ T NKR G+ C+NC+T T+LWRRN+ GEPVCNACGL +N P
Sbjct: 198 KRLTNNKRLGLQCSNCNTATTSLWRRNSLGEPVCNACGLYYKLHGVNRP 246
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 23/45 (51%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMP 61
G C NC T T LWRR+ G +CNACGL +N P P
Sbjct: 153 GRECVNCGTVSTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP 197
>gi|395823099|ref|XP_003784834.1| PREDICTED: transcription factor GATA-6 [Otolemur garnettii]
Length = 589
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 431 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 474
Score = 39.3 bits (90), Expect = 0.34, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 384 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 420
>gi|402902790|ref|XP_003914278.1| PREDICTED: transcription factor GATA-6 [Papio anubis]
Length = 595
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 437 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 480
Score = 39.3 bits (90), Expect = 0.35, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 390 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 426
>gi|387540446|gb|AFJ70850.1| transcription factor GATA-6 [Macaca mulatta]
Length = 595
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 437 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 480
Score = 39.3 bits (90), Expect = 0.35, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 390 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 426
>gi|194678210|ref|XP_001253597.2| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6 [Bos
taurus]
Length = 497
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 339 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 382
Score = 38.9 bits (89), Expect = 0.35, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 292 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 328
>gi|2494686|sp|Q91678.1|GAT6A_XENLA RecName: Full=GATA-binding factor 6-A; AltName: Full=Transcription
factor xGATA-6a
gi|1209880|gb|AAB05648.1| transcription factor xGATA-6 [Xenopus laevis]
Length = 391
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G++CANC T+ TTLWRRN GEPVCNACGL M + P
Sbjct: 229 SRRIGLACANCHTSTTTLWRRNTEGEPVCNACGLYMKLHGVPRP 272
Score = 40.4 bits (93), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 182 CVNCGSVQTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRVP 227
>gi|40288197|ref|NP_005248.2| transcription factor GATA-6 [Homo sapiens]
gi|215273987|sp|Q92908.2|GATA6_HUMAN RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
gi|119621542|gb|EAX01137.1| GATA binding protein 6 [Homo sapiens]
gi|167887577|gb|ACA05995.1| transcription factor GATA-6 [Homo sapiens]
Length = 595
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 437 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 480
Score = 38.9 bits (89), Expect = 0.35, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 390 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 426
>gi|45382241|ref|NP_990751.1| transcription factor GATA-6 [Gallus gallus]
gi|1169849|sp|P43693.1|GATA6_CHICK RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
gi|511484|gb|AAA57505.1| GATA-6 transcription factor [Gallus gallus]
Length = 387
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 228 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 271
Score = 39.3 bits (90), Expect = 0.29, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 181 CVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRP 217
>gi|225544205|gb|ACN91521.1| pnr [Bactrocera dorsalis]
gi|225544207|gb|ACN91522.1| pnr [Bactrocera dorsalis]
Length = 540
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 159 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 204
Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 108 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 157
>gi|148236237|ref|NP_001081452.1| GATA-binding factor 6-A [Xenopus laevis]
gi|49899014|gb|AAH76718.1| LOC397845 protein [Xenopus laevis]
Length = 502
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G++CANC T+ TTLWRRN GEPVCNACGL M + P
Sbjct: 340 SRRIGLACANCHTSTTTLWRRNTEGEPVCNACGLYMKLHGVPRP 383
Score = 40.4 bits (93), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 293 CVNCGSVQTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRVP 338
>gi|343488529|ref|NP_999493.2| transcription factor GATA-6 [Sus scrofa]
Length = 592
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 434 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 477
Score = 37.7 bits (86), Expect = 0.96, Method: Composition-based stats.
Identities = 15/37 (40%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CN CGL ++ P
Sbjct: 387 CVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNGLSGP 423
>gi|296222377|ref|XP_002807542.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Callithrix jacchus]
Length = 592
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 434 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 477
Score = 38.9 bits (89), Expect = 0.35, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 387 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 423
>gi|213623822|gb|AAI70266.1| GATA binding factor-1b [Xenopus laevis]
Length = 364
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/44 (56%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G C+NC T+ TTLWRRN +G+PVCNACGL ++N P
Sbjct: 227 SKRAGTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRP 270
Score = 42.0 bits (97), Expect = 0.053, Method: Composition-based stats.
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 1/50 (2%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
C NC T T LWRR+ +G +CNACGL N P I P +IS+
Sbjct: 180 CVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRP-LIRPKKRLIISK 228
>gi|225544200|gb|ACN91518.1| pnr [Bactrocera cucurbitae]
Length = 517
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 159 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 204
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 108 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 157
>gi|213626857|gb|AAI70262.1| GATA binding factor-1b [Xenopus laevis]
Length = 364
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/44 (56%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G C+NC T+ TTLWRRN +G+PVCNACGL ++N P
Sbjct: 227 SKRAGTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRP 270
Score = 42.0 bits (97), Expect = 0.052, Method: Composition-based stats.
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 1/50 (2%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
C NC T T LWRR+ +G +CNACGL N P I P +IS+
Sbjct: 180 CVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRP-LIRPKKRLIISK 228
>gi|62898826|dbj|BAD97267.1| GATA binding protein 6 variant [Homo sapiens]
Length = 595
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 437 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 480
Score = 38.9 bits (89), Expect = 0.36, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 390 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 426
>gi|8648977|emb|CAB94842.1| GATA-4 zinc-finger transcription factor [Oryctolagus cuniculus]
Length = 116
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SCANC TT TTLWRRN G+PVCNACGL M + P
Sbjct: 49 RRLSASRRVGLSCANCQTTTTTLWRRNAEGKPVCNACGLYMKLHGVPRP 97
Score = 42.7 bits (99), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 21/41 (51%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 3 EGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 43
>gi|73949078|ref|XP_858071.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
isoform 2 [Canis lupus familiaris]
Length = 439
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/39 (69%), Positives = 29/39 (74%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
SCANC TT TTLWRRN NG+PVCNACGL +IN P
Sbjct: 311 TSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRP 349
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 5 FGIREQTGNKRS--GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
FG + + + S G C NC T T LWRR+ G +CNACGL N P
Sbjct: 247 FGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRP 300
>gi|297702367|ref|XP_002828153.1| PREDICTED: transcription factor GATA-6 [Pongo abelii]
Length = 595
Score = 62.8 bits (151), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 437 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 480
Score = 38.9 bits (89), Expect = 0.37, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 390 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 426
>gi|124001863|gb|ABM87879.1| GATA4 [Papio hamadryas]
Length = 97
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/42 (66%), Positives = 32/42 (76%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
R + ++R G+SCANC TT TTLWRRN GEPVCNACGL M
Sbjct: 53 RRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMK 94
Score = 42.4 bits (98), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 8 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 47
>gi|213623646|gb|AAI70023.1| GATA binding factor-1 [Xenopus laevis]
gi|213625189|gb|AAI70025.1| GATA binding factor-1 [Xenopus laevis]
Length = 359
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/44 (56%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G C+NC T+ TTLWRRN +G+PVCNACGL ++N P
Sbjct: 225 SKRAGTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRP 268
Score = 42.0 bits (97), Expect = 0.047, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ +G +CNACGL N P
Sbjct: 178 CVNCGATVTPLWRRDMSGHYLCNACGLYHKMNGQNRP 214
>gi|260785648|ref|XP_002587872.1| hypothetical protein BRAFLDRAFT_87259 [Branchiostoma floridae]
gi|229273027|gb|EEN43883.1| hypothetical protein BRAFLDRAFT_87259 [Branchiostoma floridae]
Length = 974
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 26/39 (66%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
++ + ++R G+ CANC TT TTLWRRNN GEPVCNACGL
Sbjct: 643 KKLSASRRVGLQCANCRTTQTTLWRRNNEGEPVCNACGL 681
>gi|148232497|ref|NP_001079109.1| GATA-binding factor 1-A [Xenopus laevis]
gi|120966|sp|P23767.1|GAT1A_XENLA RecName: Full=GATA-binding factor 1-A; AltName: Full=Transcription
factor xGATA-1A
gi|214167|gb|AAA49721.1| GATA binding factor-1 [Xenopus laevis]
Length = 359
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/44 (56%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G C+NC T+ TTLWRRN +G+PVCNACGL ++N P
Sbjct: 225 SKRAGTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRP 268
Score = 42.0 bits (97), Expect = 0.047, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ +G +CNACGL N P
Sbjct: 178 CVNCGATVTPLWRRDMSGHYLCNACGLYHKMNGQNRP 214
>gi|432865839|ref|XP_004070639.1| PREDICTED: GATA-binding factor 5-A-like [Oryzias latipes]
Length = 217
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q+ ++R+G+ C NC T+ TTLWRRN GEPVCNACGL M + P
Sbjct: 63 RLQSTSRRAGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 111
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC + T LWRR+ G +CNACGL IN P
Sbjct: 17 GRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGINRP 56
>gi|148234737|ref|NP_001079244.1| GATA-binding factor 1-B [Xenopus laevis]
gi|120969|sp|P23768.1|GAT1B_XENLA RecName: Full=GATA-binding factor 1-B; AltName: Full=Transcription
factor xGATA-1B
gi|214169|gb|AAA49722.1| GATA binding factor-1 [Xenopus laevis]
Length = 364
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/44 (56%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G C+NC T+ TTLWRRN G+PVCNACGL ++N P
Sbjct: 227 SKRAGTQCSNCHTSTTTLWRRNAGGDPVCNACGLYYKLHNVNRP 270
Score = 42.0 bits (97), Expect = 0.047, Method: Composition-based stats.
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 1/50 (2%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
C NC T T LWRR+ +G +CNACGL N P I P +IS+
Sbjct: 180 CVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRP-LIRPKKRLIISK 228
>gi|301753709|ref|XP_002912716.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Ailuropoda melanoleuca]
Length = 478
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 320 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 363
>gi|301630601|ref|XP_002944405.1| PREDICTED: GATA-binding factor 1-B-like [Xenopus (Silurana)
tropicalis]
Length = 363
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/44 (56%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G C+NC T+ TTLWRRN +G+PVCNACGL ++N P
Sbjct: 227 SKRAGTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRP 270
Score = 41.6 bits (96), Expect = 0.068, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ +G +CNACGL N P
Sbjct: 180 CVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRP 216
>gi|332225943|ref|XP_003262148.1| PREDICTED: transcription factor GATA-6-like [Nomascus leucogenys]
Length = 476
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 318 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 361
Score = 38.9 bits (89), Expect = 0.40, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 271 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 307
>gi|181337117|ref|NP_032119.2| transcription factor GATA-5 [Mus musculus]
gi|341940731|sp|P97489.2|GATA5_MOUSE RecName: Full=Transcription factor GATA-5; AltName:
Full=GATA-binding factor 5
gi|74204896|dbj|BAE20945.1| unnamed protein product [Mus musculus]
gi|74209194|dbj|BAE24979.1| unnamed protein product [Mus musculus]
gi|148675383|gb|EDL07330.1| GATA binding protein 5 [Mus musculus]
Length = 404
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 26/46 (56%), Positives = 33/46 (71%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++RSG+ C+NC T TTLWRRN+ GEPVCNACGL M + P
Sbjct: 241 SSSRRSGLCCSNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVPRP 286
Score = 43.1 bits (100), Expect = 0.023, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 193 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 232
>gi|410953434|ref|XP_003983375.1| PREDICTED: transcription factor GATA-5 [Felis catus]
Length = 391
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 34/46 (73%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T ++R+G+ C+NC TT TTLWRRN +GEPVCNACGL M + P
Sbjct: 227 TSSRRAGLCCSNCHTTNTTLWRRNADGEPVCNACGLYMKLHGVPRP 272
Score = 42.4 bits (98), Expect = 0.037, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 179 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 218
>gi|1841756|gb|AAB47506.1| GATA-5 cardiac transcription factor [Mus musculus]
Length = 404
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 26/46 (56%), Positives = 33/46 (71%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++RSG+ C+NC T TTLWRRN+ GEPVCNACGL M + P
Sbjct: 241 SSSRRSGLCCSNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVPRP 286
Score = 43.1 bits (100), Expect = 0.023, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 193 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 232
>gi|225544236|gb|ACN91538.1| pnr [Rhagoletis juglandis]
Length = 556
Score = 62.4 bits (150), Expect = 4e-08, Method: Composition-based stats.
Identities = 27/46 (58%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
T +R G+ C NC T TTLWRRNN GEPVCNACGL +N P
Sbjct: 159 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 204
Score = 42.7 bits (99), Expect = 0.028, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P ++ P + ++S
Sbjct: 108 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LIKPSKRLVS 157
>gi|1289502|gb|AAC52841.1| transcription factor GATA-6, partial [Mus musculus]
Length = 205
Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 48 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 91
Score = 39.3 bits (90), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 1 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNALSRPLIKPQKRVP 46
>gi|410920497|ref|XP_003973720.1| PREDICTED: GATA-binding factor 2-like [Takifugu rubripes]
Length = 421
Score = 62.4 bits (150), Expect = 4e-08, Method: Composition-based stats.
Identities = 27/44 (61%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G CANC T+ TTLWRRN +GEPVCNACGL ++N P
Sbjct: 286 SKRAGTLCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVNRP 329
Score = 41.6 bits (96), Expect = 0.065, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDAL 74
C NC T T LWRR+ G +CNACGL N P I P ++S+ L
Sbjct: 239 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP-LIRPKKRLIVSKRAGTL 292
>gi|342671958|dbj|BAK57317.1| transcription factor GATA6 [Ovis aries]
Length = 217
Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 59 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 102
Score = 38.5 bits (88), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 12 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 57
>gi|170043918|ref|XP_001849614.1| GATAa2 transcription factor [Culex quinquefasciatus]
gi|167867189|gb|EDS30572.1| GATAa2 transcription factor [Culex quinquefasciatus]
Length = 496
Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R G+ C+NC+TT T+LWRRN GEPVCNACGL ++N P
Sbjct: 199 RRLSSARRVGLQCSNCNTTNTSLWRRNQVGEPVCNACGLYYKLHNVNRP 247
Score = 43.1 bits (100), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 154 GRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRP 193
>gi|1699253|gb|AAB37426.1| zinc finger transcription factor [Mus sp.]
Length = 444
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 286 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 329
Score = 38.9 bits (89), Expect = 0.45, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 239 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 275
>gi|392900106|ref|NP_001255402.1| Protein ELT-1, isoform d [Caenorhabditis elegans]
gi|379657042|emb|CCG28208.1| Protein ELT-1, isoform d [Caenorhabditis elegans]
Length = 487
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 29/49 (59%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ Q KR+G+ C NC T TTLWRRN G PVCNACGL + P
Sbjct: 331 KRQNAQKRTGIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVERP 379
Score = 38.5 bits (88), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T LWRR+ +G +CNACGL
Sbjct: 289 CVNCGVHNTPLWRRDGSGNYLCNACGLYF 317
>gi|111494048|gb|AAI05655.1| Gata5 protein [Mus musculus]
Length = 271
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ + ++RSG+ C+NC T TTLWRRN+ GEPVCNACGL M + P
Sbjct: 105 KRLSSSRRSGLCCSNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVPRP 153
Score = 43.1 bits (100), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 60 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 99
>gi|449494011|ref|XP_002194999.2| PREDICTED: transcription factor GATA-6 [Taeniopygia guttata]
Length = 390
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 231 SRRMGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 274
Score = 38.9 bits (89), Expect = 0.44, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 184 CVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRP 220
>gi|5882288|gb|AAD55267.1|AF179425_1 transcription factor GATA-6 [Mus musculus]
Length = 443
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 285 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 328
Score = 38.5 bits (88), Expect = 0.46, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 238 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 274
>gi|1078990|pir||S53812 BmGATA beta isoform 3 - silkworm (fragment)
gi|603166|gb|AAA67887.1| BmGATA beta isoform 3, partial [Bombyx mori]
Length = 241
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 34/56 (60%)
Query: 1 MILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++LT R+ KR G C NC TT T+LWRRN GE VCNACGL ++N P
Sbjct: 36 ILLTTSRRKGMAAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRP 91
>gi|312084746|ref|XP_003144400.1| hypothetical protein LOAG_08822 [Loa loa]
Length = 149
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 33/47 (70%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ KR+G+ C NC T TTLWRRN+ G+PVCNACGL +I+ P
Sbjct: 30 QSAQKRTGIECVNCKTNNTTLWRRNSLGQPVCNACGLYHKLHNISRP 76
>gi|11875205|ref|NP_062058.1| transcription factor GATA-6 [Rattus norvegicus]
gi|1235614|gb|AAA92577.1| DNA binding protein [Rattus norvegicus]
Length = 441
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 285 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 328
Score = 38.5 bits (88), Expect = 0.46, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 238 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 274
>gi|444707682|gb|ELW48920.1| Transcription factor GATA-5 [Tupaia chinensis]
Length = 242
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 25/44 (56%), Positives = 33/44 (75%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R+G+ C NC T+CTTLWRR+ +GEPVCNACGL M + P
Sbjct: 74 SRRAGLCCTNCHTSCTTLWRRSADGEPVCNACGLYMKLHGVPRP 117
Score = 42.4 bits (98), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 24 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKVNGVNRP 63
>gi|403265451|ref|XP_003924952.1| PREDICTED: transcription factor GATA-6-like [Saimiri boliviensis
boliviensis]
Length = 452
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 294 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 337
Score = 37.4 bits (85), Expect = 1.2, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR G +CNACGL ++ P
Sbjct: 247 CVNCGSIQTPLWRRGGPGHYLCNACGLYSKMNGLSRP 283
>gi|74101554|gb|AAZ99709.1| GATA-6, partial [Bos taurus]
Length = 179
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/43 (65%), Positives = 31/43 (72%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 48 RRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 90
Score = 35.8 bits (81), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 4/44 (9%)
Query: 22 NCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 2 NCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 45
>gi|111493951|gb|AAI05654.1| Gata5 protein [Mus musculus]
Length = 350
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 26/46 (56%), Positives = 33/46 (71%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++RSG+ C+NC T TTLWRRN+ GEPVCNACGL M + P
Sbjct: 187 SSSRRSGLCCSNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVPRP 232
Score = 42.7 bits (99), Expect = 0.028, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 139 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 178
>gi|126321769|ref|XP_001363044.1| PREDICTED: transcription factor GATA-6 [Monodelphis domestica]
Length = 590
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 28/43 (65%), Positives = 31/43 (72%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 433 RRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 475
Score = 38.9 bits (89), Expect = 0.44, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 385 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 421
>gi|477758|pir||B48099 transcription factor GATA-2, retinoic acid-inducible - mouse
(fragment)
Length = 118
Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 32/43 (74%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R+G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 53 RRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 95
Score = 41.6 bits (96), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 21/41 (51%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL N P
Sbjct: 1 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 41
>gi|157169243|gb|ABV25956.1| GATA-binding transcription factor B3 [Capitella teleta]
Length = 372
Score = 62.0 bits (149), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 32/45 (71%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+ CANC T+ TTLWRRN +GEPVCNACGL ++ P
Sbjct: 208 ASRRMGLQCANCQTSTTTLWRRNADGEPVCNACGLYFKLHGVHRP 252
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESM 66
G C NC T LWRR+ G +CNACGL +N P +M P + M
Sbjct: 159 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGLNRP--MMKPQKRM 206
>gi|37595582|gb|AAQ94635.1| Gata6 protein [Danio rerio]
gi|111185536|gb|AAH67710.2| Gata6 protein [Danio rerio]
Length = 501
Score = 62.0 bits (149), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ + ++R G+SCANC T+ TTLWRRN GEPVCNACGL + P
Sbjct: 339 KRMSSSRRIGLSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRP 387
Score = 40.0 bits (92), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 297 CVNCGSMSTPLWRRDGTGHFLCNACGLYSKMNGLSRP 333
>gi|195501117|ref|XP_002097665.1| GE24357 [Drosophila yakuba]
gi|194183766|gb|EDW97377.1| GE24357 [Drosophila yakuba]
Length = 1272
Score = 62.0 bits (149), Expect = 5e-08, Method: Composition-based stats.
Identities = 27/49 (55%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +KR+G+SC+NC+TT T+LWRRN GEPVCNACGL + P
Sbjct: 795 RRLSASKRAGLSCSNCNTTHTSLWRRNPAGEPVCNACGLYFKLHSVPRP 843
>gi|324507577|gb|ADY43212.1| Transcription factor elt-2 [Ascaris suum]
Length = 502
Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 32/48 (66%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ + +R G+ CANC T TTLWRRN GEPVCNACGL +++ P
Sbjct: 284 QSSSQRRQGLVCANCHGTNTTLWRRNAEGEPVCNACGLYFKLHNVHRP 331
>gi|15824330|gb|AAL09304.1|AF295687_1 transcription factor GATA-6 [Sus scrofa]
Length = 446
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 288 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 331
Score = 37.0 bits (84), Expect = 1.4, Method: Composition-based stats.
Identities = 15/37 (40%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CN CGL ++ P
Sbjct: 241 CVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNGLSGP 277
>gi|215274026|sp|Q95JA5.2|GATA6_PIG RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
Length = 451
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 293 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 336
Score = 37.0 bits (84), Expect = 1.4, Method: Composition-based stats.
Identities = 15/37 (40%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CN CGL ++ P
Sbjct: 246 CVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNGLSGP 282
>gi|159884077|gb|ABX00717.1| GH11649p [Drosophila melanogaster]
Length = 1009
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 26/39 (66%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL
Sbjct: 813 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGL 851
>gi|18858739|ref|NP_571632.1| transcription factor GATA-6 [Danio rerio]
gi|6524990|gb|AAF15276.1|AF191578_1 zinc finger transcription factor Gata6 [Danio rerio]
Length = 383
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ + ++R G+SCANC T+ TTLWRRN GEPVCNACGL + P
Sbjct: 226 KRMSSSRRIGLSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRP 274
Score = 40.0 bits (92), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALS 75
C NC + T LWRR+ G +CNACGL ++ P ++ P + M S LS
Sbjct: 184 CVNCGSMSTPLWRRDGTGHFLCNACGLYSKMNGLSRP--LIKPQKRMSSSRRIGLS 237
>gi|1655915|gb|AAC50941.1| hGATA-6 [Homo sapiens]
gi|2506076|dbj|BAA22621.1| GATA-6 [Homo sapiens]
Length = 449
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 291 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 334
Score = 38.5 bits (88), Expect = 0.56, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 244 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 280
>gi|1877210|emb|CAA64997.1| GATA-6 DNA binding protein [Homo sapiens]
Length = 449
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 291 SRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 334
Score = 38.5 bits (88), Expect = 0.56, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 244 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRP 280
>gi|393910073|gb|EJD75720.1| hypothetical protein LOAG_17191 [Loa loa]
Length = 395
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 26/49 (53%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ Q+ KR+G+ C NC T TTLWRRN+ G+PVCNACGL +I+ P
Sbjct: 274 KRQSAQKRTGIECVNCKTNNTTLWRRNSLGQPVCNACGLYHKLHNISRP 322
Score = 40.8 bits (94), Expect = 0.11, Method: Composition-based stats.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 8 REQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R +T N S G C NC T LWRR+ G +CNACGL N P
Sbjct: 219 RSKTRNFTSEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRP 268
>gi|323319543|gb|ADX36142.1| GATA456b [Schmidtea polychroa]
Length = 363
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%), Gaps = 1/49 (2%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R+QT KR+G+ C+NC T TTLWRRN+ G+PVCNACGL + P
Sbjct: 220 RKQTA-KRTGLQCSNCKTENTTLWRRNSEGQPVCNACGLYYRLHKTHRP 267
>gi|33416800|gb|AAH55963.1| Gata-5b protein [Xenopus laevis]
Length = 388
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ + ++R+G+ C NC T+ TTLWRRN+ GEPVCNACGL M + P
Sbjct: 226 KRLSSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRP 274
Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 181 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 220
>gi|7649393|emb|CAB88873.1| GATA transcription factor [Aedes aegypti]
Length = 868
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 25/50 (50%), Positives = 35/50 (70%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++E + +R G+ C+NC+T T+LWRRN GEPVCNACGL ++N P
Sbjct: 664 VKEPSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRP 713
Score = 42.0 bits (97), Expect = 0.053, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 616 GRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRP 655
>gi|347967445|ref|XP_307946.4| AGAP002238-PA [Anopheles gambiae str. PEST]
gi|333466292|gb|EAA03724.4| AGAP002238-PA [Anopheles gambiae str. PEST]
Length = 542
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R G+ C+NC+TT T+LWRRN GEPVCNACGL ++N P
Sbjct: 191 RRLSSARRVGLQCSNCNTTNTSLWRRNQVGEPVCNACGLYYKLHNVNRP 239
Score = 42.7 bits (99), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 146 GRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRP 185
>gi|441492|emb|CAA53807.1| a box binding factor (ABF), also called dGATA-b [Drosophila
melanogaster]
Length = 779
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 26/39 (66%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL
Sbjct: 307 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGL 345
>gi|66730290|ref|NP_001019487.1| transcription factor GATA-5 [Rattus norvegicus]
gi|55250430|gb|AAH85855.1| GATA binding protein 5 [Rattus norvegicus]
gi|149034029|gb|EDL88812.1| similar to Transcription factor GATA-5 (GATA binding factor-5)
[Rattus norvegicus]
Length = 404
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 26/46 (56%), Positives = 32/46 (69%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++RSG+ C+NC T TTLWRRN GEPVCNACGL M + P
Sbjct: 241 SSSRRSGLCCSNCHTATTTLWRRNAEGEPVCNACGLYMKLHGVPRP 286
Score = 43.1 bits (100), Expect = 0.024, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 193 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 232
>gi|12641861|dbj|BAB21551.1| dGATAe [Drosophila melanogaster]
Length = 571
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 28/36 (77%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+R+GV+CANC T TTLWRRNN G PVCNACGL
Sbjct: 533 RRNGVTCANCQTNSTTLWRRNNEGNPVCNACGLYYK 568
>gi|148230811|ref|NP_001079831.1| GATA-binding factor 5-B [Xenopus laevis]
gi|1169807|sp|P43696.1|GAT5B_XENLA RecName: Full=GATA-binding factor 5-B; AltName: Full=Transcription
factor xGATA-5B
gi|311052|gb|AAA63687.1| transcription factor xGATA-5b [Xenopus laevis]
Length = 388
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ + ++R+G+ C NC T+ TTLWRRN+ GEPVCNACGL M + P
Sbjct: 226 KRLSSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRP 274
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 181 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 220
>gi|122892555|gb|ABM67327.1| GATA4 [Hylobates klossii]
Length = 42
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/39 (69%), Positives = 31/39 (79%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+ ++R G+SCANC TT TTLWRRN GEPVCNACGL M
Sbjct: 1 SASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMK 39
>gi|355564543|gb|EHH21043.1| hypothetical protein EGK_04020 [Macaca mulatta]
Length = 480
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 26/40 (65%), Positives = 29/40 (72%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G CANC TT TTLWRRN NG+PVCNACGL ++N P
Sbjct: 346 GTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRP 385
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + + G C NC T T LWRR+ G +CNACGL N P
Sbjct: 283 RSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 331
>gi|332029519|gb|EGI69408.1| GATA-binding factor C [Acromyrmex echinatior]
Length = 67
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 26/45 (57%), Positives = 33/45 (73%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDIN 54
Q+ +R+G SCANC T TTLWRRN +GEPVCNACGL +++
Sbjct: 20 QSAARRAGTSCANCKTATTTLWRRNQSGEPVCNACGLYYKLHNVS 64
>gi|57547700|gb|AAW52540.1| GATA transcription factor pannier-like protein [Schistocerca
americana]
Length = 88
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 32/49 (65%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ T +R G+ C NC T TTLWRRNN+GEPVCNACGL +N P
Sbjct: 38 KRLTATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRP 86
>gi|195389408|ref|XP_002053369.1| GJ23376 [Drosophila virilis]
gi|194151455|gb|EDW66889.1| GJ23376 [Drosophila virilis]
Length = 1236
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R+G+SC+NC T+ T+LWRRN GEPVCNACGL ++ P
Sbjct: 700 RRLSASRRAGMSCSNCQTSYTSLWRRNPGGEPVCNACGLYYKLHNVVRP 748
>gi|1556394|emb|CAA68943.1| GATA factor [Drosophila melanogaster]
Length = 949
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 26/39 (66%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL
Sbjct: 477 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGL 515
>gi|17862982|gb|AAL39968.1| SD07261p [Drosophila melanogaster]
gi|220942384|gb|ACL83735.1| srp-PD [synthetic construct]
gi|220952630|gb|ACL88858.1| srp-PD [synthetic construct]
Length = 950
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 26/39 (66%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL
Sbjct: 477 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGL 515
>gi|354481949|ref|XP_003503163.1| PREDICTED: transcription factor GATA-5 [Cricetulus griseus]
gi|344254951|gb|EGW11055.1| Transcription factor GATA-5 [Cricetulus griseus]
Length = 406
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 26/46 (56%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +RSG+ C+NC T TTLWRRN GEPVCNACGL M + P
Sbjct: 242 SSTRRSGLCCSNCHTATTTLWRRNAEGEPVCNACGLYMKLHGVPRP 287
Score = 43.1 bits (100), Expect = 0.024, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 194 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 233
>gi|45551908|ref|NP_732100.2| serpent, isoform A [Drosophila melanogaster]
gi|73920793|sp|P52172.2|SRP_DROME RecName: Full=Box A-binding factor; Short=ABF; AltName:
Full=GATA-binding factor B; AltName: Full=Protein
serpent; AltName: Full=Transcription factor GATA-B;
AltName: Full=dGATA-B
gi|45446505|gb|AAF55261.2| serpent, isoform A [Drosophila melanogaster]
Length = 1264
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 26/39 (66%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL
Sbjct: 791 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGL 829
>gi|391332619|ref|XP_003740730.1| PREDICTED: GATA-binding factor 5-A-like [Metaseiulus occidentalis]
Length = 158
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R+G++C+NC TT T+LWRRNN GEPVCNACGL + P
Sbjct: 61 RRLSASRRAGLTCSNCQTTNTSLWRRNNVGEPVCNACGLYFRLHGVPRP 109
Score = 39.7 bits (91), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 18/32 (56%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
G C NC T LWRR+ G +CNACGL
Sbjct: 15 EGRECVNCGAISTPLWRRDGTGHYLCNACGLY 46
>gi|195570486|ref|XP_002103238.1| GD19064 [Drosophila simulans]
gi|194199165|gb|EDX12741.1| GD19064 [Drosophila simulans]
Length = 696
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 26/39 (66%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL
Sbjct: 569 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGL 607
>gi|195111294|ref|XP_002000214.1| GI22647 [Drosophila mojavensis]
gi|193916808|gb|EDW15675.1| GI22647 [Drosophila mojavensis]
Length = 1005
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 24/39 (61%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + ++R+G+SC+NC T+ T+LWRRN GEPVCNACGL
Sbjct: 485 RRLSASRRAGMSCSNCQTSYTSLWRRNPGGEPVCNACGL 523
>gi|194901182|ref|XP_001980131.1| GG16969 [Drosophila erecta]
gi|190651834|gb|EDV49089.1| GG16969 [Drosophila erecta]
Length = 1244
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 26/39 (66%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL
Sbjct: 781 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGL 819
>gi|348501136|ref|XP_003438126.1| PREDICTED: transcription factor GATA-6-like [Oreochromis niloticus]
Length = 491
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 31/44 (70%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC T+ TTLWRRN GEPVCNACGL + P
Sbjct: 338 SRRIGLSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRP 381
Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 27/50 (54%), Gaps = 1/50 (2%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
C NC + T LWRR+ G +CNACGL ++ P I PP + SR
Sbjct: 291 CVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRP-LIKPPKRTSTSR 339
>gi|303387041|gb|ADM15548.1| GATA-1, partial [Carassius auratus langsdorfii]
Length = 214
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 25/37 (67%), Positives = 29/37 (78%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+KR+G CANC T+ TTLWRRN +GEPVCNACGL
Sbjct: 178 SKRAGTQCANCHTSTTTLWRRNASGEPVCNACGLYFK 214
Score = 42.7 bits (99), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ G +CNACGL N P
Sbjct: 131 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 167
>gi|443718380|gb|ELU09032.1| hypothetical protein CAPTEDRAFT_121659, partial [Capitella
teleta]
Length = 144
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++R G+ CANC T+ TTLWRRN +GEPVCNACGL ++ P
Sbjct: 49 KRMCASRRMGLQCANCQTSTTTLWRRNADGEPVCNACGLYFKLHGVHRP 97
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 2/53 (3%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T LWRR+ G +CNACGL +N P +M P + M +
Sbjct: 3 EGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGLNRP--MMKPQKRMCA 53
>gi|256088981|ref|XP_002580598.1| GATA binding factor-1b (transcription factor xgata-1b) [Schistosoma
mansoni]
gi|360042778|emb|CCD78188.1| putative gata binding factor [Schistosoma mansoni]
Length = 919
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 31/41 (75%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
R+ N+R G+ C+NC TT TTLWRRN +GEPVCNACGL
Sbjct: 517 RKSQTNRRIGLICSNCETTKTTLWRRNLDGEPVCNACGLYQ 557
>gi|195328569|ref|XP_002030987.1| GM24275 [Drosophila sechellia]
gi|194119930|gb|EDW41973.1| GM24275 [Drosophila sechellia]
Length = 1238
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 26/39 (66%), Positives = 32/39 (82%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +KR+G+SC+NC TT T+LWRRN GEPVCNACGL
Sbjct: 779 RRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGL 817
>gi|347967443|ref|XP_003436067.1| AGAP002238-PB [Anopheles gambiae str. PEST]
gi|333466293|gb|EGK96194.1| AGAP002238-PB [Anopheles gambiae str. PEST]
Length = 1157
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 26/49 (53%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R G+ C+NC+TT T+LWRRN GEPVCNACGL ++N P
Sbjct: 806 RRLSSARRVGLQCSNCNTTNTSLWRRNQVGEPVCNACGLYYKLHNVNRP 854
Score = 41.6 bits (96), Expect = 0.069, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 761 GRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRP 800
>gi|357619365|gb|EHJ71973.1| hypothetical protein KGM_02427 [Danaus plexippus]
Length = 355
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 27/43 (62%), Positives = 29/43 (67%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G SC NC + TTLWRRNN GEPVCNACGL IN P
Sbjct: 130 RRHGQSCTNCGSRNTTLWRRNNEGEPVCNACGLYYKLHGINRP 172
Score = 43.1 bits (100), Expect = 0.021, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR++ G +CNACGL +N P
Sbjct: 79 GRECVNCGANNTPLWRRDSTGHYLCNACGLYHKINGVNRP 118
>gi|410923801|ref|XP_003975370.1| PREDICTED: GATA-binding factor 6-A-like [Takifugu rubripes]
Length = 494
Score = 60.8 bits (146), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 31/44 (70%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC T+ TTLWRRN GEPVCNACGL + P
Sbjct: 342 SRRIGLSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRP 385
Score = 40.4 bits (93), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 295 CVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRP 331
>gi|189237326|ref|XP_973133.2| PREDICTED: similar to GATA transcription factor [Tribolium
castaneum]
Length = 368
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G++C NC T+ T+LWRRN GEPVCNACGL +N P
Sbjct: 164 RRLSASRRVGLTCTNCHTSTTSLWRRNTVGEPVCNACGLYFKLHGVNRP 212
Score = 41.6 bits (96), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 22/45 (48%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMP 61
G C NC T LWRR+ G +CNACGL +N P P
Sbjct: 119 GRECVNCGAIDTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP 163
>gi|260782140|ref|XP_002586149.1| hypothetical protein BRAFLDRAFT_255109 [Branchiostoma floridae]
gi|229271242|gb|EEN42160.1| hypothetical protein BRAFLDRAFT_255109 [Branchiostoma floridae]
Length = 107
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 34/47 (72%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDIN 54
R + ++R G+ CANC TT TTLWRRNN GEPVCNACGL +++
Sbjct: 48 RRLSASRRVGLQCANCRTTQTTLWRRNNEGEPVCNACGLYYKLHNVS 94
Score = 43.1 bits (100), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T T LWRR+ G +CNACGL +N P
Sbjct: 3 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 42
>gi|33439247|gb|AAQ18782.1| GATA transcription factor [Haemonchus contortus]
Length = 417
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 30/47 (63%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q +R G+ C+NC T TTLWRRN GEPVCNACGL ++ P
Sbjct: 233 QNSQRRQGLVCSNCGGTNTTLWRRNAEGEPVCNACGLYYKLHNVQRP 279
>gi|33439249|gb|AAQ18783.1| GATA transcription factor [Haemonchus contortus]
Length = 417
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 30/47 (63%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q +R G+ C+NC T TTLWRRN GEPVCNACGL ++ P
Sbjct: 233 QNSQRRQGLVCSNCGGTNTTLWRRNAEGEPVCNACGLYYKLHNVQRP 279
>gi|358339242|dbj|GAA47341.1| GATA-binding protein 4/5/6 [Clonorchis sinensis]
Length = 725
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 30/45 (66%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
N+R G C NC+T+ TTLWRRN G+PVCNACGL +N P
Sbjct: 581 ANRRPGQICTNCNTSATTLWRRNAEGDPVCNACGLYYKLHKVNRP 625
>gi|58332620|ref|NP_001011384.1| GATA binding protein 5 [Xenopus (Silurana) tropicalis]
gi|56789762|gb|AAH88567.1| GATA binding protein 5 [Xenopus (Silurana) tropicalis]
Length = 389
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 33/46 (71%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++R+G+ C NC T+ TTLWRRN+ GEPVCNACGL M + P
Sbjct: 229 SSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRP 274
Score = 43.1 bits (100), Expect = 0.020, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 181 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 220
>gi|147906903|ref|NP_001081962.1| GATA-binding factor 5-A [Xenopus laevis]
gi|1169806|sp|P43695.1|GAT5A_XENLA RecName: Full=GATA-binding factor 5-A; AltName: Full=Transcription
factor xGATA-5A
gi|311050|gb|AAA63686.1| transcription factor xGATA-5a [Xenopus laevis]
gi|54648481|gb|AAH84985.1| GATA-5a protein [Xenopus laevis]
Length = 390
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 33/46 (71%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++R+G+ C NC T+ TTLWRRN+ GEPVCNACGL M + P
Sbjct: 228 SSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRP 273
Score = 42.0 bits (97), Expect = 0.044, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 180 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGMNRP 219
>gi|321467981|gb|EFX78968.1| hypothetical protein DAPPUDRAFT_52915 [Daphnia pulex]
Length = 108
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R G+ CANC+TT T+LWRRNN GE VCNACGL +N P
Sbjct: 48 RRLSTTRRLGLRCANCATTTTSLWRRNNQGETVCNACGLYFKLHGVNRP 96
Score = 38.9 bits (89), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 19/31 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
G C NC + T LWRR+ G +CNACGL
Sbjct: 2 GRECVNCGSISTPLWRRDGTGHYLCNACGLY 32
>gi|47206170|emb|CAF89866.1| unnamed protein product [Tetraodon nigroviridis]
Length = 306
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/41 (60%), Positives = 30/41 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
R ++R+G+ C NC T+ TTLWRRN GEPVCNACGL M
Sbjct: 240 RRPAPSRRAGLCCTNCGTSTTTLWRRNAEGEPVCNACGLYM 280
Score = 42.0 bits (97), Expect = 0.052, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC + T LWRR+ G +CNACGL N P
Sbjct: 195 GRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGSNRP 234
>gi|193683287|ref|XP_001943242.1| PREDICTED: hypothetical protein LOC100168211 [Acyrthosiphon pisum]
Length = 888
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 30/49 (61%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R Q ++R G+ C NC T T+LWRRN GEPVCNACGL + P
Sbjct: 711 RMQNASRRQGLQCTNCQTATTSLWRRNQVGEPVCNACGLYFKLHGVKRP 759
>gi|301630516|ref|XP_002944363.1| PREDICTED: GATA-binding factor 5-B-like, partial [Xenopus
(Silurana) tropicalis]
Length = 217
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 33/46 (71%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++R+G+ C NC T+ TTLWRRN+ GEPVCNACGL M + P
Sbjct: 57 SSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRP 102
Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 9 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 48
>gi|195036114|ref|XP_001989516.1| GH18757 [Drosophila grimshawi]
gi|193893712|gb|EDV92578.1| GH18757 [Drosophila grimshawi]
Length = 978
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R+G+SC+NC T+ T+LWRRN GEPVCNACGL ++ P
Sbjct: 497 RRLSASRRAGMSCSNCQTSYTSLWRRNPGGEPVCNACGLYFKLHNVVRP 545
>gi|402589624|gb|EJW83555.1| hypothetical protein WUBG_05536, partial [Wuchereria bancrofti]
Length = 119
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 32/46 (69%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ KR+G+ C NC T TTLWRRN+ G+PVCNACGL +I+ P
Sbjct: 1 SAQKRTGIECVNCKTNNTTLWRRNSLGQPVCNACGLYHKLHNISRP 46
>gi|195450286|ref|XP_002072447.1| GK22843 [Drosophila willistoni]
gi|194168532|gb|EDW83433.1| GK22843 [Drosophila willistoni]
Length = 1047
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/49 (53%), Positives = 35/49 (71%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++R+G+SC+NC TT T+LWRRN +GEPVCNACGL ++ P
Sbjct: 710 RRLNASRRAGLSCSNCLTTYTSLWRRNPSGEPVCNACGLYFKLHNVARP 758
>gi|380861645|gb|AFF18488.1| GATA456 [Schmidtea mediterranea]
Length = 529
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%), Gaps = 1/40 (2%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+Q G KRS + C+NC T T+LWRRNN GEPVCNACGL
Sbjct: 320 QQYG-KRSDLECSNCKITKTSLWRRNNEGEPVCNACGLYY 358
>gi|323319541|gb|ADX36141.1| GATA456a [Schmidtea polychroa]
Length = 528
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%), Gaps = 1/40 (2%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+Q G KRS + C+NC T T+LWRRNN GEPVCNACGL
Sbjct: 320 QQYG-KRSDLECSNCKITKTSLWRRNNEGEPVCNACGLYY 358
>gi|432917241|ref|XP_004079469.1| PREDICTED: transcription factor GATA-6-like [Oryzias latipes]
Length = 501
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/39 (64%), Positives = 31/39 (79%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+ + ++R G+SCANC T+ TTLWRRN GEPVCNACGL
Sbjct: 346 KRTSTSRRIGLSCANCQTSTTTLWRRNAEGEPVCNACGL 384
Score = 39.3 bits (90), Expect = 0.32, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC + T LWRR+ G +CNACGL ++ P
Sbjct: 304 CVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRP 340
>gi|47223008|emb|CAG07095.1| unnamed protein product [Tetraodon nigroviridis]
Length = 225
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 31/44 (70%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC T+ TTLWRRN GEPVCNACGL + P
Sbjct: 131 SRRIGLSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRP 174
Score = 39.7 bits (91), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
C NC + T LWRR+ G +CNACGL ++ P I P + SR
Sbjct: 84 CVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRPL-IKPQKRTSTSR 132
>gi|358335481|dbj|GAA54062.1| endothelial transcription factor GATA-2 [Clonorchis sinensis]
Length = 412
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 3/78 (3%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMI 67
R + ++SG CANC T TTLWRRN +G+ VCNACGL IN P + I I
Sbjct: 110 RRMSATRKSGTICANCRTAHTTLWRRNQHGDSVCNACGLYYKLHHINRPLSMKKEI---I 166
Query: 68 SRNGDALSVITSTPNIHH 85
L+ ++HH
Sbjct: 167 QTRNRRLTQGKKRKDMHH 184
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%)
Query: 3 LTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
L +R ++ N G C NC T T LWRR+ G +CNACGL
Sbjct: 51 LASKLRNKSRNLTDGRECVNCGATQTPLWRRDETGHYLCNACGLY 95
>gi|360042733|emb|CCD78143.1| putative gata binding factor [Schistosoma mansoni]
Length = 565
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 31/49 (63%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + N++ G CANC T+ TTLWRRN G+ VCNACGL IN P
Sbjct: 331 RRISANRKLGTFCANCRTSHTTLWRRNQQGDSVCNACGLYYKLHHINRP 379
Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%)
Query: 3 LTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
L +R +T G C NC T T LWRR+ G +CNACGL N P
Sbjct: 272 LANKLRNKTKIITDGRECVNCGATQTPLWRRDEAGHYLCNACGLYHKMNGTNRP 325
>gi|256073972|ref|XP_002573301.1| endothelial transcription factor GATA-2 [Schistosoma mansoni]
Length = 565
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 31/49 (63%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + N++ G CANC T+ TTLWRRN G+ VCNACGL IN P
Sbjct: 331 RRISANRKLGTFCANCRTSHTTLWRRNQQGDSVCNACGLYYKLHHINRP 379
Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%)
Query: 3 LTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
L +R +T G C NC T T LWRR+ G +CNACGL N P
Sbjct: 272 LANKLRNKTKIITDGRECVNCGATQTPLWRRDEAGHYLCNACGLYHKMNGTNRP 325
>gi|339236827|ref|XP_003379968.1| erythroid transcription factor [Trichinella spiralis]
gi|316977299|gb|EFV60416.1| erythroid transcription factor [Trichinella spiralis]
Length = 321
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/42 (59%), Positives = 29/42 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+ T +KR G+ C NC T TTLWRRN NG+PVCNACGL
Sbjct: 238 KTATTSKRQGLVCVNCQTMQTTLWRRNQNGDPVCNACGLYFK 279
>gi|256085819|ref|XP_002579109.1| GATA binding factor-1b (transcription factor xgata-1b) [Schistosoma
mansoni]
gi|360044426|emb|CCD81974.1| putative gata zinc finger domain-containing protein [Schistosoma
mansoni]
Length = 227
Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 27/35 (77%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+RSG C NC+T+ TTLWRRN GEPVCNACGL
Sbjct: 128 RRSGQFCTNCNTSATTLWRRNTEGEPVCNACGLYY 162
>gi|157125534|ref|XP_001654374.1| transcription factor GATA-4 (GATA binding factor-4) [Aedes aegypti]
gi|108873609|gb|EAT37834.1| AAEL010222-PB [Aedes aegypti]
Length = 1003
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R G+ C+NC+T T+LWRRN GEPVCNACGL ++N P
Sbjct: 633 RRLSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRP 681
>gi|41059166|gb|AAR99325.1| GATA transcription factor [Aedes aegypti]
Length = 1003
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + +R G+ C+NC+T T+LWRRN GEPVCNACGL ++N P
Sbjct: 633 RRLSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRP 681
>gi|111493996|gb|AAI05656.1| Gata5 protein [Mus musculus]
Length = 320
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 32/46 (69%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++RSG+ C+NC T TTLWRRN+ G PVCNACGL M + P
Sbjct: 157 SSSRRSGLCCSNCHTATTTLWRRNSEGGPVCNACGLYMKLHGVPRP 202
Score = 42.4 bits (98), Expect = 0.034, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 109 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 148
>gi|357609507|gb|EHJ66487.1| BmGATA beta isoform 2 - silkworm [Danaus plexippus]
Length = 267
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 30/52 (57%), Positives = 33/52 (63%), Gaps = 1/52 (1%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
R G KR GV C+NC TT TTLWRRN GE VCNACGL IN P+ +
Sbjct: 101 RPSMGTKRQGV-CSNCETTITTLWRRNPLGENVCNACGLYFKLHGINRPKNM 151
Score = 43.1 bits (100), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 27/55 (49%), Gaps = 2/55 (3%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRN 70
G C NC T LWRR+ G +CNACGL +N P K PP + R+
Sbjct: 7 EGRECVNCGAIDTPLWRRDGTGHYLCNACGLYTKMNGMNRPLK--PPRRLVRQRH 59
>gi|112984186|ref|NP_001037446.1| transcription factor BCFI [Bombyx mori]
gi|483331|gb|AAA65734.1| transcription factor BCFI [Bombyx mori]
Length = 509
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 29/49 (59%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R KR G C NC TT T+LWRRN GE VCNACGL ++N P
Sbjct: 310 RRLMAAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRP 358
Score = 40.8 bits (94), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 21/42 (50%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G C NC T LW R+ G +CNACGL +N P K
Sbjct: 266 GRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPLK 307
>gi|1730200|sp|P52167.1|GATB_BOMMO RecName: Full=Transcription factor BCFI; AltName: Full=BmGATA-beta;
Short=GATA-beta
Length = 508
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 29/49 (59%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R KR G C NC TT T+LWRRN GE VCNACGL ++N P
Sbjct: 310 RRLMAAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRP 358
Score = 40.8 bits (94), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 21/42 (50%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G C NC T LW R+ G +CNACGL +N P K
Sbjct: 266 GRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPLK 307
>gi|603164|gb|AAA67886.1| BmGATA beta isoform 1, partial [Bombyx mori]
Length = 313
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 29/49 (59%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R KR G C NC TT T+LWRRN GE VCNACGL ++N P
Sbjct: 115 RRLMAAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRP 163
Score = 40.4 bits (93), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 21/43 (48%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G C NC T LW R+ G +CNACGL +N P K
Sbjct: 70 EGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPLK 112
>gi|1078989|pir||S53811 BmGATA beta isoform 2 - silkworm (fragment)
gi|603165|gb|AAA67885.1| BmGATA beta isoform 2, partial [Bombyx mori]
Length = 327
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 28/43 (65%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
KR G C NC TT T+LWRRN GE VCNACGL ++N P
Sbjct: 135 KRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRP 177
Score = 40.4 bits (93), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 21/43 (48%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G C NC T LW R+ G +CNACGL +N P K
Sbjct: 70 EGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPLK 112
>gi|114319150|gb|ABI63575.1| GATA4 [Danio rerio]
Length = 352
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SC NC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 212 RRLSASRRVGLSCTNCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 260
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 167 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 206
>gi|410916493|ref|XP_003971721.1| PREDICTED: transcription factor GATA-4-like [Takifugu rubripes]
Length = 386
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SC NC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 228 RRLSASRRVGLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 276
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|270008081|gb|EFA04529.1| serpent [Tribolium castaneum]
Length = 528
Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 28/40 (70%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G++C NC T+ T+LWRRN GEPVCNACGL +N P
Sbjct: 333 GLTCTNCHTSTTSLWRRNTVGEPVCNACGLYFKLHGVNRP 372
>gi|196011732|ref|XP_002115729.1| hypothetical protein TRIADDRAFT_59720 [Trichoplax adhaerens]
gi|190581505|gb|EDV21581.1| hypothetical protein TRIADDRAFT_59720 [Trichoplax adhaerens]
Length = 468
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 30/43 (69%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R + CANC+T TTLWRRN G+PVCNACGL +IN P
Sbjct: 340 RRMDLICANCNTDRTTLWRRNKYGQPVCNACGLYFKLHEINRP 382
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 10 QTGNKRSGV-SCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ GN G +CANC T LWRR+ +G+ +CNACGL +N P
Sbjct: 281 KNGNATKGTRACANCGVISTPLWRRDGSGQYLCNACGLYHKINGVNRP 328
>gi|146760638|gb|ABQ44507.1| GATA-binding protein 4 [Oryzias latipes]
Length = 218
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 31/42 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
R + ++R G+SC NC TT TTLWRRN GEPVCNACGL M
Sbjct: 174 RRLSASRRVGLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMK 215
>gi|348524793|ref|XP_003449907.1| PREDICTED: transcription factor GATA-4 [Oreochromis niloticus]
Length = 389
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SC NC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 228 RRLSASRRVGLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 276
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|432946176|ref|XP_004083805.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-4
[Oryzias latipes]
Length = 388
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SC NC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 228 RRLSASRRVGLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 276
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|156147115|gb|ABU53701.1| GATA4 [Oreochromis niloticus]
Length = 392
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 33/49 (67%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+SC NC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 228 RRLSASRRVGLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 276
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 183 GRECVNCGAMPTPLWRRDGTGHYLCNACGLYHKMNGINRP 222
>gi|268578523|ref|XP_002644244.1| C. briggsae CBR-ELT-2 protein [Caenorhabditis briggsae]
Length = 460
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 31/46 (67%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++R G+ C+NC+ T TTLWRRN G+PVCNACGL I P
Sbjct: 237 SASRRQGLVCSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHIARP 282
>gi|335353905|dbj|BAK39709.1| GATA binding protein 2 [Tursiops truncatus]
Length = 83
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/39 (66%), Positives = 30/39 (76%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R + +R+G CANC TT TTLWRRN NG+PVCNACGL
Sbjct: 45 RRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGL 83
Score = 38.9 bits (89), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ G +CNACGL N P
Sbjct: 3 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRP 39
>gi|312066897|ref|XP_003136488.1| GATA zinc finger family protein [Loa loa]
gi|307768347|gb|EFO27581.1| GATA zinc finger family protein [Loa loa]
Length = 564
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 29/48 (60%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +R G+ CANC T TTLWRR+ +G PVCNACGL + P
Sbjct: 352 QHNSQRRQGLVCANCRGTNTTLWRRDADGHPVCNACGLYFKLHQVQRP 399
>gi|45382623|ref|NP_990795.1| erythroid transcription factor [Gallus gallus]
gi|120955|sp|P17678.1|GATA1_CHICK RecName: Full=Erythroid transcription factor; AltName: Full=Eryf1;
AltName: Full=GATA-binding factor 1; Short=GATA-1;
AltName: Full=NF-E1 DNA-binding protein; Short=NF-E1a
gi|212629|gb|AAA49055.1| Eryf1 protein [Gallus gallus]
Length = 304
Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 31/44 (70%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+KR+G C+NC T+ TTLWRR+ G+PVCNACGL +N P
Sbjct: 157 SKRAGTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVNRP 200
Score = 41.2 bits (95), Expect = 0.071, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T T LWRR+ G +CNACGL N P
Sbjct: 110 CVNCGATATPLWRRDGTGHYLCNACGLYHRLNGQNRP 146
>gi|149234449|ref|XP_001523104.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146453213|gb|EDK47469.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1093
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/33 (66%), Positives = 26/33 (78%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SGVSC+NC T T LWRRN G+P+CNACGL +
Sbjct: 768 SGVSCSNCHTKTTPLWRRNPQGQPLCNACGLFL 800
>gi|170587074|ref|XP_001898304.1| GATA zinc finger family protein [Brugia malayi]
gi|158594699|gb|EDP33283.1| GATA zinc finger family protein [Brugia malayi]
Length = 567
Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G+ CANC T TTLWRR+ +G PVCNACGL + P
Sbjct: 353 QRRQGLVCANCRGTNTTLWRRDADGHPVCNACGLYYKLHQVQRP 396
>gi|408489445|pdb|3GAT|A Chain A, Solution Nmr Structure Of The C-Terminal Domain Of
Chicken Gata-1 Bound To Dna, 34 Structures
gi|425684924|pdb|2GAT|A Chain A, Solution Structure Of The C-Terminal Domain Of Chicken
Gata- 1 Bound To Dna, Nmr, Regularized Mean Structure
Length = 66
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 30/43 (69%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
KR+G C+NC T+ TTLWRR+ G+PVCNACGL +N P
Sbjct: 1 KRAGTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVNRP 43
>gi|25146629|ref|NP_509755.2| Protein ELT-2 [Caenorhabditis elegans]
gi|1706638|sp|Q10655.1|ELT2_CAEEL RecName: Full=Transcription factor elt-2
gi|847811|gb|AAC36130.1| GATA-factor [Caenorhabditis elegans]
gi|22265747|emb|CAA90029.2| Protein ELT-2 [Caenorhabditis elegans]
Length = 433
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 42/71 (59%), Gaps = 1/71 (1%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNG 71
++R G+ C+NC+ T TTLWRRN G+PVCNACGL I P M ++ +R
Sbjct: 229 ASRRQGLVCSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHIPRPTS-MKKEGALQTRKR 287
Query: 72 DALSVITSTPN 82
+ S +STP+
Sbjct: 288 KSKSGDSSTPS 298
>gi|358341608|dbj|GAA49239.1| GATA-binding factor A [Clonorchis sinensis]
Length = 492
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/33 (69%), Positives = 27/33 (81%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+R G+ C NC TT TTLWRRN +G+PVCNACGL
Sbjct: 121 RRIGLVCTNCETTQTTLWRRNADGQPVCNACGL 153
>gi|313234089|emb|CBY19666.1| unnamed protein product [Oikopleura dioica]
Length = 553
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 29/45 (64%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPE 57
NKR G C NC T+ TTLWRR+ GEPVCNACGL + P+
Sbjct: 374 NKRIGTICVNCKTSQTTLWRRSLKGEPVCNACGLYEKLHGVPRPK 418
Score = 42.7 bits (99), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 20/29 (68%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
CANC+ T T LWRR+ G +CNACGL
Sbjct: 327 CANCAATTTPLWRRDKCGNYLCNACGLYY 355
>gi|442928|pdb|1GAT|A Chain A, Solution Structure Of The Specific Dna Complex Of The
Zinc Containing Dna Binding Domain Of The Erythroid
Transcription Factor Gata-1 By Multidimensional Nmr
gi|442930|pdb|1GAU|A Chain A, Solution Structure Of The Specific Dna Complex Of The
Zinc Containing Dna Binding Domain Of The Erythroid
Transcription Factor Gata-1 By Multidimensional Nmr
Length = 60
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 30/43 (69%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
KR+G C+NC T+ TTLWRR+ G+PVCNACGL +N P
Sbjct: 1 KRAGTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVNRP 43
>gi|332023110|gb|EGI63371.1| GATA-binding factor A [Acromyrmex echinatior]
Length = 595
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/50 (52%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 9 EQTGN--KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+Q+GN K+ +SC NC T TT+WRRN GE VCNACGL +N P
Sbjct: 412 QQSGNVAKKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRP 461
>gi|407025369|gb|AFS65551.1| Gata1/2/3, partial [Parastichopus parvimensis]
Length = 261
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 28/36 (77%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNA 43
+ + +KR+G SCANC T TTLWRRN NGEPVCNA
Sbjct: 226 KRRLSSKRTGTSCANCQATTTTLWRRNPNGEPVCNA 261
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 26/49 (53%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R ++ + G C NC T T LWRR+ NG +CNACGL N P
Sbjct: 173 RAKSRSSTEGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRP 221
>gi|431896291|gb|ELK05707.1| Transcription factor GATA-6 [Pteropus alecto]
Length = 291
Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 32/44 (72%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++R G+SCANC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 132 SRRLGLSCANCRTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 175
Score = 38.9 bits (89), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI--MNTVD--INAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 85 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVP 130
>gi|402589823|gb|EJW83754.1| GATA zinc finger family protein [Wuchereria bancrofti]
Length = 553
Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 28/43 (65%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G+ CANC T TTLWRR+ +G PVCNACGL + P
Sbjct: 344 RRQGLICANCRGTNTTLWRRDADGHPVCNACGLYYKLHQVQRP 386
>gi|341903414|gb|EGT59349.1| hypothetical protein CAEBREN_06149 [Caenorhabditis brenneri]
Length = 425
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 29/43 (67%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G+ C+NC+ T TTLWRRN G+PVCNACGL I P
Sbjct: 229 RRQGLVCSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHIPRP 271
>gi|307172112|gb|EFN63680.1| GATA-binding factor A [Camponotus floridanus]
Length = 595
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/50 (52%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 9 EQTGN--KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+Q+GN K+ +SC NC T TT+WRRN GE VCNACGL +N P
Sbjct: 411 QQSGNIAKKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRP 460
>gi|70997557|ref|XP_753523.1| siderophore transcription factor SreA [Aspergillus fumigatus Af293]
gi|66851159|gb|EAL91485.1| siderophore transcription factor SreA [Aspergillus fumigatus Af293]
gi|159126746|gb|EDP51862.1| siderophore transcription factor SreA [Aspergillus fumigatus A1163]
Length = 546
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 42/88 (47%), Gaps = 17/88 (19%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIMP----- 61
VSC NC TT T LWRR+ NG P+CNACGL M I ++++P
Sbjct: 251 VSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTTMKKSIIKRRKRVVPALRDQ 310
Query: 62 -PIESMISRNGDALSVITSTPNIHHSDD 88
P + S NG + S S + HS D
Sbjct: 311 SPSGATYSSNGSSASPEASPAVLAHSHD 338
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 141
>gi|241954936|ref|XP_002420189.1| nitrogen regulatory GATA-factor, putative; transcriptional
activator with GATA-1-type Zn finger DNA-binding motif,
putative [Candida dubliniensis CD36]
gi|223643530|emb|CAX42412.1| nitrogen regulatory GATA-factor, putative [Candida dubliniensis
CD36]
Length = 753
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 25/33 (75%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+GVSC NC T T LWRRN G+P+CNACGL +
Sbjct: 508 AGVSCTNCGTKTTPLWRRNPQGQPLCNACGLFL 540
>gi|378726228|gb|EHY52687.1| GATA-binding protein [Exophiala dermatitidis NIH/UT8656]
Length = 487
Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 11/74 (14%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIMPPIESM 66
V+C NC TT T LWRR+ +G P+CNACGL M I ++++P +
Sbjct: 268 VACKNCGTTVTPLWRRDEHGHPICNACGLYHKLHGSHRPVQMKKSTIKRRKRVVPAYPDV 327
Query: 67 ISRNGDALSVITST 80
+ +G+A TST
Sbjct: 328 LRSDGNASQKTTST 341
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C+NC TT T LWRR+ G +CNACGL T + P
Sbjct: 97 GQVCSNCGTTKTPLWRRSPTGTTICNACGLYQKTRNAPRP 136
>gi|353242644|emb|CCA74270.1| hypothetical protein PIIN_08223 [Piriformospora indica DSM 11827]
Length = 1358
Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 32/56 (57%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALS 75
C NCSTT T LWRRN G+P+CNACGL + P + + +RNG L+
Sbjct: 1177 CTNCSTTTTPLWRRNPEGQPLCNACGLFFKLHGVTRPLSLKTDVIKKRNRNGATLT 1232
>gi|300709080|ref|XP_002996708.1| hypothetical protein NCER_100170 [Nosema ceranae BRL01]
gi|239606029|gb|EEQ83037.1| hypothetical protein NCER_100170 [Nosema ceranae BRL01]
Length = 165
Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 37/76 (48%), Gaps = 2/76 (2%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESM 66
I E+ GNKR C NC TT T LWRR ++G +CNACGL + N P +
Sbjct: 4 IAEEIGNKRG--YCKNCGTTTTPLWRRGDDGSYLCNACGLYLKIHKKNRPHEFKTDSFKH 61
Query: 67 ISRNGDALSVITSTPN 82
+RN + I N
Sbjct: 62 RNRNKKEVPFIPQLRN 77
>gi|31559064|gb|AAP50501.1| nitrogen regulatory GATA-factor [Candida albicans]
Length = 755
Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 25/33 (75%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+GVSC NC T T LWRRN G+P+CNACGL +
Sbjct: 501 TGVSCTNCGTKTTPLWRRNPQGQPLCNACGLFL 533
>gi|384499461|gb|EIE89952.1| hypothetical protein RO3G_14663 [Rhizopus delemar RA 99-880]
Length = 374
Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 41/74 (55%), Gaps = 8/74 (10%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGD--ALSVI 77
CANCST T LWRR+ +G+P+CNACGL ++ P + S I R A ++
Sbjct: 181 CANCSTETTPLWRRDESGQPICNACGLYYKLHHVHRPVTMK---RSTIKRRKRVTAANLT 237
Query: 78 TSTPNIHH---SDD 88
T P ++H SDD
Sbjct: 238 THKPMVYHPTKSDD 251
Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ SC+NC TT T LWRR+ GE +CNACGL + + P
Sbjct: 83 TATSCSNCGTTTTPLWRRSPLGETICNACGLYYKARNTSRP 123
>gi|68478842|ref|XP_716583.1| hypothetical protein CaO19.1275 [Candida albicans SC5314]
gi|68478951|ref|XP_716530.1| hypothetical protein CaO19.8862 [Candida albicans SC5314]
gi|46438200|gb|EAK97535.1| hypothetical protein CaO19.8862 [Candida albicans SC5314]
gi|46438254|gb|EAK97588.1| hypothetical protein CaO19.1275 [Candida albicans SC5314]
Length = 688
Score = 55.1 bits (131), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 25/33 (75%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+GVSC NC T T LWRRN G+P+CNACGL +
Sbjct: 434 TGVSCTNCGTKTTPLWRRNPQGQPLCNACGLFL 466
>gi|238881297|gb|EEQ44935.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 582
Score = 55.1 bits (131), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 25/33 (75%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+GVSC NC T T LWRRN G+P+CNACGL +
Sbjct: 328 TGVSCTNCGTKTTPLWRRNPQGQPLCNACGLFL 360
>gi|448515041|ref|XP_003867232.1| Gat1 zinc finger transcriptional regulator [Candida orthopsilosis
Co 90-125]
gi|380351571|emb|CCG21794.1| Gat1 zinc finger transcriptional regulator [Candida orthopsilosis]
Length = 761
Score = 55.1 bits (131), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 28/46 (60%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G SG SC NC T T LWRRN G+P+CNACGL + + P
Sbjct: 600 SGASASGPSCTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGVTRP 645
>gi|431894591|gb|ELK04391.1| Transcription factor GATA-5 [Pteropus alecto]
Length = 243
Score = 54.7 bits (130), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++R+G+ C NC T TTLWRR+ G+PVCNACGL + P
Sbjct: 80 SSSRRAGLCCTNCHTATTTLWRRDAEGQPVCNACGLYAKLHGVPRP 125
Score = 42.7 bits (99), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 32 GRECVNCGALSTPLWRRDTTGHYLCNACGLYHKMNGVNRP 71
>gi|354547100|emb|CCE43833.1| hypothetical protein CPAR2_500590 [Candida parapsilosis]
Length = 769
Score = 54.7 bits (130), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 27/41 (65%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
SG +C+NC T T LWRRN G+P+CNACGL + + P
Sbjct: 595 SGQTCSNCQTKTTPLWRRNPEGQPLCNACGLFLKLHGVTRP 635
>gi|154286086|ref|XP_001543838.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150407479|gb|EDN03020.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 630
Score = 54.7 bits (130), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 27/46 (58%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPI 63
++C NC TT T LWRR+ NG P+CNACGL P + PI
Sbjct: 289 IACQNCQTTVTPLWRRDENGHPICNACGLYHKLHGAYRPPTMKKPI 334
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G SC+NC T T LWRR+ G +CNACGL + + + P
Sbjct: 124 TGHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 164
>gi|358372181|dbj|GAA88786.1| siderophore transcription factor SreA [Aspergillus kawachii IFO
4308]
Length = 568
Score = 54.3 bits (129), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
VSC NC TT T LWRR+ NG P+CNACGL
Sbjct: 264 VSCQNCGTTVTPLWRRDENGHPICNACGLYYK 295
Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 107 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 148
>gi|344305512|gb|EGW35744.1| hypothetical protein SPAPADRAFT_58947 [Spathaspora passalidarum
NRRL Y-27907]
Length = 355
Score = 54.3 bits (129), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 20/33 (60%), Positives = 25/33 (75%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
GVSC+NC T T LWRR+ G+P+CNACGL +
Sbjct: 272 GVSCSNCHTRTTPLWRRDPEGQPLCNACGLFLK 304
>gi|308495029|ref|XP_003109703.1| CRE-ELT-2 protein [Caenorhabditis remanei]
gi|308245893|gb|EFO89845.1| CRE-ELT-2 protein [Caenorhabditis remanei]
Length = 426
Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 32/49 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
+ ++R G+ C+NC+ T TTLWRRN G+PVCNACGL + P +
Sbjct: 225 SASRRQGLICSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHVARPTSM 273
>gi|145228733|ref|XP_001388675.1| GATA factor SREP [Aspergillus niger CBS 513.88]
gi|134054767|emb|CAK43607.1| unnamed protein product [Aspergillus niger]
gi|350637901|gb|EHA26257.1| hypothetical protein ASPNIDRAFT_52040 [Aspergillus niger ATCC 1015]
Length = 569
Score = 54.3 bits (129), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
VSC NC TT T LWRR+ NG P+CNACGL
Sbjct: 265 VSCQNCGTTVTPLWRRDENGHPICNACGLYYK 296
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 108 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 149
>gi|169775323|ref|XP_001822129.1| GATA factor SREP [Aspergillus oryzae RIB40]
gi|83769992|dbj|BAE60127.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 566
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
VSC NC TT T LWRR+ NG P+CNACGL
Sbjct: 264 VSCQNCGTTVTPLWRRDENGHPICNACGLYYK 295
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 106 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 147
>gi|391873006|gb|EIT82081.1| GATA transcription factor [Aspergillus oryzae 3.042]
Length = 566
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
VSC NC TT T LWRR+ NG P+CNACGL
Sbjct: 264 VSCQNCGTTVTPLWRRDENGHPICNACGLYYK 295
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 106 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 147
>gi|238496045|ref|XP_002379258.1| siderophore transcription factor SreA [Aspergillus flavus NRRL3357]
gi|220694138|gb|EED50482.1| siderophore transcription factor SreA [Aspergillus flavus NRRL3357]
Length = 559
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
VSC NC TT T LWRR+ NG P+CNACGL
Sbjct: 257 VSCQNCGTTVTPLWRRDENGHPICNACGLYYK 288
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 99 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 140
>gi|119479079|ref|XP_001259568.1| siderophore transcription factor SreA [Neosartorya fischeri NRRL
181]
gi|119407722|gb|EAW17671.1| siderophore transcription factor SreA [Neosartorya fischeri NRRL
181]
Length = 546
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
VSC NC TT T LWRR+ NG P+CNACGL
Sbjct: 251 VSCQNCGTTVTPLWRRDENGHPICNACGLYYK 282
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 141
>gi|115401494|ref|XP_001216335.1| GATA factor SREP [Aspergillus terreus NIH2624]
gi|114190276|gb|EAU31976.1| GATA factor SREP [Aspergillus terreus NIH2624]
Length = 536
Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
VSC NC TT T LWRR+ NG P+CNACGL
Sbjct: 245 VSCQNCGTTVTPLWRRDENGHPICNACGLYYK 276
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 27/42 (64%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + +++ P K
Sbjct: 93 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVSRPTK 134
>gi|389646733|ref|XP_003720998.1| nitrogen regulatory protein NUT1 [Magnaporthe oryzae 70-15]
gi|2494690|sp|Q01168.1|NUT1_MAGO7 RecName: Full=Nitrogen regulatory protein NUT1
gi|1401346|gb|AAB03415.1| nitrogen regulatory protein [Magnaporthe grisea]
gi|86196459|gb|EAQ71097.1| hypothetical protein MGCH7_ch7g504 [Magnaporthe oryzae 70-15]
gi|351638390|gb|EHA46255.1| nitrogen regulatory protein NUT1 [Magnaporthe oryzae 70-15]
gi|440466924|gb|ELQ36165.1| nitrogen regulatory protein NUT1 [Magnaporthe oryzae Y34]
gi|440482142|gb|ELQ62657.1| nitrogen regulatory protein NUT1 [Magnaporthe oryzae P131]
Length = 956
Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 29/43 (67%), Gaps = 3/43 (6%)
Query: 9 EQTGNKRSG---VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+Q GN + G +C NC+T T LWRRN G+P+CNACGL +
Sbjct: 649 QQQGNNQGGDAPTTCTNCATQTTPLWRRNPEGQPLCNACGLFL 691
>gi|344231972|gb|EGV63851.1| hypothetical protein CANTEDRAFT_113880 [Candida tenuis ATCC 10573]
Length = 569
Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 33/61 (54%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSVI 77
VSC NC T T LWRRN G+P+CNACGL + + P + + RN ++ I
Sbjct: 382 VSCTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRQRNSNSSKKI 441
Query: 78 T 78
T
Sbjct: 442 T 442
>gi|348517021|ref|XP_003446034.1| PREDICTED: transcription factor GATA-3-like [Oreochromis niloticus]
Length = 170
Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 27/43 (62%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G C NC T TTLWRRN+ GE VCNACGL +N P
Sbjct: 79 QRKGTQCVNCLTERTTLWRRNSAGEAVCNACGLYYRLHRVNRP 121
>gi|301349225|gb|ADK74261.1| Gata 4-like protein [Salmo salar]
Length = 181
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 32/49 (65%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++R G+ C NC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 110 RRLSASRRVGLLCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 158
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL IN P
Sbjct: 65 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKVNGINRP 104
>gi|384500433|gb|EIE90924.1| hypothetical protein RO3G_15635 [Rhizopus delemar RA 99-880]
Length = 379
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 30/41 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+E ++ S + C+NCSTT T LWRR++ G P+CNACGL +
Sbjct: 233 KESKDDEGSQLVCSNCSTTTTPLWRRDDEGAPLCNACGLYL 273
Score = 42.7 bits (99), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 28/56 (50%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMP 61
G R + + + C NC T T LWRR+ E +CNACGL + P+ + P
Sbjct: 173 GKRPSSSHLENKPICTNCGATSTPLWRRSAEDELLCNACGLYQKLHNAPRPKTLKP 228
>gi|380022377|ref|XP_003695026.1| PREDICTED: uncharacterized protein LOC100864759 [Apis florea]
Length = 588
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 9 EQTGN--KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +GN K+ +SC NC T TT+WRRN GE VCNACGL +N P
Sbjct: 404 QHSGNITKKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRP 453
>gi|328785124|ref|XP_001120276.2| PREDICTED: hypothetical protein LOC724420 [Apis mellifera]
Length = 588
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 9 EQTGN--KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +GN K+ +SC NC T TT+WRRN GE VCNACGL +N P
Sbjct: 404 QHSGNIAKKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRP 453
>gi|350398203|ref|XP_003485118.1| PREDICTED: hypothetical protein LOC100747914 [Bombus impatiens]
Length = 588
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 9 EQTGN--KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +GN K+ +SC NC T TT+WRRN GE VCNACGL +N P
Sbjct: 404 QHSGNMAKKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRP 453
>gi|340724648|ref|XP_003400693.1| PREDICTED: hypothetical protein LOC100646437 [Bombus terrestris]
Length = 588
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 9 EQTGN--KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +GN K+ +SC NC T TT+WRRN GE VCNACGL +N P
Sbjct: 404 QHSGNMAKKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRP 453
>gi|312384257|gb|EFR29024.1| hypothetical protein AND_02351 [Anopheles darlingi]
Length = 806
Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats.
Identities = 31/45 (68%), Positives = 37/45 (82%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
GN+RSGV+CANC+TT TTLWRRNN G+PVCNACGL ++N P
Sbjct: 645 GNRRSGVTCANCNTTTTTLWRRNNQGDPVCNACGLYFKLHNVNRP 689
Score = 36.6 bits (83), Expect = 2.0, Method: Composition-based stats.
Identities = 14/27 (51%), Positives = 18/27 (66%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C NC ++ T LWRR+ G +CNAC L
Sbjct: 589 CVNCGSSDTPLWRRDVVGHTLCNACAL 615
>gi|62815898|emb|CAG27837.1| GATA-type transcription factor [Ogataea angusta]
gi|190360942|gb|ACE76877.1| GATA factor [Ogataea angusta]
Length = 565
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G ++ ++C NC T T LWRRN G+P+CNACGL +
Sbjct: 386 GTEQGSIACTNCHTKTTPLWRRNPEGQPLCNACGLFL 422
>gi|193083007|ref|NP_001122335.1| GATAa protein [Ciona intestinalis]
gi|70569756|dbj|BAE06471.1| transcription factor protein [Ciona intestinalis]
Length = 641
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 27/40 (67%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
C+NCSTT TTLWRR+ +G PVCNACGL + P +
Sbjct: 358 CSNCSTTVTTLWRRSPDGNPVCNACGLYQKLHGVPRPRTM 397
>gi|320581748|gb|EFW95967.1| GATA-type transcription factor [Ogataea parapolymorpha DL-1]
Length = 565
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G ++ ++C NC T T LWRRN G+P+CNACGL +
Sbjct: 386 GTEQGSIACTNCHTKTTPLWRRNPEGQPLCNACGLFL 422
>gi|367022150|ref|XP_003660360.1| hypothetical protein MYCTH_2298578 [Myceliophthora thermophila ATCC
42464]
gi|347007627|gb|AEO55115.1| hypothetical protein MYCTH_2298578 [Myceliophthora thermophila ATCC
42464]
Length = 553
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 39/73 (53%), Gaps = 8/73 (10%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI---MPPIESMISRNG 71
+SG C+NC TT T LWRR+ GE +CNACGL + T NAP I PP + NG
Sbjct: 93 QSGQVCSNCGTTFTPLWRRSPQGEIICNACGLYLKTR--NAPRPINLKRPPA---VPGNG 147
Query: 72 DALSVITSTPNIH 84
S + +P
Sbjct: 148 SRQSPVKLSPKAQ 160
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 29/55 (52%)
Query: 2 ILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
I ++ Q N ++C NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 238 IAALQLQNQNQNPTVVIACQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVHRP 292
>gi|190345835|gb|EDK37787.2| hypothetical protein PGUG_01885 [Meyerozyma guilliermondii ATCC
6260]
Length = 568
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 20/31 (64%), Positives = 23/31 (74%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
VSC NC T T LWRRN G+P+CNACGL +
Sbjct: 416 VSCTNCHTRTTPLWRRNPQGQPLCNACGLFL 446
>gi|345486907|ref|XP_003425585.1| PREDICTED: hypothetical protein LOC100678172 [Nasonia vitripennis]
Length = 647
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 27/43 (62%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
K+ +SC NC T TT+WRRN GE VCNACGL +N P
Sbjct: 460 KKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRP 502
>gi|150951390|ref|XP_001387707.2| activator of transcription of nitrogen-regulated genes
[Scheffersomyces stipitis CBS 6054]
gi|149388554|gb|EAZ63684.2| activator of transcription of nitrogen-regulated genes
[Scheffersomyces stipitis CBS 6054]
Length = 820
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 23/32 (71%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
GVSC NC T T LWRRN G P+CNACGL +
Sbjct: 590 GVSCTNCHTQTTPLWRRNPQGLPLCNACGLFL 621
>gi|67515789|ref|XP_657780.1| hypothetical protein AN0176.2 [Aspergillus nidulans FGSC A4]
gi|4585213|gb|AAD25328.1|AF095898_1 siderophore biosynthesis repressor SREA [Emericella nidulans]
gi|40746893|gb|EAA66049.1| hypothetical protein AN0176.2 [Aspergillus nidulans FGSC A4]
gi|259489617|tpe|CBF90036.1| TPA: Putative uncharacterized proteinSiderophore biosynthesis
repressor SREA ; [Source:UniProtKB/TrEMBL;Acc:Q9Y754]
[Aspergillus nidulans FGSC A4]
Length = 549
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
V+C NC TT T LWRR+ NG P+CNACGL
Sbjct: 249 VACQNCGTTVTPLWRRDENGHPICNACGLYYK 280
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 103 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 144
>gi|242762467|ref|XP_002340383.1| siderophore transcription factor SreA [Talaromyces stipitatus ATCC
10500]
gi|218723579|gb|EED22996.1| siderophore transcription factor SreA [Talaromyces stipitatus ATCC
10500]
Length = 493
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
V+C NC TT T LWRR+ NG P+CNACGL
Sbjct: 254 VACQNCGTTVTPLWRRDENGHPICNACGLYYK 285
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 34/66 (51%), Gaps = 10/66 (15%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSV 76
G SC+NC T T LWRR+ G +CNACGL + +++ P K RN + +
Sbjct: 103 GHSCSNCGTKSTPLWRRSPTGATICNACGLYLKARNVHRPTK----------RNRNRPQI 152
Query: 77 ITSTPN 82
T PN
Sbjct: 153 TTPGPN 158
>gi|383865617|ref|XP_003708269.1| PREDICTED: uncharacterized protein LOC100877935 [Megachile
rotundata]
Length = 586
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 27/43 (62%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
K+ +SC NC T TT+WRRN GE VCNACGL +N P
Sbjct: 409 KKVDMSCTNCGTMTTTIWRRNMKGEIVCNACGLYYKLHGVNRP 451
>gi|294657093|ref|XP_459410.2| DEHA2E01826p [Debaryomyces hansenii CBS767]
gi|199432441|emb|CAG87621.2| DEHA2E01826p [Debaryomyces hansenii CBS767]
Length = 636
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+SC NC+TT T LWRR+ G+P+CNACGL +
Sbjct: 455 ISCTNCNTTATPLWRRDPKGKPLCNACGLFL 485
>gi|322789591|gb|EFZ14817.1| hypothetical protein SINV_01115 [Solenopsis invicta]
Length = 547
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 27/43 (62%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
K+ +SC NC T TT+WRRN GE VCNACGL +N P
Sbjct: 373 KKIDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRP 415
>gi|307197634|gb|EFN78813.1| GATA-binding factor A [Harpegnathos saltator]
Length = 594
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 9 EQTGN--KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+Q+ N K+ ++C+NC T TT+WRRN GE VCNACGL IN P
Sbjct: 409 QQSSNVTKKVEMTCSNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGINRP 458
>gi|384500731|gb|EIE91222.1| hypothetical protein RO3G_15933 [Rhizopus delemar RA 99-880]
Length = 260
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 33/54 (61%), Gaps = 1/54 (1%)
Query: 8 REQTGNKRSGVS-CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM 60
++Q + GV+ CANC TT T LWRR+ NG +CNACGL ++ P +M
Sbjct: 22 KKQEDHHPQGVTVCANCETTTTPLWRRDTNGRTICNACGLYYKLHLVHRPATMM 75
>gi|400600701|gb|EJP68369.1| Zinc finger, GATA-type [Beauveria bassiana ARSEF 2860]
Length = 838
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 29/45 (64%), Gaps = 3/45 (6%)
Query: 7 IREQTGNKRSG---VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ TGN+ G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 578 LQAATGNQAEGGNPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 622
>gi|225558357|gb|EEH06641.1| siderophore transcription factor SreA [Ajellomyces capsulatus
G186AR]
Length = 630
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
++C NC TT T LWRR+ NG P+CNACGL
Sbjct: 289 IACQNCQTTVTPLWRRDENGHPICNACGLY 318
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G SC+NC T T LWRR+ G +CNACGL + + + P
Sbjct: 124 TGHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 164
>gi|212529674|ref|XP_002144994.1| siderophore transcription factor SreA [Talaromyces marneffei ATCC
18224]
gi|210074392|gb|EEA28479.1| siderophore transcription factor SreA [Talaromyces marneffei ATCC
18224]
Length = 485
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
V+C NC TT T LWRR+ NG P+CNACGL
Sbjct: 248 VACQNCGTTVTPLWRRDENGHPICNACGLYYK 279
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 27/42 (64%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + +++ P K
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGATICNACGLYLKARNVHRPTK 141
>gi|342319729|gb|EGU11676.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 3850
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 32/53 (60%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRN 70
+ C NC TT T LWRR++ G+P+CNACGL N ++ P + + +RN
Sbjct: 3732 IKCLNCGTTNTPLWRRDSEGKPLCNACGLFRNLHGVDRPANLNTGVIKKRNRN 3784
>gi|240274786|gb|EER38301.1| siderophore biosynthesis repressor SREA [Ajellomyces capsulatus
H143]
Length = 630
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
++C NC TT T LWRR+ NG P+CNACGL
Sbjct: 289 IACQNCQTTVTPLWRRDENGHPICNACGLY 318
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G SC+NC T T LWRR+ G +CNACGL + + + P
Sbjct: 124 TGHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 164
>gi|164685140|gb|ABY66603.1| siderophore uptake regulator [Ajellomyces capsulatus]
Length = 630
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
++C NC TT T LWRR+ NG P+CNACGL
Sbjct: 289 IACQNCQTTVTPLWRRDENGHPICNACGLY 318
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G SC+NC T T LWRR+ G +CNACGL + + + P
Sbjct: 124 TGHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 164
>gi|255953715|ref|XP_002567610.1| GATA transcription factor sreP-Penicillium chrysogenum [Penicillium
chrysogenum Wisconsin 54-1255]
gi|2494691|sp|Q92259.1|SREP_PENCH RecName: Full=GATA factor SREP
gi|1517916|gb|AAC49628.1| SreP [Penicillium chrysogenum]
gi|211589321|emb|CAP95461.1| GATA transcription factor sreP-Penicillium chrysogenum [Penicillium
chrysogenum Wisconsin 54-1255]
Length = 532
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 40/88 (45%), Gaps = 17/88 (19%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIMP----- 61
V+C NC TT T LWRR+ G P+CNACGL M I ++++P
Sbjct: 236 VACQNCGTTVTPLWRRDEQGHPICNACGLYYKLHGCYRPTNMKKSIIKRRKRVVPALRDQ 295
Query: 62 -PIESMISRNGDALSVITSTPNIHHSDD 88
P +S NG + S S + H D
Sbjct: 296 SPTAGTLSSNGSSTSPEASPAALAHGHD 323
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 91 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 132
>gi|325094138|gb|EGC47448.1| siderophore transcription factor SreA [Ajellomyces capsulatus H88]
Length = 541
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
++C NC TT T LWRR+ NG P+CNACGL
Sbjct: 200 IACQNCQTTVTPLWRRDENGHPICNACGLY 229
Score = 42.4 bits (98), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+K+ ANC T T LWRR+ G +CNACGL + + + P
Sbjct: 32 DKQETTGKANCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 75
>gi|270012803|gb|EFA09251.1| GATAd [Tribolium castaneum]
Length = 745
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 27/43 (62%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R +SC NC T TT+WRRN GE VCNACGL I+ P
Sbjct: 574 RRVDLSCTNCGTQTTTIWRRNMKGEMVCNACGLYYKLHGIDRP 616
>gi|119172876|ref|XP_001238983.1| hypothetical protein CIMG_10005 [Coccidioides immitis RS]
Length = 752
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
G + T + +CANC T T LWRRN G+P+CNACGL +
Sbjct: 511 GPAKATEQPNAPTTCANCFTQTTPLWRRNPEGQPLCNACGLFLK 554
>gi|340960405|gb|EGS21586.1| putative transcription factor [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 447
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 38/81 (46%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIES 65
++ Q N ++C NC TT T LWRR+ G +CNACGL ++ P + I
Sbjct: 205 ALQLQGANATVVIACQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKSIIK 264
Query: 66 MISRNGDALSVITSTPNIHHS 86
RN V S P +HS
Sbjct: 265 RRKRNRQTSPVSVSDPGQYHS 285
Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 2/74 (2%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRN 70
+ + +G C+NC TT T LWRR+ G +CNACGL + + P + P ++ +
Sbjct: 58 SSQEYTGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARPINLKRPPS--VTAS 115
Query: 71 GDALSVITSTPNIH 84
G S + +P H
Sbjct: 116 GARQSPVRLSPKGH 129
>gi|261201874|ref|XP_002628151.1| siderophore transcription factor SreA [Ajellomyces dermatitidis
SLH14081]
gi|239590248|gb|EEQ72829.1| siderophore transcription factor SreA [Ajellomyces dermatitidis
SLH14081]
Length = 633
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
++C NC TT T LWRR+ NG P+CNACGL
Sbjct: 287 IACQNCQTTVTPLWRRDENGHPICNACGLY 316
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 37/71 (52%), Gaps = 4/71 (5%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP----EKIMPPIESMISRNG 71
+G SC+NC T T LWRR+ G +CNACGL + + + P ++ P S ++N
Sbjct: 124 TGHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRPTHRSRSLLNPYGSNPAQNA 183
Query: 72 DALSVITSTPN 82
D TS N
Sbjct: 184 DKSRSSTSPTN 194
>gi|327353473|gb|EGE82330.1| siderophore transcription factor SreA [Ajellomyces dermatitidis
ATCC 18188]
Length = 634
Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
++C NC TT T LWRR+ NG P+CNACGL
Sbjct: 288 IACQNCQTTVTPLWRRDENGHPICNACGLY 317
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP----EKIMPPIESMISRNG 71
+G SC+NC T T LWRR+ G +CNACGL + + + P ++ P S ++N
Sbjct: 125 TGHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRPTHRSRSLLNPYGSNPAQNA 184
Query: 72 DALSVITS 79
D TS
Sbjct: 185 DKSRSSTS 192
>gi|239611963|gb|EEQ88950.1| siderophore transcription factor SreA [Ajellomyces dermatitidis
ER-3]
Length = 634
Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
++C NC TT T LWRR+ NG P+CNACGL
Sbjct: 288 IACQNCQTTVTPLWRRDENGHPICNACGLY 317
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP----EKIMPPIESMISRNG 71
+G SC+NC T T LWRR+ G +CNACGL + + + P ++ P S ++N
Sbjct: 125 TGHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRPTHRSRSLLNPYGSNPAQNA 184
Query: 72 DALSVITS 79
D TS
Sbjct: 185 DKSRSSTS 192
>gi|349604427|gb|AEP99980.1| Zinc finger transcription factor Trps1-like protein, partial [Equus
caballus]
Length = 547
Score = 52.4 bits (124), Expect = 4e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 158 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 188
>gi|392869191|gb|EAS27676.2| hypothetical protein CIMG_10005 [Coccidioides immitis RS]
Length = 762
Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
G + T + +CANC T T LWRRN G+P+CNACGL +
Sbjct: 521 GPAKATEQPNAPTTCANCFTQTTPLWRRNPEGQPLCNACGLFLK 564
>gi|303324011|ref|XP_003071993.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
gi|240111703|gb|EER29848.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
Length = 760
Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
G + T + +CANC T T LWRRN G+P+CNACGL +
Sbjct: 520 GPAKATEQPNAPTTCANCFTQTTPLWRRNPEGQPLCNACGLFLK 563
>gi|22478075|gb|AAH37058.1| Trps1 protein [Mus musculus]
gi|148697296|gb|EDL29243.1| trichorhinophalangeal syndrome I (human), isoform CRA_c [Mus
musculus]
Length = 1092
Score = 52.4 bits (124), Expect = 4e-05, Method: Composition-based stats.
Identities = 26/40 (65%), Positives = 29/40 (72%), Gaps = 4/40 (10%)
Query: 11 TGNKR----SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
T N+R SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 694 TPNRRRRRGSGVFCANCLTTKTSLWRKNANGGYVCNACGL 733
>gi|320036931|gb|EFW18869.1| GATA transcriptional activator AreA [Coccidioides posadasii str.
Silveira]
Length = 760
Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
G + T + +CANC T T LWRRN G+P+CNACGL +
Sbjct: 520 GPAKATEQPNAPTTCANCFTQTTPLWRRNPEGQPLCNACGLFLK 563
>gi|189240277|ref|XP_001812551.1| PREDICTED: similar to GATAd CG5034-PA [Tribolium castaneum]
Length = 451
Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 27/43 (62%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R +SC NC T TT+WRRN GE VCNACGL I+ P
Sbjct: 280 RRVDLSCTNCGTQTTTIWRRNMKGEMVCNACGLYYKLHGIDRP 322
>gi|349804083|gb|AEQ17514.1| putative gata binding protein 6 [Hymenochirus curtipes]
Length = 164
Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 30/43 (69%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R G++C NC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 60 RRIGLACTNCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVPRP 102
Score = 37.7 bits (86), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 4/47 (8%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLI--MN--TVDINAPEKIMP 61
C NC + T LWRR+ G +CNACGL MN + + P+K +P
Sbjct: 11 ECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGPSRPLIKPQKRVP 57
>gi|171684885|ref|XP_001907384.1| hypothetical protein [Podospora anserina S mat+]
gi|170942403|emb|CAP68055.1| unnamed protein product [Podospora anserina S mat+]
Length = 670
Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 19/29 (65%), Positives = 23/29 (79%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC+TT T LWRRN G+P+CNACGL +
Sbjct: 388 CTNCATTTTPLWRRNPEGQPLCNACGLFL 416
>gi|431901719|gb|ELK08596.1| Zinc finger transcription factor Trps1 [Pteropus alecto]
Length = 427
Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 38 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 68
>gi|348588267|ref|XP_003479888.1| PREDICTED: zinc finger transcription factor Trps1-like [Cavia
porcellus]
Length = 1294
Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|281344786|gb|EFB20370.1| hypothetical protein PANDA_007849 [Ailuropoda melanoleuca]
Length = 687
Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 298 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 328
>gi|448123846|ref|XP_004204769.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
gi|358249402|emb|CCE72468.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
Length = 697
Score = 52.0 bits (123), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 23/31 (74%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+SC NC T T LWRRN G+P+CNACGL +
Sbjct: 528 ISCTNCGTRTTPLWRRNPQGQPLCNACGLFL 558
>gi|301780742|ref|XP_002925788.1| PREDICTED: transcription factor GATA-5-like, partial [Ailuropoda
melanoleuca]
gi|281352342|gb|EFB27926.1| hypothetical protein PANDA_015338 [Ailuropoda melanoleuca]
Length = 292
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/36 (63%), Positives = 31/36 (86%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+ ++R+G+ C+NC TT TTLWRRN +G+PVCNACGL
Sbjct: 129 SSSRRAGLCCSNCRTTTTTLWRRNADGDPVCNACGL 164
Score = 43.1 bits (100), Expect = 0.019, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 81 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 120
>gi|426235710|ref|XP_004011823.1| PREDICTED: zinc finger transcription factor Trps1 isoform 1 [Ovis
aries]
Length = 1293
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 904 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 934
>gi|74210804|dbj|BAE25041.1| unnamed protein product [Mus musculus]
Length = 1240
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 851 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 881
>gi|7021933|dbj|BAA91441.1| unnamed protein product [Homo sapiens]
Length = 785
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 398 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 428
>gi|395818033|ref|XP_003782443.1| PREDICTED: zinc finger transcription factor Trps1 [Otolemur
garnettii]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|300798549|ref|NP_001179842.1| zinc finger transcription factor Trps1 [Bos taurus]
gi|296480517|tpg|DAA22632.1| TPA: trichorhinophalangeal syndrome I [Bos taurus]
Length = 1293
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 904 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 934
>gi|52789279|gb|AAH83110.1| Trps1 protein, partial [Mus musculus]
Length = 1013
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 624 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 654
>gi|426235712|ref|XP_004011824.1| PREDICTED: zinc finger transcription factor Trps1 isoform 2 [Ovis
aries]
Length = 1284
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 895 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 925
>gi|20140645|sp|Q925H1.1|TRPS1_MOUSE RecName: Full=Zinc finger transcription factor Trps1
gi|13785801|gb|AAK39508.1|AF346836_1 atypical GATA protein TRPS1 [Mus musculus]
Length = 1281
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 892 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 922
>gi|198386343|ref|NP_001128309.1| zinc finger transcription factor Trps1 [Rattus norvegicus]
gi|149066418|gb|EDM16291.1| trichorhinophalangeal syndrome I (predicted) [Rattus norvegicus]
Length = 1281
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 892 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 922
>gi|397530026|gb|AFO53621.1| nitrogen regulatory protein AreA [Beauveria bassiana]
Length = 455
Score = 52.0 bits (123), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 29/45 (64%), Gaps = 3/45 (6%)
Query: 7 IREQTGNKRSG---VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ TGN+ G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 195 LQAATGNQAEGGNPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 239
>gi|426360550|ref|XP_004047503.1| PREDICTED: zinc finger transcription factor Trps1 isoform 2
[Gorilla gorilla gorilla]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|6684534|gb|AAF23614.1|AF183810_1 zinc finger transcription factor TRPS1 [Homo sapiens]
Length = 1281
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 892 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 922
>gi|71534019|gb|AAH99962.1| Trps1 protein [Mus musculus]
Length = 1281
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 892 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 922
>gi|403283521|ref|XP_003933167.1| PREDICTED: zinc finger transcription factor Trps1 [Saimiri
boliviensis boliviensis]
Length = 1285
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 896 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 926
>gi|380798109|gb|AFE70930.1| zinc finger transcription factor Trps1, partial [Macaca mulatta]
Length = 1281
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 892 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 922
>gi|355779903|gb|EHH64379.1| Tricho-rhino-phalangeal syndrome type I protein [Macaca
fascicularis]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|355698175|gb|EHH28723.1| Tricho-rhino-phalangeal syndrome type I protein [Macaca mulatta]
gi|383413901|gb|AFH30164.1| zinc finger transcription factor Trps1 [Macaca mulatta]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|350596310|ref|XP_003484256.1| PREDICTED: zinc finger transcription factor Trps1-like, partial [Sus
scrofa]
Length = 1631
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 1158 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 1188
>gi|168480127|ref|NP_114389.2| zinc finger transcription factor Trps1 [Mus musculus]
gi|148697295|gb|EDL29242.1| trichorhinophalangeal syndrome I (human), isoform CRA_b [Mus
musculus]
Length = 1281
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 892 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 922
>gi|258573903|ref|XP_002541133.1| predicted protein [Uncinocarpus reesii 1704]
gi|237901399|gb|EEP75800.1| predicted protein [Uncinocarpus reesii 1704]
Length = 573
Score = 52.0 bits (123), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 23/31 (74%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NCSTT T LWRR++ G P+CNACGL
Sbjct: 260 TACQNCSTTVTPLWRRDDQGHPICNACGLYF 290
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G SC NC T T LWRR NG +CNACGL + + + P
Sbjct: 99 GHSCVNCGTKRTPLWRRAPNGSTICNACGLYLKARNADRP 138
>gi|426360548|ref|XP_004047502.1| PREDICTED: zinc finger transcription factor Trps1 isoform 1
[Gorilla gorilla gorilla]
Length = 1285
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 896 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 926
>gi|20140909|sp|Q9UHF7.2|TRPS1_HUMAN RecName: Full=Zinc finger transcription factor Trps1; AltName:
Full=Tricho-rhino-phalangeal syndrome type I protein;
AltName: Full=Zinc finger protein GC79
gi|10644122|gb|AAG21134.1| zinc finger protein GC79 [Homo sapiens]
Length = 1281
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 892 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 922
>gi|332831062|ref|XP_001137364.2| PREDICTED: zinc finger transcription factor Trps1 isoform 1 [Pan
troglodytes]
gi|410216932|gb|JAA05685.1| trichorhinophalangeal syndrome I [Pan troglodytes]
gi|410264746|gb|JAA20339.1| trichorhinophalangeal syndrome I [Pan troglodytes]
gi|410339009|gb|JAA38451.1| trichorhinophalangeal syndrome I [Pan troglodytes]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|297683502|ref|XP_002819416.1| PREDICTED: zinc finger transcription factor Trps1 isoform 2 [Pongo
abelii]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|90652851|ref|NP_054831.2| zinc finger transcription factor Trps1 [Homo sapiens]
gi|119612358|gb|EAW91952.1| trichorhinophalangeal syndrome I [Homo sapiens]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|441647973|ref|XP_004090846.1| PREDICTED: zinc finger transcription factor Trps1 isoform 3
[Nomascus leucogenys]
Length = 1281
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 892 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 922
>gi|410042135|ref|XP_003951383.1| PREDICTED: zinc finger transcription factor Trps1 [Pan troglodytes]
Length = 1281
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 892 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 922
>gi|410987671|ref|XP_004000119.1| PREDICTED: zinc finger transcription factor Trps1 [Felis catus]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|410042133|ref|XP_003951382.1| PREDICTED: zinc finger transcription factor Trps1 [Pan troglodytes]
Length = 1285
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 896 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 926
>gi|397505690|ref|XP_003823385.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger transcription factor
Trps1 [Pan paniscus]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|390476034|ref|XP_002759325.2| PREDICTED: zinc finger transcription factor Trps1 [Callithrix
jacchus]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|354471417|ref|XP_003497939.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger transcription factor
Trps1-like [Cricetulus griseus]
Length = 1252
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 863 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 893
>gi|332214140|ref|XP_003256186.1| PREDICTED: zinc finger transcription factor Trps1 isoform 1
[Nomascus leucogenys]
Length = 1294
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|297299992|ref|XP_001092834.2| PREDICTED: zinc finger transcription factor Trps1 [Macaca mulatta]
Length = 1285
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 896 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 926
>gi|194384368|dbj|BAG64957.1| unnamed protein product [Homo sapiens]
Length = 1285
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 896 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 926
>gi|29437200|gb|AAH49857.1| Trps1 protein [Mus musculus]
gi|148697294|gb|EDL29241.1| trichorhinophalangeal syndrome I (human), isoform CRA_a [Mus
musculus]
Length = 1035
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 646 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 676
>gi|441647970|ref|XP_004090845.1| PREDICTED: zinc finger transcription factor Trps1 isoform 2
[Nomascus leucogenys]
Length = 1285
Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 896 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 926
>gi|260941173|ref|XP_002614753.1| hypothetical protein CLUG_05531 [Clavispora lusitaniae ATCC 42720]
gi|238851939|gb|EEQ41403.1| hypothetical protein CLUG_05531 [Clavispora lusitaniae ATCC 42720]
Length = 603
Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats.
Identities = 20/31 (64%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
V C NC T T LWRRN GEP+CNACGL +
Sbjct: 386 VQCTNCHTRTTPLWRRNPQGEPLCNACGLFL 416
>gi|301767638|ref|XP_002919239.1| PREDICTED: zinc finger transcription factor Trps1-like [Ailuropoda
melanoleuca]
Length = 1294
Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|240280318|gb|EER43822.1| nitrogen regulatory protein areA [Ajellomyces capsulatus H143]
gi|325096614|gb|EGC49924.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 951
Score = 51.6 bits (122), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 27/39 (69%), Gaps = 2/39 (5%)
Query: 13 NKRSGV--SCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
N+ SGV +C NC T T LWRRN G+P+CNACGL +
Sbjct: 713 NEPSGVPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLK 751
>gi|432882419|ref|XP_004074021.1| PREDICTED: zinc finger transcription factor Trps1-like [Oryzias
latipes]
Length = 1521
Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 38/71 (53%), Gaps = 8/71 (11%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP--IESMISR---- 69
SGV CANC TT T+LWR+N NG VCNACGL P I+ E +I R
Sbjct: 959 SGVFCANCLTTKTSLWRKNANGGYVCNACGLYQKLHSTPRPLNIIKQNNGEQIIRRRTRK 1018
Query: 70 --NGDALSVIT 78
N D+LS T
Sbjct: 1019 RLNADSLSSET 1029
>gi|242809077|ref|XP_002485294.1| GATA transcriptional activator AreA [Talaromyces stipitatus ATCC
10500]
gi|218715919|gb|EED15341.1| GATA transcriptional activator AreA [Talaromyces stipitatus ATCC
10500]
Length = 863
Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 27/39 (69%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G++ + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 620 KPGDQGAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 658
>gi|149721616|ref|XP_001496092.1| PREDICTED: zinc finger transcription factor Trps1 [Equus caballus]
Length = 1294
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|225561125|gb|EEH09406.1| GATA transcriptional activator AreA [Ajellomyces capsulatus G186AR]
Length = 951
Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 27/39 (69%), Gaps = 2/39 (5%)
Query: 13 NKRSGV--SCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
N+ SGV +C NC T T LWRRN G+P+CNACGL +
Sbjct: 713 NEPSGVPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLK 751
>gi|359321027|ref|XP_539139.4| PREDICTED: zinc finger transcription factor Trps1 isoform 1 [Canis
lupus familiaris]
Length = 1294
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|13785803|gb|AAK39509.1|AF346837_1 atypical GATA protein TRPS1 [Xenopus laevis]
Length = 1018
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 629 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 659
>gi|327269364|ref|XP_003219464.1| PREDICTED: zinc finger transcription factor Trps1-like [Anolis
carolinensis]
Length = 1294
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|121713620|ref|XP_001274421.1| siderophore transcription factor SreA [Aspergillus clavatus NRRL 1]
gi|119402574|gb|EAW12995.1| siderophore transcription factor SreA [Aspergillus clavatus NRRL 1]
Length = 561
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 20/29 (68%), Positives = 23/29 (79%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGL 46
V+C NC TT T LWRR+ NG P+CNACGL
Sbjct: 259 VACQNCGTTVTPLWRRDENGHPICNACGL 287
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 110 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 151
>gi|452847941|gb|EME49873.1| nitrogen response factor-like protein [Dothistroma septosporum
NZE10]
Length = 938
Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 6/49 (12%)
Query: 7 IREQTGNKRSG------VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+ ++G+ R G SC NC T T LWRRN G+P+CNACGL +
Sbjct: 674 VPSRSGSPRPGDNSGVPTSCTNCFTQTTPLWRRNPEGQPLCNACGLFLK 722
>gi|345779065|ref|XP_003431823.1| PREDICTED: zinc finger transcription factor Trps1 [Canis lupus
familiaris]
Length = 1285
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 896 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 926
>gi|344273361|ref|XP_003408491.1| PREDICTED: zinc finger transcription factor Trps1 [Loxodonta
africana]
Length = 1294
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|313219632|emb|CBY30553.1| unnamed protein product [Oikopleura dioica]
Length = 368
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 26/39 (66%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R NKR G C NC TT T+LWR++N + VCNACGL
Sbjct: 235 RRSVPNKREGTICDNCKTTETSLWRKSNEQKAVCNACGL 273
Score = 43.5 bits (101), Expect = 0.016, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 20/27 (74%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
CANC T T LWRR++ G +CNACGL
Sbjct: 193 CANCKATTTPLWRRDDAGNYLCNACGL 219
>gi|224046642|ref|XP_002200422.1| PREDICTED: zinc finger transcription factor Trps1 [Taeniopygia
guttata]
Length = 1294
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|126002036|ref|XP_001352245.1| GA18611 [Drosophila pseudoobscura pseudoobscura]
gi|54640243|gb|EAL29293.1| GA18611 [Drosophila pseudoobscura pseudoobscura]
Length = 803
Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL IN P
Sbjct: 651 QKDMSCSNCGTLTTTIWRRSARGEMVCNACGLYFKLHGINRP 692
>gi|344241451|gb|EGV97554.1| Zinc finger transcription factor Trps1 [Cricetulus griseus]
Length = 395
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 6 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 36
>gi|313232905|emb|CBY09588.1| unnamed protein product [Oikopleura dioica]
Length = 368
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 26/39 (66%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R NKR G C NC TT T+LWR++N + VCNACGL
Sbjct: 235 RRSVPNKREGTICDNCKTTETSLWRKSNEQKAVCNACGL 273
Score = 43.5 bits (101), Expect = 0.017, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 20/27 (74%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
CANC T T LWRR++ G +CNACGL
Sbjct: 193 CANCKATTTPLWRRDDAGNYLCNACGL 219
>gi|363731050|ref|XP_418402.3| PREDICTED: zinc finger transcription factor Trps1 [Gallus gallus]
Length = 1294
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|326918020|ref|XP_003205291.1| PREDICTED: zinc finger transcription factor Trps1-like [Meleagris
gallopavo]
Length = 1294
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 905 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 935
>gi|149410576|ref|XP_001507190.1| PREDICTED: zinc finger transcription factor Trps1 [Ornithorhynchus
anatinus]
Length = 1295
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 906 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 936
>gi|126322235|ref|XP_001369943.1| PREDICTED: zinc finger transcription factor Trps1 [Monodelphis
domestica]
Length = 1295
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 906 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 936
>gi|12082800|gb|AAG48616.1|AF312694_1 nitrogen response factor NRF1 [Passalora fulva]
Length = 918
Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/36 (61%), Positives = 25/36 (69%), Gaps = 2/36 (5%)
Query: 16 SGV--SCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
SGV SC NC T T LWRRN G+P+CNACGL +
Sbjct: 667 SGVPTSCTNCFTQTTPLWRRNPEGQPLCNACGLFLK 702
>gi|384489627|gb|EIE80849.1| hypothetical protein RO3G_05554 [Rhizopus delemar RA 99-880]
Length = 221
Score = 51.6 bits (122), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 27/37 (72%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
CANCST T LWRR+ +G+P+CNACGL +++ P
Sbjct: 174 CANCSTKTTPLWRRDESGQPICNACGLYFKLHNVHRP 210
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SC+NC TT T LWRR+ GE +CNACGL
Sbjct: 88 SCSNCGTTTTPLWRRSPLGETICNACGLYY 117
>gi|295666031|ref|XP_002793566.1| nitrogen regulatory protein areA [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226277860|gb|EEH33426.1| nitrogen regulatory protein areA [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 992
Score = 51.6 bits (122), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 27/43 (62%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G +QT + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 749 GGAKQTEQSGAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 791
>gi|410928426|ref|XP_003977601.1| PREDICTED: zinc finger transcription factor Trps1-like [Takifugu
rubripes]
Length = 1451
Score = 51.6 bits (122), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 912 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 942
>gi|295789013|ref|NP_001171405.1| zinc finger transcription factor Trps1 [Danio rerio]
gi|290759652|gb|ADD54453.1| transcription factor Trps1 [Danio rerio]
Length = 1243
Score = 51.6 bits (122), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 839 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 869
>gi|213402573|ref|XP_002172059.1| iron-sensing transcription factor 1 [Schizosaccharomyces japonicus
yFS275]
gi|212000106|gb|EEB05766.1| iron-sensing transcription factor 1 [Schizosaccharomyces japonicus
yFS275]
Length = 519
Score = 51.6 bits (122), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G C NC TT T LWRR+ +G P+CNACGL ++ P
Sbjct: 176 AGTYCQNCGTTTTPLWRRDESGNPICNACGLYYKIHGVHRP 216
Score = 40.8 bits (94), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 20/37 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
CANC + T WRR + +CNACGL T D P
Sbjct: 12 CANCKSQITAQWRRGPDNSILCNACGLYFKTRDAPRP 48
>gi|23094291|emb|CAD45643.1| GATAa protein [Ciona intestinalis]
Length = 431
Score = 51.6 bits (122), Expect = 7e-05, Method: Composition-based stats.
Identities = 21/27 (77%), Positives = 24/27 (88%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C+NCSTT TTLWRR+ +G PVCNACGL
Sbjct: 148 CSNCSTTVTTLWRRSPDGNPVCNACGL 174
>gi|395512307|ref|XP_003760382.1| PREDICTED: zinc finger transcription factor Trps1 [Sarcophilus
harrisii]
Length = 1293
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 904 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 934
>gi|384491758|gb|EIE82954.1| hypothetical protein RO3G_07659 [Rhizopus delemar RA 99-880]
Length = 383
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 25/41 (60%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
S CANC TT T LWRR NGE +CNACGL + + P
Sbjct: 5 SATECANCGTTTTPLWRRGPNGETICNACGLYLKARNTLRP 45
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + CANC TT T LWRR+ G +CNACGL ++ P
Sbjct: 92 RQALICANCRTTTTPLWRRDEAGNTICNACGLYYKLHHVHRP 133
>gi|258569170|ref|XP_002585329.1| nitrogen regulatory protein areA [Uncinocarpus reesii 1704]
gi|237906775|gb|EEP81176.1| nitrogen regulatory protein areA [Uncinocarpus reesii 1704]
Length = 1107
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 23/31 (74%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+CANC T T LWRRN G+P+CNACGL +
Sbjct: 876 TCANCFTQTTPLWRRNPEGQPLCNACGLFLK 906
>gi|147903038|ref|NP_001083884.1| zinc finger transcription factor Trps1 [Xenopus laevis]
gi|20140637|sp|Q90ZS6.1|TRPS1_XENLA RecName: Full=Zinc finger transcription factor Trps1
gi|13785805|gb|AAK39510.1|AF346838_1 atypical GATA protein TRPS1 [Xenopus laevis]
Length = 1271
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 882 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 912
>gi|225684362|gb|EEH22646.1| GATA factor SREP [Paracoccidioides brasiliensis Pb03]
Length = 448
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 44/88 (50%), Gaps = 18/88 (20%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIMP----- 61
++C NC TT T LWRR+ +G P+CNACGL M I ++++P
Sbjct: 288 IACQNCQTTVTPLWRRDEHGHPICNACGLYHKLHGSYRPPTMKKSIIKRRKRVVPAMREQ 347
Query: 62 -PIESMISRNGDALSVITSTPNIHHSDD 88
P + S NG A S TS N+ H+ D
Sbjct: 348 SPAGATQSSNGSA-SPETSPANLAHNYD 374
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G SC+NC T T LWRR+ G +CNACGL + + + P
Sbjct: 122 TGHSCSNCGTKQTPLWRRSPTGATICNACGLYLKARNADRP 162
>gi|213626352|gb|AAI70461.1| Unknown (protein for MGC:197188) [Xenopus laevis]
gi|213627665|gb|AAI70459.1| Unknown (protein for MGC:197186) [Xenopus laevis]
Length = 1271
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 882 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 912
>gi|407920283|gb|EKG13498.1| Zinc finger GATA-type protein [Macrophomina phaseolina MS6]
Length = 588
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 21/44 (47%), Positives = 30/44 (68%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
+G +C+NC TT T LWRR+ GEP+CNACGL + + + P +
Sbjct: 131 NGQTCSNCGTTRTPLWRRSPTGEPICNACGLYLKARNQSRPSNL 174
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 19/38 (50%), Positives = 24/38 (63%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+CANC TT T LWRR+ G +CNACGL ++ P
Sbjct: 298 ACANCRTTVTPLWRRDEAGHTICNACGLYYKLHGVHRP 335
>gi|301621431|ref|XP_002940055.1| PREDICTED: zinc finger transcription factor Trps1-like [Xenopus
(Silurana) tropicalis]
Length = 1268
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 879 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 909
>gi|402878984|ref|XP_003903137.1| PREDICTED: zinc finger transcription factor Trps1-like [Papio
anubis]
Length = 407
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 18 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 48
>gi|348513424|ref|XP_003444242.1| PREDICTED: zinc finger transcription factor Trps1 [Oreochromis
niloticus]
Length = 1508
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 934 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 964
>gi|226293998|gb|EEH49418.1| GATA factor SREP [Paracoccidioides brasiliensis Pb18]
Length = 583
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 44/88 (50%), Gaps = 18/88 (20%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIMP----- 61
++C NC TT T LWRR+ +G P+CNACGL M I ++++P
Sbjct: 288 IACQNCQTTVTPLWRRDEHGHPICNACGLYHKLHGSYRPPTMKKSIIKRRKRVVPAMREQ 347
Query: 62 -PIESMISRNGDALSVITSTPNIHHSDD 88
P + S NG A S TS N+ H+ D
Sbjct: 348 SPAGATQSSNGSA-SPETSPANLAHNYD 374
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G SC+NC T T LWRR+ G +CNACGL + + + P
Sbjct: 122 TGHSCSNCGTKQTPLWRRSPTGATICNACGLYLKARNADRP 162
>gi|225683771|gb|EEH22055.1| nitrogen regulatory protein areA [Paracoccidioides brasiliensis
Pb03]
Length = 995
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 27/43 (62%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G +QT + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 752 GGAKQTEQSGAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 794
>gi|320590477|gb|EFX02920.1| siderophore transcription factor [Grosmannia clavigera kw1407]
Length = 597
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NCSTT T LWRR+ NG +CNACGL ++ P
Sbjct: 281 IACQNCSTTITPLWRRDENGRTICNACGLYYKLHGVHRP 319
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 25/47 (53%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
SG C+NC T T LWRR+ G +CNACGL + P + P
Sbjct: 107 SGQVCSNCGITRTPLWRRSPQGAIICNACGLYQKARNTARPSYLKRP 153
>gi|226293134|gb|EEH48554.1| nitrogen regulatory protein areA [Paracoccidioides brasiliensis
Pb18]
Length = 993
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 27/43 (62%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G +QT + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 750 GGAKQTEQSGAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 792
>gi|429862999|gb|ELA37584.1| siderophore regulation protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 592
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 31/55 (56%)
Query: 2 ILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
I I+ Q N ++C NCSTT T LWRR+ +G +CNACGL ++ P
Sbjct: 271 IAALQIQGQNPNTTVVIACQNCSTTVTPLWRRDESGHTICNACGLYYKLHGVHRP 325
>gi|254573912|ref|XP_002494065.1| Transcriptional activator of genes involved in nitrogen catabolite
repression [Komagataella pastoris GS115]
gi|238033864|emb|CAY71886.1| Transcriptional activator of genes involved in nitrogen catabolite
repression [Komagataella pastoris GS115]
gi|328354116|emb|CCA40513.1| Nitrogen catabolic enzyme regulatory protein [Komagataella pastoris
CBS 7435]
Length = 615
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 25/31 (80%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+SC+NC T T LWRR+ +G+P+CNACGL +
Sbjct: 484 ISCSNCKTQTTPLWRRDPSGKPLCNACGLFL 514
>gi|403160140|ref|XP_003320695.2| hypothetical protein PGTG_02717 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375169428|gb|EFP76276.2| hypothetical protein PGTG_02717 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1276
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 25/32 (78%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+ C NC TT T LWRR++ G+P+CNACGL +N
Sbjct: 1169 IQCLNCHTTETPLWRRDSEGKPLCNACGLFVN 1200
>gi|351697322|gb|EHB00241.1| Zinc finger transcription factor Trps1 [Heterocephalus glaber]
Length = 465
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 76 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 106
>gi|444323739|ref|XP_004182510.1| hypothetical protein TBLA_0I03360 [Tetrapisispora blattae CBS 6284]
gi|387515557|emb|CCH62991.1| hypothetical protein TBLA_0I03360 [Tetrapisispora blattae CBS 6284]
Length = 1061
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
N++ + C NC TT T LWRR+N+G +CNACGL I+ P
Sbjct: 416 NRKQQIVCGNCKTTKTPLWRRDNDGNTLCNACGLFQKLHGISRP 459
>gi|350582965|ref|XP_003481402.1| PREDICTED: zinc finger transcription factor Trps1-like [Sus scrofa]
Length = 602
Score = 51.2 bits (121), Expect = 8e-05, Method: Composition-based stats.
Identities = 23/31 (74%), Positives = 25/31 (80%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 213 SGVFCANCLTTKTSLWRKNANGGYVCNACGL 243
>gi|47229404|emb|CAF99392.1| unnamed protein product [Tetraodon nigroviridis]
Length = 287
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 25/34 (73%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
SGV CANC TT T+LWR+N NG VCNACGL
Sbjct: 6 SGVFCANCLTTKTSLWRKNANGGYVCNACGLYQK 39
>gi|384500898|gb|EIE91389.1| global nitrogen regulator protein [Rhizopus delemar RA 99-880]
Length = 272
Score = 51.2 bits (121), Expect = 9e-05, Method: Composition-based stats.
Identities = 21/34 (61%), Positives = 25/34 (73%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+K + C+NC TT T LWRRN GEP+CNACGL
Sbjct: 150 HKINSTQCSNCHTTNTPLWRRNPEGEPLCNACGL 183
>gi|384495933|gb|EIE86424.1| global nitrogen regulator protein [Rhizopus delemar RA 99-880]
Length = 617
Score = 51.2 bits (121), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 20/33 (60%), Positives = 22/33 (66%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
S C NC TT T LWRRN G P+CNACGL +
Sbjct: 463 STTKCTNCGTTTTPLWRRNPEGHPLCNACGLFL 495
>gi|443712897|gb|ELU05981.1| hypothetical protein CAPTEDRAFT_130016, partial [Capitella
teleta]
Length = 79
Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
R +CANC T TT+WRRN +GEPVCNACGL + + +PP
Sbjct: 36 RETHNCANCGTQQTTMWRRNTDGEPVCNACGLYFKLHKV----RCLPP 79
Score = 35.0 bits (79), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 14/21 (66%), Positives = 16/21 (76%)
Query: 28 TTLWRRNNNGEPVCNACGLIM 48
TT WRRN GEP+CNACG +
Sbjct: 1 TTTWRRNLTGEPLCNACGCYL 21
>gi|2494687|sp|Q01582.1|AREA_PENCH RecName: Full=Nitrogen regulatory protein areA; Short=Nitrogen
regulator nre
gi|437210|gb|AAA83400.1| NRE [Penicillium chrysogenum]
Length = 725
Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
G++ + N +C NC T T LWRRN G+P+CNACGL +
Sbjct: 511 GLKNGSTNAGPEPACTNCFTQTTPLWRRNPEGQPLCNACGLFLK 554
>gi|189206315|ref|XP_001939492.1| GATA factor SREP [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187975585|gb|EDU42211.1| GATA factor SREP [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 443
Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 28/41 (68%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G C+NC TT T LWRR+ +GE VCNACGL M + + P
Sbjct: 75 TGQVCSNCKTTQTPLWRRSPSGETVCNACGLYMKARNQSRP 115
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+C NC TT T LWRR++ G +CNACGL
Sbjct: 209 ACQNCGTTITPLWRRDDAGHIICNACGLYYK 239
>gi|2133281|pir||S51493 major nitrogen regulation protein - Penicillium chrysogenum
(strain Q176)
Length = 862
Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
G++ + N +C NC T T LWRRN G+P+CNACGL +
Sbjct: 648 GLKNGSTNAGPEPACTNCFTQTTPLWRRNPEGQPLCNACGLFLK 691
>gi|425771811|gb|EKV10244.1| GATA factor SREP [Penicillium digitatum Pd1]
gi|425777158|gb|EKV15342.1| GATA factor SREP [Penicillium digitatum PHI26]
Length = 528
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 22/32 (68%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
V+C NC TT T LWRR+ G P+CNACGL
Sbjct: 229 VACQNCGTTVTPLWRRDEQGHPICNACGLYYK 260
Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEK 58
G SC+NC T T LWRR+ G +CNACGL + ++ P K
Sbjct: 84 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTK 125
>gi|323319537|gb|ADX36139.1| GATA123a [Schmidtea polychroa]
Length = 101
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 29/49 (59%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + ++ G +C NC T TTLWRRN G+ VCNACGL I+ P
Sbjct: 50 RRLSSARKVGTTCINCGTNHTTLWRRNQQGDSVCNACGLYYKLHHISRP 98
Score = 42.0 bits (97), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
G C NC T T LWRR+ G +CNACGL N P ++ P + S
Sbjct: 5 GRECVNCGATQTPLWRRDGTGHYLCNACGLYHKMNGTNRP--LIKPKRRLSS 54
>gi|195146794|ref|XP_002014369.1| GL19159 [Drosophila persimilis]
gi|194106322|gb|EDW28365.1| GL19159 [Drosophila persimilis]
Length = 803
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL IN P
Sbjct: 651 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGINRP 692
>gi|330921388|ref|XP_003299406.1| hypothetical protein PTT_10383 [Pyrenophora teres f. teres 0-1]
gi|311326936|gb|EFQ92491.1| hypothetical protein PTT_10383 [Pyrenophora teres f. teres 0-1]
Length = 465
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 28/41 (68%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G C+NC TT T LWRR+ +GE VCNACGL M + + P
Sbjct: 78 TGQVCSNCKTTQTPLWRRSPSGETVCNACGLYMKARNQSRP 118
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+C NC TT T LWRR++ G +CNACGL
Sbjct: 231 ACQNCGTTITPLWRRDDAGHIICNACGLYYK 261
>gi|448121454|ref|XP_004204211.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
gi|358349750|emb|CCE73029.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
Length = 697
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++C NC T T LWRRN G+P+CNACGL +
Sbjct: 528 IACTNCGTRTTPLWRRNPQGQPLCNACGLFL 558
>gi|346325730|gb|EGX95327.1| Nitrogen regulatory protein areA [Cordyceps militaris CM01]
Length = 857
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 7 IREQTGNKRSG---VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ GN+ G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 598 LQAAAGNQTDGGNPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 642
>gi|50551201|ref|XP_503074.1| YALI0D20482p [Yarrowia lipolytica]
gi|49648942|emb|CAG81266.1| YALI0D20482p [Yarrowia lipolytica CLIB122]
Length = 238
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 20/31 (64%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SC NC T T LWRRN GEP+CNACGL +
Sbjct: 85 TSCTNCHTQTTPLWRRNPEGEPLCNACGLFL 115
>gi|146420517|ref|XP_001486214.1| hypothetical protein PGUG_01885 [Meyerozyma guilliermondii ATCC
6260]
Length = 568
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 34/66 (51%), Gaps = 1/66 (1%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRN-GDALSV 76
VSC NC T T LWRRN G+P+CNACGL + + P + + R G
Sbjct: 416 VSCTNCHTRTTPLWRRNPQGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRQRGLGSTTKK 475
Query: 77 ITSTPN 82
TSTP+
Sbjct: 476 PTSTPS 481
>gi|194859800|ref|XP_001969453.1| GG10108 [Drosophila erecta]
gi|190661320|gb|EDV58512.1| GG10108 [Drosophila erecta]
Length = 838
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL IN P
Sbjct: 685 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGINRP 726
>gi|406602734|emb|CCH45692.1| Zinc finger protein [Wickerhamomyces ciferrii]
Length = 487
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 29/43 (67%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
++C NC TT T LWRR++NG+ +CNACGL ++ P K+
Sbjct: 177 AIACTNCGTTVTPLWRRDDNGDTICNACGLYYKLHGLHRPIKM 219
Score = 43.9 bits (102), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 21/29 (72%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C+NCSTT T LWRR +G +CNACGL
Sbjct: 31 CSNCSTTKTPLWRRAPDGSLICNACGLYY 59
>gi|156049633|ref|XP_001590783.1| hypothetical protein SS1G_08523 [Sclerotinia sclerotiorum 1980]
gi|154692922|gb|EDN92660.1| hypothetical protein SS1G_08523 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 749
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM-PPIESMISRNGDA 73
++G C+NC TT T LWRR+ G +CNACGL + + P + PP + I GD+
Sbjct: 111 QTGQICSNCGTTRTPLWRRSPQGATICNACGLYQKARNASRPTNLKRPPTTAPIGHQGDS 170
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
QT N V+C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 265 QTQNTTVVVACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 311
>gi|302419453|ref|XP_003007557.1| nitrogen regulatory protein areA [Verticillium albo-atrum VaMs.102]
gi|261353208|gb|EEY15636.1| nitrogen regulatory protein areA [Verticillium albo-atrum VaMs.102]
Length = 983
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 25/37 (67%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G+ + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 738 GDSSAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 774
>gi|303312759|ref|XP_003066391.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
gi|240106053|gb|EER24246.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
gi|320032224|gb|EFW14179.1| GATA factor SREP [Coccidioides posadasii str. Silveira]
Length = 584
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 2/38 (5%)
Query: 13 NKRSGV--SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
N+ GV +C NC TT T LWRR++ G P+CNACGL
Sbjct: 259 NQEGGVPTACQNCGTTVTPLWRRDDQGHPICNACGLYF 296
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G SC NC T T LWRR NG +CNACGL + + + P
Sbjct: 106 GHSCVNCGTKRTPLWRRAPNGSTICNACGLYLKARNADRP 145
>gi|429854610|gb|ELA29612.1| nitrogen response regulator [Colletotrichum gloeosporioides Nara
gc5]
Length = 912
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 7 IREQTGNKRSG---VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ GN+ G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 649 LQAAAGNQGEGNTPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 693
>gi|346976335|gb|EGY19787.1| nitrogen regulatory protein areA [Verticillium dahliae VdLs.17]
Length = 982
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 25/37 (67%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G+ + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 699 GDSSAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 735
>gi|25136582|gb|AAN65464.1| major nitrogen regulatory protein [Glomerella lindemuthiana]
Length = 971
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 690 GESSTPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 726
>gi|452987950|gb|EME87705.1| GLN3p transcription factor [Pseudocercospora fijiensis CIRAD86]
Length = 876
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 25/40 (62%), Gaps = 2/40 (5%)
Query: 12 GNKRSGV--SCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
G SGV +C NC T T LWRRN G P+CNACGL +
Sbjct: 620 GGDNSGVPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFLK 659
>gi|388855580|emb|CCF50803.1| related to transcription factor ScGATA-6 (N-terminal fragment),
partial [Ustilago hordei]
Length = 1352
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 23/29 (79%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C+NC TT T LWRR+ G+P+CNACGL +
Sbjct: 1204 CSNCHTTKTPLWRRDPEGQPLCNACGLFL 1232
>gi|194761760|ref|XP_001963094.1| GF15767 [Drosophila ananassae]
gi|190616791|gb|EDV32315.1| GF15767 [Drosophila ananassae]
Length = 810
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL +N P
Sbjct: 662 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 703
>gi|295670209|ref|XP_002795652.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284737|gb|EEH40303.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 650
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 33/88 (37%), Positives = 44/88 (50%), Gaps = 18/88 (20%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGL-----------IMNTVDINAPEKIMP----- 61
++C NC TT T LWRR+ +G P+CNACGL M I ++++P
Sbjct: 352 IACQNCQTTVTPLWRRDEHGHPICNACGLYHKLHGSYRPPTMKKSIIKRRKRVVPAMREQ 411
Query: 62 -PIESMISRNGDALSVITSTPNIHHSDD 88
P + S NG A S TS NI H+ D
Sbjct: 412 SPAGATQSSNGSA-SPETSPANIAHNYD 438
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G SC+NC T T LWRR+ G +CNACGL + + + P
Sbjct: 186 TGHSCSNCGTKQTPLWRRSPTGATICNACGLYLKARNADRP 226
>gi|440492336|gb|ELQ74911.1| GATA-4/5/6 transcription factor [Trachipleistophora hominis]
Length = 463
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 20/43 (46%), Positives = 24/43 (55%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM 60
+ C NC TT T+ WRR NG P+CN+CGL EK M
Sbjct: 189 IKCGNCGTTETSFWRRGINGHPLCNSCGLYFRDNTYKEKEKEM 231
Score = 40.8 bits (94), Expect = 0.10, Method: Composition-based stats.
Identities = 16/28 (57%), Positives = 21/28 (75%), Gaps = 1/28 (3%)
Query: 20 CANCSTTCTTLWRRNNNGEP-VCNACGL 46
C+NC TT T WRR+ +G+ +CNACGL
Sbjct: 133 CSNCVTTTTPSWRRSTDGKKLLCNACGL 160
>gi|429965956|gb|ELA47953.1| hypothetical protein VCUG_00536 [Vavraia culicis 'floridensis']
Length = 463
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 20/43 (46%), Positives = 24/43 (55%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM 60
+ C NC TT T+ WRR NG P+CN+CGL EK M
Sbjct: 189 IKCGNCGTTETSFWRRGINGHPLCNSCGLYFRDNTYKEKEKEM 231
Score = 40.8 bits (94), Expect = 0.10, Method: Composition-based stats.
Identities = 16/28 (57%), Positives = 21/28 (75%), Gaps = 1/28 (3%)
Query: 20 CANCSTTCTTLWRRNNNGEP-VCNACGL 46
C+NC TT T WRR+ +G+ +CNACGL
Sbjct: 133 CSNCVTTTTPSWRRSTDGKKLLCNACGL 160
>gi|24583358|ref|NP_609383.2| GATAd, isoform A [Drosophila melanogaster]
gi|442627216|ref|NP_001260326.1| GATAd, isoform B [Drosophila melanogaster]
gi|442627218|ref|NP_001260327.1| GATAd, isoform C [Drosophila melanogaster]
gi|22946136|gb|AAF52916.2| GATAd, isoform A [Drosophila melanogaster]
gi|379699074|gb|AFD10759.1| FI19405p1 [Drosophila melanogaster]
gi|440213644|gb|AGB92861.1| GATAd, isoform B [Drosophila melanogaster]
gi|440213645|gb|AGB92862.1| GATAd, isoform C [Drosophila melanogaster]
Length = 842
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
+ +SC+NC T TT+WRR+ GE VCNACGL +N P +
Sbjct: 687 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRPHSM 731
>gi|195578139|ref|XP_002078923.1| GD23678 [Drosophila simulans]
gi|194190932|gb|EDX04508.1| GD23678 [Drosophila simulans]
Length = 844
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
+ +SC+NC T TT+WRR+ GE VCNACGL +N P +
Sbjct: 691 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRPHSM 735
>gi|115386818|ref|XP_001209950.1| nitrogen regulatory protein areA [Aspergillus terreus NIH2624]
gi|114190948|gb|EAU32648.1| nitrogen regulatory protein areA [Aspergillus terreus NIH2624]
Length = 817
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 20/43 (46%), Positives = 26/43 (60%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G + G+ +C NC T T LWRRN G+P+CNACGL +
Sbjct: 602 GTKNADGSSSGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 644
>gi|384499201|gb|EIE89692.1| hypothetical protein RO3G_14403 [Rhizopus delemar RA 99-880]
Length = 281
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 24/36 (66%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
N+ SG C+NC T T LWRRN G P+CNACGL
Sbjct: 168 NRPSGTVCSNCGTNKTPLWRRNAEGSPLCNACGLYY 203
Score = 40.4 bits (93), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 21/29 (72%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C+NC + T LWRR+ N + +CNACGL +
Sbjct: 124 CSNCGASSTPLWRRSANAQILCNACGLYV 152
>gi|195339757|ref|XP_002036483.1| GM18170 [Drosophila sechellia]
gi|194130363|gb|EDW52406.1| GM18170 [Drosophila sechellia]
Length = 844
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
+ +SC+NC T TT+WRR+ GE VCNACGL +N P +
Sbjct: 691 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRPHSM 735
>gi|195473609|ref|XP_002089085.1| GE18923 [Drosophila yakuba]
gi|194175186|gb|EDW88797.1| GE18923 [Drosophila yakuba]
Length = 838
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL +N P
Sbjct: 685 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 726
>gi|195397608|ref|XP_002057420.1| GJ18119 [Drosophila virilis]
gi|194141074|gb|EDW57493.1| GJ18119 [Drosophila virilis]
Length = 860
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL +N P
Sbjct: 693 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 734
>gi|12641865|dbj|BAB21556.1| GATA factor d [Drosophila melanogaster]
Length = 352
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL +N P
Sbjct: 197 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 238
>gi|195119257|ref|XP_002004148.1| GI18290 [Drosophila mojavensis]
gi|193914723|gb|EDW13590.1| GI18290 [Drosophila mojavensis]
Length = 469
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL +N P
Sbjct: 403 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 444
>gi|451993853|gb|EMD86325.1| hypothetical protein COCHEDRAFT_109473 [Cochliobolus heterostrophus
C5]
Length = 520
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 27/41 (65%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G C+NC TT T LWRR+ GE VCNACGL M + + P
Sbjct: 127 TGQVCSNCKTTQTPLWRRSPAGETVCNACGLYMKARNQSRP 167
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+C NC TT T LWRR++ G +CNACGL
Sbjct: 286 ACQNCGTTITPLWRRDDAGHIICNACGLYYK 316
>gi|336463275|gb|EGO51515.1| Nitrogen catabolic enzyme regulatory protein [Neurospora
tetrasperma FGSC 2508]
gi|350297522|gb|EGZ78499.1| nitrogen catabolic enzyme regulatory protein [Neurospora
tetrasperma FGSC 2509]
Length = 1033
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN +G+P+CNACGL +
Sbjct: 738 TTCTNCFTQTTPLWRRNPDGQPLCNACGLFL 768
>gi|119467139|ref|XP_001257376.1| GATA transcriptional activator AreA [Neosartorya fischeri NRRL 181]
gi|119405528|gb|EAW15479.1| GATA transcriptional activator AreA [Neosartorya fischeri NRRL 181]
Length = 882
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 27/38 (71%), Gaps = 1/38 (2%)
Query: 12 GNKRSG-VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G++ +G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 661 GDQSNGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 698
>gi|296424766|ref|XP_002841917.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295638169|emb|CAZ86108.1| unnamed protein product [Tuber melanosporum]
Length = 658
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 3/52 (5%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
V+C NC TT T LWRR+ +G +CNACGL ++ PE + +S+I R
Sbjct: 351 VACQNCGTTITPLWRRDESGHTICNACGLYHKLHGVHRPEAMK---KSVIKR 399
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 1/52 (1%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM-PPIESMI 67
G C+NC TT T LWRR +G+ +CNACGL + + + P + PP S I
Sbjct: 201 GQVCSNCRTTRTPLWRRAPDGQTICNACGLYLKARNQSRPTNLKRPPHGSTI 252
>gi|380489847|emb|CCF36425.1| GATA zinc finger [Colletotrichum higginsianum]
Length = 591
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 31/55 (56%)
Query: 2 ILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
I I+ Q N+ ++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 271 IAALQIQGQNPNQTVVIACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 325
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 28/48 (58%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
++G C+NC TT T LWRR+ G +CNACGL + + P + P
Sbjct: 117 QTGQVCSNCGTTRTPLWRRSPQGTTICNACGLYQKARNASRPTNLKKP 164
>gi|336264503|ref|XP_003347028.1| hypothetical protein SMAC_05227 [Sordaria macrospora k-hell]
gi|380093121|emb|CCC09358.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 936
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN +G+P+CNACGL +
Sbjct: 681 TTCTNCFTQTTPLWRRNPDGQPLCNACGLFL 711
>gi|121704154|ref|XP_001270341.1| GATA transcriptional activator AreA [Aspergillus clavatus NRRL 1]
gi|119398485|gb|EAW08915.1| GATA transcriptional activator AreA [Aspergillus clavatus NRRL 1]
Length = 879
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 10 QTGNKRSG-VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G + +G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 658 KNGEQNTGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 697
>gi|344234670|gb|EGV66538.1| hypothetical protein CANTEDRAFT_91703 [Candida tenuis ATCC 10573]
Length = 499
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 27/40 (67%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G SC+NC T T LWRR+ NG +CNACGL + + + N P
Sbjct: 89 GQSCSNCGTKTTPLWRRSTNGTLICNACGLYLRSNNTNRP 128
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++C NC+TT T LWRR++ G +CNACGL
Sbjct: 195 AIACFNCNTTITPLWRRDDGGNTICNACGLYY 226
>gi|408389698|gb|EKJ69132.1| hypothetical protein FPSE_10693 [Fusarium pseudograminearum CS3096]
Length = 949
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 3/45 (6%)
Query: 7 IREQTGNKRSG---VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ GN G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 655 LQAAAGNGSDGNAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 699
>gi|255729218|ref|XP_002549534.1| hypothetical protein CTRG_03831 [Candida tropicalis MYA-3404]
gi|240132603|gb|EER32160.1| hypothetical protein CTRG_03831 [Candida tropicalis MYA-3404]
Length = 589
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 21/29 (72%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T T LWRRN G+P+CNACGL +
Sbjct: 474 CTNCGTKTTPLWRRNPQGQPLCNACGLFL 502
>gi|195050653|ref|XP_001992938.1| GH13551 [Drosophila grimshawi]
gi|193899997|gb|EDV98863.1| GH13551 [Drosophila grimshawi]
Length = 911
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL +N P
Sbjct: 742 QKDMSCSNCGTLTTTIWRRSARGEMVCNACGLYYKLHGVNRP 783
>gi|46128513|ref|XP_388810.1| AREA_GIBFU Nitrogen regulatory protein areA [Gibberella zeae PH-1]
Length = 964
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 3/45 (6%)
Query: 7 IREQTGNKRSG---VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ GN G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 670 LQAAAGNGSDGNAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 714
>gi|63054447|ref|NP_588278.2| transcription factor Gaf1 [Schizosaccharomyces pombe 972h-]
gi|12644402|sp|Q10280.2|GAF1_SCHPO RecName: Full=Transcription factor gaf1; Short=Gaf-1
gi|157310519|emb|CAB40003.2| transcription factor Gaf1 [Schizosaccharomyces pombe]
Length = 855
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 22/29 (75%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC T T LWRR+ +G+P+CNACGL M
Sbjct: 635 CTNCQTRTTPLWRRSPDGQPLCNACGLFM 663
>gi|159122664|gb|EDP47785.1| GATA transcriptional activator AreA [Aspergillus fumigatus A1163]
Length = 881
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 27/38 (71%), Gaps = 1/38 (2%)
Query: 12 GNKRSG-VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G++ +G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 662 GDQNNGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 699
>gi|70984747|ref|XP_747880.1| GATA transcriptional activator AreA [Aspergillus fumigatus Af293]
gi|66845507|gb|EAL85842.1| GATA transcriptional activator AreA [Aspergillus fumigatus Af293]
Length = 881
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 27/38 (71%), Gaps = 1/38 (2%)
Query: 12 GNKRSG-VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G++ +G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 662 GDQNNGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 699
>gi|426197491|gb|EKV47418.1| hypothetical protein AGABI2DRAFT_221510 [Agaricus bisporus var.
bisporus H97]
Length = 894
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 28/48 (58%), Gaps = 10/48 (20%)
Query: 11 TGNKRSGVS----------CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+G +RSG S C NC TT T LWRR+ G+P+CNACGL
Sbjct: 744 SGAQRSGKSGEDGDPLPTLCTNCQTTNTPLWRRDPEGQPLCNACGLFY 791
>gi|310791188|gb|EFQ26717.1| GATA zinc finger [Glomerella graminicola M1.001]
Length = 956
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 671 GEGNAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 707
>gi|5902804|sp|O13508.2|AREA_PENRO RecName: Full=Nitrogen regulatory protein areA; AltName:
Full=Nitrogen regulator nmc
gi|3702623|emb|CAA04815.1| nitrogen regulator [Penicillium roqueforti]
Length = 860
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 658 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 688
>gi|384499134|gb|EIE89625.1| global nitrogen regulator protein [Rhizopus delemar RA 99-880]
Length = 512
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 23/38 (60%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
Q + C+NC TT T LWRRN G P+CNACGL
Sbjct: 245 QQPQDNNKTQCSNCQTTTTPLWRRNPQGLPLCNACGLF 282
>gi|321251473|ref|XP_003192077.1| transcriptional activator [Cryptococcus gattii WM276]
gi|317458545|gb|ADV20290.1| Transcriptional activator, putative [Cryptococcus gattii WM276]
Length = 1290
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 24/34 (70%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
S + C NC TT T LWRR+ +G+P+CNACGL
Sbjct: 1178 ESPIMCTNCQTTNTPLWRRDPDGQPLCNACGLFY 1211
>gi|119192532|ref|XP_001246872.1| hypothetical protein CIMG_00643 [Coccidioides immitis RS]
gi|392863886|gb|EAS35338.2| SreP protein [Coccidioides immitis RS]
Length = 584
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC TT T LWRR++ G P+CNACGL
Sbjct: 266 TACQNCGTTVTPLWRRDDQGHPICNACGLYF 296
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G SC NC T T LWRR NG +CNACGL + + + P
Sbjct: 106 GHSCVNCGTKRTPLWRRAPNGSTICNACGLYLKARNADRP 145
>gi|425771786|gb|EKV10222.1| GATA transcriptional activator AreA [Penicillium digitatum Pd1]
gi|425777069|gb|EKV15259.1| GATA transcriptional activator AreA [Penicillium digitatum PHI26]
Length = 873
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 671 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFLK 702
>gi|378730997|gb|EHY57456.1| GATA-binding protein [Exophiala dermatitidis NIH/UT8656]
Length = 943
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 694 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFLK 725
>gi|451856759|gb|EMD70050.1| hypothetical protein COCSADRAFT_216437 [Cochliobolus sativus
ND90Pr]
Length = 507
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 27/41 (65%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G C+NC TT T LWRR+ GE VCNACGL M + + P
Sbjct: 115 TGQICSNCKTTQTPLWRRSPAGETVCNACGLYMKARNQSRP 155
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 11/58 (18%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIMPPIES 65
+C NC TT T LWRR++ G +CNACGL M +I ++I+P E+
Sbjct: 273 ACQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPVAMKKQEIKRRKRIVPAAET 330
>gi|392570339|gb|EIW63512.1| hypothetical protein TRAVEDRAFT_62981 [Trametes versicolor FP-101664
SS1]
Length = 2626
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 19/28 (67%), Positives = 22/28 (78%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGL 46
SC NC TT T LWRR+ G+P+CNACGL
Sbjct: 993 SCTNCQTTITPLWRRDPEGQPLCNACGL 1020
>gi|238878432|gb|EEQ42070.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 470
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
Q+G + + C NC T T LWR+ NNG+ +CNACGL
Sbjct: 244 QSGQPKKVLQCTNCQTRTTPLWRKANNGDLLCNACGLFY 282
>gi|358385091|gb|EHK22688.1| hypothetical protein TRIVIDRAFT_113059, partial [Trichoderma virens
Gv29-8]
Length = 950
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 677 GESSTPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 713
>gi|2494688|sp|Q92269.1|NRFA_PENUR RecName: Full=Nitrogen regulatory protein nrfA
gi|1654230|gb|AAB17740.1| nitrogen regulatory factor NRFA [Penicillium urticae]
Length = 865
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 663 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 693
>gi|443924595|gb|ELU43592.1| Opi1 domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 1201
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 37/70 (52%), Gaps = 3/70 (4%)
Query: 1 MILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM 60
+ T G+ ++ +SC NC T+ T LWRR++ G +CNACGL + PE +
Sbjct: 106 QVRTPGLSDRPSGAPGALSCTNCGTSTTPLWRRDDAGNNICNACGLYHKLHGTHRPEAMK 165
Query: 61 PPIESMISRN 70
+S+I R
Sbjct: 166 ---KSVIKRR 172
>gi|449548468|gb|EMD39435.1| URBS1-like transcription factor [Ceriporiopsis subvermispora B]
Length = 728
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 28/39 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++CANC T+ T LWRR+ +G+ VCNACGL + ++ P
Sbjct: 96 LACANCGTSSTPLWRRDADGKSVCNACGLYLKNRNMARP 134
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 23/32 (71%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+SCANC T+ T LWRR++ G +CNACGL
Sbjct: 297 LSCANCGTSTTPLWRRDDVGNNICNACGLYFK 328
>gi|384484811|gb|EIE76991.1| hypothetical protein RO3G_01695 [Rhizopus delemar RA 99-880]
Length = 326
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 19/27 (70%), Positives = 21/27 (77%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
CANC T T LWR+N GEP+CNACGL
Sbjct: 230 CANCHQTQTPLWRKNERGEPLCNACGL 256
>gi|37547258|gb|AAP30890.1| AreA protein [Phaeosphaeria nodorum]
Length = 851
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G + +C NC T T LWRRN G P+CNACGL +
Sbjct: 621 KQGEGSAPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 659
>gi|261189233|ref|XP_002621028.1| GATA transcriptional activator AreA [Ajellomyces dermatitidis
SLH14081]
gi|239591813|gb|EEQ74394.1| GATA transcriptional activator AreA [Ajellomyces dermatitidis
SLH14081]
Length = 970
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 739 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 769
>gi|169608630|ref|XP_001797734.1| hypothetical protein SNOG_07400 [Phaeosphaeria nodorum SN15]
gi|111063746|gb|EAT84866.1| hypothetical protein SNOG_07400 [Phaeosphaeria nodorum SN15]
Length = 854
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G + +C NC T T LWRRN G P+CNACGL +
Sbjct: 624 KQGEGSAPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 662
>gi|195457472|ref|XP_002075579.1| GK18561 [Drosophila willistoni]
gi|194171664|gb|EDW86565.1| GK18561 [Drosophila willistoni]
Length = 858
Score = 49.7 bits (117), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ +SC+NC T TT+WRR+ GE VCNACGL +N P
Sbjct: 694 QKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 735
>gi|384494818|gb|EIE85309.1| hypothetical protein RO3G_10019 [Rhizopus delemar RA 99-880]
Length = 91
Score = 49.7 bits (117), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 40/82 (48%), Gaps = 18/82 (21%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPI-------ESMIS 68
S C+NC TT T LWRR NG+ +CNACGL + A + PP +S+
Sbjct: 15 SATVCSNCGTTTTPLWRRAPNGDIICNACGLYL-----KARHTLRPPYRGHAQGTDSVTG 69
Query: 69 RNGDAL------SVITSTPNIH 84
R L S+ T +PNIH
Sbjct: 70 REDQRLAKAARPSIKTRSPNIH 91
>gi|340374204|ref|XP_003385628.1| PREDICTED: hypothetical protein LOC100636783 [Amphimedon
queenslandica]
Length = 611
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 25/40 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
C+NC TT T LWRR NG+PVCN CGL + P+ +
Sbjct: 349 CSNCGTTKTVLWRRLENGDPVCNPCGLYYKLNGVARPKSL 388
Score = 41.2 bits (95), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC ++ T LWRRN+ G +CNACGL
Sbjct: 295 CNNCGSSNTPLWRRNSEGHYLCNACGLYY 323
>gi|327354121|gb|EGE82978.1| GATA transcriptional activator AreA [Ajellomyces dermatitidis ATCC
18188]
Length = 976
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 745 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 775
>gi|239614730|gb|EEQ91717.1| GATA transcriptional activator AreA [Ajellomyces dermatitidis ER-3]
Length = 970
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 739 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 769
>gi|50552360|ref|XP_503590.1| YALI0E05555p [Yarrowia lipolytica]
gi|49649459|emb|CAG79171.1| YALI0E05555p [Yarrowia lipolytica CLIB122]
Length = 805
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI-MPPIESMISRNG 71
C+NC TT T LWRR NG +CNACGL + +I+ P + PP + I G
Sbjct: 228 CSNCGTTRTPLWRRAPNGATICNACGLYLKARNISRPTHLEKPPATATIYLPG 280
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 34/57 (59%), Gaps = 3/57 (5%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDAL 74
++C NC TT T LWRR+++G +CNACGL ++ P ++ ++MI R L
Sbjct: 395 IACQNCGTTITPLWRRDDSGSTICNACGLYYRLHGVHRPVRMK---KNMIKRRKRVL 448
>gi|3182920|sp|P78688.1|AREA_GIBFU RecName: Full=Nitrogen regulatory protein areA
gi|1834451|emb|CAA71897.1| AREA [Fusarium fujikuroi]
Length = 971
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 3/45 (6%)
Query: 7 IREQTGNKRSG---VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ GN G +C NC T T LWRRN G+P+CNACGL +
Sbjct: 678 LQAAAGNGNDGNAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 722
>gi|396463571|ref|XP_003836396.1| hypothetical protein LEMA_P039320.1 [Leptosphaeria maculans JN3]
gi|312212949|emb|CBX93031.1| hypothetical protein LEMA_P039320.1 [Leptosphaeria maculans JN3]
Length = 925
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G + +C NC T T LWRRN G P+CNACGL +
Sbjct: 690 KPGEGSTPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 728
>gi|302921056|ref|XP_003053207.1| hypothetical protein NECHADRAFT_59326 [Nectria haematococca mpVI
77-13-4]
gi|256734147|gb|EEU47494.1| hypothetical protein NECHADRAFT_59326 [Nectria haematococca mpVI
77-13-4]
Length = 897
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 617 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 647
>gi|409080576|gb|EKM80936.1| hypothetical protein AGABI1DRAFT_119477 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 680
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 28/48 (58%), Gaps = 10/48 (20%)
Query: 11 TGNKRSGVS----------CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+G +RSG S C NC TT T LWRR+ G+P+CNACGL
Sbjct: 530 SGAQRSGKSGEDGDPLPTLCTNCQTTNTPLWRRDPEGQPLCNACGLFY 577
>gi|406607700|emb|CCH40972.1| Nitrogen regulatory protein areA [Wickerhamomyces ciferrii]
Length = 618
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 22/29 (75%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC+T T LWRRN G+P+CNACGL +
Sbjct: 403 CTNCNTQTTPLWRRNPEGQPLCNACGLFL 431
>gi|402075114|gb|EJT70585.1| nitrogen regulatory protein NUT1 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1004
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 693 GESAAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 729
>gi|405117554|gb|AFR92329.1| transcriptional activator [Cryptococcus neoformans var. grubii H99]
Length = 1277
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 22/29 (75%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC TT T LWRR+ +G+P+CNACGL
Sbjct: 1173 CTNCQTTNTPLWRRDPDGQPLCNACGLFY 1201
>gi|392927094|ref|NP_001257105.1| Protein ELT-4, isoform a [Caenorhabditis elegans]
gi|22265829|emb|CAD44111.1| Protein ELT-4, isoform a [Caenorhabditis elegans]
Length = 72
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R + C+NC+ T TTLWRR G+PVCNACGL + P
Sbjct: 11 RKRLVCSNCNGTNTTLWRRKAEGDPVCNACGLYFKLHHVTRP 52
>gi|134106329|ref|XP_778175.1| hypothetical protein CNBA1750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260878|gb|EAL23528.1| hypothetical protein CNBA1750 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1290
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 22/29 (75%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC TT T LWRR+ +G+P+CNACGL
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNACGLFY 1214
>gi|89078359|gb|ABD60578.1| GATA transcription factor FNR1 [Fusarium oxysporum f. sp.
lycopersici]
Length = 906
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 627 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 657
>gi|58258269|ref|XP_566547.1| transcriptional activator [Cryptococcus neoformans var. neoformans
JEC21]
gi|57222684|gb|AAW40728.1| transcriptional activator, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1290
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 22/29 (75%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C NC TT T LWRR+ +G+P+CNACGL
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNACGLFY 1214
>gi|342877030|gb|EGU78556.1| hypothetical protein FOXB_10937 [Fusarium oxysporum Fo5176]
Length = 962
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 683 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 713
>gi|50556296|ref|XP_505556.1| YALI0F17886p [Yarrowia lipolytica]
gi|49651426|emb|CAG78365.1| YALI0F17886p [Yarrowia lipolytica CLIB122]
Length = 697
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 25/34 (73%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ + C+NC+TT T LWRR+ GE +CNACGL +
Sbjct: 531 KEALQCSNCNTTTTPLWRRSPEGESLCNACGLFL 564
>gi|238503764|ref|XP_002383114.1| GATA transcriptional activator AreA [Aspergillus flavus NRRL3357]
gi|220690585|gb|EED46934.1| GATA transcriptional activator AreA [Aspergillus flavus NRRL3357]
Length = 583
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 379 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 409
>gi|320590499|gb|EFX02942.1| gata transcriptional activator [Grosmannia clavigera kw1407]
Length = 1048
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 26/42 (61%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ Q + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 746 LQAQGQGDSAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 787
>gi|330928054|ref|XP_003302108.1| hypothetical protein PTT_13807 [Pyrenophora teres f. teres 0-1]
gi|311322720|gb|EFQ89799.1| hypothetical protein PTT_13807 [Pyrenophora teres f. teres 0-1]
Length = 889
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G + +C NC T T LWRRN G P+CNACGL +
Sbjct: 661 KQGEGGAPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 699
>gi|169621604|ref|XP_001804212.1| hypothetical protein SNOG_14013 [Phaeosphaeria nodorum SN15]
gi|111057518|gb|EAT78638.1| hypothetical protein SNOG_14013 [Phaeosphaeria nodorum SN15]
Length = 511
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 26/41 (63%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
SG C+NC TT T LWRR+ GE +CNACGL + + P
Sbjct: 109 SGQVCSNCKTTQTPLWRRSPTGETICNACGLYFKARNQHRP 149
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+C NC TT T LWRR++ G +CNACGL + + P
Sbjct: 272 ACQNCGTTITPLWRRDDAGHIICNACGLYYKLHNKHRP 309
>gi|322701406|gb|EFY93156.1| nitrogen response regulator [Metarhizium acridum CQMa 102]
Length = 961
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 678 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 708
>gi|189200320|ref|XP_001936497.1| nitrogen regulatory protein areA [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187983596|gb|EDU49084.1| nitrogen regulatory protein areA [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 912
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G + +C NC T T LWRRN G P+CNACGL +
Sbjct: 649 KQGEGGAPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 687
>gi|67903360|ref|XP_681936.1| AREA_EMENI Nitrogen regulatory protein areA [Aspergillus nidulans
FGSC A4]
gi|1351972|sp|P17429.2|AREA_EMENI RecName: Full=Nitrogen regulatory protein areA
gi|1154625|emb|CAA36731.1| areA [Emericella nidulans]
gi|40741511|gb|EAA60701.1| AREA_EMENI Nitrogen regulatory protein areA [Aspergillus nidulans
FGSC A4]
gi|259483114|tpe|CBF78221.1| TPA: Nitrogen regulatory protein areA
[Source:UniProtKB/Swiss-Prot;Acc:P17429] [Aspergillus
nidulans FGSC A4]
Length = 876
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
+ G + +C NC T T LWRRN G+P+CNACGL + + P + + +R
Sbjct: 663 KNGEQNGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNR 722
Query: 70 N-GDALSVITS 79
N ++L+V +S
Sbjct: 723 NSANSLAVGSS 733
>gi|19113848|ref|NP_592936.1| iron-sensing transcription factor Fep1 [Schizosaccharomyces pombe
972h-]
gi|1346083|sp|Q10134.1|FEP1_SCHPO RecName: Full=Iron-sensing transcription factor 1; AltName:
Full=Transcription factor gaf2; Short=Gaf-2
gi|1167862|emb|CAA93113.1| iron-sensing transcription factor Fep1 [Schizosaccharomyces pombe]
gi|20302469|emb|CAD30004.1| iron-sensing transcription factor [Schizosaccharomyces pombe]
gi|21064951|gb|AAM29187.1| transcription factor Fep1 [Schizosaccharomyces pombe]
Length = 564
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 25/37 (67%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC+TT T LWRR+ +G P+CNACGL ++ P
Sbjct: 172 CQNCATTNTPLWRRDESGNPICNACGLYYKIHGVHRP 208
Score = 42.0 bits (97), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 21/32 (65%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G SC+NC T T+LWRR + +CNACGL
Sbjct: 9 GQSCSNCHKTTTSLWRRGPDNSLLCNACGLYQ 40
>gi|452979166|gb|EME78929.1| hypothetical protein MYCFIDRAFT_216796 [Pseudocercospora fijiensis
CIRAD86]
Length = 434
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 11/61 (18%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIMPPIESM 66
V+C NC TT T LWRR+ G +CNACGL M +I ++++P +S
Sbjct: 262 VACQNCGTTITPLWRRDEQGHTICNACGLYHKLHGSHRPVQMKKAEIKRRKRVVPAHQSA 321
Query: 67 I 67
I
Sbjct: 322 I 322
Score = 42.4 bits (98), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 20/30 (66%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
C+NC TT T LWRR+ G +CNACGL
Sbjct: 118 CSNCGTTKTPLWRRSPAGAVICNACGLYYK 147
>gi|451996461|gb|EMD88928.1| hypothetical protein COCHEDRAFT_1110693 [Cochliobolus
heterostrophus C5]
Length = 824
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 24/40 (60%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+ G + +C NC T T LWRRN G P+CNACGL +
Sbjct: 596 KQGEGGAPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFLK 635
>gi|353238152|emb|CCA70107.1| related to SRE siderophore regulation protein (GATA factor)
[Piriformospora indica DSM 11827]
Length = 414
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 28/42 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
+SCANC T+ T LWRR++ G +CNACGL ++ P+ +
Sbjct: 235 MSCANCGTSATPLWRRDDMGSTICNACGLYYKLHGVHRPDSM 276
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 17/28 (60%), Positives = 21/28 (75%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLI 47
C+NC T T LWRRN+ G+ +CNACGL
Sbjct: 40 CSNCGTNTTPLWRRNSGGDTICNACGLY 67
>gi|340975689|gb|EGS22804.1| putative sequence-specific DNA binding protein [Chaetomium
thermophilum var. thermophilum DSM 1495]
Length = 928
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 644 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 674
>gi|3218331|emb|CAA07052.1| nitrogen response regulator [Metarhizium anisopliae]
Length = 944
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 661 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 691
>gi|242220478|ref|XP_002476005.1| hypothetical transcription factor [Postia placenta Mad-698-R]
gi|220724794|gb|EED78814.1| hypothetical transcription factor [Postia placenta Mad-698-R]
Length = 1062
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
C NC TT T LWRR+ G+P+CNACGL
Sbjct: 931 CTNCQTTNTPLWRRDPEGQPLCNACGLFYK 960
>gi|449546862|gb|EMD37831.1| hypothetical protein CERSUDRAFT_114478 [Ceriporiopsis subvermispora
B]
Length = 1137
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
C NC TT T LWRR+ G+P+CNACGL
Sbjct: 1004 CTNCQTTNTPLWRRDPEGQPLCNACGLFFK 1033
>gi|322709648|gb|EFZ01224.1| nitrogen response regulator [Metarhizium anisopliae ARSEF 23]
Length = 1063
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 780 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 810
>gi|358393386|gb|EHK42787.1| hypothetical protein TRIATDRAFT_224741 [Trichoderma atroviride IMI
206040]
Length = 913
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 641 TTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 671
>gi|336384681|gb|EGO25829.1| hypothetical protein SERLADRAFT_448749 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1944
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 28/47 (59%), Gaps = 4/47 (8%)
Query: 4 TFGIREQTGNKRSG----VSCANCSTTCTTLWRRNNNGEPVCNACGL 46
T G QTG G +C NC TT T LWRR+ G+P+CNACGL
Sbjct: 693 TDGSGTQTGKGDDGDQTPTACTNCQTTNTPLWRRDPEGQPLCNACGL 739
>gi|406867034|gb|EKD20073.1| GATA transcription factor [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 560
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 33/59 (55%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDAL 74
+G C+NC T+ T LWRR+ G +CNACGL + + + P + P +M + G +
Sbjct: 114 AGQVCSNCGTSRTPLWRRSPQGATICNACGLYLKARNASRPPNLKRPPTTMAASPGQEI 172
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 21/28 (75%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACG 45
++C NC TT T LWRR+ +G +CNACG
Sbjct: 276 LACQNCGTTITPLWRRDESGHTICNACG 303
>gi|451850876|gb|EMD64177.1| hypothetical protein COCSADRAFT_89695 [Cochliobolus sativus ND90Pr]
Length = 843
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 24/40 (60%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+ G + +C NC T T LWRRN G P+CNACGL +
Sbjct: 615 KQGEGGAPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFLK 654
>gi|336470856|gb|EGO59017.1| hypothetical protein NEUTE1DRAFT_59972 [Neurospora tetrasperma FGSC
2508]
gi|350291924|gb|EGZ73119.1| siderophore regulation protein [Neurospora tetrasperma FGSC 2509]
Length = 587
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 34/64 (53%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALS 75
+G C+NC TT T LWRR+ G +CNACGL + + P I P M S A +
Sbjct: 114 AGQVCSNCGTTHTPLWRRSPQGAIICNACGLYLKARNAARPANIRRPPSVMASNVRQAAA 173
Query: 76 VITS 79
++S
Sbjct: 174 KLSS 177
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ N ++C NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 272 QSQNTTVVIACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRP 318
>gi|71020337|ref|XP_760399.1| hypothetical protein UM04252.1 [Ustilago maydis 521]
gi|46100068|gb|EAK85301.1| hypothetical protein UM04252.1 [Ustilago maydis 521]
Length = 3893
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 18/29 (62%), Positives = 23/29 (79%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C+NC TT T LWRR+ G+P+CNACGL +
Sbjct: 1597 CSNCHTTKTPLWRRDPEGQPLCNACGLFL 1625
>gi|396465058|ref|XP_003837137.1| hypothetical protein LEMA_P033710.1 [Leptosphaeria maculans JN3]
gi|312213695|emb|CBX93697.1| hypothetical protein LEMA_P033710.1 [Leptosphaeria maculans JN3]
Length = 511
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 27/41 (65%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+G C+NC TT T LWRR+ GE +CNACGL M + + P
Sbjct: 108 AGQICSNCRTTQTPLWRRSPTGETLCNACGLYMKARNQSRP 148
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+C NC TT T LWRR++ G +CNACGL
Sbjct: 270 ACQNCGTTITPLWRRDDAGHIICNACGLYYK 300
>gi|392577630|gb|EIW70759.1| hypothetical protein TREMEDRAFT_73557 [Tremella mesenterica DSM 1558]
Length = 1334
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 25/38 (65%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+G+ + C NC TT T LWRR+ G+P+CNACGL
Sbjct: 1166 SGSGETPTVCTNCQTTNTPLWRRDPEGQPLCNACGLFY 1203
>gi|384488088|gb|EIE80268.1| hypothetical protein RO3G_04973 [Rhizopus delemar RA 99-880]
Length = 336
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 19/27 (70%), Positives = 21/27 (77%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
CANC T T LWR+N GEP+CNACGL
Sbjct: 236 CANCHQTQTPLWRKNERGEPLCNACGL 262
Score = 34.3 bits (77), Expect = 9.0, Method: Composition-based stats.
Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 1/55 (1%)
Query: 3 LTFGIREQTGNKRSGVSCANCSTTCTTLWRRN-NNGEPVCNACGLIMNTVDINAP 56
++ I + + + C NC T T LWRR + +CNACGL + + P
Sbjct: 135 MSIKITKPSNRPSRQLECFNCKVTKTPLWRRTPDRKHSLCNACGLYYKQYNHHRP 189
>gi|367045274|ref|XP_003653017.1| hypothetical protein THITE_2114956 [Thielavia terrestris NRRL 8126]
gi|347000279|gb|AEO66681.1| hypothetical protein THITE_2114956 [Thielavia terrestris NRRL 8126]
Length = 564
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALS 75
+G C+NC TT T LWRR+ GE +CNACGL + + P + P ++ NG S
Sbjct: 103 TGQICSNCGTTQTPLWRRSPQGETICNACGLYLKARNAARPINLKRPPS--VTTNGVRQS 160
Query: 76 VITSTPNIH 84
++ +P
Sbjct: 161 LVKLSPKAQ 169
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 24/39 (61%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 262 IACQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVHRP 300
>gi|345570715|gb|EGX53536.1| hypothetical protein AOL_s00006g402 [Arthrobotrys oligospora ATCC
24927]
Length = 984
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 3/51 (5%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
+C NC TT T LWRR+ +G +CNACGL ++ PE + +S+I R
Sbjct: 332 ACQNCGTTITPLWRRDESGHTICNACGLYHKLHGVHRPETMK---KSVIKR 379
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESM 66
G C+NC TT T LWRR NG +CNACGL + + P + P S+
Sbjct: 149 GQVCSNCGTTRTPLWRRAPNGLTICNACGLYLKARNAARPSTLKRPPNSV 198
>gi|453088528|gb|EMF16568.1| hypothetical protein SEPMUDRAFT_34698 [Mycosphaerella populorum
SO2202]
Length = 957
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 21/32 (65%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+C NC T T LWRRN G P+CNACGL +
Sbjct: 712 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFLK 743
>gi|384489884|gb|EIE81106.1| hypothetical protein RO3G_05811 [Rhizopus delemar RA 99-880]
Length = 234
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 25/41 (60%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
S + C NC T T LWRR+++G +CNACGL ++ P
Sbjct: 15 STIRCYNCDATTTPLWRRDDDGNTICNACGLYYKLHHVHRP 55
>gi|449298668|gb|EMC94683.1| hypothetical protein BAUCODRAFT_35909 [Baudoinia compniacensis UAMH
10762]
Length = 936
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 21/32 (65%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+C NC T T LWRRN G P+CNACGL +
Sbjct: 675 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFLK 706
>gi|710607|gb|AAC35593.1| gene homologous to GATA-binding transcription factor family
[Schizosaccharomyces pombe]
Length = 290
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRR+ +G+P+CNACGL M
Sbjct: 69 TCTNCQTRTTPLWRRSPDGQPLCNACGLFM 98
>gi|407924861|gb|EKG17886.1| Zinc finger GATA-type protein [Macrophomina phaseolina MS6]
Length = 923
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G P+CNACGL +
Sbjct: 688 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 718
>gi|113206467|gb|ABI34463.1| AreA [Talaromyces marneffei]
Length = 883
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 33/62 (53%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
+ G+ + +C NC T T LWRRN G+P+CNACGL + + P + + +R
Sbjct: 638 KPGDSGAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNR 697
Query: 70 NG 71
G
Sbjct: 698 GG 699
>gi|268554364|ref|XP_002635169.1| C. briggsae CBR-END-3 protein [Caenorhabditis briggsae]
Length = 253
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 23/38 (60%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
SC+NC T T LWRRN GEP CN+C L T + P
Sbjct: 193 SCSNCGTRETKLWRRNEQGEPECNSCNLYERTKGVKRP 230
>gi|440467530|gb|ELQ36746.1| GATA factor SREP [Magnaporthe oryzae Y34]
gi|440488667|gb|ELQ68382.1| GATA factor SREP [Magnaporthe oryzae P131]
Length = 539
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPI 63
SG C+NC TT T LWRR+ G +CNACGL + + P + P+
Sbjct: 110 SGQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARPTNLKRPL 157
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 29/50 (58%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R Q+ N ++C NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 254 LRIQSQNTTVVIACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRP 303
>gi|388580872|gb|EIM21184.1| iron transporter biosynthesis regulating transcription factor,
partial [Wallemia sebi CBS 633.66]
Length = 113
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 4/49 (8%)
Query: 13 NKRSGVS--CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
NK SG S CANC TT T LWRR ++G+P CNACGL NAP +
Sbjct: 51 NKTSGSSLQCANCGTTTTPLWRRTDDGKPQCNACGLYQKLH--NAPRPV 97
Score = 37.7 bits (86), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 7/76 (9%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPI--ESMISR-NGDALSV 76
C+NC T+ T LWRR N + +CNACGL + P + +S I++ +G +L
Sbjct: 1 CSNCGTSNTPLWRRGLNDQTLCNACGLYEKNRNTPRPTTLQSTTINQSDINKTSGSSLQC 60
Query: 77 ----ITSTPNIHHSDD 88
T+TP +DD
Sbjct: 61 ANCGTTTTPLWRRTDD 76
>gi|170097681|ref|XP_001880060.1| siderophore biosynthesis regulatory protein [Laccaria bicolor
S238N-H82]
gi|164645463|gb|EDR09711.1| siderophore biosynthesis regulatory protein [Laccaria bicolor
S238N-H82]
Length = 928
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 27/42 (64%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
R++T + + C NC TT T LWR+++ G+ VCNACGL
Sbjct: 586 RQETVDVMTAAQCYNCHTTATPLWRKDDEGKTVCNACGLYYK 627
Score = 39.7 bits (91), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 18/30 (60%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI 47
C+NC T T LWRR N E CNACGL
Sbjct: 531 AECSNCGATHTPLWRRGLNDELNCNACGLY 560
>gi|389634415|ref|XP_003714860.1| GATA factor SREP [Magnaporthe oryzae 70-15]
gi|351647193|gb|EHA55053.1| GATA factor SREP [Magnaporthe oryzae 70-15]
Length = 539
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPI 63
SG C+NC TT T LWRR+ G +CNACGL + + P + P+
Sbjct: 110 SGQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARPTNLKRPL 157
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 29/50 (58%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+R Q+ N ++C NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 254 LRIQSQNTTVVIACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRP 303
>gi|3182912|sp|O13412.1|AREA_ASPNG RecName: Full=Nitrogen regulatory protein areA
gi|2462907|emb|CAA57524.1| areA [Aspergillus niger]
gi|28412451|emb|CAA68196.1| areA [Aspergillus niger]
Length = 882
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 21/40 (52%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 10 QTGNKRSG-VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G + SG +C NC T T LWRRN G+P+CNACGL +
Sbjct: 665 KNGEQSSGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 704
>gi|212537481|ref|XP_002148896.1| GATA transcriptional activator AreA [Talaromyces marneffei ATCC
18224]
gi|210068638|gb|EEA22729.1| GATA transcriptional activator AreA [Talaromyces marneffei ATCC
18224]
Length = 854
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 33/62 (53%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
+ G+ + +C NC T T LWRRN G+P+CNACGL + + P + + +R
Sbjct: 609 KPGDSGAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNR 668
Query: 70 NG 71
G
Sbjct: 669 GG 670
>gi|350637200|gb|EHA25558.1| hypothetical protein ASPNIDRAFT_120062 [Aspergillus niger ATCC
1015]
Length = 880
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/40 (52%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 10 QTGNKRSG-VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G + SG +C NC T T LWRRN G+P+CNACGL +
Sbjct: 661 KNGEQSSGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 700
>gi|254566821|ref|XP_002490521.1| Transcriptional activator of genes regulated by nitrogen catabolite
repression (NCR) [Komagataella pastoris GS115]
gi|238030317|emb|CAY68240.1| Transcriptional activator of genes regulated by nitrogen catabolite
repression (NCR) [Komagataella pastoris GS115]
gi|249692033|emb|CAM59350.2| GATA-type transcription factor Fep1 [Komagataella pastoris]
gi|328350911|emb|CCA37311.1| Zinc finger protein 287 [Komagataella pastoris CBS 7435]
Length = 337
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 36/67 (53%), Gaps = 3/67 (4%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
E +G + ++C NC TT T LWRR+N G +CNACGL + P + +MI
Sbjct: 153 ESSGLQSVAIACTNCGTTVTPLWRRDNIGNTICNACGLYYRLHGSHRPVNLK---RTMIK 209
Query: 69 RNGDALS 75
R A++
Sbjct: 210 RRKRAIA 216
Score = 43.5 bits (101), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM-PPIESMISR 69
C+NC T T LWRR +G VCNACGL + + P + PP +SR
Sbjct: 44 CSNCGTAKTPLWRRAADGSLVCNACGLYYRAKNSHRPVNLKRPPKVVTVSR 94
>gi|296413330|ref|XP_002836367.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630184|emb|CAZ80558.1| unnamed protein product [Tuber melanosporum]
Length = 930
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G P+CNACGL +
Sbjct: 697 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 727
>gi|336371932|gb|EGO00272.1| hypothetical protein SERLA73DRAFT_107315 [Serpula lacrymans var.
lacrymans S7.3]
Length = 2013
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 28/47 (59%), Gaps = 4/47 (8%)
Query: 4 TFGIREQTGNKRSG----VSCANCSTTCTTLWRRNNNGEPVCNACGL 46
T G QTG G +C NC TT T LWRR+ G+P+CNACGL
Sbjct: 723 TDGSGTQTGKGDDGDQTPTACTNCQTTNTPLWRRDPEGQPLCNACGL 769
>gi|164423316|ref|XP_963796.2| hypothetical protein NCU09068 [Neurospora crassa OR74A]
gi|157070042|gb|EAA34560.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 308
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN +G+P+CNACGL +
Sbjct: 13 TTCTNCFTQTTPLWRRNPDGQPLCNACGLFL 43
>gi|217337439|gb|ACK43087.1| major nitrogen regulator element [Tuber borchii]
Length = 931
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G P+CNACGL +
Sbjct: 698 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 728
>gi|358371133|dbj|GAA87742.1| regulator of nitrogen metabolite repression [Aspergillus kawachii
IFO 4308]
Length = 881
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/40 (52%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 10 QTGNKRSG-VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+ G + SG +C NC T T LWRRN G+P+CNACGL +
Sbjct: 663 KNGEQSSGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 702
>gi|154292043|ref|XP_001546599.1| hypothetical protein BC1G_14396 [Botryotinia fuckeliana B05.10]
gi|347838109|emb|CCD52681.1| bir1, transcription factor Zn, GATA [Botryotinia fuckeliana]
Length = 604
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM-PPIESMISRNGD 72
C+NC TT T LWRR+ G +CNACGL + + P + PP + I GD
Sbjct: 119 CSNCGTTRTPLWRRSPQGATICNACGLYQKARNASRPTNLKRPPTTAPIGYPGD 172
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
V+C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 276 VACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 314
>gi|398397269|ref|XP_003852092.1| hypothetical protein MYCGRDRAFT_109392 [Zymoseptoria tritici
IPO323]
gi|339471973|gb|EGP87068.1| hypothetical protein MYCGRDRAFT_109392 [Zymoseptoria tritici
IPO323]
Length = 932
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G P+CNACGL +
Sbjct: 689 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 719
>gi|321263187|ref|XP_003196312.1| zinc finger protein regulator of siderophore biosynthesis
[Cryptococcus gattii WM276]
gi|317462787|gb|ADV24525.1| Zinc finger protein regulator of siderophore biosynthesis, putative
[Cryptococcus gattii WM276]
Length = 1015
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G+SC NC T+ T LWRR+ G P CNACGL
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGL 334
>gi|116193523|ref|XP_001222574.1| hypothetical protein CHGG_06479 [Chaetomium globosum CBS 148.51]
gi|88182392|gb|EAQ89860.1| hypothetical protein CHGG_06479 [Chaetomium globosum CBS 148.51]
Length = 981
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 19/37 (51%), Positives = 24/37 (64%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
N + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 697 ANANTPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 733
>gi|401888062|gb|EJT52030.1| iron regulator 1 [Trichosporon asahii var. asahii CBS 2479]
Length = 977
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G+SC NC T+ T LWRR+ G P CNACGL
Sbjct: 347 GMSCRNCGTSTTPLWRRDEEGRPQCNACGL 376
>gi|346323174|gb|EGX92772.1| GATA factor SREP [Cordyceps militaris CM01]
Length = 533
Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ NG +CNACGL ++ P
Sbjct: 240 IACQNCGTTITPLWRRDGNGHVICNACGLYYRLHGVHRP 278
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 36/73 (49%), Gaps = 12/73 (16%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNT------VDINAPEKIM---PPIESMISRN 70
C+NC TT T LWRR+ G +CNACGL + ++ P K++ PP S
Sbjct: 87 CSNCGTTETPLWRRSPQGATICNACGLYLRARNSARPTNLKKPPKVVSAEPPRASTPKAG 146
Query: 71 GDALSVITSTPNI 83
G S STP +
Sbjct: 147 G---STQDSTPKV 156
>gi|345571186|gb|EGX54001.1| hypothetical protein AOL_s00004g660 [Arthrobotrys oligospora ATCC
24927]
Length = 820
Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G P+CNACGL +
Sbjct: 584 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 614
>gi|347827511|emb|CCD43208.1| similar to transcription factor Zn, GATA [Botryotinia fuckeliana]
Length = 989
Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G P+CNACGL +
Sbjct: 701 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 731
>gi|156055378|ref|XP_001593613.1| hypothetical protein SS1G_05040 [Sclerotinia sclerotiorum 1980]
gi|154702825|gb|EDO02564.1| hypothetical protein SS1G_05040 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 901
Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G P+CNACGL +
Sbjct: 609 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 639
>gi|85105503|ref|XP_961978.1| hypothetical protein NCU07728 [Neurospora crassa OR74A]
gi|3552028|gb|AAC64946.1| siderophore regulation protein [Neurospora crassa]
gi|28923567|gb|EAA32742.1| predicted protein [Neurospora crassa OR74A]
Length = 587
Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 29/53 (54%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
+G C+NC TT T LWRR+ G +CNACGL + + P I P M S
Sbjct: 114 AGQVCSNCGTTHTPLWRRSPQGAIICNACGLYLKARNAARPANIRRPPSVMAS 166
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ N ++C NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 272 QSQNTTVVIACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRP 318
>gi|340519562|gb|EGR49800.1| nitrogen regulatory-like protein [Trichoderma reesei QM6a]
Length = 944
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 19/37 (51%), Positives = 24/37 (64%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 674 GENSAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 710
>gi|154302551|ref|XP_001551685.1| hypothetical protein BC1G_09852 [Botryotinia fuckeliana B05.10]
Length = 989
Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G P+CNACGL +
Sbjct: 701 TTCTNCFTQTTPLWRRNPEGHPLCNACGLFL 731
>gi|406699318|gb|EKD02524.1| iron regulator 1 [Trichosporon asahii var. asahii CBS 8904]
Length = 977
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G+SC NC T+ T LWRR+ G P CNACGL
Sbjct: 347 GMSCRNCGTSTTPLWRRDEEGRPQCNACGL 376
>gi|405122586|gb|AFR97352.1| iron regulator 1 [Cryptococcus neoformans var. grubii H99]
Length = 1019
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G+SC NC T+ T LWRR+ G P CNACGL
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGL 334
>gi|108802602|gb|ABG21304.1| iron regulator 1 [Cryptococcus neoformans var. grubii]
Length = 952
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G+SC NC T+ T LWRR+ G P CNACGL
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGL 334
>gi|108802600|gb|ABG21303.1| iron regulator 1 [Cryptococcus neoformans var. neoformans]
Length = 963
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G+SC NC T+ T LWRR+ G P CNACGL
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGL 334
>gi|58260324|ref|XP_567572.1| gata factor srep [Cryptococcus neoformans var. neoformans JEC21]
gi|134116222|ref|XP_773282.1| hypothetical protein CNBJ0600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255904|gb|EAL18635.1| hypothetical protein CNBJ0600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229622|gb|AAW46055.1| gata factor srep, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 1060
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G+SC NC T+ T LWRR+ G P CNACGL
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGL 334
>gi|391327583|ref|XP_003738277.1| PREDICTED: GATA-binding factor C-like [Metaseiulus occidentalis]
Length = 227
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 31/49 (63%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
RE+ +G+ C NC+TT TTLWRRN G PVCNACGL + N P
Sbjct: 38 REKRNGGSAGMVCHNCNTTTTTLWRRNIEGFPVCNACGLYYKLHNTNRP 86
>gi|406698929|gb|EKD02150.1| transcriptional activator [Trichosporon asahii var. asahii CBS 8904]
Length = 1273
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 28/86 (32%), Positives = 38/86 (44%), Gaps = 11/86 (12%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIM 60
G+ + C NC TT T LWRR+ G+P+CNACGL + T I +
Sbjct: 1140 GDGENPTMCTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPLSLKTDVIKKRNRAA 1199
Query: 61 PPIESMISRNGDALSVITSTPNIHHS 86
P ES R G + S+P +
Sbjct: 1200 PTKESSSGRKGRGRNSKASSPQAQRT 1225
>gi|237512877|dbj|BAH58791.1| GATA-binding protein 5 [Polypterus senegalus]
Length = 395
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 29/49 (59%), Positives = 34/49 (69%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
R QT ++R+G+ C NC TT TTLWRRN GEPVCNACGL M + P
Sbjct: 232 RLQTSSRRAGLCCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 280
Score = 44.3 bits (103), Expect = 0.010, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC + T LWRR+ G +CNACGL IN P
Sbjct: 186 GRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRP 225
>gi|358394095|gb|EHK43496.1| hypothetical protein TRIATDRAFT_174504, partial [Trichoderma
atroviride IMI 206040]
Length = 567
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSV 76
G C+NC+TT T LWRR+ G +CNACGL + + + P + P +++S G A
Sbjct: 109 GQICSNCNTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKKP-PNLVSAGGSAQPP 167
Query: 77 ITST 80
S+
Sbjct: 168 AKSS 171
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 24/39 (61%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 267 IACQNCGTTITPLWRRDEGGHTICNACGLYYKLHGVHRP 305
>gi|354683559|gb|AER34940.1| GATA-type transcriptional repressor SRE [Aureobasidium pullulans]
Length = 333
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 24/33 (72%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNT 50
V+C NC TT T LWRR++NG +CNACG+ T
Sbjct: 289 VACQNCGTTITPLWRRDDNGHTICNACGMFATT 321
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
TG +G C+NC TT T LWRR+ G+ +CNACGL +
Sbjct: 118 TGMGGAGQVCSNCGTTRTPLWRRSPAGDTICNACGLYL 155
>gi|389745400|gb|EIM86581.1| hypothetical protein STEHIDRAFT_57406, partial [Stereum hirsutum
FP-91666 SS1]
Length = 76
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 31/58 (53%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPI 63
G R G + +SCANC T+ T LWRR++ G +CNACGL + P + +
Sbjct: 13 GRRNAKGGQVGALSCANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRPNSMKKTV 70
>gi|401889257|gb|EJT53195.1| transcriptional activator [Trichosporon asahii var. asahii CBS 2479]
Length = 1273
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 28/86 (32%), Positives = 38/86 (44%), Gaps = 11/86 (12%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIM 60
G+ + C NC TT T LWRR+ G+P+CNACGL + T I +
Sbjct: 1140 GDGENPTMCTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPLSLKTDVIKKRNRAA 1199
Query: 61 PPIESMISRNGDALSVITSTPNIHHS 86
P ES R G + S+P +
Sbjct: 1200 PTKESSSGRKGRGRNSKASSPQAQRT 1225
>gi|358382685|gb|EHK20356.1| hypothetical protein TRIVIDRAFT_192933 [Trichoderma virens Gv29-8]
Length = 567
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC+TT T LWRR+ +G +CNACGL ++ P
Sbjct: 267 IACQNCATTITPLWRRDESGHTICNACGLYYKLHGVHRP 305
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 3 LTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
L ++EQ G C+NC+TT T LWRR+ G +CNACGL + + P + P
Sbjct: 99 LAANVQEQGGQ-----VCSNCNTTRTPLWRRSPQGATICNACGLYLKARNAARPTSLKKP 153
>gi|402079663|gb|EJT74928.1| GATA factor SREP [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 581
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
Q +SG C+NC TT T LWRR+ G +CNACGL + + P + P
Sbjct: 102 QGPQGQSGQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARPTNLKRP 154
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 265 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 303
>gi|440639377|gb|ELR09296.1| hypothetical protein GMDG_03864 [Geomyces destructans 20631-21]
Length = 555
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
V+C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 268 VACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 306
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 24/40 (60%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C+NC TT T LWRR+ G +CNACGL + + P
Sbjct: 111 GQICSNCGTTSTPLWRRSPQGATICNACGLYQKARNASRP 150
>gi|116266310|gb|ABJ91394.1| glutathione S-transferase/GATA-1 fusion protein [Cloning vector
pGATA]
Length = 359
Score = 48.1 bits (113), Expect = 7e-04, Method: Composition-based stats.
Identities = 27/45 (60%), Positives = 32/45 (71%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G+KR+G C NC TT TTLWRRN +G+PVCNACGL +N P
Sbjct: 284 GSKRAGTQCTNCQTTTTTLWRRNASGDPVCNACGLYFKLHQVNRP 328
>gi|344228669|gb|EGV60555.1| hypothetical protein CANTEDRAFT_137034 [Candida tenuis ATCC 10573]
Length = 424
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 25/37 (67%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G + + C+NC T T LWR++NNG+ +CNACGL
Sbjct: 212 GGPKKLLQCSNCQTKTTPLWRKSNNGDLLCNACGLFY 248
>gi|322694341|gb|EFY86173.1| GATA transcription factor [Metarhizium acridum CQMa 102]
Length = 523
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 27/45 (60%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
++C NC TT T LWRR+ +G +CNACGL ++ P + P
Sbjct: 228 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKP 272
>gi|302422454|ref|XP_003009057.1| siderophore regulation protein [Verticillium albo-atrum VaMs.102]
gi|261352203|gb|EEY14631.1| siderophore regulation protein [Verticillium albo-atrum VaMs.102]
Length = 581
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDA 73
G C+NC TT T LWRR+ G +CNACGL + + P + ++S N A
Sbjct: 120 GQVCSNCGTTQTPLWRRSPQGTTICNACGLYQKARNASRPTSLKKKPPQLVSANSRA 176
Score = 47.8 bits (112), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 284 IACQNCGTTTTPLWRRDESGHTICNACGLYYKLHGVHRP 322
>gi|390600081|gb|EIN09476.1| hypothetical protein PUNSTDRAFT_143032 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 697
Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
SC NC TT T LWRR+ G+ +CNACGL + + + P +
Sbjct: 95 SCTNCGTTTTPLWRRDAEGKSICNACGLYLKSRRVPRPSSL 135
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+SCANC T+ T LWRR++ G +CNACGL
Sbjct: 282 ALSCANCGTSTTPLWRRDDVGNNICNACGL 311
>gi|400602130|gb|EJP69755.1| siderophore regulation protein [Beauveria bassiana ARSEF 2860]
Length = 532
Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 8 REQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIE-- 64
++Q+G S G C+NC TT T LWRR+ G +CNACGL + + P + P +
Sbjct: 74 KQQSGASSSHGQICSNCGTTETPLWRRSPQGATICNACGLYLRARNSARPTNLKKPPKVV 133
Query: 65 ------SMISRNGDALSVITSTPNI 83
+ SR GD S STP +
Sbjct: 134 STELPLASTSRAGD--SGQDSTPKV 156
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C+NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 238 IACSNCGTTITPLWRRDGEGNMICNACGLYYRLHGVHRP 276
>gi|346970218|gb|EGY13670.1| siderophore regulation protein [Verticillium dahliae VdLs.17]
Length = 581
Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDA 73
G C+NC TT T LWRR+ G +CNACGL + + P + ++S N A
Sbjct: 120 GQVCSNCGTTQTPLWRRSPQGTTICNACGLYQKARNASRPTSLKKKPPQLVSANSRA 176
Score = 47.8 bits (112), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 284 IACQNCGTTTTPLWRRDESGHTICNACGLYYKLHGVHRP 322
>gi|355726392|gb|AES08857.1| trichorhinophalangeal syndrome I [Mustela putorius furo]
Length = 387
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 21/29 (72%), Positives = 23/29 (79%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGL 46
V CANC TT T+LWR+N NG VCNACGL
Sbjct: 1 VFCANCLTTKTSLWRKNANGGYVCNACGL 29
>gi|443898378|dbj|GAC75713.1| GATA-4/5/6 transcription factors [Pseudozyma antarctica T-34]
Length = 1386
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 18/29 (62%), Positives = 23/29 (79%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C+NC TT T LWRR+ G+P+CNACGL +
Sbjct: 1239 CSNCHTTKTPLWRRDPEGQPLCNACGLFL 1267
>gi|268554043|ref|XP_002635009.1| C. briggsae CBR-ELT-6 protein [Caenorhabditis briggsae]
Length = 405
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 26/41 (63%)
Query: 6 GIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
G E + G C+NCSTT TT WRR+ G+ VCNACGL
Sbjct: 274 GSPEDDSSNSGGSRCSNCSTTKTTAWRRDLAGKLVCNACGL 314
>gi|390602192|gb|EIN11585.1| hypothetical protein PUNSTDRAFT_83261 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1041
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 18/27 (66%), Positives = 22/27 (81%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C NC TT T LWRR+ +G+P+CNACGL
Sbjct: 919 CTNCQTTNTPLWRRDPSGQPLCNACGL 945
>gi|380484354|emb|CCF40053.1| GATA zinc finger [Colletotrichum higginsianum]
Length = 961
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 19/37 (51%), Positives = 24/37 (64%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G + +C NC T T LWRRN G+P+CNACGL +
Sbjct: 677 GEGNAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFL 713
>gi|2914227|pdb|4GAT|A Chain A, Solution Nmr Structure Of The Wild Type Dna Binding
Domain Of Area Complexed To A 13bp Dna Containing A
Cgata Site, Regularized Mean Structure
gi|2914241|pdb|5GAT|A Chain A, Solution Nmr Structure Of The Wild Type Dna Binding
Domain Of Area Complexed To A 13bp Dna Containing A
Cgata Site, 35 Structures
Length = 66
Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 32/61 (52%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISR 69
+ G + +C NC T T LWRRN G+P+CNACGL + + P + + +R
Sbjct: 2 KNGEQNGPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNR 61
Query: 70 N 70
N
Sbjct: 62 N 62
>gi|409044810|gb|EKM54291.1| hypothetical protein PHACADRAFT_258054 [Phanerochaete carnosa
HHB-10118-sp]
Length = 872
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 25/39 (64%), Gaps = 4/39 (10%)
Query: 12 GNKRSGVS----CANCSTTCTTLWRRNNNGEPVCNACGL 46
GN G S C NC TT T LWRR+ G+P+CNACGL
Sbjct: 724 GNGEDGESSPTVCTNCQTTNTPLWRRDPEGQPLCNACGL 762
>gi|194224624|ref|XP_001915060.1| PREDICTED: transcription factor GATA-5-like [Equus caballus]
Length = 255
Score = 47.8 bits (112), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 33/46 (71%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ ++R+G+ C NC TT TTLWRRN +GEPVCNACGL M + P
Sbjct: 91 SSSRRAGLCCTNCHTTTTTLWRRNADGEPVCNACGLYMKLHGVPRP 136
>gi|393245125|gb|EJD52636.1| hypothetical protein AURDEDRAFT_158369 [Auricularia delicata
TFB-10046 SS5]
Length = 1111
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 18/27 (66%), Positives = 22/27 (81%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C NC TT T LWRR+ +G+P+CNACGL
Sbjct: 1033 CTNCQTTNTPLWRRDPDGQPLCNACGL 1059
>gi|73992691|ref|XP_543086.2| PREDICTED: transcription factor GATA-5 [Canis lupus familiaris]
Length = 400
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 32/72 (44%), Positives = 44/72 (61%), Gaps = 2/72 (2%)
Query: 11 TGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRN 70
+ ++R+G+ C NC TT TTLWRRN +GEPVCNACGL M + P + ES+ +R
Sbjct: 237 SSSRRAGLCCTNCHTTTTTLWRRNADGEPVCNACGLYMKLHGV--PRPLAMKKESIQTRK 294
Query: 71 GDALSVITSTPN 82
SV+ + N
Sbjct: 295 RKPKSVVKTKSN 306
Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 189 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 228
>gi|395331932|gb|EJF64312.1| hypothetical protein DICSQDRAFT_101190 [Dichomitus squalens
LYAD-421 SS1]
Length = 534
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 27/40 (67%), Gaps = 4/40 (10%)
Query: 7 IREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
IR Q G +SCANC T+ T LWRR++ G +CNACGL
Sbjct: 110 IRSQVG----ALSCANCGTSTTPLWRRDDVGNNICNACGL 145
>gi|384490311|gb|EIE81533.1| hypothetical protein RO3G_06238 [Rhizopus delemar RA 99-880]
Length = 462
Score = 47.8 bits (112), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 27/42 (64%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMP 61
C+NCSTT T LWRR+ E +CNACGL + + P+ + P
Sbjct: 377 CSNCSTTTTPLWRRSAEDELLCNACGLYLKLHNTPRPKHLKP 418
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 23/27 (85%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C+NC+T+ T LWRR+ +G P+CNACGL
Sbjct: 436 CSNCATSTTPLWRRDVDGSPLCNACGL 462
>gi|310792185|gb|EFQ27712.1| GATA zinc finger [Glomerella graminicola M1.001]
Length = 592
Score = 47.8 bits (112), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 30/55 (54%)
Query: 2 ILTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
I + Q N+ ++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 271 IAALQAQGQNTNQTVVIACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 325
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 28/48 (58%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
++G C+NC TT T LWRR+ G +CNACGL + + P + P
Sbjct: 117 QTGQVCSNCGTTRTPLWRRSPQGSTICNACGLYQKARNASRPTNLKKP 164
>gi|13620173|emb|CAC36427.1| GATA transcription factor [Botryotinia fuckeliana]
Length = 388
Score = 47.8 bits (112), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
V+C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 275 VACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 313
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 28/52 (53%), Gaps = 1/52 (1%)
Query: 22 NCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM-PPIESMISRNGD 72
NC TT T LWRR+ G +CNACGL + + P + PP + I GD
Sbjct: 120 NCGTTRTPLWRRSPQGATICNACGLYQKARNASRPTNLKRPPTTAPIGYPGD 171
>gi|357625879|gb|EHJ76168.1| hypothetical protein KGM_07907 [Danaus plexippus]
Length = 514
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 20/32 (62%), Positives = 24/32 (75%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
R +SC+NC T TT+WRR+ GE VCNACGL
Sbjct: 350 RVELSCSNCGTHTTTIWRRDARGEMVCNACGL 381
>gi|343425356|emb|CBQ68892.1| related to transcription factor ScGATA-6 [Sporisorium reilianum SRZ2]
Length = 1405
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 18/29 (62%), Positives = 23/29 (79%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C+NC TT T LWRR+ G+P+CNACGL +
Sbjct: 1251 CSNCHTTKTPLWRRDPEGQPLCNACGLFL 1279
>gi|1107471|gb|AAB03891.1| nitrogen catabolic enzyme regulatory protein [Neurospora crassa]
Length = 1036
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN +G+P+CNACGL +
Sbjct: 742 TCTNCFTQTTPLWRRNPDGQPLCNACGLFL 771
>gi|68486067|ref|XP_713050.1| hypothetical protein CaO19.8743 [Candida albicans SC5314]
gi|68486134|ref|XP_713017.1| hypothetical protein CaO19.1150 [Candida albicans SC5314]
gi|46434483|gb|EAK93891.1| hypothetical protein CaO19.1150 [Candida albicans SC5314]
gi|46434522|gb|EAK93929.1| hypothetical protein CaO19.8743 [Candida albicans SC5314]
Length = 472
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 19/37 (51%), Positives = 25/37 (67%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
Q+G + + C NC T T LWR+ NNG+ +CNACGL
Sbjct: 247 QSGQPKKVLQCTNCQTRTTPLWRKANNGDLLCNACGL 283
>gi|166214957|sp|P19212.2|NIT2_NEUCR RecName: Full=Nitrogen catabolic enzyme regulatory protein
Length = 1036
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN +G+P+CNACGL +
Sbjct: 742 TCTNCFTQTTPLWRRNPDGQPLCNACGLFL 771
>gi|448101527|ref|XP_004199582.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
gi|359381004|emb|CCE81463.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
Length = 546
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 25/40 (62%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C+NC TT T LWRR +G +CNACGL + + N P
Sbjct: 26 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNSTNRP 65
Score = 44.3 bits (103), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++C NC +T T LWRR++ G +CNACGL
Sbjct: 139 AIACFNCGSTITPLWRRDDAGNTICNACGLYY 170
>gi|389740007|gb|EIM81199.1| hypothetical protein STEHIDRAFT_142426 [Stereum hirsutum FP-91666
SS1]
Length = 1696
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 19/32 (59%), Positives = 23/32 (71%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+S C NC TT T LWRR+ G+P+CNACGL
Sbjct: 1548 QSPTVCTNCQTTNTPLWRRDPEGQPLCNACGL 1579
>gi|403419010|emb|CCM05710.1| predicted protein [Fibroporia radiculosa]
Length = 1271
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 19/32 (59%), Positives = 23/32 (71%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+S C NC TT T LWRR+ G+P+CNACGL
Sbjct: 1132 QSPTVCTNCQTTNTPLWRRDPEGQPLCNACGL 1163
>gi|322712581|gb|EFZ04154.1| GATA transcription factor [Metarhizium anisopliae ARSEF 23]
Length = 554
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 259 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 297
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 25/46 (54%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR+ G +CNACGL + P + P
Sbjct: 100 GQVCSNCGTTRTPLWRRSPQGATICNACGLYQKARNTARPTSLKKP 145
>gi|448097680|ref|XP_004198732.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
gi|359380154|emb|CCE82395.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
Length = 539
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 25/40 (62%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C+NC TT T LWRR +G +CNACGL + + N P
Sbjct: 22 GQQCSNCGTTKTPLWRRAPDGTMICNACGLYLRSNSTNRP 61
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++C NC +T T LWRR++ G +CNACGL
Sbjct: 135 AIACYNCGSTITPLWRRDDAGNTICNACGLYY 166
>gi|363748562|ref|XP_003644499.1| hypothetical protein Ecym_1456 [Eremothecium cymbalariae
DBVPG#7215]
gi|356888131|gb|AET37682.1| hypothetical protein Ecym_1456 [Eremothecium cymbalariae
DBVPG#7215]
Length = 657
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 22/34 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+S C NC T T LWRR+ G P+CNACGL +
Sbjct: 440 KSDTKCTNCHTKTTPLWRRDPQGNPLCNACGLFL 473
>gi|367042150|ref|XP_003651455.1| hypothetical protein THITE_37279 [Thielavia terrestris NRRL 8126]
gi|346998717|gb|AEO65119.1| hypothetical protein THITE_37279 [Thielavia terrestris NRRL 8126]
Length = 988
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 721 TCTNCFTQTTPLWRRNPEGQPLCNACGLFL 750
>gi|302894399|ref|XP_003046080.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256727007|gb|EEU40367.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 559
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 256 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 294
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 26/46 (56%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR+ G +CNACGL + + P + P
Sbjct: 98 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARPTSLKRP 143
>gi|50312349|ref|XP_456208.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49645344|emb|CAG98916.1| KLLA0F25300p [Kluyveromyces lactis]
Length = 692
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ C+NC T T LWRR+ G P+CNACGL +
Sbjct: 466 KTETQCSNCHTKTTPLWRRDPQGNPLCNACGLFL 499
>gi|408399802|gb|EKJ78894.1| hypothetical protein FPSE_00936 [Fusarium pseudograminearum CS3096]
Length = 557
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 28/48 (58%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+Q N ++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 249 QQNSNATVVIACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 296
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR+ G +CNACGL + + + P + P
Sbjct: 101 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKRP 146
>gi|384498142|gb|EIE88633.1| hypothetical protein RO3G_13344 [Rhizopus delemar RA 99-880]
Length = 228
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
CANC TT T LWRR+ G +CNACGL ++ P
Sbjct: 78 CANCRTTTTPLWRRDEAGNTICNACGLYYKLHQVHRP 114
Score = 39.7 bits (91), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGL 46
C+ C T T LWRR +GE +CNACGL
Sbjct: 20 KCSLCGATKTPLWRRGPHGEALCNACGL 47
>gi|355562973|gb|EHH19535.1| GATA-binding factor 5, partial [Macaca mulatta]
Length = 237
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 36/49 (73%)
Query: 8 REQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++++ ++R+G+ C NC TT TTLWRRN+ GEPVCNACGL M + P
Sbjct: 71 QKRSSSRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRP 119
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
G C NC T LWRR+ G +CNACGL +N P
Sbjct: 27 GRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRP 66
>gi|443895398|dbj|GAC72744.1| GATA-4/5/6 transcription factors [Pseudozyma antarctica T-34]
Length = 1025
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDIN 54
+G+ C+NC T T LWRR +G +CNACGL M + N
Sbjct: 325 AGLRCSNCGVTSTPLWRRAPDGSTICNACGLYMKSHSTN 363
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 18/38 (47%), Positives = 23/38 (60%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+ N + C NC TT T LWRR+ +G +CNACGL
Sbjct: 464 DDKSNGVGALRCTNCQTTTTPLWRRDEDGNNICNACGL 501
>gi|392593705|gb|EIW83030.1| hypothetical protein CONPUDRAFT_81125 [Coniophora puteana
RWD-64-598 SS2]
Length = 1103
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 18/32 (56%), Positives = 24/32 (75%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
++ +C NC TT T LWRR+ G+P+CNACGL
Sbjct: 968 QTPTACTNCQTTNTPLWRRDPEGQPLCNACGL 999
>gi|19075466|ref|NP_587966.1| transcription factor, zf-GATA type (predicted) [Schizosaccharomyces
pombe 972h-]
gi|74627051|sp|O94720.1|YCF8_SCHPO RecName: Full=GATA zinc finger domain-containing protein C1393.08
gi|4468727|emb|CAB38164.1| transcription factor, zf-GATA type (predicted) [Schizosaccharomyces
pombe]
Length = 557
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 20/37 (54%), Positives = 26/37 (70%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
CANCS+T T+LWR++ +G+ VCNACGL N P
Sbjct: 419 CANCSSTKTSLWRKDRHGQTVCNACGLYARLHGHNRP 455
>gi|403214757|emb|CCK69257.1| hypothetical protein KNAG_0C01440 [Kazachstania naganishii CBS
8797]
Length = 460
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 23/33 (69%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SG C NC+T T LWRR+N G +CNACGL +
Sbjct: 175 SGSICKNCATVNTPLWRRDNEGNTLCNACGLFL 207
>gi|340517662|gb|EGR47905.1| Zn-finger gata type domain-containing protein [Trichoderma reesei
QM6a]
Length = 554
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 251 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 289
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 27/49 (55%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
K + C+NC TT T LWRR+ G +CNACGL + + P + P
Sbjct: 89 KSADAPCSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARPTSLKKP 137
>gi|146420469|ref|XP_001486190.1| hypothetical protein PGUG_01861 [Meyerozyma guilliermondii ATCC
6260]
Length = 352
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 20/44 (45%), Positives = 26/44 (59%)
Query: 3 LTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
L I++ + + C NC T T LWRR NNG+ +CNACGL
Sbjct: 205 LESSIKKSENSPKKSFQCNNCQTRTTPLWRRANNGDLLCNACGL 248
>gi|255715743|ref|XP_002554153.1| KLTH0E15488p [Lachancea thermotolerans]
gi|238935535|emb|CAR23716.1| KLTH0E15488p [Lachancea thermotolerans CBS 6340]
Length = 541
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 20/40 (50%), Positives = 23/40 (57%)
Query: 9 EQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
E GN C NC T T LWRRN +G +CNACGL +
Sbjct: 18 ESNGNAAGSPMCKNCHTLTTPLWRRNEHGAVLCNACGLFL 57
>gi|336263677|ref|XP_003346618.1| hypothetical protein SMAC_04791 [Sordaria macrospora k-hell]
Length = 542
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 24/39 (61%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 234 IACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRP 272
>gi|410077767|ref|XP_003956465.1| hypothetical protein KAFR_0C03380 [Kazachstania africana CBS 2517]
gi|372463049|emb|CCF57330.1| hypothetical protein KAFR_0C03380 [Kazachstania africana CBS 2517]
Length = 302
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 20/54 (37%), Positives = 29/54 (53%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDA 73
C NC T+ T LWR+++ G P+CNACGL + + P + + RN A
Sbjct: 204 CHNCQTSTTPLWRKDSEGHPLCNACGLFLRLHGVMRPLSLKTDVIKKRQRNSTA 257
>gi|384489810|gb|EIE81032.1| hypothetical protein RO3G_05737 [Rhizopus delemar RA 99-880]
Length = 393
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 18/27 (66%), Positives = 22/27 (81%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C+NCST T LWRR++ G P+CNACGL
Sbjct: 292 CSNCSTIKTPLWRRDDEGAPLCNACGL 318
Score = 39.7 bits (91), Expect = 0.20, Method: Composition-based stats.
Identities = 17/42 (40%), Positives = 23/42 (54%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMP 61
C NC T T LWRR+ + +CNACGL + P+ + P
Sbjct: 234 CTNCGATKTPLWRRSIEDDLLCNACGLYQKLHNAPRPKSLKP 275
>gi|342881901|gb|EGU82684.1| hypothetical protein FOXB_06796 [Fusarium oxysporum Fo5176]
Length = 555
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 255 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 293
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR+ G +CNACGL + + + P + P
Sbjct: 99 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKRP 144
>gi|5002306|gb|AAD37409.1|AF148539_1 major nitrogen regulatory protein [Aspergillus parasiticus]
Length = 866
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 663 TCTNCFTQTTPLWRRNPEGQPLCNACGLFL 692
>gi|406861428|gb|EKD14482.1| GATA zinc finger protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 2130
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G P+CNACGL +
Sbjct: 1842 TCTNCFTQTTPLWRRNPEGHPLCNACGLFL 1871
>gi|171690816|ref|XP_001910333.1| hypothetical protein [Podospora anserina S mat+]
gi|170945356|emb|CAP71468.1| unnamed protein product [Podospora anserina S mat+]
Length = 536
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 6/52 (11%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNT------VDINAPEKIMP 61
+G C+NC TT T LWRR+ G +CNACGL + V++ P ++P
Sbjct: 97 AGQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNTHRPVNLKRPPNVIP 148
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 34/67 (50%), Gaps = 3/67 (4%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSVI 77
++C NC TT T LWRR+ G +CNACGL ++ P + +S+I R +
Sbjct: 255 IACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVTMK---KSVIKRRKRVIPAA 311
Query: 78 TSTPNIH 84
+P +
Sbjct: 312 GGSPEME 318
>gi|169764959|ref|XP_001816951.1| nitrogen regulatory protein areA [Aspergillus oryzae RIB40]
gi|3182913|sp|O13415.1|AREA_ASPOR RecName: Full=Nitrogen regulatory protein areA
gi|2632173|emb|CAA05776.1| areA [Aspergillus oryzae]
gi|83764805|dbj|BAE54949.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 866
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 663 TCTNCFTQTTPLWRRNPEGQPLCNACGLFL 692
>gi|1584351|prf||2122392A GATA factor
Length = 156
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ C+NC+T+ T LWR++ G P+CNACGL + + P
Sbjct: 99 IKCSNCTTSTTPLWRKDPKGLPLCNACGLFLKLHGVTRP 137
>gi|390516511|emb|CCE73649.1| GATA-type transcription factor, iron regulation [Fusarium
fujikuroi]
Length = 555
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 255 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 293
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR+ G +CNACGL + + + P + P
Sbjct: 99 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKRP 144
>gi|12802699|gb|AAK08066.1|AF320305_1 AreA [Aspergillus oryzae]
Length = 853
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 651 TCTNCFTQTTPLWRRNPEGQPLCNACGLFL 680
>gi|391863417|gb|EIT72728.1| GATA transcription factor [Aspergillus oryzae 3.042]
Length = 866
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T T LWRRN G+P+CNACGL +
Sbjct: 663 TCTNCFTQTTPLWRRNPEGQPLCNACGLFL 692
>gi|46136099|ref|XP_389741.1| hypothetical protein FG09565.1 [Gibberella zeae PH-1]
Length = 557
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
++C NC TT T LWRR+ +G +CNACGL ++ P
Sbjct: 258 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP 296
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR+ G +CNACGL + + + P + P
Sbjct: 101 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKRP 146
>gi|393222104|gb|EJD07588.1| hypothetical protein FOMMEDRAFT_100924 [Fomitiporia mediterranea
MF3/22]
Length = 676
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+SCANC T+ T LWRR++ G +CNACGL + P
Sbjct: 288 LSCANCGTSTTPLWRRDDAGNNICNACGLYYKLHGTHRP 326
Score = 42.4 bits (98), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 23/35 (65%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNT 50
S CA+C+ T T+ WRR+ +G +CNACGL T
Sbjct: 93 SSSQCASCNATKTSQWRRDADGRQICNACGLSQKT 127
>gi|398392283|ref|XP_003849601.1| hypothetical protein MYCGRDRAFT_110870 [Zymoseptoria tritici
IPO323]
gi|339469478|gb|EGP84577.1| hypothetical protein MYCGRDRAFT_110870 [Zymoseptoria tritici
IPO323]
Length = 352
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 33/65 (50%), Gaps = 11/65 (16%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLI-----------MNTVDINAPEKIMPPIESM 66
V+C NC TT T LWRR+ G +CNACGL M +I ++++P +
Sbjct: 265 VACQNCGTTITPLWRRDEAGHTICNACGLYHKLHGAARPVQMKKAEIKRRKRVVPAAGNQ 324
Query: 67 ISRNG 71
I+ G
Sbjct: 325 IAEAG 329
Score = 43.9 bits (102), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%)
Query: 12 GNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
G +G C+NC TT T LWRR+ G +CNACGL
Sbjct: 111 GAAPAGQMCSNCGTTKTPLWRRSPAGAVICNACGLYYK 148
>gi|315053355|ref|XP_003176051.1| nitrogen regulatory protein areA [Arthroderma gypseum CBS 118893]
gi|311337897|gb|EFQ97099.1| nitrogen regulatory protein areA [Arthroderma gypseum CBS 118893]
Length = 801
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
SC+NC T T LWRRN G+ +CNACGL +
Sbjct: 584 TSCSNCFTQTTPLWRRNAEGQFLCNACGLFLK 615
>gi|380090512|emb|CCC11809.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 554
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
Q+ N ++C NC TT T LWRR+ G +CNACGL ++ P
Sbjct: 238 QSQNTTVVIACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRP 284
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 29/53 (54%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMIS 68
+G C+NC TT T LWRR+ G +CNACGL + + P I P M S
Sbjct: 81 AGQVCSNCGTTHTPLWRRSPLGAIICNACGLYLKARNAARPANIRRPPSVMAS 133
>gi|238878590|gb|EEQ42228.1| hypothetical protein CAWG_00430 [Candida albicans WO-1]
Length = 517
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRN 70
++C NC TT T LWRR++ G +CNACGL + P K+ P RN
Sbjct: 181 AIACFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMKRPTIKRRKRN 234
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR +G +CNACGL + + + P + P
Sbjct: 57 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRP 102
>gi|344302057|gb|EGW32362.1| hypothetical protein SPAPADRAFT_55830 [Spathaspora passalidarum
NRRL Y-27907]
Length = 413
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 30/51 (58%), Gaps = 6/51 (11%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNT------VDINAPEKIMP 61
G C+NC TT T LWRR +G +CNACGL + + V++ P I+P
Sbjct: 39 GQQCSNCGTTKTPLWRRAPDGSLICNACGLYLRSNNTHRPVNLKRPPNIIP 89
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++C NC TT T LWRR+++G +CNACGL
Sbjct: 163 AIACFNCGTTITPLWRRDDSGNTICNACGLFY 194
>gi|151940742|gb|EDN59129.1| transcriptional activator with GATA-1-type Zn finger DNA-binding
motif [Saccharomyces cerevisiae YJM789]
Length = 510
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ C+NC+T+ T LWR++ G P+CNACGL + + P
Sbjct: 308 IKCSNCTTSTTPLWRKDPKGLPLCNACGLFLKLHGVTRP 346
>gi|440634869|gb|ELR04788.1| hypothetical protein GMDG_07015 [Geomyces destructans 20631-21]
Length = 996
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 34/65 (52%), Gaps = 2/65 (3%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSVIT 78
+C NC T T LWRRN G P+CNACGL + + P + + + RN + + T
Sbjct: 697 TCTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPLSLKTDV--IKKRNRGSGATTT 754
Query: 79 STPNI 83
+ P I
Sbjct: 755 AVPLI 759
>gi|14318498|ref|NP_116632.1| Gat1p [Saccharomyces cerevisiae S288c]
gi|1169843|sp|P43574.1|GAT1_YEAST RecName: Full=Transcriptional regulatory protein GAT1
gi|836733|dbj|BAA09217.1| unnamed protein product [Saccharomyces cerevisiae]
gi|1049059|gb|AAB03516.1| GATA-type transcriptional factor [Saccharomyces cerevisiae]
gi|285811874|tpg|DAA12419.1| TPA: Gat1p [Saccharomyces cerevisiae S288c]
Length = 510
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ C+NC+T+ T LWR++ G P+CNACGL + + P
Sbjct: 308 IKCSNCTTSTTPLWRKDPKGLPLCNACGLFLKLHGVTRP 346
>gi|453081543|gb|EMF09592.1| hypothetical protein SEPMUDRAFT_72436 [Mycosphaerella populorum
SO2202]
Length = 489
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%)
Query: 19 SCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
+C NC TT T LWRR+ NG +CNACGL + P ++
Sbjct: 288 ACQNCGTTITPLWRRDENGHTICNACGLYHKLHGAHRPVQM 328
Score = 42.4 bits (98), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 20/30 (66%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
C+NC TT T LWRR+ G +CNACGL
Sbjct: 132 CSNCGTTKTPLWRRSPAGAVICNACGLYYK 161
>gi|387594150|gb|EIJ89174.1| hypothetical protein NEQG_00993 [Nematocida parisii ERTm3]
gi|387595654|gb|EIJ93277.1| hypothetical protein NEPG_01619 [Nematocida parisii ERTm1]
Length = 122
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM 60
K ++C NC T T LWRR+ G +CNACGL + + P K M
Sbjct: 5 KKGDDITCENCKTNITPLWRRSETGNYLCNACGLYLKIHKKHRPVKFM 52
>gi|302308916|ref|NP_986060.2| AFR513Cp [Ashbya gossypii ATCC 10895]
gi|299790856|gb|AAS53884.2| AFR513Cp [Ashbya gossypii ATCC 10895]
gi|374109291|gb|AEY98197.1| FAFR513Cp [Ashbya gossypii FDAG1]
Length = 468
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 17/29 (58%), Positives = 21/29 (72%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIM 48
C+NC T T LWRR G+P+CNACGL +
Sbjct: 289 CSNCMTKTTPLWRRGPQGDPLCNACGLFL 317
>gi|241949727|ref|XP_002417586.1| negative regulator of iron uptake genes, putative [Candida
dubliniensis CD36]
gi|223640924|emb|CAX45241.1| negative regulator of iron uptake genes, putative [Candida
dubliniensis CD36]
Length = 523
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRN 70
++C NC TT T LWRR++ G +CNACGL + P K+ P RN
Sbjct: 176 AIACFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMKRPTIKRRKRN 229
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR +G +CNACGL + + + P + P
Sbjct: 53 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRP 98
>gi|21745321|gb|AAM77345.1|AF520973_1 transcription factor SFU1 [Candida albicans]
Length = 518
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRN 70
++C NC TT T LWRR++ G +CNACGL + P K+ P RN
Sbjct: 182 AIACFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMKRPTIKRRKRN 235
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR +G +CNACGL + + + P + P
Sbjct: 58 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRP 103
>gi|395332895|gb|EJF65273.1| hypothetical protein DICSQDRAFT_132832 [Dichomitus squalens LYAD-421
SS1]
Length = 1227
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 18/27 (66%), Positives = 21/27 (77%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C NC TT T LWRR+ G+P+CNACGL
Sbjct: 1100 CTNCQTTITPLWRRDPEGQPLCNACGL 1126
>gi|259146169|emb|CAY79428.1| Gat1p [Saccharomyces cerevisiae EC1118]
Length = 473
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ C+NC+T+ T LWR++ G P+CNACGL + + P
Sbjct: 271 IKCSNCTTSTTPLWRKDPKGLPLCNACGLFLKLHGVTRP 309
>gi|207345697|gb|EDZ72434.1| YFL021Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 473
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ C+NC+T+ T LWR++ G P+CNACGL + + P
Sbjct: 271 IKCSNCTTSTTPLWRKDPKGLPLCNACGLFLKLHGVTRP 309
>gi|241958258|ref|XP_002421848.1| GATA zinc finger protein, putative; protein gzf3 homologue,
putative [Candida dubliniensis CD36]
gi|223645193|emb|CAX39792.1| GATA zinc finger protein, putative [Candida dubliniensis CD36]
Length = 662
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 22/33 (66%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SG C NC T T LWRR+ G+ +CNACGL +
Sbjct: 108 SGPVCGNCQTQTTPLWRRDETGQVLCNACGLFL 140
>gi|302661634|ref|XP_003022483.1| GATA transcriptional activator AreA [Trichophyton verrucosum HKI
0517]
gi|291186429|gb|EFE41865.1| GATA transcriptional activator AreA [Trichophyton verrucosum HKI
0517]
Length = 905
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
SC+NC T T LWRRN G+ +CNACGL +
Sbjct: 675 TSCSNCFTQTTPLWRRNAEGQFLCNACGLFLK 706
>gi|68464951|ref|XP_723553.1| negative regulator of iron uptake genes [Candida albicans SC5314]
gi|68465330|ref|XP_723364.1| negative regulator of iron uptake genes [Candida albicans SC5314]
gi|46445393|gb|EAL04662.1| negative regulator of iron uptake genes [Candida albicans SC5314]
gi|46445589|gb|EAL04857.1| negative regulator of iron uptake genes [Candida albicans SC5314]
Length = 517
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRN 70
++C NC TT T LWRR++ G +CNACGL + P K+ P RN
Sbjct: 181 AIACFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMKRPTIKRRKRN 234
Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
G C+NC TT T LWRR +G +CNACGL + + + P + P
Sbjct: 57 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRP 102
>gi|378755937|gb|EHY65962.1| hypothetical protein NERG_00658 [Nematocida sp. 1 ERTm2]
Length = 124
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM 60
K ++C NC T T LWRR+ G +CNACGL + + P K M
Sbjct: 5 KKGDDITCENCKTNITPLWRRSETGNYLCNACGLYLKIHKKHRPVKFM 52
>gi|296814256|ref|XP_002847465.1| nitrogen regulatory protein areA [Arthroderma otae CBS 113480]
gi|238840490|gb|EEQ30152.1| nitrogen regulatory protein areA [Arthroderma otae CBS 113480]
Length = 774
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SC+NC T T LWRRN G+ +CNACGL +
Sbjct: 543 TSCSNCFTQTTPLWRRNAEGQFLCNACGLFL 573
>gi|190345815|gb|EDK37763.2| hypothetical protein PGUG_01861 [Meyerozyma guilliermondii ATCC
6260]
Length = 352
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 20/44 (45%), Positives = 26/44 (59%)
Query: 3 LTFGIREQTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGL 46
L I++ + + C NC T T LWRR NNG+ +CNACGL
Sbjct: 205 LESSIKKSENSPKKSFQCNNCQTRTTPLWRRANNGDLLCNACGL 248
>gi|354548341|emb|CCE45077.1| hypothetical protein CPAR2_700810 [Candida parapsilosis]
Length = 642
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM-PPIESMISR 69
G C+NC TT T LWRR +G +CNACGL + + + P + PP ISR
Sbjct: 61 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNHTHRPVNLKRPPNTIAISR 114
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 26/43 (60%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
++C NC TT T LWRR++ G +CNACGL + P K+
Sbjct: 231 AIACFNCDTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKM 273
>gi|238595515|ref|XP_002393788.1| hypothetical protein MPER_06424 [Moniliophthora perniciosa FA553]
gi|215461799|gb|EEB94718.1| hypothetical protein MPER_06424 [Moniliophthora perniciosa FA553]
Length = 147
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
C NC TT T LWRR+ G+P+CNACGL
Sbjct: 55 CTNCQTTNTPLWRRDPEGQPLCNACGLFYK 84
>gi|113374055|dbj|BAF03588.1| zinc-finger protein [Arthroderma otae]
Length = 761
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SC+NC T T LWRRN G+ +CNACGL +
Sbjct: 530 TSCSNCFTQTTPLWRRNAEGQFLCNACGLFL 560
>gi|341876112|gb|EGT32047.1| hypothetical protein CAEBREN_07808 [Caenorhabditis brenneri]
Length = 1407
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 24/40 (60%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
C+NCS +T WR+N +GE VCN C L T + PE +
Sbjct: 1137 CSNCSVKESTCWRKNKDGEQVCNPCALYEKTTGVKRPESL 1176
>gi|326483661|gb|EGE07671.1| zinc-finger protein [Trichophyton equinum CBS 127.97]
Length = 709
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SC+NC T T LWRRN G+ +CNACGL +
Sbjct: 478 TSCSNCFTQTTPLWRRNAEGQFLCNACGLFL 508
>gi|255710611|ref|XP_002551589.1| KLTH0A03036p [Lachancea thermotolerans]
gi|238932966|emb|CAR21147.1| KLTH0A03036p [Lachancea thermotolerans CBS 6340]
Length = 720
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%)
Query: 15 RSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
++ C NC T T LWRR+ G P+CNACGL +
Sbjct: 456 KTDTKCTNCHTKTTPLWRRDPQGNPLCNACGLFL 489
>gi|68480451|ref|XP_715824.1| hypothetical protein CaO19.10361 [Candida albicans SC5314]
gi|46437465|gb|EAK96811.1| hypothetical protein CaO19.10361 [Candida albicans SC5314]
Length = 715
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 22/33 (66%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SG C NC T T LWRR+ G+ +CNACGL +
Sbjct: 133 SGPVCGNCQTQTTPLWRRDETGQVLCNACGLFL 165
>gi|328709165|ref|XP_003243884.1| PREDICTED: hypothetical protein LOC100167435 isoform 2
[Acyrthosiphon pisum]
Length = 588
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 27/43 (62%), Gaps = 6/43 (13%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
C+NC TTC+T+WRR N +PVCNACGL KI PP
Sbjct: 533 CSNCGTTCSTIWRRLNE-DPVCNACGLYYKLHG-----KIRPP 569
>gi|358056188|dbj|GAA97928.1| hypothetical protein E5Q_04608 [Mixia osmundae IAM 14324]
Length = 1047
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 24/37 (64%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC T+ T LWRR+ G+P+CN+CGL +N P
Sbjct: 946 CMNCKTSNTPLWRRDELGQPLCNSCGLFKKLHGVNRP 982
>gi|256268890|gb|EEU04239.1| Gat1p [Saccharomyces cerevisiae JAY291]
Length = 514
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ C+NC+T+ T LWR++ G P+CNACGL + + P
Sbjct: 312 IKCSNCTTSTTPLWRKDPKGLPLCNACGLFLKLHGVTRP 350
>gi|213406309|ref|XP_002173926.1| transcription factor gaf1 [Schizosaccharomyces japonicus yFS275]
gi|212001973|gb|EEB07633.1| transcription factor gaf1 [Schizosaccharomyces japonicus yFS275]
Length = 867
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 20/43 (46%), Positives = 27/43 (62%)
Query: 14 KRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
K++ +C NC T T LWRR+ +G P+CNACGL M + P
Sbjct: 652 KQNVPTCTNCQTRTTPLWRRSPDGHPLCNACGLFMKINGVVRP 694
>gi|703468|gb|AAB38022.1| homologous to GATA-binding transcription factor
[Schizosaccharomyces pombe]
Length = 564
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
C NC+TT T WRR+ +G P+CNACGL ++ P
Sbjct: 172 CQNCATTNTPKWRRDESGNPICNACGLYYKIHGVHRP 208
Score = 42.0 bits (97), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 21/32 (65%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
G SC+NC T T+LWRR + +CNACGL
Sbjct: 9 GQSCSNCHKTTTSLWRRGPDNSLLCNACGLYQ 40
>gi|389745401|gb|EIM86582.1| hypothetical protein STEHIDRAFT_39829, partial [Stereum hirsutum
FP-91666 SS1]
Length = 93
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/26 (65%), Positives = 22/26 (84%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACG 45
CANC T+ T LWRR+ +G+P+CNACG
Sbjct: 67 CANCGTSSTPLWRRDGDGKPICNACG 92
>gi|302499979|ref|XP_003011984.1| GATA transcriptional activator AreA [Arthroderma benhamiae CBS
112371]
gi|291175539|gb|EFE31344.1| GATA transcriptional activator AreA [Arthroderma benhamiae CBS
112371]
Length = 811
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SC+NC T T LWRRN G+ +CNACGL +
Sbjct: 581 TSCSNCFTQTTPLWRRNAEGQFLCNACGLFL 611
>gi|149239598|ref|XP_001525675.1| hypothetical protein LELG_03603 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146451168|gb|EDK45424.1| hypothetical protein LELG_03603 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 515
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 31/51 (60%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMI 67
G C+NC TT T LWRR +G +CNACGL + + + + P + P +++
Sbjct: 67 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNNSHRPVNLKRPPNTIV 117
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 1/52 (1%)
Query: 9 EQTGNKRS-GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKI 59
E +G+ S ++C NC TT T LWRR++ G +CNACGL + P K+
Sbjct: 217 EDSGDTSSFAIACFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKM 268
>gi|344302128|gb|EGW32433.1| hypothetical protein SPAPADRAFT_153357 [Spathaspora passalidarum
NRRL Y-27907]
Length = 429
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 13 NKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
N + + C NC T T LWR+ NNG+ +CNACGL
Sbjct: 157 NSKKVLQCTNCQTKTTPLWRKANNGDLLCNACGLFY 192
>gi|219687608|dbj|BAH09312.1| GATA transcription factor [Trichophyton interdigitale]
Length = 796
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SC+NC T T LWRRN G+ +CNACGL +
Sbjct: 565 TSCSNCFTQTTPLWRRNAEGQFLCNACGLFL 595
>gi|8096353|dbj|BAA96108.1| transcription factor ScGATA-6 [Schizophyllum commune]
Length = 1075
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 18/29 (62%), Positives = 22/29 (75%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+ C NC TT T LWRR+ G+P+CNACGL
Sbjct: 955 ILCTNCQTTNTPLWRRDPEGQPLCNACGL 983
>gi|392299652|gb|EIW10745.1| Gat1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 469
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ C+NC+T+ T LWR++ G P+CNACGL + + P
Sbjct: 267 IKCSNCTTSTTPLWRKDPKGLPLCNACGLFLKLHGVTRP 305
>gi|302680356|ref|XP_003029860.1| transcription factor scgata-6 [Schizophyllum commune H4-8]
gi|300103550|gb|EFI94957.1| transcription factor scgata-6 [Schizophyllum commune H4-8]
Length = 1075
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 18/29 (62%), Positives = 22/29 (75%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGL 46
+ C NC TT T LWRR+ G+P+CNACGL
Sbjct: 955 ILCTNCQTTNTPLWRRDPEGQPLCNACGL 983
>gi|193636743|ref|XP_001951486.1| PREDICTED: hypothetical protein LOC100167435 isoform 1
[Acyrthosiphon pisum]
Length = 639
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 27/43 (62%), Gaps = 6/43 (13%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPP 62
C+NC TTC+T+WRR N +PVCNACGL KI PP
Sbjct: 527 CSNCGTTCSTIWRRLNE-DPVCNACGLYYKLHG-----KIRPP 563
>gi|349577896|dbj|GAA23063.1| K7_Gat1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 493
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+ C+NC+T+ T LWR++ G P+CNACGL + + P
Sbjct: 291 IKCSNCTTSTTPLWRKDPKGLPLCNACGLFLKLHGVTRP 329
>gi|238879747|gb|EEQ43385.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 701
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 22/33 (66%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SG C NC T T LWRR+ G+ +CNACGL +
Sbjct: 128 SGPVCGNCQTQTTPLWRRDETGQVLCNACGLFL 160
>gi|327309432|ref|XP_003239407.1| hypothetical protein TERG_08692 [Trichophyton rubrum CBS 118892]
gi|326459663|gb|EGD85116.1| hypothetical protein TERG_08692 [Trichophyton rubrum CBS 118892]
Length = 704
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SC+NC T T LWRRN G+ +CNACGL +
Sbjct: 474 TSCSNCFTQTTPLWRRNAEGQFLCNACGLFL 504
>gi|367011361|ref|XP_003680181.1| hypothetical protein TDEL_0C00810 [Torulaspora delbrueckii]
gi|359747840|emb|CCE90970.1| hypothetical protein TDEL_0C00810 [Torulaspora delbrueckii]
Length = 345
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 25/40 (62%)
Query: 10 QTGNKRSGVSCANCSTTCTTLWRRNNNGEPVCNACGLIMN 49
+ G+ C+NC+T T LWRR+ G P+CNACGL +
Sbjct: 178 EPGDSNRDTRCSNCNTRTTPLWRRDPAGNPLCNACGLFLK 217
>gi|255728505|ref|XP_002549178.1| predicted protein [Candida tropicalis MYA-3404]
gi|240133494|gb|EER33050.1| predicted protein [Candida tropicalis MYA-3404]
Length = 383
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 32/64 (50%), Gaps = 5/64 (7%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIMPPIESMISRNGDALSVI 77
+ C NC T T LWR+ NNG+ +CNACGL + + PP +S+ S N
Sbjct: 203 LQCTNCQTKTTPLWRKTNNGDLLCNACGLFHKLHGV-----VRPPRKSVSSNNNSKQQRS 257
Query: 78 TSTP 81
TP
Sbjct: 258 KPTP 261
>gi|326471709|gb|EGD95718.1| zinc-finger protein [Trichophyton tonsurans CBS 112818]
Length = 745
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SC+NC T T LWRRN G+ +CNACGL +
Sbjct: 522 TSCSNCFTQTTPLWRRNAEGQFLCNACGLFL 552
>gi|302880934|ref|XP_003039394.1| hypothetical protein NECHADRAFT_56277 [Nectria haematococca mpVI
77-13-4]
gi|256720228|gb|EEU33681.1| hypothetical protein NECHADRAFT_56277 [Nectria haematococca mpVI
77-13-4]
Length = 262
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 22/31 (70%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
+C NC T + LWRRN G+P+CNACGL +
Sbjct: 77 TACTNCLTKASPLWRRNPQGQPLCNACGLFL 107
>gi|255722205|ref|XP_002546037.1| hypothetical protein CTRG_00818 [Candida tropicalis MYA-3404]
gi|240136526|gb|EER36079.1| hypothetical protein CTRG_00818 [Candida tropicalis MYA-3404]
Length = 667
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 22/33 (66%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SG C NC T T LWRR+ G+ +CNACGL +
Sbjct: 119 SGPICGNCQTQTTPLWRRDETGQVLCNACGLFL 151
>gi|384491561|gb|EIE82757.1| hypothetical protein RO3G_07462 [Rhizopus delemar RA 99-880]
Length = 243
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 26/44 (59%)
Query: 17 GVSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAPEKIM 60
++C+NC T T LWRR+ G +CNACGL ++ P +M
Sbjct: 66 NITCSNCQATNTPLWRRDEAGNTICNACGLYYKLHHVHRPRTMM 109
>gi|68480557|ref|XP_715773.1| hypothetical protein CaO19.2842 [Candida albicans SC5314]
gi|46437412|gb|EAK96759.1| hypothetical protein CaO19.2842 [Candida albicans SC5314]
Length = 710
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 22/33 (66%)
Query: 16 SGVSCANCSTTCTTLWRRNNNGEPVCNACGLIM 48
SG C NC T T LWRR+ G+ +CNACGL +
Sbjct: 130 SGPVCGNCQTQTTPLWRRDETGQVLCNACGLFL 162
>gi|403415920|emb|CCM02620.1| predicted protein [Fibroporia radiculosa]
Length = 746
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 18 VSCANCSTTCTTLWRRNNNGEPVCNACGLIMNTVDINAP 56
+SCANC T+ T LWRR++ G +CNACGL + P
Sbjct: 287 LSCANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 325
>gi|402217264|gb|EJT97345.1| hypothetical protein DACRYDRAFT_102444 [Dacryopinax sp. DJM-731 SS1]
Length = 1326
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 18/27 (66%), Positives = 20/27 (74%)
Query: 20 CANCSTTCTTLWRRNNNGEPVCNACGL 46
C NC TT T LWRR+ G P+CNACGL
Sbjct: 1199 CTNCQTTNTPLWRRDAEGNPLCNACGL 1225
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.131 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,429,953,953
Number of Sequences: 23463169
Number of extensions: 47868364
Number of successful extensions: 119509
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1571
Number of HSP's successfully gapped in prelim test: 413
Number of HSP's that attempted gapping in prelim test: 117001
Number of HSP's gapped (non-prelim): 2792
length of query: 88
length of database: 8,064,228,071
effective HSP length: 58
effective length of query: 30
effective length of database: 6,703,364,269
effective search space: 201100928070
effective search space used: 201100928070
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 69 (31.2 bits)