BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy12567
(142 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|307186218|gb|EFN71902.1| Transcription factor BCFI [Camponotus floridanus]
Length = 830
Score = 130 bits (328), Expect = 1e-28, Method: Composition-based stats.
Identities = 54/59 (91%), Positives = 55/59 (93%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL R
Sbjct: 557 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASR 615
Score = 123 bits (309), Expect = 2e-26, Method: Composition-based stats.
Identities = 59/105 (56%), Positives = 67/105 (63%), Gaps = 4/105 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P
Sbjct: 557 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVKQ-PRRLS 612
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NC T LWRR+ +G +CNACGLY K++G+NRP
Sbjct: 613 ASRRAGLSCSNCETTMTSLWRRNASGEPVCNACGLYFKLHGVNRP 657
>gi|328786707|ref|XP_001121273.2| PREDICTED: hypothetical protein LOC725422 [Apis mellifera]
Length = 942
Score = 130 bits (328), Expect = 1e-28, Method: Composition-based stats.
Identities = 54/59 (91%), Positives = 55/59 (93%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL R
Sbjct: 608 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASR 666
Score = 121 bits (304), Expect = 8e-26, Method: Composition-based stats.
Identities = 59/105 (56%), Positives = 66/105 (62%), Gaps = 4/105 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P
Sbjct: 608 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVKQ-PRRLS 663
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NC T LWRR+ G +CNACGLY K++G+NRP
Sbjct: 664 ASRRVGTSCSNCQTTMTSLWRRNTLGEPVCNACGLYFKLHGVNRP 708
>gi|332016551|gb|EGI57432.1| Transcription factor GATA-4 [Acromyrmex echinatior]
Length = 833
Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats.
Identities = 54/59 (91%), Positives = 55/59 (93%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL R
Sbjct: 560 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSATR 618
Score = 122 bits (305), Expect = 7e-26, Method: Composition-based stats.
Identities = 59/105 (56%), Positives = 66/105 (62%), Gaps = 4/105 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P
Sbjct: 560 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVKQ-PRRLS 615
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NC T LWRR+ G +CNACGLY K++G+NRP
Sbjct: 616 ATRRVGLSCSNCETTMTSLWRRNAQGEPVCNACGLYFKLHGVNRP 660
>gi|340720273|ref|XP_003398565.1| PREDICTED: hypothetical protein LOC100651336 [Bombus terrestris]
Length = 985
Score = 130 bits (326), Expect = 2e-28, Method: Composition-based stats.
Identities = 54/59 (91%), Positives = 55/59 (93%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL R
Sbjct: 686 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASR 744
Score = 120 bits (302), Expect = 1e-25, Method: Composition-based stats.
Identities = 59/105 (56%), Positives = 66/105 (62%), Gaps = 4/105 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P
Sbjct: 686 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVKQ-PRRLS 741
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NC T LWRR+ G +CNACGLY K++G+NRP
Sbjct: 742 ASRRVGTSCSNCQTTMTSLWRRNTLGEPVCNACGLYFKLHGVNRP 786
>gi|383863205|ref|XP_003707072.1| PREDICTED: uncharacterized protein LOC100875205 [Megachile
rotundata]
Length = 941
Score = 129 bits (325), Expect = 3e-28, Method: Composition-based stats.
Identities = 54/59 (91%), Positives = 55/59 (93%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL R
Sbjct: 621 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASR 679
Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats.
Identities = 59/105 (56%), Positives = 66/105 (62%), Gaps = 4/105 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
+ EYFTEGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P
Sbjct: 621 EAEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVKQ-PRRLS 676
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NC T LWRR+ G +CNACGLY K++G+NRP
Sbjct: 677 ASRRVGTSCSNCQTTMTSLWRRNTLGEPVCNACGLYFKLHGVNRP 721
>gi|307198088|gb|EFN79141.1| Transcription factor GATA-4 [Harpegnathos saltator]
Length = 785
Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats.
Identities = 53/59 (89%), Positives = 55/59 (93%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ EY+TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL R
Sbjct: 557 EAEYYTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSATR 615
Score = 120 bits (300), Expect = 3e-25, Method: Composition-based stats.
Identities = 58/105 (55%), Positives = 67/105 (63%), Gaps = 4/105 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
+ EY+TEGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P
Sbjct: 557 EAEYYTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVKQ-PRRLS 612
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NC T LWRR+ G +CNACGLY+K++G+NRP
Sbjct: 613 ATRRMGLACSNCETTMTSLWRRNTMGEPVCNACGLYYKLHGVNRP 657
>gi|347967443|ref|XP_003436067.1| AGAP002238-PB [Anopheles gambiae str. PEST]
gi|333466293|gb|EGK96194.1| AGAP002238-PB [Anopheles gambiae str. PEST]
Length = 1157
Score = 125 bits (314), Expect = 5e-27, Method: Composition-based stats.
Identities = 52/59 (88%), Positives = 53/59 (89%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
D + FTEGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL R
Sbjct: 754 DADLFTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSSAR 812
Score = 120 bits (300), Expect = 3e-25, Method: Composition-based stats.
Identities = 57/106 (53%), Positives = 67/106 (63%), Gaps = 4/106 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
D + FTEGRECVNCGAI TPLWRRDGTGHYLCNA G M+R L P
Sbjct: 754 DADLFTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVKQ-PRRLS 809
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 810 SARRVGLQCSNCNTTNTSLWRRNQVGEPVCNACGLYYKLHNVNRPL 855
>gi|345491199|ref|XP_001607828.2| PREDICTED: hypothetical protein LOC100124017 [Nasonia vitripennis]
Length = 461
Score = 125 bits (313), Expect = 8e-27, Method: Composition-based stats.
Identities = 50/55 (90%), Positives = 53/55 (96%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
+ EYFTEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ RR+
Sbjct: 402 EAEYFTEGRECVNCGAVSTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQSRRM 456
>gi|56849575|gb|AAW31646.1| GATAa2 transcription factor [Aedes aegypti]
Length = 1034
Score = 124 bits (312), Expect = 1e-26, Method: Composition-based stats.
Identities = 61/110 (55%), Positives = 68/110 (61%), Gaps = 18/110 (16%)
Query: 30 STPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE--------------DIEYF 75
S P W HY +A +ST+ + R S+P + + F
Sbjct: 557 SEPAW----PSHYEASAIAYSTASSNNSSSGRRPSTSVPSTTPFSAAASLTAMGIEADLF 612
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
TEGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL
Sbjct: 613 TEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 662
Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats.
Identities = 62/117 (52%), Positives = 72/117 (61%), Gaps = 8/117 (6%)
Query: 7 ASLPVEDIE--YFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF 64
ASL IE FTEGRECVNCGAI TPLWRRDGTGHYLCNA G M+R L
Sbjct: 600 ASLTAMGIEADLFTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLV 656
Query: 65 AS---LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
L E G +C NC ++T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 657 KQPRRLVKEPSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRPL 713
>gi|157125536|ref|XP_001654375.1| transcription factor GATA-4 (GATA binding factor-4) [Aedes aegypti]
gi|108873610|gb|EAT37835.1| AAEL010222-PA [Aedes aegypti]
Length = 1034
Score = 124 bits (312), Expect = 1e-26, Method: Composition-based stats.
Identities = 61/110 (55%), Positives = 68/110 (61%), Gaps = 18/110 (16%)
Query: 30 STPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE--------------DIEYF 75
S P W HY +A +ST+ + R S+P + + F
Sbjct: 557 SEPAW----PSHYEASAIAYSTASSNNSSSGRRPSTSVPSTTPFSAAASLTAMGIEADLF 612
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
TEGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL
Sbjct: 613 TEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 662
Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats.
Identities = 62/117 (52%), Positives = 72/117 (61%), Gaps = 8/117 (6%)
Query: 7 ASLPVEDIE--YFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF 64
ASL IE FTEGRECVNCGAI TPLWRRDGTGHYLCNA G M+R L
Sbjct: 600 ASLTAMGIEADLFTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLV 656
Query: 65 AS---LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
L E G +C NC ++T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 657 KQPRRLVKEPSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRPL 713
>gi|7649393|emb|CAB88873.1| GATA transcription factor [Aedes aegypti]
Length = 868
Score = 124 bits (312), Expect = 1e-26, Method: Composition-based stats.
Identities = 61/110 (55%), Positives = 68/110 (61%), Gaps = 18/110 (16%)
Query: 30 STPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE--------------DIEYF 75
S P W HY +A +ST+ + R S+P + + F
Sbjct: 558 SEPAW----PSHYEASAIAYSTASSNNSSSGRRPSTSVPSTTPFSAAASLTAMGIEADLF 613
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
TEGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL
Sbjct: 614 TEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 663
Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats.
Identities = 62/117 (52%), Positives = 72/117 (61%), Gaps = 8/117 (6%)
Query: 7 ASLPVEDIE--YFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF 64
ASL IE FTEGRECVNCGAI TPLWRRDGTGHYLCNA G M+R L
Sbjct: 601 ASLTAMGIEADLFTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLV 657
Query: 65 AS---LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
L E G +C NC ++T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 658 KQPRRLVKEPSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRPL 714
>gi|170043918|ref|XP_001849614.1| GATAa2 transcription factor [Culex quinquefasciatus]
gi|167867189|gb|EDS30572.1| GATAa2 transcription factor [Culex quinquefasciatus]
Length = 496
Score = 124 bits (311), Expect = 1e-26, Method: Composition-based stats.
Identities = 52/59 (88%), Positives = 53/59 (89%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
D + FTEGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL R
Sbjct: 147 DADLFTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSSAR 205
Score = 118 bits (296), Expect = 7e-25, Method: Composition-based stats.
Identities = 57/106 (53%), Positives = 67/106 (63%), Gaps = 4/106 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
D + FTEGRECVNCGAI TPLWRRDGTGHYLCNA G M+R L P
Sbjct: 147 DADLFTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVKQ-PRRLS 202
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 203 SARRVGLQCSNCNTTNTSLWRRNQVGEPVCNACGLYYKLHNVNRPL 248
>gi|347967445|ref|XP_307946.4| AGAP002238-PA [Anopheles gambiae str. PEST]
gi|333466292|gb|EAA03724.4| AGAP002238-PA [Anopheles gambiae str. PEST]
Length = 542
Score = 124 bits (311), Expect = 1e-26, Method: Composition-based stats.
Identities = 52/59 (88%), Positives = 53/59 (89%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
D + FTEGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL R
Sbjct: 139 DADLFTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSSAR 197
Score = 118 bits (296), Expect = 7e-25, Method: Composition-based stats.
Identities = 57/106 (53%), Positives = 67/106 (63%), Gaps = 4/106 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
D + FTEGRECVNCGAI TPLWRRDGTGHYLCNA G M+R L P
Sbjct: 139 DADLFTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVKQ-PRRLS 194
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 195 SARRVGLQCSNCNTTNTSLWRRNQVGEPVCNACGLYYKLHNVNRPL 240
>gi|24647320|ref|NP_732098.1| serpent, isoform B [Drosophila melanogaster]
gi|23171429|gb|AAN13691.1| serpent, isoform B [Drosophila melanogaster]
Length = 1249
Score = 124 bits (310), Expect = 2e-26, Method: Composition-based stats.
Identities = 58/86 (67%), Positives = 62/86 (72%), Gaps = 5/86 (5%)
Query: 54 TGYFIMSRFLFASLPVE----DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYH 109
TG FL AS + D +YFTEGRECVNCGAISTPLWRRD TGHYLCNACGLY
Sbjct: 703 TGRVDPGTFLSASASLSATLFDADYFTEGRECVNCGAISTPLWRRDNTGHYLCNACGLYM 762
Query: 110 KMNGMNRPLVKQPRRLRFD-RGNFNC 134
KMNGMNRPL+KQPRRL R +C
Sbjct: 763 KMNGMNRPLIKQPRRLSASKRAGLSC 788
Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats.
Identities = 57/106 (53%), Positives = 65/106 (61%), Gaps = 4/106 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
D +YFTEGRECVNCGAISTPLWRRD TGHYLCNA G M+R L P
Sbjct: 724 DADYFTEGRECVNCGAISTPLWRRDNTGHYLCNACGLYMKMNG---MNRPLIKQ-PRRLS 779
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC T LWRR+ G +CNACGLY+K++ + RPL
Sbjct: 780 ASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPL 825
>gi|189237326|ref|XP_973133.2| PREDICTED: similar to GATA transcription factor [Tribolium
castaneum]
Length = 368
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/74 (78%), Positives = 60/74 (81%), Gaps = 4/74 (5%)
Query: 60 SRFLFASLPVEDI----EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 115
+R L AS + I EYFTEGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMN
Sbjct: 97 NRPLSASASLSAIDPRAEYFTEGRECVNCGAIDTPLWRRDGTGHYLCNACGLYHKMNGMN 156
Query: 116 RPLVKQPRRLRFDR 129
RPLVKQPRRL R
Sbjct: 157 RPLVKQPRRLSASR 170
Score = 115 bits (289), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 69/147 (46%), Positives = 83/147 (56%), Gaps = 12/147 (8%)
Query: 2 SRFLFASLPVEDI----EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYF 57
+R L AS + I EYFTEGRECVNCGAI TPLWRRDGTGHYLCNA G
Sbjct: 97 NRPLSASASLSAIDPRAEYFTEGRECVNCGAIDTPLWRRDGTGHYLCNACGLYHKMNG-- 154
Query: 58 IMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
M+R L P G C NC +T LWRR+ G +CNACGLY K++G+NRP
Sbjct: 155 -MNRPLVKQ-PRRLSASRRVGLTCTNCHTSTTSLWRRNTVGEPVCNACGLYFKLHGVNRP 212
Query: 118 LVKQP----RRLRFDRGNFNCYERSTL 140
L + R R +G+ + R+ L
Sbjct: 213 LAMKKDSIQTRKRKPKGSKDSNSRNAL 239
>gi|386765869|ref|NP_001247128.1| serpent, isoform E [Drosophila melanogaster]
gi|383292736|gb|AFH06446.1| serpent, isoform E [Drosophila melanogaster]
Length = 731
Score = 122 bits (307), Expect = 4e-26, Method: Composition-based stats.
Identities = 58/86 (67%), Positives = 62/86 (72%), Gaps = 5/86 (5%)
Query: 54 TGYFIMSRFLFASLPVE----DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYH 109
TG FL AS + D +YFTEGRECVNCGAISTPLWRRD TGHYLCNACGLY
Sbjct: 185 TGRVDPGTFLSASASLSATLFDADYFTEGRECVNCGAISTPLWRRDNTGHYLCNACGLYM 244
Query: 110 KMNGMNRPLVKQPRRLRFD-RGNFNC 134
KMNGMNRPL+KQPRRL R +C
Sbjct: 245 KMNGMNRPLIKQPRRLSASKRAGLSC 270
Score = 117 bits (292), Expect = 2e-24, Method: Composition-based stats.
Identities = 57/106 (53%), Positives = 65/106 (61%), Gaps = 4/106 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
D +YFTEGRECVNCGAISTPLWRRD TGHYLCNA G M+R L P
Sbjct: 206 DADYFTEGRECVNCGAISTPLWRRDNTGHYLCNACGLYMKMNG---MNRPLIKQ-PRRLS 261
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC T LWRR+ G +CNACGLY+K++ + RPL
Sbjct: 262 ASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPL 307
>gi|390179289|ref|XP_003736856.1| GA26230, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859789|gb|EIM52929.1| GA26230, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 1295
Score = 122 bits (307), Expect = 4e-26, Method: Composition-based stats.
Identities = 50/61 (81%), Positives = 53/61 (86%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRG 130
D +YFTEGRECVNCGAI TPLWRRD TGHYLCNACGLY KMNGMNRPL+KQPRRL +
Sbjct: 751 DADYFTEGRECVNCGAIQTPLWRRDNTGHYLCNACGLYMKMNGMNRPLIKQPRRLSASKR 810
Query: 131 N 131
N
Sbjct: 811 N 811
Score = 115 bits (287), Expect = 8e-24, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 63/106 (59%), Gaps = 4/106 (3%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
D +YFTEGRECVNCGAI TPLWRRD TGHYLCNA G M+R L P
Sbjct: 751 DADYFTEGRECVNCGAIQTPLWRRDNTGHYLCNACGLYMKMNG---MNRPLIKQ-PRRLS 806
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC T LWRR+ G +CNACGLY K++ + RPL
Sbjct: 807 ASKRNGLSCSNCLTTYTSLWRRNPAGEPVCNACGLYFKLHSVTRPL 852
>gi|285157761|gb|ADC35036.1| GATA456a [Chaetopterus sp. MB-2010a]
Length = 597
Score = 118 bits (295), Expect = 9e-25, Method: Composition-based stats.
Identities = 52/73 (71%), Positives = 58/73 (79%), Gaps = 1/73 (1%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
+E +YF EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K RRL
Sbjct: 325 LEGPDYFAEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPQRRL-VG 383
Query: 129 RGNFNCYERSTLF 141
+ N Y + + F
Sbjct: 384 QPELNSYNQYSQF 396
Score = 115 bits (287), Expect = 9e-24, Method: Composition-based stats.
Identities = 58/131 (44%), Positives = 73/131 (55%), Gaps = 23/131 (17%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA-------------------RLFS- 50
+E +YF EGRECVNCGAISTPLWRRDGTGHYLCNA RL
Sbjct: 325 LEGPDYFAEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPQRRLVGQ 384
Query: 51 ---TSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGL 107
S+ Y ++ + ++ G C NC +T LWRR+ G +CNACGL
Sbjct: 385 PELNSYNQYSQFNQSVCIGSSIKKSASRRVGLSCANCQTTTTTLWRRNNEGEPVCNACGL 444
Query: 108 YHKMNGMNRPL 118
Y+K++G+NRPL
Sbjct: 445 YYKLHGVNRPL 455
>gi|285157832|gb|ADC35039.1| GATA456a [Themiste lageniformis]
Length = 730
Score = 118 bits (295), Expect = 9e-25, Method: Composition-based stats.
Identities = 56/115 (48%), Positives = 68/115 (59%), Gaps = 8/115 (6%)
Query: 9 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLP 68
+PV+ EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L
Sbjct: 376 MPVDPSNMMAEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIKPTR 432
Query: 69 VEDIEYFTEGR-----ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NCG +T LWRR+ G +CNACGLY+K++ +NRP+
Sbjct: 433 RMSNNVLQSGSRRMGLQCANCGTSTTTLWRRNNEGEPVCNACGLYYKLHQVNRPM 487
Score = 117 bits (292), Expect = 2e-24, Method: Composition-based stats.
Identities = 47/59 (79%), Positives = 51/59 (86%)
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
+PV+ EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K RR+
Sbjct: 376 MPVDPSNMMAEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPTRRM 434
>gi|407025371|gb|AFS65552.1| Gata4/5/6, partial [Parastichopus parvimensis]
Length = 302
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/111 (56%), Positives = 69/111 (62%), Gaps = 6/111 (5%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT-EG 78
GRECVNCGAISTPLWRRDGTGHYLCNA GY +R L P + EG
Sbjct: 195 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGY---NRPLLK--PQRRMSGSRREG 249
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
C NC +T LWRR+ G +CNACG YHKMNG NRPL+K RRL R
Sbjct: 250 ITCANCHTSTTTLWRRNKEGEPVCNACGFYHKMNGQNRPLIKPKRRLSSKR 300
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/68 (69%), Positives = 51/68 (75%), Gaps = 4/68 (5%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN----FN 133
GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RR+ R N
Sbjct: 195 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRPLLKPQRRMSGSRREGITCAN 254
Query: 134 CYERSTLF 141
C+ +T
Sbjct: 255 CHTSTTTL 262
>gi|19554935|gb|AAB29876.2| achaete and scute repressor/zinc finger protein [Drosophila
melanogaster]
Length = 540
Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 69/106 (65%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 163 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 217
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 218 ATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 263
Score = 111 bits (278), Expect = 8e-23, Method: Composition-based stats.
Identities = 56/98 (57%), Positives = 66/98 (67%), Gaps = 6/98 (6%)
Query: 34 WRRDGTGH-----YLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAI 87
W +G G Y NA + S+ + S F +S +++ F EGRECVNCGAI
Sbjct: 116 WTTEGFGSAHAQFYSPNAAVMMGSWRSAYDPSGFQRSSPYESAMDFQFGEGRECVNCGAI 175
Query: 88 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 176 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 213
>gi|17136408|ref|NP_476685.1| pannier, isoform A [Drosophila melanogaster]
gi|1709699|sp|P52168.1|PNR_DROME RecName: Full=GATA-binding factor A; AltName: Full=Protein pannier;
AltName: Full=Transcription factor GATA-A; AltName:
Full=dGATA-A
gi|545347|gb|AAB29874.1| C4 zinc finger DNA-binding protein/GATA transcription factor
[Drosophila melanogaster]
gi|23171432|gb|AAN13693.1| pannier, isoform A [Drosophila melanogaster]
gi|159884201|gb|ABX00779.1| RE33290p [Drosophila melanogaster]
Length = 540
Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 69/106 (65%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 163 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 217
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 218 ATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 263
Score = 111 bits (278), Expect = 8e-23, Method: Composition-based stats.
Identities = 56/98 (57%), Positives = 66/98 (67%), Gaps = 6/98 (6%)
Query: 34 WRRDGTGH-----YLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAI 87
W +G G Y NA + S+ + S F +S +++ F EGRECVNCGAI
Sbjct: 116 WTTEGFGSAHAQFYSPNAAVMMGSWRSAYDPSGFQRSSPYESAMDFQFGEGRECVNCGAI 175
Query: 88 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 176 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 213
>gi|157169239|gb|ABV25954.1| GATA-binding transcription factor B1 [Capitella teleta]
Length = 750
Score = 117 bits (293), Expect = 2e-24, Method: Composition-based stats.
Identities = 47/60 (78%), Positives = 54/60 (90%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
+S+ +D +Y+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLVK RR
Sbjct: 388 SSVNADDPDYYVEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVKPQRR 447
Score = 117 bits (292), Expect = 2e-24, Method: Composition-based stats.
Identities = 55/124 (44%), Positives = 75/124 (60%), Gaps = 15/124 (12%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF-- 64
+S+ +D +Y+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L
Sbjct: 388 SSVNADDPDYYVEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPLVKP 444
Query: 65 ----------ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM 114
+ PV ++ G +C NC +T LWRR+ G +CNACGLY+K++ +
Sbjct: 445 QRRSTGGGSSTTSPVPNLGSRRLGLQCANCSTTTTTLWRRNNEGEPVCNACGLYYKLHQV 504
Query: 115 NRPL 118
RP+
Sbjct: 505 ARPI 508
>gi|195111290|ref|XP_002000212.1| GI22649 [Drosophila mojavensis]
gi|193916806|gb|EDW15673.1| GI22649 [Drosophila mojavensis]
Length = 517
Score = 117 bits (292), Expect = 2e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 68/106 (64%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 156 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 210
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 211 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPL 256
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 46/51 (90%), Positives = 48/51 (94%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 156 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 206
>gi|390179286|ref|XP_003736855.1| GA17817, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859788|gb|EIM52928.1| GA17817, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 496
Score = 117 bits (292), Expect = 2e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 69/106 (65%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 115 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 169
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 170 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 215
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 46/51 (90%), Positives = 48/51 (94%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 115 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 165
>gi|225544205|gb|ACN91521.1| pnr [Bactrocera dorsalis]
gi|225544207|gb|ACN91522.1| pnr [Bactrocera dorsalis]
Length = 540
Score = 117 bits (292), Expect = 2e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 68/106 (64%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 105 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 159
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 160 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPL 205
Score = 112 bits (280), Expect = 5e-23, Method: Composition-based stats.
Identities = 56/96 (58%), Positives = 67/96 (69%), Gaps = 4/96 (4%)
Query: 34 WRRD--GTGHYLCNARLFST-SFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAIST 89
W D GT H A+ ++ S+ + + F +S I++ F EGRECVNCGAIST
Sbjct: 60 WTTDSFGTAHAQLPAQFYTMGSWRAAYDPTGFQRSSPYENAIDFQFGEGRECVNCGAIST 119
Query: 90 PLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
PLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 120 PLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 155
>gi|198454786|ref|XP_001359720.2| GA17817, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132953|gb|EAL28872.2| GA17817, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 69/106 (65%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 154 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 205
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 206 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 251
Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats.
Identities = 51/69 (73%), Positives = 54/69 (78%), Gaps = 4/69 (5%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRG 130
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 154 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRLGLC 213
Query: 131 NFNCYERST 139
NC R+T
Sbjct: 214 CTNCGTRTT 222
>gi|195157146|ref|XP_002019457.1| GL12215 [Drosophila persimilis]
gi|194116048|gb|EDW38091.1| GL12215 [Drosophila persimilis]
Length = 532
Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 69/106 (65%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 154 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 205
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 206 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 251
Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats.
Identities = 51/69 (73%), Positives = 54/69 (78%), Gaps = 4/69 (5%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRG 130
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 154 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRLGLC 213
Query: 131 NFNCYERST 139
NC R+T
Sbjct: 214 CTNCGTRTT 222
>gi|18032000|gb|AAL32068.1| pannier-like protein [Calliphora vicina]
Length = 511
Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 68/106 (64%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 117 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 171
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 172 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPL 217
Score = 111 bits (278), Expect = 9e-23, Method: Composition-based stats.
Identities = 46/51 (90%), Positives = 48/51 (94%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 117 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 167
>gi|225544236|gb|ACN91538.1| pnr [Rhagoletis juglandis]
Length = 556
Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 68/106 (64%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 105 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSTT 159
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 160 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPL 205
Score = 113 bits (282), Expect = 3e-23, Method: Composition-based stats.
Identities = 57/96 (59%), Positives = 67/96 (69%), Gaps = 4/96 (4%)
Query: 34 WRRD--GTGHYLCNARLFST-SFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAIST 89
W D GT H A+ +S S+ + + F +S I++ F EGRECVNCGAIST
Sbjct: 60 WTTDSFGTAHAQLPAQFYSMGSWRAAYDPTGFQRSSPYENAIDFQFGEGRECVNCGAIST 119
Query: 90 PLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
PLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 120 PLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 155
>gi|225544221|gb|ACN91531.1| pnr [Ceratitis capitata]
Length = 539
Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 68/106 (64%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 105 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 159
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 160 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPL 205
Score = 112 bits (279), Expect = 6e-23, Method: Composition-based stats.
Identities = 56/96 (58%), Positives = 67/96 (69%), Gaps = 4/96 (4%)
Query: 34 WRRD--GTGHYLCNARLFST-SFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAIST 89
W D GT H A+ ++ S+ + + F +S I++ F EGRECVNCGAIST
Sbjct: 60 WTTDSFGTAHAQLPAQFYTMGSWRAAYDPTGFQRSSPYENAIDFQFGEGRECVNCGAIST 119
Query: 90 PLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
PLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 120 PLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 155
>gi|194767723|ref|XP_001965964.1| GF11891 [Drosophila ananassae]
gi|190619807|gb|EDV35331.1| GF11891 [Drosophila ananassae]
Length = 551
Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats.
Identities = 58/103 (56%), Positives = 68/103 (66%), Gaps = 6/103 (5%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + +
Sbjct: 175 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLTATR 229
Query: 77 E-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 230 RLGLRCTNCGTHTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 272
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 47/55 (85%), Positives = 49/55 (89%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 175 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 229
>gi|24647330|ref|NP_732102.1| pannier, isoform B [Drosophila melanogaster]
gi|442619352|ref|NP_001262621.1| pannier, isoform D [Drosophila melanogaster]
gi|23171433|gb|AAF55264.3| pannier, isoform B [Drosophila melanogaster]
gi|218506027|gb|ACK77655.1| RE17974p [Drosophila melanogaster]
gi|440217480|gb|AGB96001.1| pannier, isoform D [Drosophila melanogaster]
Length = 488
Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 69/106 (65%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 111 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 165
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 166 ATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 211
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/98 (57%), Positives = 66/98 (67%), Gaps = 6/98 (6%)
Query: 34 WRRDGTGH-----YLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAI 87
W +G G Y NA + S+ + S F +S +++ F EGRECVNCGAI
Sbjct: 64 WTTEGFGSAHAQFYSPNAAVMMGSWRSAYDPSGFQRSSPYESAMDFQFGEGRECVNCGAI 123
Query: 88 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 124 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 161
>gi|20152127|gb|AAM11423.1| SD02611p [Drosophila melanogaster]
Length = 486
Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 69/106 (65%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 111 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 165
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 166 ATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 211
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/98 (57%), Positives = 66/98 (67%), Gaps = 6/98 (6%)
Query: 34 WRRDGTGH-----YLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAI 87
W +G G Y NA + S+ + S F +S +++ F EGRECVNCGAI
Sbjct: 64 WTTEGFGSAHAQFYSPNAAVMMGSWRSAYDPSGFQRSSPYESAMDFQFGEGRECVNCGAI 123
Query: 88 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 124 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 161
>gi|195328561|ref|XP_002030983.1| GM24279 [Drosophila sechellia]
gi|194119926|gb|EDW41969.1| GM24279 [Drosophila sechellia]
Length = 494
Score = 116 bits (290), Expect = 3e-24, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 69/106 (65%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 163 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 214
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 215 ATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 260
Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats.
Identities = 61/116 (52%), Positives = 72/116 (62%), Gaps = 10/116 (8%)
Query: 34 WRRDGTGH-----YLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAI 87
W +G G Y NA + S+ + S F +S +++ F EGRECVNCGAI
Sbjct: 116 WTTEGFGSAHAQFYSPNAAVMMGSWRSAYDPSGFQRSSPYESAMDFQFGEGRECVNCGAI 175
Query: 88 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNFNCYERST 139
STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R NC R+T
Sbjct: 176 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRMGLCCTNCGTRTT 231
>gi|119112909|ref|XP_307949.3| AGAP002235-PA [Anopheles gambiae str. PEST]
gi|15420800|gb|AAK97462.1|AF395080_1 zinc finger transcription factor pannier [Anopheles gambiae]
gi|116132794|gb|EAA03705.3| AGAP002235-PA [Anopheles gambiae str. PEST]
Length = 537
Score = 116 bits (290), Expect = 4e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 68/106 (64%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 114 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSQT 168
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 169 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPL 214
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 46/51 (90%), Positives = 48/51 (94%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 114 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 164
>gi|225544200|gb|ACN91518.1| pnr [Bactrocera cucurbitae]
Length = 517
Score = 115 bits (289), Expect = 4e-24, Method: Composition-based stats.
Identities = 59/106 (55%), Positives = 68/106 (64%), Gaps = 9/106 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 105 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 159
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 160 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPL 205
Score = 111 bits (278), Expect = 9e-23, Method: Composition-based stats.
Identities = 56/96 (58%), Positives = 67/96 (69%), Gaps = 4/96 (4%)
Query: 34 WRRD--GTGHYLCNARLFST-SFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAIST 89
W D GT H A+ ++ S+ + + F +S I++ F EGRECVNCGAIST
Sbjct: 60 WTTDSFGTAHAQLPAQFYTMGSWRAAYDPTGFQRSSPYENAIDFQFGEGRECVNCGAIST 119
Query: 90 PLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
PLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 120 PLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 155
>gi|442619350|ref|NP_001262620.1| pannier, isoform C [Drosophila melanogaster]
gi|440217479|gb|AGB96000.1| pannier, isoform C [Drosophila melanogaster]
Length = 537
Score = 115 bits (289), Expect = 4e-24, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 69/106 (65%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 163 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 214
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 215 ATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 260
Score = 112 bits (280), Expect = 6e-23, Method: Composition-based stats.
Identities = 61/116 (52%), Positives = 72/116 (62%), Gaps = 10/116 (8%)
Query: 34 WRRDGTGH-----YLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAI 87
W +G G Y NA + S+ + S F +S +++ F EGRECVNCGAI
Sbjct: 116 WTTEGFGSAHAQFYSPNAAVMMGSWRSAYDPSGFQRSSPYESAMDFQFGEGRECVNCGAI 175
Query: 88 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNFNCYERST 139
STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R NC R+T
Sbjct: 176 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRMGLCCTNCGTRTT 231
>gi|195501108|ref|XP_002097661.1| GE24361 [Drosophila yakuba]
gi|194183762|gb|EDW97373.1| GE24361 [Drosophila yakuba]
Length = 540
Score = 115 bits (289), Expect = 5e-24, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 69/106 (65%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 169 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 220
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 221 ATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 266
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 61/116 (52%), Positives = 72/116 (62%), Gaps = 10/116 (8%)
Query: 34 WRRDGTGH-----YLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAI 87
W +G G Y NA + S+ + S F +S +++ F EGRECVNCGAI
Sbjct: 122 WTTEGFGSAHAQFYPPNAAVMMGSWRSAYDPSGFQRSSPYESAMDFQFGEGRECVNCGAI 181
Query: 88 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNFNCYERST 139
STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R NC R+T
Sbjct: 182 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRMGLCCTNCGTRTT 237
>gi|194901188|ref|XP_001980134.1| GG16973 [Drosophila erecta]
gi|190651837|gb|EDV49092.1| GG16973 [Drosophila erecta]
Length = 535
Score = 115 bits (289), Expect = 5e-24, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 69/106 (65%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 165 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 216
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 217 ATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 262
Score = 112 bits (279), Expect = 6e-23, Method: Composition-based stats.
Identities = 61/116 (52%), Positives = 72/116 (62%), Gaps = 10/116 (8%)
Query: 34 WRRDGTGH-----YLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAI 87
W +G G Y NA + S+ + S F +S +++ F EGRECVNCGAI
Sbjct: 118 WTTEGFGSAHAQFYSPNAAVMMGSWRSAYDPSGFQRSSPYESAMDFQFGEGRECVNCGAI 177
Query: 88 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNFNCYERST 139
STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R NC R+T
Sbjct: 178 STPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRMGLCCTNCGTRTT 233
>gi|195452164|ref|XP_002073240.1| GK13255 [Drosophila willistoni]
gi|194169325|gb|EDW84226.1| GK13255 [Drosophila willistoni]
Length = 550
Score = 115 bits (288), Expect = 5e-24, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 69/106 (65%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 159 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 210
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 211 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVNRPL 256
Score = 111 bits (278), Expect = 8e-23, Method: Composition-based stats.
Identities = 51/69 (73%), Positives = 54/69 (78%), Gaps = 4/69 (5%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRG 130
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 159 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRLGLC 218
Query: 131 NFNCYERST 139
NC R+T
Sbjct: 219 CTNCGTRTT 227
>gi|1078989|pir||S53811 BmGATA beta isoform 2 - silkworm (fragment)
gi|603165|gb|AAA67885.1| BmGATA beta isoform 2, partial [Bombyx mori]
Length = 327
Score = 115 bits (288), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 68/120 (56%), Gaps = 9/120 (7%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY---------FIMSR 61
VE E+FTEGRECVNCGAI TPLW RDGTGHYLCNA G + R
Sbjct: 62 VELAEFFTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPLKQPRRLVRQR 121
Query: 62 FLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
+ P D+ G C NC +T LWRR+ G +CNACGLY K++ +NRPL +
Sbjct: 122 HAALAAPPHDMAAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRPLTMK 181
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/61 (83%), Positives = 53/61 (86%), Gaps = 1/61 (1%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
VE E+FTEGRECVNCGAI TPLW RDGTGHYLCNACGLY+KMNGMNRPL KQPRRL
Sbjct: 62 VELAEFFTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPL-KQPRRLVRQ 120
Query: 129 R 129
R
Sbjct: 121 R 121
>gi|156538733|ref|XP_001607848.1| PREDICTED: GATA-binding factor 5-A-like [Nasonia vitripennis]
Length = 442
Score = 115 bits (287), Expect = 7e-24, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 69/106 (65%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F+EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 169 FSEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKING---MNRPL-----IKPTKRLT 220
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 221 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPL 266
Score = 110 bits (274), Expect = 2e-22, Method: Composition-based stats.
Identities = 50/69 (72%), Positives = 55/69 (79%), Gaps = 4/69 (5%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRG 130
F+EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHK+NGMNRPL+K +RL R
Sbjct: 169 FSEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKINGMNRPLIKPTKRLTATRRLGLC 228
Query: 131 NFNCYERST 139
NC R+T
Sbjct: 229 CTNCGTRTT 237
>gi|195389402|ref|XP_002053366.1| GJ23378 [Drosophila virilis]
gi|194151452|gb|EDW66886.1| GJ23378 [Drosophila virilis]
Length = 520
Score = 114 bits (286), Expect = 9e-24, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 68/106 (64%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 156 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 207
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 208 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPL 253
Score = 112 bits (279), Expect = 6e-23, Method: Composition-based stats.
Identities = 51/69 (73%), Positives = 54/69 (78%), Gaps = 4/69 (5%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRG 130
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 156 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRLGLC 215
Query: 131 NFNCYERST 139
NC R+T
Sbjct: 216 CTNCGTRTT 224
>gi|54262117|ref|NP_001005725.1| GATA transcription factor e [Strongylocentrotus purpuratus]
gi|52001519|gb|AAU21562.1| GATA transcription factor [Strongylocentrotus purpuratus]
Length = 567
Score = 114 bits (286), Expect = 9e-24, Method: Composition-based stats.
Identities = 55/99 (55%), Positives = 63/99 (63%), Gaps = 3/99 (3%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR 79
GRECVNCGAISTPLWRRDGTGHYLCNA GY +R L + EG
Sbjct: 259 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGY---NRPLIKNPRRLQSGSRREGI 315
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 316 TCANCHTSTTTLWRRNKDGEPVCNACGLYFKLHGVNRPL 354
Score = 113 bits (282), Expect = 3e-23, Method: Composition-based stats.
Identities = 45/49 (91%), Positives = 47/49 (95%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG NRPL+K PRRL+
Sbjct: 259 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRPLIKNPRRLQ 307
>gi|37781291|gb|AAP35028.1| GATA transcription factor alpha [Patiria miniata]
Length = 644
Score = 114 bits (286), Expect = 1e-23, Method: Composition-based stats.
Identities = 57/104 (54%), Positives = 67/104 (64%), Gaps = 7/104 (6%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F GRECVNCGAISTPLWRRDGTGHYLCNA G MSR L P ++ +
Sbjct: 354 FGIGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MSRPLIK--PQRRLQSGS 408
Query: 77 --EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
EG C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 409 RREGIVCANCHTTTTTLWRRNKEGEPVCNACGLYYKLHSVNRPL 452
Score = 111 bits (278), Expect = 9e-23, Method: Composition-based stats.
Identities = 52/96 (54%), Positives = 65/96 (67%), Gaps = 6/96 (6%)
Query: 37 DGTGHYLCNARL--FSTSFTGYFIMSRFLFASLPVEDIEYFTE----GRECVNCGAISTP 90
+G G Y + + +S F + + A P + + +T+ GRECVNCGAISTP
Sbjct: 310 NGLGAYYVSPEIASWSAPFDSSVALHPNMIARRPSPESDMWTQDFGIGRECVNCGAISTP 369
Query: 91 LWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
LWRRDGTGHYLCNACGLYHKMNGM+RPL+K RRL+
Sbjct: 370 LWRRDGTGHYLCNACGLYHKMNGMSRPLIKPQRRLQ 405
>gi|195036110|ref|XP_001989514.1| GH18759 [Drosophila grimshawi]
gi|193893710|gb|EDV92576.1| GH18759 [Drosophila grimshawi]
Length = 528
Score = 114 bits (286), Expect = 1e-23, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 68/106 (64%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 155 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 206
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 207 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPL 252
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 51/69 (73%), Positives = 54/69 (78%), Gaps = 4/69 (5%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRG 130
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 155 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRLGLC 214
Query: 131 NFNCYERST 139
NC R+T
Sbjct: 215 CTNCGTRTT 223
>gi|340720269|ref|XP_003398563.1| PREDICTED: hypothetical protein LOC100651088 [Bombus terrestris]
Length = 440
Score = 114 bits (285), Expect = 1e-23, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 68/106 (64%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 178 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 229
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 230 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPL 275
Score = 111 bits (278), Expect = 9e-23, Method: Composition-based stats.
Identities = 51/69 (73%), Positives = 54/69 (78%), Gaps = 4/69 (5%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRG 130
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 178 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRLGLC 237
Query: 131 NFNCYERST 139
NC R+T
Sbjct: 238 CTNCGTRTT 246
>gi|170043914|ref|XP_001849612.1| GATA transcription factor GATAd [Culex quinquefasciatus]
gi|167867187|gb|EDS30570.1| GATA transcription factor GATAd [Culex quinquefasciatus]
Length = 520
Score = 114 bits (284), Expect = 2e-23, Method: Composition-based stats.
Identities = 58/106 (54%), Positives = 68/106 (64%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 111 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 162
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 163 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPL 208
Score = 111 bits (278), Expect = 9e-23, Method: Composition-based stats.
Identities = 51/69 (73%), Positives = 54/69 (78%), Gaps = 4/69 (5%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRG 130
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 111 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRRLGLC 170
Query: 131 NFNCYERST 139
NC R+T
Sbjct: 171 CTNCGTRTT 179
>gi|157125522|ref|XP_001654368.1| GATA transcription factor (GATAd) [Aedes aegypti]
gi|108873603|gb|EAT37828.1| AAEL010221-PA [Aedes aegypti]
Length = 483
Score = 114 bits (284), Expect = 2e-23, Method: Composition-based stats.
Identities = 58/103 (56%), Positives = 66/103 (64%), Gaps = 4/103 (3%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA-SLPVEDIEYF 75
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L S +
Sbjct: 114 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIKPSKRLVSTATR 170
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 171 RLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPL 213
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 51/71 (71%), Positives = 54/71 (76%), Gaps = 6/71 (8%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL------RFD 128
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 114 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSTATRRLG 173
Query: 129 RGNFNCYERST 139
NC R+T
Sbjct: 174 LCCTNCGTRTT 184
>gi|56900910|gb|AAW31748.1| GATA transcription factor GATAd [Aedes aegypti]
Length = 483
Score = 114 bits (284), Expect = 2e-23, Method: Composition-based stats.
Identities = 58/103 (56%), Positives = 66/103 (64%), Gaps = 4/103 (3%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA-SLPVEDIEYF 75
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L S +
Sbjct: 114 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIKPSKRLVSTATR 170
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 171 RLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPL 213
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 51/71 (71%), Positives = 54/71 (76%), Gaps = 6/71 (8%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL------RFD 128
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 114 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSTATRRLG 173
Query: 129 RGNFNCYERST 139
NC R+T
Sbjct: 174 LCCTNCGTRTT 184
>gi|242020217|ref|XP_002430552.1| GATA-binding factor, putative [Pediculus humanus corporis]
gi|212515716|gb|EEB17814.1| GATA-binding factor, putative [Pediculus humanus corporis]
Length = 409
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 48/67 (71%), Positives = 56/67 (83%)
Query: 63 LFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP 122
+ AS + ++EGRECVNCG +STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP
Sbjct: 138 ISASASLSAFVDYSEGRECVNCGTVSTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP 197
Query: 123 RRLRFDR 129
+RL ++
Sbjct: 198 KRLTNNK 204
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/115 (48%), Positives = 70/115 (60%), Gaps = 4/115 (3%)
Query: 5 LFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF 64
+ AS + ++EGRECVNCG +STPLWRRDGTGHYLCNA G M+R L
Sbjct: 138 ISASASLSAFVDYSEGRECVNCGTVSTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLV 194
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
P G +C NC +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 195 KQ-PKRLTNNKRLGLQCSNCNTATTSLWRRNSLGEPVCNACGLYYKLHGVNRPLA 248
>gi|242002772|ref|XP_002436029.1| GATA binding factor-1B, putative [Ixodes scapularis]
gi|215499365|gb|EEC08859.1| GATA binding factor-1B, putative [Ixodes scapularis]
Length = 425
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 56/84 (66%), Positives = 63/84 (75%), Gaps = 4/84 (4%)
Query: 51 TSFTGYFIMSRFLFASLPVEDIEYFT-EGRECVNCGAISTPLWRRDGTGHYLCNACGLYH 109
T T Y + ASL D+EYF EGRECVNCGAISTPLWRRDGTGHYLCNACGLY+
Sbjct: 338 TGLTSYTALVGDKRASL--ADLEYFGGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYN 395
Query: 110 KMNGMNRPLVKQPRRLRFDRGNFN 133
KMNG +RP++K PRRL RG +N
Sbjct: 396 KMNGAHRPIIKTPRRL-VSRGKWN 418
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/35 (91%), Positives = 33/35 (94%), Gaps = 1/35 (2%)
Query: 13 DIEYFT-EGRECVNCGAISTPLWRRDGTGHYLCNA 46
D+EYF EGRECVNCGAISTPLWRRDGTGHYLCNA
Sbjct: 356 DLEYFGGEGRECVNCGAISTPLWRRDGTGHYLCNA 390
>gi|285157858|gb|ADC35040.1| GATA456b [Themiste lageniformis]
Length = 496
Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats.
Identities = 57/114 (50%), Positives = 60/114 (52%), Gaps = 49/114 (42%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
+YF EGRECVNCGAISTPLWRRDGTGHYL
Sbjct: 222 DYFGEGRECVNCGAISTPLWRRDGTGHYL------------------------------- 250
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
RRDGTGHYLCNACGLYHKMNG+NRPL+K RRL D
Sbjct: 251 ------------------RRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLVND 286
>gi|161511589|gb|ABX71821.1| GATA1/2/3 [Paracentrotus lividus]
gi|270154815|gb|ACZ62636.1| GATA1/2/3 [Paracentrotus lividus]
Length = 430
Score = 112 bits (280), Expect = 4e-23, Method: Composition-based stats.
Identities = 55/101 (54%), Positives = 64/101 (63%), Gaps = 5/101 (4%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDG GHYLCNA G +R L P +
Sbjct: 215 TEGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAKRT 269
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC A +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 270 GTSCANCQATATTLWRRNPNGDPVCNACGLYYKLHGVNRPL 310
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
TEGRECVNCGA STPLWRRDG GHYLCNACGLYHKMNG NRPL+K RRL R +C
Sbjct: 215 TEGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRPLIKPKRRLSAKRTGTSC 273
>gi|47551055|ref|NP_999704.1| GATA transcription factor [Strongylocentrotus purpuratus]
gi|3702856|gb|AAC62960.1| GATA transcription factor [Strongylocentrotus purpuratus]
Length = 431
Score = 112 bits (280), Expect = 5e-23, Method: Composition-based stats.
Identities = 55/101 (54%), Positives = 64/101 (63%), Gaps = 5/101 (4%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDG GHYLCNA G +R L P +
Sbjct: 216 TEGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAKRT 270
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC A +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 271 GTSCANCQATATTLWRRNPNGDPVCNACGLYYKLHGVNRPL 311
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
TEGRECVNCGA STPLWRRDG GHYLCNACGLYHKMNG NRPL+K RRL R +C
Sbjct: 216 TEGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRPLIKPKRRLSAKRTGTSC 274
>gi|291045150|dbj|BAI82362.1| GATA transcription factor [Haemaphysalis longicornis]
Length = 319
Score = 112 bits (279), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 46/59 (77%), Positives = 54/59 (91%)
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
++EYF EGRECVNCG+ISTPLWRRDGTGHYLCNACGLY+KMNG++RP++K PRRL R
Sbjct: 8 ELEYFGEGRECVNCGSISTPLWRRDGTGHYLCNACGLYNKMNGVHRPVIKTPRRLSASR 66
Score = 111 bits (278), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 58/113 (51%), Positives = 69/113 (61%), Gaps = 12/113 (10%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLP 68
++EYF EGRECVNCG+ISTPLWRRDGTGHYLCNA G R L AS
Sbjct: 8 ELEYFGEGRECVNCGSISTPLWRRDGTGHYLCNACGLYNKMNGVHRPVIKTPRRLSASRR 67
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
V G C NC +T LWRR+ G +CNACGLY +++G+NRPL +
Sbjct: 68 V--------GLTCSNCQTGTTSLWRRNNVGEPVCNACGLYFRLHGVNRPLAMK 112
>gi|338722454|ref|XP_001495866.2| PREDICTED: transcription factor GATA-4-like [Equus caballus]
Length = 315
Score = 112 bits (279), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 49/64 (76%), Positives = 55/64 (85%)
Query: 66 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
S PV+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 75 SHPVDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRL 134
Query: 126 RFDR 129
R
Sbjct: 135 SASR 138
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/116 (50%), Positives = 69/116 (59%), Gaps = 12/116 (10%)
Query: 8 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFL 63
S PV+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L
Sbjct: 75 SHPVDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRL 134
Query: 64 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
AS V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 135 SASRRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 182
>gi|219518827|gb|AAI43480.1| GATA4 protein [Homo sapiens]
Length = 443
Score = 112 bits (279), Expect = 6e-23, Method: Composition-based stats.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 206 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 265
Query: 129 R 129
R
Sbjct: 266 R 266
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 57/112 (50%), Positives = 67/112 (59%), Gaps = 12/112 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 206 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 265
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 266 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 309
>gi|402877493|ref|XP_003902460.1| PREDICTED: transcription factor GATA-4 [Papio anubis]
Length = 443
Score = 112 bits (279), Expect = 6e-23, Method: Composition-based stats.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 206 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 265
Query: 129 R 129
R
Sbjct: 266 R 266
Score = 110 bits (276), Expect = 2e-22, Method: Composition-based stats.
Identities = 57/112 (50%), Positives = 67/112 (59%), Gaps = 12/112 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 206 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 265
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 266 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 309
>gi|297682265|ref|XP_002818845.1| PREDICTED: transcription factor GATA-4 isoform 1 [Pongo abelii]
Length = 443
Score = 112 bits (279), Expect = 6e-23, Method: Composition-based stats.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 206 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 265
Query: 129 R 129
R
Sbjct: 266 R 266
Score = 110 bits (276), Expect = 2e-22, Method: Composition-based stats.
Identities = 57/112 (50%), Positives = 67/112 (59%), Gaps = 12/112 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 206 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 265
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 266 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 309
>gi|297298911|ref|XP_001087008.2| PREDICTED: transcription factor GATA-4 isoform 1 [Macaca mulatta]
Length = 443
Score = 112 bits (279), Expect = 6e-23, Method: Composition-based stats.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 206 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 265
Query: 129 R 129
R
Sbjct: 266 R 266
Score = 110 bits (276), Expect = 2e-22, Method: Composition-based stats.
Identities = 57/112 (50%), Positives = 67/112 (59%), Gaps = 12/112 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 206 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 265
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 266 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 309
>gi|444525948|gb|ELV14225.1| Transcription factor GATA-4 [Tupaia chinensis]
Length = 253
Score = 112 bits (279), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 48/62 (77%), Positives = 54/62 (87%)
Query: 68 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRF 127
PV+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 15 PVDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSA 74
Query: 128 DR 129
R
Sbjct: 75 SR 76
Score = 110 bits (274), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/114 (50%), Positives = 68/114 (59%), Gaps = 12/114 (10%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFA 65
PV+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L A
Sbjct: 15 PVDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSA 74
Query: 66 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
S V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 75 SRRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 120
>gi|17402599|dbj|BAB78731.1| transcription factor GATA-4 [Mus musculus]
Length = 442
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 205 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 264
Query: 129 R 129
R
Sbjct: 265 R 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 57/112 (50%), Positives = 67/112 (59%), Gaps = 12/112 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 205 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 264
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 265 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|117276623|gb|ABK32792.1| GATA transcription factor 456 [Platynereis dumerilii]
Length = 364
Score = 112 bits (279), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 59/119 (49%), Positives = 71/119 (59%), Gaps = 12/119 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVE 70
+YF EGRECVNCGAISTPLWRRDGTGHYLCNA G R L AS V
Sbjct: 252 DYFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGLNRPLIKPQRRLSASRRV- 310
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
G C NC +T LWRR+ G +CNACGLY+K++G+NRPL + ++ +
Sbjct: 311 -------GLSCANCHTTTTTLWRRNNEGEPVCNACGLYYKLHGVNRPLAMKKEGIQTKK 362
>gi|344254252|gb|EGW10356.1| Endonuclease VIII-like 2 [Cricetulus griseus]
Length = 509
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 20 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 79
Query: 129 R 129
R
Sbjct: 80 R 80
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 57/112 (50%), Positives = 67/112 (59%), Gaps = 12/112 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 20 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 79
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 80 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 123
>gi|284005110|ref|NP_001164701.1| gata4 transcription factor [Saccoglossus kowalevskii]
gi|283462246|gb|ADB22417.1| gata4 transcription factor [Saccoglossus kowalevskii]
Length = 516
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 63/147 (42%), Positives = 80/147 (54%), Gaps = 24/147 (16%)
Query: 6 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA 65
F+S P EGRECVNCGA+STPLWRRDGTGHYLCNA G M+R L
Sbjct: 290 FSSFP-------AEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK 339
Query: 66 SLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV----- 119
P + G +C NC T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 340 --PQRRLSASRRVGLQCANCHTTQTTLWRRNNEGEPVCNACGLYFKLHSVNRPLALKKDT 397
Query: 120 -----KQPRRLRFDR-GNFNCYERSTL 140
++P+ D+ GN N ++S +
Sbjct: 398 IQTRKRKPKNTNSDKNGNQNMDDKSNI 424
>gi|57546103|gb|AAW51922.1| GATA binding protein 4 [Homo sapiens]
Length = 442
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (276), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTALWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|33188461|ref|NP_002043.2| transcription factor GATA-4 [Homo sapiens]
gi|215274105|sp|P43694.2|GATA4_HUMAN RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|75517731|gb|AAI01581.1| GATA binding protein 4 [Homo sapiens]
gi|85397947|gb|AAI05109.1| GATA binding protein 4 [Homo sapiens]
gi|219518812|gb|AAI43435.1| GATA binding protein 4 [Homo sapiens]
Length = 442
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|395842329|ref|XP_003793970.1| PREDICTED: transcription factor GATA-4 [Otolemur garnettii]
Length = 442
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 210 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 264
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 210 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 266
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 267 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 307
>gi|332244648|ref|XP_003271486.1| PREDICTED: transcription factor GATA-4 [Nomascus leucogenys]
Length = 442
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|395739374|ref|XP_003777251.1| PREDICTED: transcription factor GATA-4 [Pongo abelii]
Length = 442
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|387541612|gb|AFJ71433.1| transcription factor GATA-4 [Macaca mulatta]
Length = 442
Score = 112 bits (279), Expect = 7e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|300797798|ref|NP_001179806.1| transcription factor GATA-4 [Bos taurus]
gi|296484958|tpg|DAA27073.1| TPA: GATA binding protein 4 [Bos taurus]
Length = 442
Score = 111 bits (278), Expect = 8e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|114326345|ref|NP_001041577.1| transcription factor GATA-4 [Canis lupus familiaris]
gi|122142221|sp|Q0Q0E4.1|GATA4_CANFA RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|110554951|gb|ABG75570.1| GATA binding protein 4 [Canis lupus familiaris]
Length = 442
Score = 111 bits (278), Expect = 8e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|508484|gb|AAA58496.1| putative [Homo sapiens]
Length = 442
Score = 111 bits (278), Expect = 8e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|403307497|ref|XP_003944229.1| PREDICTED: transcription factor GATA-4 [Saimiri boliviensis
boliviensis]
Length = 442
Score = 111 bits (278), Expect = 8e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|291385798|ref|XP_002709484.1| PREDICTED: GATA-4 zinc-finger transcription factor [Oryctolagus
cuniculus]
Length = 443
Score = 111 bits (278), Expect = 8e-23, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|410956538|ref|XP_003984899.1| PREDICTED: transcription factor GATA-4 [Felis catus]
Length = 441
Score = 111 bits (278), Expect = 9e-23, Method: Composition-based stats.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 204 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 263
Query: 129 R 129
R
Sbjct: 264 R 264
Score = 110 bits (274), Expect = 2e-22, Method: Composition-based stats.
Identities = 57/112 (50%), Positives = 67/112 (59%), Gaps = 12/112 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 204 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 263
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 264 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 307
>gi|46909569|ref|NP_032118.2| transcription factor GATA-4 [Mus musculus]
gi|5882286|gb|AAD55266.1|AF179424_1 transcription factor GATA-4 [Mus musculus]
gi|148704120|gb|EDL36067.1| GATA binding protein 4 [Mus musculus]
gi|223460984|gb|AAI37825.1| GATA binding protein 4 [Mus musculus]
Length = 441
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 210 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 264
Score = 110 bits (274), Expect = 3e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 210 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 266
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 267 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 307
>gi|25282465|ref|NP_653331.1| transcription factor GATA-4 [Rattus norvegicus]
gi|1169847|sp|P46152.1|GATA4_RAT RecName: Full=Transcription factor GATA-4; AltName:
Full=DNA-binding protein GATA-GT2; AltName:
Full=GATA-binding factor 4
gi|437670|gb|AAA16159.1| DNA binding protein [Rattus norvegicus]
gi|149030263|gb|EDL85319.1| GATA binding protein 4 [Rattus norvegicus]
Length = 440
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 210 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 264
Score = 110 bits (274), Expect = 3e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 210 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 266
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 267 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 307
>gi|307198085|gb|EFN79138.1| GATA-binding factor A [Harpegnathos saltator]
Length = 375
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/107 (55%), Positives = 67/107 (62%), Gaps = 7/107 (6%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L E+ T
Sbjct: 165 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIKPSKRLMSEFQT 221
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 222 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPLA 268
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/51 (90%), Positives = 48/51 (94%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 165 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 215
>gi|344281251|ref|XP_003412393.1| PREDICTED: transcription factor GATA-4-like [Loxodonta africana]
Length = 611
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 265
Score = 109 bits (273), Expect = 3e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
>gi|148222826|ref|NP_001084335.1| GATA-binding factor 3 [Xenopus laevis]
gi|83406000|gb|AAI10755.1| XGATA-3 protein [Xenopus laevis]
Length = 435
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 57/114 (50%), Positives = 67/114 (58%), Gaps = 6/114 (5%)
Query: 6 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA 65
F+S P TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L
Sbjct: 239 FSSKPRPKTRSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK 295
Query: 66 SLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
P + G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 296 --PKRRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 347
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 49/66 (74%), Positives = 51/66 (77%)
Query: 64 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
F+S P TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K R
Sbjct: 239 FSSKPRPKTRSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKR 298
Query: 124 RLRFDR 129
RL R
Sbjct: 299 RLSAAR 304
>gi|603164|gb|AAA67886.1| BmGATA beta isoform 1, partial [Bombyx mori]
Length = 313
Score = 110 bits (276), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/111 (52%), Positives = 68/111 (61%), Gaps = 5/111 (4%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE 70
VE E+FTEGRECVNCGAI TPLW RDGTGHYLCNA G M+R L P
Sbjct: 62 VELAEFFTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNG---MNRPL--KQPRR 116
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
+ G C NC +T LWRR+ G +CNACGLY K++ +NRPL +
Sbjct: 117 LMAAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRPLTMK 167
Score = 107 bits (267), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/57 (87%), Positives = 52/57 (91%), Gaps = 1/57 (1%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
VE E+FTEGRECVNCGAI TPLW RDGTGHYLCNACGLY+KMNGMNRPL KQPRRL
Sbjct: 62 VELAEFFTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPL-KQPRRL 117
>gi|37781177|gb|AAP34298.1| GATA transcription factor [Patiria miniata]
Length = 676
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 52/95 (54%), Positives = 64/95 (67%), Gaps = 6/95 (6%)
Query: 37 DGTGHYLCNARL--FSTSFTGYFIMSRFLFASLPVEDIEYFTE----GRECVNCGAISTP 90
+G G Y + + +S F + + A P + + +T+ GRECVNCGAISTP
Sbjct: 310 NGLGAYYVSPEIASWSAPFDSSVALHPNMIARRPSPESDMWTQDFGIGRECVNCGAISTP 369
Query: 91 LWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
LWRRDGTGHYLCNACGLYHKMNGM+RPL+K RRL
Sbjct: 370 LWRRDGTGHYLCNACGLYHKMNGMSRPLIKPQRRL 404
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 59/134 (44%), Positives = 66/134 (49%), Gaps = 35/134 (26%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA----------- 65
F GRECVNCGAISTPLWRRDGTGHYLCNA G MSR L
Sbjct: 354 FGIGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MSRPLIKPQRRLDELGEL 410
Query: 66 --SLPVEDIEYFT-------------------EGRECVNCGAISTPLWRRDGTGHYLCNA 104
P IE EG C NC +T LWRR+ G +CNA
Sbjct: 411 CEKAPDMTIEGLDPNLLRFKDRRWKPQSGSRREGIVCANCHTTTTTLWRRNKEGEPVCNA 470
Query: 105 CGLYHKMNGMNRPL 118
CGLY+K++ +NRPL
Sbjct: 471 CGLYYKLHSVNRPL 484
>gi|313224233|emb|CBY20022.1| unnamed protein product [Oikopleura dioica]
Length = 483
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 65/102 (63%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G+ +R L P + +
Sbjct: 255 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGH---NRPLIK--PKKRLSAARR 309
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 310 AGTSCSNCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVNRPL 351
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 49/72 (68%), Positives = 56/72 (77%), Gaps = 5/72 (6%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGN 131
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K +RL R
Sbjct: 255 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRPLIKPKKRLSAARRAGTSC 314
Query: 132 FNCY-ERSTLFR 142
NC+ ++TL+R
Sbjct: 315 SNCHTTQTTLWR 326
>gi|126303987|ref|XP_001381638.1| PREDICTED: transcription factor GATA-4 [Monodelphis domestica]
Length = 451
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 222 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 276
Score = 109 bits (272), Expect = 4e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 222 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 278
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 279 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 319
>gi|3183530|sp|Q08369.3|GATA4_MOUSE RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|1815679|gb|AAB42015.1| GATA-4 [Mus musculus]
Length = 440
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 210 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 264
Score = 109 bits (272), Expect = 4e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 210 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 266
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 267 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 307
>gi|112984186|ref|NP_001037446.1| transcription factor BCFI [Bombyx mori]
gi|483331|gb|AAA65734.1| transcription factor BCFI [Bombyx mori]
Length = 509
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/109 (53%), Positives = 67/109 (61%), Gaps = 5/109 (4%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE 70
VE E+FTEGRECVNCGAI TPLW RDGTGHYLCNA G M+R L P
Sbjct: 257 VELAEFFTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNG---MNRPL--KQPRR 311
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 312 LMAAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRPLT 360
Score = 107 bits (267), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/57 (87%), Positives = 52/57 (91%), Gaps = 1/57 (1%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
VE E+FTEGRECVNCGAI TPLW RDGTGHYLCNACGLY+KMNGMNRPL KQPRRL
Sbjct: 257 VELAEFFTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPL-KQPRRL 312
>gi|1730200|sp|P52167.1|GATB_BOMMO RecName: Full=Transcription factor BCFI; AltName: Full=BmGATA-beta;
Short=GATA-beta
Length = 508
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/109 (53%), Positives = 67/109 (61%), Gaps = 5/109 (4%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE 70
VE E+FTEGRECVNCGAI TPLW RDGTGHYLCNA G M+R L P
Sbjct: 257 VELAEFFTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNG---MNRPL--KQPRR 311
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 312 LMAAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRPLT 360
Score = 107 bits (267), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/57 (87%), Positives = 52/57 (91%), Gaps = 1/57 (1%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
VE E+FTEGRECVNCGAI TPLW RDGTGHYLCNACGLY+KMNGMNRPL KQPRRL
Sbjct: 257 VELAEFFTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPL-KQPRRL 312
>gi|327288072|ref|XP_003228752.1| PREDICTED: transcription factor GATA-4-like [Anolis carolinensis]
Length = 411
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/62 (77%), Positives = 54/62 (87%), Gaps = 3/62 (4%)
Query: 71 DIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRF 127
++EYF +EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 175 NLEYFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSA 234
Query: 128 DR 129
R
Sbjct: 235 SR 236
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 68/114 (59%), Gaps = 15/114 (13%)
Query: 13 DIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFA 65
++EYF +EGRECVNCGA+STPLWRRDGTGHYLCNA G R L A
Sbjct: 175 NLEYFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSA 234
Query: 66 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
S + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 235 SRRI--------GLMCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 280
>gi|49256546|gb|AAH71107.1| Gata4 protein [Xenopus laevis]
Length = 392
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 172 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 231
Query: 129 R 129
R
Sbjct: 232 R 232
Score = 106 bits (265), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 172 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 231
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 232 RRV--------GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 276
>gi|148235016|ref|NP_001084098.1| transcription factor GATA-4 [Xenopus laevis]
gi|2494683|sp|Q91677.1|GATA4_XENLA RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4; Short=xGATA-4
gi|1209878|gb|AAB05647.1| transcription factor xGATA-4 [Xenopus laevis]
Length = 392
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 172 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 231
Query: 129 R 129
R
Sbjct: 232 R 232
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 172 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 231
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 232 RRV--------GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 276
>gi|52345846|ref|NP_001004967.1| GATA binding protein 3 [Xenopus (Silurana) tropicalis]
gi|49522444|gb|AAH75479.1| GATA binding protein 3 [Xenopus (Silurana) tropicalis]
Length = 437
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 57/114 (50%), Positives = 66/114 (57%), Gaps = 6/114 (5%)
Query: 6 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA 65
F S P TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L
Sbjct: 240 FPSKPRPKTRSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK 296
Query: 66 SLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
P + G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 297 --PKRRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 348
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 49/66 (74%), Positives = 50/66 (75%)
Query: 64 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
F S P TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K R
Sbjct: 240 FPSKPRPKTRSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKR 299
Query: 124 RLRFDR 129
RL R
Sbjct: 300 RLSAAR 305
>gi|189442167|gb|AAI67280.1| GATA binding protein 4 [Xenopus (Silurana) tropicalis]
Length = 394
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 174 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 129 R 129
R
Sbjct: 234 R 234
Score = 106 bits (265), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 174 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 234 RRV--------GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 278
>gi|238863840|gb|ACR66214.1| transcription factor GATA123 [Branchiostoma floridae]
Length = 473
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 64/102 (62%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 271 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 325
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 326 AGTQCANCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVNRPL 367
Score = 110 bits (274), Expect = 3e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 271 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 324
>gi|47523656|ref|NP_999458.1| transcription factor GATA-4 [Sus scrofa]
gi|22651761|gb|AAM63546.1| transcription factor GATA-4 [Sus scrofa]
Length = 442
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGISRPLIKPQRRLSASRRV--- 267
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 268 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 308
Score = 109 bits (273), Expect = 3e-22, Method: Composition-based stats.
Identities = 45/55 (81%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG++RPL+K RRL R
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGISRPLIKPQRRLSASR 265
>gi|119220894|gb|ABL61528.1| GATA transcription factor pannier [Drosophila quadrilineata]
Length = 344
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/107 (55%), Positives = 68/107 (63%), Gaps = 9/107 (8%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + T
Sbjct: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLVSAT 59
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 60 ATRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPLA 106
Score = 105 bits (263), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 46/51 (90%), Positives = 48/51 (94%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL
Sbjct: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRL 55
>gi|62858609|ref|NP_001016949.1| GATA binding protein 4 [Xenopus (Silurana) tropicalis]
gi|89266736|emb|CAJ83964.1| GATA binding protein 4 [Xenopus (Silurana) tropicalis]
Length = 394
Score = 110 bits (274), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 174 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 129 R 129
R
Sbjct: 234 R 234
Score = 106 bits (265), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 174 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 234 RRV--------GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 278
>gi|147905907|ref|NP_001085355.1| GATA binding protein 4 [Xenopus laevis]
gi|49256044|gb|AAH71122.1| MGC81427 protein [Xenopus laevis]
Length = 394
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 174 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 129 R 129
R
Sbjct: 234 R 234
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 68/115 (59%), Gaps = 12/115 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
+E + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 174 IEFFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL +
Sbjct: 234 RRV--------GLCCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMK 280
>gi|7381420|gb|AAF61481.1|AF184154_1 zinc finger DNA binding protein pannier [Ceratitis capitata]
Length = 324
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 58/106 (54%), Positives = 70/106 (66%), Gaps = 4/106 (3%)
Query: 28 AISTPLWRRD--GTGHYLCNARLFST-SFTGYFIMSRFLFASLPVEDIEY-FTEGRECVN 83
A + W D GT H A+ ++ S+ + + F +S I++ F EGRECVN
Sbjct: 54 AAAQNAWTTDSFGTAHAQLPAQFYTMGSWRAAYDPTGFQRSSPYENAIDFQFGEGRECVN 113
Query: 84 CGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
CGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 114 CGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 159
Score = 108 bits (270), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 62/118 (52%), Positives = 75/118 (63%), Gaps = 14/118 (11%)
Query: 8 SLPVED-IEY-FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA 65
S P E+ I++ F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L
Sbjct: 94 SSPYENAIDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-- 148
Query: 66 SLPVEDIEYFTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
++ + T R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 149 ---IKPSKRLTATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPLA 203
>gi|328721865|ref|XP_001943353.2| PREDICTED: endothelial transcription factor GATA-2-like
[Acyrthosiphon pisum]
Length = 386
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 58/109 (53%), Positives = 68/109 (62%), Gaps = 11/109 (10%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F +GRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + + F
Sbjct: 161 FGDGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLVK--PAKRLVAFE 215
Query: 77 E------GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NCG T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 216 TASNRRLGLSCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHGVNRPLT 264
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 46/51 (90%), Positives = 48/51 (94%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F +GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL
Sbjct: 161 FGDGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPAKRL 211
>gi|332862667|ref|XP_528070.3| PREDICTED: transcription factor GATA-4 [Pan troglodytes]
Length = 408
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 171 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 230
Query: 129 R 129
R
Sbjct: 231 R 231
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 171 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 230
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 231 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 275
>gi|426358838|ref|XP_004046698.1| PREDICTED: transcription factor GATA-4 [Gorilla gorilla gorilla]
Length = 382
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 145 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 204
Query: 129 R 129
R
Sbjct: 205 R 205
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 145 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 204
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 205 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 249
>gi|397467354|ref|XP_003805387.1| PREDICTED: transcription factor GATA-4 [Pan paniscus]
Length = 386
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 149 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 208
Query: 129 R 129
R
Sbjct: 209 R 209
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 149 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 208
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 209 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 253
>gi|383863209|ref|XP_003707074.1| PREDICTED: GATA-binding factor A-like [Megachile rotundata]
Length = 369
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/107 (54%), Positives = 68/107 (63%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 111 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 162
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 163 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPLA 209
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 47/55 (85%), Positives = 49/55 (89%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 111 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 165
>gi|270008083|gb|EFA04531.1| pannier [Tribolium castaneum]
Length = 407
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/107 (54%), Positives = 68/107 (63%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 119 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 170
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 171 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPLA 217
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 47/55 (85%), Positives = 49/55 (89%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 119 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 173
>gi|119586025|gb|EAW65621.1| GATA binding protein 4, isoform CRA_b [Homo sapiens]
Length = 417
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 180 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 239
Query: 129 R 129
R
Sbjct: 240 R 240
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 180 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 239
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 240 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 284
>gi|91094885|ref|XP_973051.1| PREDICTED: similar to AGAP002235-PA [Tribolium castaneum]
Length = 395
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/107 (54%), Positives = 68/107 (63%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 119 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 170
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 171 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPLA 217
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 47/55 (85%), Positives = 49/55 (89%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 119 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 173
>gi|393910073|gb|EJD75720.1| hypothetical protein LOAG_17191 [Loa loa]
Length = 395
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 53/103 (51%), Positives = 67/103 (65%), Gaps = 4/103 (3%)
Query: 16 YFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF 75
+ +EGRECVNCGA+ TPLWRRDGTGHYLCNA G +R L + +
Sbjct: 225 FTSEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLVKPKKRQSAQKR 281
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
T G ECVNC +T LWRR+ G +CNACGLYHK++ ++RP+
Sbjct: 282 T-GIECVNCKTNNTTLWRRNSLGQPVCNACGLYHKLHNISRPI 323
Score = 102 bits (253), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 42/56 (75%), Positives = 47/56 (83%)
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ +EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVK +R +
Sbjct: 225 FTSEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKRQSAQK 280
>gi|350423466|ref|XP_003493491.1| PREDICTED: GATA-binding factor 5-A-like, partial [Bombus impatiens]
Length = 317
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 57/105 (54%), Positives = 68/105 (64%), Gaps = 12/105 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 49 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 100
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
R C NCG ++T LWRR+ G +CNACGLY K++G+NRP
Sbjct: 101 ATRRLGLCCTNCGTLTTTLWRRNNEGEPVCNACGLYFKLHGVNRP 145
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 47/55 (85%), Positives = 49/55 (89%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 49 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 103
>gi|332016555|gb|EGI57436.1| GATA-binding factor A [Acromyrmex echinatior]
Length = 407
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/107 (54%), Positives = 68/107 (63%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 159 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 210
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 211 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPLA 257
Score = 108 bits (269), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 47/55 (85%), Positives = 49/55 (89%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 159 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 213
>gi|195037647|ref|XP_001990272.1| GH19248 [Drosophila grimshawi]
gi|193894468|gb|EDV93334.1| GH19248 [Drosophila grimshawi]
Length = 696
Score = 109 bits (272), Expect = 4e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 467 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 523
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 524 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 568
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 467 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 515
>gi|281347691|gb|EFB23275.1| hypothetical protein PANDA_012893 [Ailuropoda melanoleuca]
Length = 347
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 110 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 169
Query: 129 R 129
R
Sbjct: 170 R 170
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 110 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 169
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 170 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 214
>gi|440913584|gb|ELR63022.1| Transcription factor GATA-4, partial [Bos grunniens mutus]
Length = 268
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 31 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 90
Query: 129 R 129
R
Sbjct: 91 R 91
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 31 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 90
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 91 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 135
>gi|322782943|gb|EFZ10661.1| hypothetical protein SINV_07216 [Solenopsis invicta]
Length = 407
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/107 (54%), Positives = 68/107 (63%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 157 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 208
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 209 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPLA 255
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 47/55 (85%), Positives = 49/55 (89%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 157 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 211
>gi|324507780|gb|ADY43293.1| Transcription factor GATA-3 [Ascaris suum]
Length = 397
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 53/101 (52%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA+ TPLWRRDGTGHYLCNA G +R L + + T
Sbjct: 227 SEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLVKPKKRQSAQKRT- 282
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G ECVNC +T LWRR+ G +CNACGLYHK++ ++RP+
Sbjct: 283 GIECVNCKTNNTTLWRRNAHGQPVCNACGLYHKLHNISRPI 323
Score = 101 bits (252), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 42/54 (77%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVK +R +
Sbjct: 227 SEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKRQSAQK 280
>gi|221378310|ref|NP_001138026.1| grain, isoform B [Drosophila melanogaster]
gi|220903030|gb|ACL83485.1| grain, isoform B [Drosophila melanogaster]
Length = 699
Score = 109 bits (272), Expect = 4e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 470 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 526
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 527 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 571
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 470 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 518
>gi|195390588|ref|XP_002053950.1| GJ23061 [Drosophila virilis]
gi|194152036|gb|EDW67470.1| GJ23061 [Drosophila virilis]
Length = 686
Score = 109 bits (272), Expect = 4e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 457 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 513
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 514 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 558
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 457 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 505
>gi|195330796|ref|XP_002032089.1| GM26366 [Drosophila sechellia]
gi|194121032|gb|EDW43075.1| GM26366 [Drosophila sechellia]
Length = 703
Score = 109 bits (272), Expect = 4e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 474 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 530
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 531 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 575
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 474 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 522
>gi|195499000|ref|XP_002096762.1| GE24873 [Drosophila yakuba]
gi|194182863|gb|EDW96474.1| GE24873 [Drosophila yakuba]
Length = 707
Score = 109 bits (272), Expect = 4e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 478 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 534
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 535 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 579
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 478 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 526
>gi|15593990|emb|CAC69835.1| gata4 protein [Bos taurus]
Length = 266
Score = 109 bits (272), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 29 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 88
Query: 129 R 129
R
Sbjct: 89 R 89
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 29 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 88
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 89 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 133
>gi|426222407|ref|XP_004005383.1| PREDICTED: transcription factor GATA-4 [Ovis aries]
Length = 353
Score = 109 bits (272), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 52/61 (85%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL
Sbjct: 116 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAS 175
Query: 129 R 129
R
Sbjct: 176 R 176
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 116 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAS 175
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 176 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 220
>gi|162416003|gb|ABX89306.1| pannier protein, partial [Megaselia abdita]
Length = 206
Score = 108 bits (271), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 58/104 (55%), Positives = 67/104 (64%), Gaps = 6/104 (5%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L P + +
Sbjct: 102 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPLIK--PSKRLTATR 156
Query: 77 E-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 157 RLGLSCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPLA 200
Score = 108 bits (270), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 57/105 (54%), Positives = 68/105 (64%), Gaps = 5/105 (4%)
Query: 42 YLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY-FTEGRECVNCGAISTPLWRRDGTGHY 100
Y NA + + T + + F A+ +++ F EGRECVNCGAISTPLWRRDGTGHY
Sbjct: 68 YAQNAMMMGSWRTAFDPTAAFHQANPYDSSMDFQFGEGRECVNCGAISTPLWRRDGTGHY 127
Query: 101 LCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNFNCYERSTLF 141
LCNACGLYHKMNGMNRPL+K +RL R NC R+T
Sbjct: 128 LCNACGLYHKMNGMNRPLIKPSKRLTATRRLGLSCTNCGTRTTTL 172
>gi|195145358|ref|XP_002013663.1| GL23278 [Drosophila persimilis]
gi|194102606|gb|EDW24649.1| GL23278 [Drosophila persimilis]
Length = 705
Score = 108 bits (271), Expect = 5e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 476 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 532
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 533 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 577
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 476 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 524
>gi|442617963|ref|NP_001262366.1| grain, isoform C [Drosophila melanogaster]
gi|440217190|gb|AGB95748.1| grain, isoform C [Drosophila melanogaster]
Length = 712
Score = 108 bits (271), Expect = 6e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 470 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 526
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 527 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 571
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 470 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 518
>gi|198452168|ref|XP_001358657.2| GA21945, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198131814|gb|EAL27798.2| GA21945, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 699
Score = 108 bits (271), Expect = 6e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 470 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 526
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 527 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 571
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 470 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 518
>gi|224048927|ref|XP_002186842.1| PREDICTED: transcription factor GATA-4 [Taeniopygia guttata]
Length = 411
Score = 108 bits (271), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 51/75 (68%), Positives = 58/75 (77%), Gaps = 3/75 (4%)
Query: 58 IMSRFLFASLPVEDIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM 114
+ SR A+ +IE+F +EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+
Sbjct: 161 LQSRGTPAAARHANIEFFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGI 220
Query: 115 NRPLVKQPRRLRFDR 129
NRPL K RRL R
Sbjct: 221 NRPLFKPQRRLSASR 235
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/114 (51%), Positives = 69/114 (60%), Gaps = 15/114 (13%)
Query: 13 DIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFA 65
+IE+F +EGRECVNCGA+STPLWRRDGTGHYLCNA G F R L A
Sbjct: 174 NIEFFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLFKPQRRLSA 233
Query: 66 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
S V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 234 SRRV--------GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 279
>gi|194746249|ref|XP_001955593.1| GF16170 [Drosophila ananassae]
gi|190628630|gb|EDV44154.1| GF16170 [Drosophila ananassae]
Length = 703
Score = 108 bits (271), Expect = 6e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 474 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 530
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 531 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 575
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 474 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 522
>gi|117276621|gb|ABK32791.1| GATA transcription factor 123 [Platynereis dumerilii]
Length = 523
Score = 108 bits (271), Expect = 6e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 64/102 (62%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 287 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---ANRPLIK--PKRRLSAARR 341
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 342 AGTSCANCGTTTTTLWRRNHNGDPVCNACGLYYKLHSVNRPL 383
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 287 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGANRPLIKPKRRLSAAR 340
>gi|326916751|ref|XP_003204668.1| PREDICTED: transcription factor GATA-4-like [Meleagris gallopavo]
Length = 410
Score = 108 bits (270), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 51/77 (66%), Positives = 59/77 (76%), Gaps = 3/77 (3%)
Query: 56 YFIMSRFLFASLPVEDIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
+ + SR A+ +IE+F +EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMN
Sbjct: 159 HNLQSRGTPAAARHANIEFFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMN 218
Query: 113 GMNRPLVKQPRRLRFDR 129
G+NRPL K RRL R
Sbjct: 219 GINRPLFKPQRRLSASR 235
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/126 (49%), Positives = 73/126 (57%), Gaps = 15/126 (11%)
Query: 1 MSRFLFASLPVEDIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY- 56
SR A+ +IE+F +EGRECVNCGA+STPLWRRDGTGHYLCNA G
Sbjct: 162 QSRGTPAAARHANIEFFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGIN 221
Query: 57 ---FIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG 113
F R L AS V G C NC +T LWRR+ G +CNACGLY K++G
Sbjct: 222 RPLFKPQRRLSASRRV--------GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHG 273
Query: 114 MNRPLV 119
+ RPL
Sbjct: 274 VPRPLA 279
>gi|195453771|ref|XP_002073935.1| GK12883 [Drosophila willistoni]
gi|194170020|gb|EDW84921.1| GK12883 [Drosophila willistoni]
Length = 491
Score = 108 bits (270), Expect = 6e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 263 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 319
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 320 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 364
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 263 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 311
>gi|307170386|gb|EFN62698.1| GATA-binding factor A [Camponotus floridanus]
Length = 638
Score = 108 bits (270), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 58/107 (54%), Positives = 68/107 (63%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 154 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 205
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 206 ATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPLA 252
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 47/55 (85%), Positives = 49/55 (89%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 154 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 208
>gi|390178038|ref|XP_003736551.1| GA21945, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859299|gb|EIM52624.1| GA21945, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 488
Score = 108 bits (270), Expect = 7e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 259 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 315
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 316 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 360
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 259 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 307
>gi|50745252|ref|XP_420041.1| PREDICTED: transcription factor GATA-4 [Gallus gallus]
Length = 410
Score = 108 bits (270), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 51/77 (66%), Positives = 59/77 (76%), Gaps = 3/77 (3%)
Query: 56 YFIMSRFLFASLPVEDIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
+ + SR A+ +IE+F +EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMN
Sbjct: 159 HNLQSRGTPAAARHANIEFFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMN 218
Query: 113 GMNRPLVKQPRRLRFDR 129
G+NRPL K RRL R
Sbjct: 219 GINRPLFKPQRRLSASR 235
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/126 (49%), Positives = 73/126 (57%), Gaps = 15/126 (11%)
Query: 1 MSRFLFASLPVEDIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY- 56
SR A+ +IE+F +EGRECVNCGA+STPLWRRDGTGHYLCNA G
Sbjct: 162 QSRGTPAAARHANIEFFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGIN 221
Query: 57 ---FIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG 113
F R L AS V G C NC +T LWRR+ G +CNACGLY K++G
Sbjct: 222 RPLFKPQRRLSASRRV--------GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHG 273
Query: 114 MNRPLV 119
+ RPL
Sbjct: 274 VPRPLA 279
>gi|328786704|ref|XP_001121210.2| PREDICTED: transcription factor GATA-5 [Apis mellifera]
Length = 444
Score = 108 bits (270), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 58/107 (54%), Positives = 68/107 (63%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L ++ + T
Sbjct: 182 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPSKRLT 233
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 234 ATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPLA 280
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 47/55 (85%), Positives = 49/55 (89%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 182 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATR 236
>gi|119586024|gb|EAW65620.1| GATA binding protein 4, isoform CRA_a [Homo sapiens]
Length = 416
Score = 108 bits (270), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 185 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 239
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 185 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 241
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 242 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 283
>gi|24644977|ref|NP_731211.1| grain, isoform A [Drosophila melanogaster]
gi|3183017|sp|P91623.1|GATC_DROME RecName: Full=GATA-binding factor C; AltName: Full=Protein grain;
AltName: Full=Transcription factor GATA-C; AltName:
Full=dGATA-C
gi|1815600|dbj|BAA09102.1| GATA transcription factor [Drosophila melanogaster]
gi|7298992|gb|AAF54195.1| grain, isoform A [Drosophila melanogaster]
gi|17945890|gb|AAL48991.1| RE40104p [Drosophila melanogaster]
gi|220942548|gb|ACL83817.1| grn-PA [synthetic construct]
gi|220952714|gb|ACL88900.1| grn-PA [synthetic construct]
Length = 486
Score = 108 bits (270), Expect = 7e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 257 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 313
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 314 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 358
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 257 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 305
>gi|194904144|ref|XP_001981009.1| GG17473 [Drosophila erecta]
gi|190652712|gb|EDV49967.1| GG17473 [Drosophila erecta]
Length = 487
Score = 108 bits (270), Expect = 7e-22, Method: Composition-based stats.
Identities = 53/105 (50%), Positives = 67/105 (63%), Gaps = 8/105 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA---SLPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 258 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 314
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 315 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPL 359
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 258 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 306
>gi|1169844|sp|P43691.1|GATA4_CHICK RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|511480|gb|AAA57503.1| GATA-4 transcription factor, partial [Gallus gallus]
Length = 380
Score = 108 bits (270), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 51/77 (66%), Positives = 59/77 (76%), Gaps = 3/77 (3%)
Query: 56 YFIMSRFLFASLPVEDIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
+ + SR A+ +IE+F +EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMN
Sbjct: 129 HNLQSRGTPAAARHANIEFFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMN 188
Query: 113 GMNRPLVKQPRRLRFDR 129
G+NRPL K RRL R
Sbjct: 189 GINRPLFKPQRRLSASR 205
Score = 108 bits (269), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 62/126 (49%), Positives = 73/126 (57%), Gaps = 15/126 (11%)
Query: 1 MSRFLFASLPVEDIEYF---TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY- 56
SR A+ +IE+F +EGRECVNCGA+STPLWRRDGTGHYLCNA G
Sbjct: 132 QSRGTPAAARHANIEFFDDYSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGIN 191
Query: 57 ---FIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG 113
F R L AS V G C NC +T LWRR+ G +CNACGLY K++G
Sbjct: 192 RPLFKPQRRLSASRRV--------GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHG 243
Query: 114 MNRPLV 119
+ RPL
Sbjct: 244 VPRPLA 249
>gi|301776575|ref|XP_002923708.1| PREDICTED: transcription factor GATA-4-like [Ailuropoda
melanoleuca]
Length = 353
Score = 108 bits (270), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 122 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 176
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 122 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 178
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 179 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 220
>gi|383417433|gb|AFH31930.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Macaca mulatta]
gi|383417435|gb|AFH31931.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Macaca mulatta]
gi|383417437|gb|AFH31932.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Macaca mulatta]
Length = 444
Score = 108 bits (270), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|109088183|ref|XP_001108337.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like isoform 3 [Macaca mulatta]
gi|355562284|gb|EHH18878.1| hypothetical protein EGK_19445 [Macaca mulatta]
Length = 444
Score = 108 bits (270), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|60827672|gb|AAX36808.1| GATA binding protein 3 [synthetic construct]
gi|61368371|gb|AAX43165.1| GATA binding protein 3 [synthetic construct]
Length = 445
Score = 108 bits (270), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|348575369|ref|XP_003473462.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3-like [Cavia porcellus]
Length = 444
Score = 108 bits (270), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|332217060|ref|XP_003257671.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Nomascus leucogenys]
Length = 444
Score = 108 bits (270), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|291401984|ref|XP_002717407.1| PREDICTED: GATA binding protein 3 [Oryctolagus cuniculus]
Length = 444
Score = 108 bits (270), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|449272886|gb|EMC82589.1| GATA-binding factor 3 [Columba livia]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|440906268|gb|ELR56550.1| Trans-acting T-cell-specific transcription factor GATA-3 [Bos
grunniens mutus]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|426363955|ref|XP_004049093.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Gorilla gorilla gorilla]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|55633439|ref|XP_507651.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
isoform 3 [Pan troglodytes]
gi|397515219|ref|XP_003827854.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Pan paniscus]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|18959228|ref|NP_579827.1| GATA binding protein 3 [Rattus norvegicus]
gi|12704536|gb|AAK00586.1| GATA-3 [Rattus norvegicus]
gi|149021023|gb|EDL78630.1| GATA binding protein 3 [Rattus norvegicus]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|56605890|ref|NP_001008444.1| GATA-binding factor 3 [Gallus gallus]
gi|120961|sp|P23825.1|GATA3_CHICK RecName: Full=GATA-binding factor 3; Short=GATA-3; AltName:
Full=Transcription factor NF-E1c
gi|62968|emb|CAA40253.1| NF-E1 [Gallus gallus]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|402879607|ref|XP_003903424.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Papio anubis]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|351708641|gb|EHB11560.1| Trans-acting T-cell-specific transcription factor GATA-3
[Heterocephalus glaber]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|297686007|ref|XP_002820561.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Pongo abelii]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|113205712|ref|NP_001038032.1| trans-acting T-cell-specific transcription factor GATA-3 [Sus
scrofa]
gi|91177057|gb|ABE26883.1| GATA binding protein 3 [Sus scrofa]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|6679951|ref|NP_032117.1| trans-acting T-cell-specific transcription factor GATA-3 [Mus
musculus]
gi|120963|sp|P23772.1|GATA3_MOUSE RecName: Full=Trans-acting T-cell-specific transcription factor
GATA-3; AltName: Full=GATA-binding factor 3
gi|51053|emb|CAA38917.1| GATA-3 factor [Mus musculus]
gi|38566046|gb|AAH62915.1| GATA binding protein 3 [Mus musculus]
gi|74140120|dbj|BAE33786.1| unnamed protein product [Mus musculus]
gi|89275361|gb|ABD66080.1| GATA binding protein 3 [Mus musculus]
gi|89275371|gb|ABD66081.1| GATA binding protein 3 [Mus musculus]
gi|148676049|gb|EDL07996.1| GATA binding protein 3 [Mus musculus]
Length = 443
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 258 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 312
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 313 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 354
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 258 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 311
>gi|120964|sp|P23773.1|GATA3_XENLA RecName: Full=GATA-binding factor 3; AltName: Full=Transcription
factor xGATA-3
gi|214173|gb|AAA49724.1| GATA binding factor-3 [Xenopus laevis]
Length = 435
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 251 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 305
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 306 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 347
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 251 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 304
>gi|432089717|gb|ELK23534.1| Trans-acting T-cell-specific transcription factor GATA-3 [Myotis
davidii]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|50541959|ref|NP_001002295.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Homo sapiens]
gi|31664|emb|CAA38916.1| hGATA-3 factor [Homo sapiens]
gi|13905020|gb|AAH06793.1| GATA binding protein 3 [Homo sapiens]
gi|61358245|gb|AAX41534.1| GATA binding protein 3 [synthetic construct]
gi|119606774|gb|EAW86368.1| GATA binding protein 3, isoform CRA_b [Homo sapiens]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|403296353|ref|XP_003939076.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Saimiri boliviensis boliviensis]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|395827355|ref|XP_003786870.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Otolemur garnettii]
Length = 445
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 260 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 314
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 315 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 356
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 260 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 313
>gi|355782626|gb|EHH64547.1| hypothetical protein EGM_17792 [Macaca fascicularis]
Length = 444
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|354465002|ref|XP_003494969.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Cricetulus griseus]
gi|344238968|gb|EGV95071.1| Trans-acting T-cell-specific transcription factor GATA-3
[Cricetulus griseus]
Length = 443
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 258 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 312
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 313 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 354
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 258 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 311
>gi|395507323|ref|XP_003757975.1| PREDICTED: transcription factor GATA-4 isoform 2 [Sarcophilus
harrisii]
Length = 452
Score = 108 bits (269), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 53/61 (86%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 217 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 276
Query: 129 R 129
R
Sbjct: 277 R 277
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
V+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 217 VDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 276
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 277 RRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 321
>gi|126340357|ref|XP_001363328.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Monodelphis domestica]
Length = 444
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|391331261|ref|XP_003740068.1| PREDICTED: GATA-binding factor 2-like [Metaseiulus occidentalis]
Length = 480
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 54/111 (48%), Positives = 62/111 (55%), Gaps = 3/111 (2%)
Query: 8 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASL 67
S P EGRECVNCGA STPLWRRDGTGHYLCNA G +R L
Sbjct: 234 SKPKNKTRSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIKPK 290
Query: 68 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 291 RRLQSAARRAGTSCANCKTTTTTLWRRNHNGEPVCNACGLYYKLHNVNRPL 341
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 46/61 (75%), Positives = 48/61 (78%)
Query: 66 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
S P EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL
Sbjct: 234 SKPKNKTRSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRL 293
Query: 126 R 126
+
Sbjct: 294 Q 294
>gi|395829517|ref|XP_003787903.1| PREDICTED: transcription factor GATA-5 [Otolemur garnettii]
Length = 435
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 219 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSTR 275
Score = 93.2 bits (230), Expect = 3e-17, Method: Composition-based stats.
Identities = 54/113 (47%), Positives = 69/113 (61%), Gaps = 6/113 (5%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L P + +
Sbjct: 219 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPLVR--PQKRLSS 273
Query: 75 FTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
G C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 274 TRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 326
>gi|417401032|gb|JAA47421.1| Putative trans-acting t-cell-specific transcription factor gata-3
[Desmodus rotundus]
Length = 444
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|395538986|ref|XP_003771455.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Sarcophilus harrisii]
Length = 444
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|344277697|ref|XP_003410636.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3-like [Loxodonta africana]
Length = 444
Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|348554147|ref|XP_003462887.1| PREDICTED: transcription factor GATA-5-like [Cavia porcellus]
Length = 337
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/65 (75%), Positives = 54/65 (83%), Gaps = 2/65 (3%)
Query: 67 LPVEDI--EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
LPV D E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +R
Sbjct: 113 LPVSDFLEEFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKR 172
Query: 125 LRFDR 129
L R
Sbjct: 173 LSSSR 177
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/117 (48%), Positives = 68/117 (58%), Gaps = 14/117 (11%)
Query: 9 LPVEDI--EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFAS 66
LPV D E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L
Sbjct: 113 LPVSDFLEEFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL--- 166
Query: 67 LPVEDIEYFTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V + + R NC T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 167 --VRPQKRLSSSRRAGLCCTNCHTTHTTLWRRNTDGEPVCNACGLYMKLHGVPRPLA 221
>gi|351695298|gb|EHA98216.1| Transcription factor GATA-4 [Heterocephalus glaber]
Length = 410
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/70 (70%), Positives = 57/70 (81%), Gaps = 1/70 (1%)
Query: 61 RFLFASLPVEDI-EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R F+ + D+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+
Sbjct: 164 RLWFSKGHLADMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLI 223
Query: 120 KQPRRLRFDR 129
K RRL R
Sbjct: 224 KPQRRLSASR 233
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 59/122 (48%), Positives = 71/122 (58%), Gaps = 13/122 (10%)
Query: 3 RFLFASLPVEDI-EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----F 57
R F+ + D+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G
Sbjct: 164 RLWFSKGHLADMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLI 223
Query: 58 IMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
R L AS V G C NC +T LWRR+ G +CNACGLY K++G+ RP
Sbjct: 224 KPQRRLSASRRV--------GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 275
Query: 118 LV 119
L
Sbjct: 276 LA 277
>gi|390473525|ref|XP_003734617.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-4
[Callithrix jacchus]
Length = 379
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 148 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 202
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 148 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 204
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 205 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 246
>gi|432946176|ref|XP_004083805.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-4
[Oryzias latipes]
Length = 388
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/61 (75%), Positives = 52/61 (85%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
++ + F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 174 IDLFDDFAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 129 R 129
R
Sbjct: 234 R 234
Score = 101 bits (252), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 56/113 (49%), Positives = 66/113 (58%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
++ + F EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 174 IDLFDDFAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 234 RRV--------GLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 278
>gi|73949076|ref|XP_849153.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
isoform 1 [Canis lupus familiaris]
Length = 444
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|193788522|dbj|BAG53416.1| unnamed protein product [Homo sapiens]
Length = 236
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 59
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 61
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 62 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 103
>gi|357619365|gb|EHJ71973.1| hypothetical protein KGM_02427 [Danaus plexippus]
Length = 355
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 54/107 (50%), Positives = 70/107 (65%), Gaps = 6/107 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E + EGRECVNCGA +TPLWRRD TGHYLCNA G ++R L P + +
Sbjct: 73 LEAYEEGRECVNCGANNTPLWRRDSTGHYLCNACGLYHKING---VNRPLVK--PSKRLS 127
Query: 74 YF-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G+ C NCG+ +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 128 AARRHGQSCTNCGSRNTTLWRRNNEGEPVCNACGLYYKLHGINRPLA 174
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/74 (63%), Positives = 55/74 (74%), Gaps = 4/74 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E + EGRECVNCGA +TPLWRRD TGHYLCNACGLYHK+NG+NRPLVK +RL R
Sbjct: 73 LEAYEEGRECVNCGANNTPLWRRDSTGHYLCNACGLYHKINGVNRPLVKPSKRLSAARRH 132
Query: 128 DRGNFNCYERSTLF 141
+ NC R+T
Sbjct: 133 GQSCTNCGSRNTTL 146
>gi|387540446|gb|AFJ70850.1| transcription factor GATA-6 [Macaca mulatta]
Length = 595
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 57/119 (47%), Positives = 71/119 (59%), Gaps = 12/119 (10%)
Query: 7 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMS 60
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +S
Sbjct: 368 APLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LS 424
Query: 61 RFLFASLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R L P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 425 RPLIK--PQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 481
Score = 100 bits (248), Expect = 3e-19, Method: Composition-based stats.
Identities = 47/85 (55%), Positives = 58/85 (68%), Gaps = 10/85 (11%)
Query: 65 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL
Sbjct: 368 APLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 427
Query: 119 VKQPRRL----RFDRGNFNCYERST 139
+K +R+ R NC+ +T
Sbjct: 428 IKPQKRVPSSRRLGLSCANCHTTTT 452
>gi|402902790|ref|XP_003914278.1| PREDICTED: transcription factor GATA-6 [Papio anubis]
Length = 595
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 57/119 (47%), Positives = 71/119 (59%), Gaps = 12/119 (10%)
Query: 7 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMS 60
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +S
Sbjct: 368 APLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LS 424
Query: 61 RFLFASLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R L P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 425 RPLIK--PQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 481
Score = 100 bits (248), Expect = 3e-19, Method: Composition-based stats.
Identities = 47/85 (55%), Positives = 58/85 (68%), Gaps = 10/85 (11%)
Query: 65 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL
Sbjct: 368 APLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 427
Query: 119 VKQPRRL----RFDRGNFNCYERST 139
+K +R+ R NC+ +T
Sbjct: 428 IKPQKRVPSSRRLGLSCANCHTTTT 452
>gi|296222377|ref|XP_002807542.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Callithrix jacchus]
Length = 592
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 57/119 (47%), Positives = 71/119 (59%), Gaps = 12/119 (10%)
Query: 7 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMS 60
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +S
Sbjct: 365 APLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LS 421
Query: 61 RFLFASLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R L P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 422 RPLIK--PQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 478
Score = 100 bits (248), Expect = 3e-19, Method: Composition-based stats.
Identities = 47/85 (55%), Positives = 58/85 (68%), Gaps = 10/85 (11%)
Query: 65 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL
Sbjct: 365 APLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 424
Query: 119 VKQPRRL----RFDRGNFNCYERST 139
+K +R+ R NC+ +T
Sbjct: 425 IKPQKRVPSSRRLGLSCANCHTTTT 449
>gi|47224312|emb|CAG09158.1| unnamed protein product [Tetraodon nigroviridis]
Length = 365
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 47/59 (79%), Positives = 51/59 (86%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFN 133
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL DR +
Sbjct: 174 FAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLVGDRAGLH 232
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/30 (93%), Positives = 29/30 (96%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
F EGRECVNCGA+STPLWRRDGTGHYLCNA
Sbjct: 174 FAEGRECVNCGAMSTPLWRRDGTGHYLCNA 203
>gi|348524793|ref|XP_003449907.1| PREDICTED: transcription factor GATA-4 [Oreochromis niloticus]
Length = 389
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/61 (75%), Positives = 52/61 (85%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
++ + F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 174 IDLFDDFAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 129 R 129
R
Sbjct: 234 R 234
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/115 (48%), Positives = 67/115 (58%), Gaps = 12/115 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
++ + F EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 174 IDLFDDFAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL +
Sbjct: 234 RRV--------GLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMK 280
>gi|73949078|ref|XP_858071.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
isoform 2 [Canis lupus familiaris]
Length = 439
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 45/50 (90%), Positives = 46/50 (92%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRL 308
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 51/101 (50%), Positives = 61/101 (60%), Gaps = 9/101 (8%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G + P+ +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---------QNRPLIKPKRRLV 309
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 310 RTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 350
>gi|328715574|ref|XP_003245664.1| PREDICTED: GATA-binding factor A-like, partial [Acyrthosiphon
pisum]
Length = 232
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/113 (53%), Positives = 66/113 (58%), Gaps = 6/113 (5%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV-EDIEYF 75
F EGRECVNCGAISTPLWRRDG GHYLCNA G S L E +
Sbjct: 7 FGEGRECVNCGAISTPLWRRDGCGHYLCNACGLYHKMNGMNRPSGRPAKRLGAFETVSSR 66
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
G C NCG T LWRRD G +CNACGLY+K++G NRPL R+R D
Sbjct: 67 RYGVSCTNCGTRMTTLWRRDNDGKPVCNACGLYYKLHGFNRPL-----RMRKD 114
>gi|53147411|dbj|BAD52301.1| transcription factor Gata-binding protein 2/3 [Eptatretus burgeri]
Length = 491
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 45/55 (81%), Positives = 46/55 (83%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 295 LAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 349
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/103 (50%), Positives = 60/103 (58%), Gaps = 6/103 (5%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 295 LAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAAR 349
Query: 77 E-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC T LWRR+ G +CNACGLY K++ +NRP+
Sbjct: 350 RAGTSCANCQTTITTLWRRNANGDPVCNACGLYFKLHNVNRPM 392
>gi|1845570|dbj|BAA11334.1| GATA-4 transcription factor [Homo sapiens]
gi|122893044|gb|ABM67539.1| Homo sapiens GATA4 [Shuttle vector pUCAG.GATA4]
Length = 439
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 208 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 262
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 208 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 264
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 265 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 306
>gi|354471588|ref|XP_003498023.1| PREDICTED: transcription factor GATA-4-like [Cricetulus griseus]
Length = 238
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 59
Score = 105 bits (263), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 61
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 62 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 103
>gi|348551454|ref|XP_003461545.1| PREDICTED: endothelial transcription factor GATA-2-like [Cavia
porcellus]
Length = 455
Score = 107 bits (267), Expect = 1e-21, Method: Composition-based stats.
Identities = 43/50 (86%), Positives = 46/50 (92%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRL 339
>gi|5882288|gb|AAD55267.1|AF179425_1 transcription factor GATA-6 [Mus musculus]
Length = 443
Score = 107 bits (267), Expect = 1e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 229 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 283
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 284 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 329
Score = 99.8 bits (247), Expect = 3e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 229 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 288
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 289 GLSCANCHTTTT 300
>gi|345803446|ref|XP_547642.3| PREDICTED: transcription factor GATA-6 [Canis lupus familiaris]
Length = 627
Score = 107 bits (267), Expect = 1e-21, Method: Composition-based stats.
Identities = 54/112 (48%), Positives = 68/112 (60%), Gaps = 6/112 (5%)
Query: 8 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASL 67
+L + +E E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L
Sbjct: 403 ALAADLLEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK-- 457
Query: 68 PVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 458 PQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 509
Score = 99.8 bits (247), Expect = 3e-19, Method: Composition-based stats.
Identities = 44/78 (56%), Positives = 55/78 (70%), Gaps = 4/78 (5%)
Query: 66 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
+L + +E E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+
Sbjct: 403 ALAADLLEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRV 462
Query: 126 ----RFDRGNFNCYERST 139
R NC+ +T
Sbjct: 463 PSSRRLGLSCANCHTTTT 480
>gi|297702367|ref|XP_002828153.1| PREDICTED: transcription factor GATA-6 [Pongo abelii]
Length = 595
Score = 107 bits (267), Expect = 1e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 381 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 435
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 436 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 481
Score = 99.8 bits (247), Expect = 3e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 381 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 440
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 441 GLSCANCHTTTT 452
>gi|74145051|dbj|BAE22221.1| unnamed protein product [Mus musculus]
Length = 236
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 59
Score = 105 bits (263), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 61
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 62 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 103
>gi|426385574|ref|XP_004059281.1| PREDICTED: transcription factor GATA-6 [Gorilla gorilla gorilla]
Length = 596
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 382 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 436
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 437 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 482
Score = 99.8 bits (247), Expect = 4e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 382 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 441
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 442 GLSCANCHTTTT 453
>gi|332225943|ref|XP_003262148.1| PREDICTED: transcription factor GATA-6-like [Nomascus leucogenys]
Length = 476
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 262 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 316
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 317 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 362
Score = 99.8 bits (247), Expect = 4e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 262 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 321
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 322 GLSCANCHTTTT 333
>gi|40288197|ref|NP_005248.2| transcription factor GATA-6 [Homo sapiens]
gi|215273987|sp|Q92908.2|GATA6_HUMAN RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
gi|119621542|gb|EAX01137.1| GATA binding protein 6 [Homo sapiens]
gi|167887577|gb|ACA05995.1| transcription factor GATA-6 [Homo sapiens]
Length = 595
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 381 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 435
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 436 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 481
Score = 99.8 bits (247), Expect = 4e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 381 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 440
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 441 GLSCANCHTTTT 452
>gi|11875205|ref|NP_062058.1| transcription factor GATA-6 [Rattus norvegicus]
gi|1235614|gb|AAA92577.1| DNA binding protein [Rattus norvegicus]
Length = 441
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 229 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 283
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 284 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 329
Score = 99.8 bits (247), Expect = 4e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 229 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 288
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 289 GLSCANCHTTTT 300
>gi|339247095|ref|XP_003375181.1| GATA-binding factor 2 [Trichinella spiralis]
gi|316971560|gb|EFV55318.1| GATA-binding factor 2 [Trichinella spiralis]
Length = 215
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 55/101 (54%), Positives = 64/101 (63%), Gaps = 4/101 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G+ +R L T
Sbjct: 1 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGH---NRPLIKPKKRSSASKRT- 56
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 57 GINCANCGTNTTTLWRRNQNGDPVCNACGLYYKLHNVNRPL 97
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/60 (76%), Positives = 48/60 (80%), Gaps = 1/60 (1%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD-RGNFNC 134
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K +R R NC
Sbjct: 1 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRPLIKPKKRSSASKRTGINC 60
>gi|62898826|dbj|BAD97267.1| GATA binding protein 6 variant [Homo sapiens]
Length = 595
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 381 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 435
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 436 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 481
Score = 99.8 bits (247), Expect = 4e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 381 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 440
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 441 GLSCANCHTTTT 452
>gi|410920237|ref|XP_003973590.1| PREDICTED: GATA-binding factor 2-like [Takifugu rubripes]
Length = 457
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 267 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 325
Score = 106 bits (264), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 267 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 321
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+G G +CNACGLY K++ +NRPL
Sbjct: 322 AGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRPL 363
>gi|348510371|ref|XP_003442719.1| PREDICTED: GATA-binding factor 2-like [Oreochromis niloticus]
Length = 458
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 268 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 326
Score = 106 bits (264), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 268 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 322
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+G G +CNACGLY K++ +NRPL
Sbjct: 323 AGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRPL 364
>gi|147906903|ref|NP_001081962.1| GATA-binding factor 5-A [Xenopus laevis]
gi|1169806|sp|P43695.1|GAT5A_XENLA RecName: Full=GATA-binding factor 5-A; AltName: Full=Transcription
factor xGATA-5A
gi|311050|gb|AAA63686.1| transcription factor xGATA-5a [Xenopus laevis]
gi|54648481|gb|AAH84985.1| GATA-5a protein [Xenopus laevis]
Length = 390
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/57 (80%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+K +RL R
Sbjct: 175 EFPGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPQKRLSSSR 231
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 70/116 (60%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G M+R L ++ +
Sbjct: 175 EFPGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----IKPQKR 226
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 227 LSSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 282
>gi|194678210|ref|XP_001253597.2| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6 [Bos
taurus]
Length = 497
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 57/119 (47%), Positives = 70/119 (58%), Gaps = 12/119 (10%)
Query: 7 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMS 60
A LPV + +E E RECVNCG+I TPLWRRDGTGHYLCNA + G +S
Sbjct: 270 APLPVPRGPGTDLLEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LS 326
Query: 61 RFLFASLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R L P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 327 RPLIK--PQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 383
Score = 99.4 bits (246), Expect = 4e-19, Method: Composition-based stats.
Identities = 47/85 (55%), Positives = 57/85 (67%), Gaps = 10/85 (11%)
Query: 65 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
A LPV + +E E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL
Sbjct: 270 APLPVPRGPGTDLLEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 329
Query: 119 VKQPRRL----RFDRGNFNCYERST 139
+K +R+ R NC+ +T
Sbjct: 330 IKPQKRVPSSRRLGLSCANCHTTTT 354
>gi|355697729|gb|EHH28277.1| GATA-binding factor 4 [Macaca mulatta]
Length = 304
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/69 (71%), Positives = 54/69 (78%), Gaps = 2/69 (2%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL--RFDRGNF 132
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 76 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSRRVGLSCA 135
Query: 133 NCYERSTLF 141
NC +T
Sbjct: 136 NCQTTTTTL 144
Score = 104 bits (260), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 65/103 (63%), Gaps = 6/103 (5%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L P +
Sbjct: 76 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIK--PQRRLSRRV 130
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 131 -GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 172
>gi|395507321|ref|XP_003757974.1| PREDICTED: transcription factor GATA-4 isoform 1 [Sarcophilus
harrisii]
Length = 451
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 222 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 276
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 222 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 278
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 279 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 320
>gi|46909571|ref|NP_034388.2| transcription factor GATA-6 [Mus musculus]
gi|215274120|sp|Q61169.3|GATA6_MOUSE RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
gi|74210838|dbj|BAE25049.1| unnamed protein product [Mus musculus]
gi|148691038|gb|EDL22985.1| GATA binding protein 6 [Mus musculus]
Length = 589
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 375 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 429
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 430 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 475
Score = 99.4 bits (246), Expect = 4e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 375 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 434
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 435 GLSCANCHTTTT 446
>gi|1655915|gb|AAC50941.1| hGATA-6 [Homo sapiens]
gi|2506076|dbj|BAA22621.1| GATA-6 [Homo sapiens]
Length = 449
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 235 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 289
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 290 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 335
Score = 99.4 bits (246), Expect = 4e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 235 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 294
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 295 GLSCANCHTTTT 306
>gi|215274094|sp|P46153.2|GATA6_RAT RecName: Full=Transcription factor GATA-6; AltName:
Full=DNA-binding protein GATA-GT2; AltName:
Full=GATA-binding factor 6
Length = 587
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 375 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 429
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 430 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 475
Score = 99.4 bits (246), Expect = 4e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 375 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 434
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 435 GLSCANCHTTTT 446
>gi|1877210|emb|CAA64997.1| GATA-6 DNA binding protein [Homo sapiens]
Length = 449
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 235 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 289
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 290 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 335
Score = 99.4 bits (246), Expect = 4e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 235 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 294
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 295 GLSCANCHTTTT 306
>gi|47218982|emb|CAG02020.1| unnamed protein product [Tetraodon nigroviridis]
Length = 449
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 267 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 325
Score = 105 bits (263), Expect = 4e-21, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 267 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 321
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+G G +CNACGLY K++ +NRPL
Sbjct: 322 AGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRPL 363
>gi|40538758|ref|NP_571308.1| GATA-binding protein 2a [Danio rerio]
gi|31544961|gb|AAH53131.1| GATA-binding protein 2a [Danio rerio]
Length = 456
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 64/102 (62%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 266 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 320
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+G G +CNACGLY+K++ +NRPL
Sbjct: 321 AGTCCANCQTTTTTLWRRNGNGDPVCNACGLYYKLHNVNRPL 362
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 266 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 324
>gi|355779507|gb|EHH63983.1| GATA-binding factor 4 [Macaca fascicularis]
Length = 253
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/75 (68%), Positives = 58/75 (77%), Gaps = 3/75 (4%)
Query: 70 EDI-EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL--R 126
ED+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 19 EDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSRR 78
Query: 127 FDRGNFNCYERSTLF 141
NC +T
Sbjct: 79 VGLSCANCQTTTTTL 93
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 56/109 (51%), Positives = 69/109 (63%), Gaps = 7/109 (6%)
Query: 12 EDI-EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE 70
ED+ + F+EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L P
Sbjct: 19 EDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIK--PQR 73
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 74 RLSRRV-GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 121
>gi|432858527|ref|XP_004068890.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Oryzias latipes]
Length = 456
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 266 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 324
Score = 105 bits (263), Expect = 4e-21, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 266 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 320
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+G G +CNACGLY K++ +NRPL
Sbjct: 321 AGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRPL 362
>gi|410916493|ref|XP_003971721.1| PREDICTED: transcription factor GATA-4-like [Takifugu rubripes]
Length = 386
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 45/61 (73%), Positives = 52/61 (85%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
++ + + EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 174 IDLFDDYAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 129 R 129
R
Sbjct: 234 R 234
Score = 100 bits (248), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 67/115 (58%), Gaps = 12/115 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
++ + + EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 174 IDLFDDYAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL +
Sbjct: 234 RRV--------GLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMK 280
>gi|432858529|ref|XP_004068891.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Oryzias latipes]
Length = 447
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 315
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 54/102 (52%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 311
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+G G +CNACGLY K++ +NRPL
Sbjct: 312 AGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRPL 353
>gi|114319150|gb|ABI63575.1| GATA4 [Danio rerio]
Length = 352
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/62 (77%), Positives = 53/62 (85%), Gaps = 3/62 (4%)
Query: 71 DIEYFT---EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRF 127
++E+F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLVK RRL
Sbjct: 157 NLEFFDDLGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLVKPQRRLSA 216
Query: 128 DR 129
R
Sbjct: 217 SR 218
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 67/114 (58%), Gaps = 15/114 (13%)
Query: 13 DIEYFT---EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFA 65
++E+F EGRECVNCGA+STPLWRRDGTGHYLCNA G R L A
Sbjct: 157 NLEFFDDLGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLVKPQRRLSA 216
Query: 66 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
S V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 217 SRRV--------GLSCTNCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 262
>gi|269785247|ref|NP_001161551.1| GATA2 transcription factor [Saccoglossus kowalevskii]
gi|268054087|gb|ACY92530.1| GATA2 transcription factor [Saccoglossus kowalevskii]
Length = 352
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 55/103 (53%), Positives = 64/103 (62%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 136 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 190
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC A T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 191 AGTSCANCQATQTTLWRRNANGDPVCNACGLYYKLHGVNRPLT 233
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 136 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 189
>gi|242005616|ref|XP_002423660.1| transcription factor GATA-4, putative [Pediculus humanus corporis]
gi|212506820|gb|EEB10922.1| transcription factor GATA-4, putative [Pediculus humanus corporis]
Length = 231
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/107 (54%), Positives = 68/107 (63%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
F EGRECVNCGAISTPLWRRDGTGHYLCNA G M+R L V+ + T
Sbjct: 88 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----VKPSKRMT 139
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 140 ATKRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGINRPLA 186
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 46/51 (90%), Positives = 48/51 (94%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +R+
Sbjct: 88 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRM 138
>gi|18858733|ref|NP_571286.1| transcription factor GATA-3 [Danio rerio]
gi|3183010|sp|Q91428.1|GATA3_DANRE RecName: Full=Transcription factor GATA-3; AltName:
Full=GATA-binding factor 3
gi|1245717|gb|AAA93491.1| transcription factor [Danio rerio]
gi|190336965|gb|AAI62389.1| GATA-binding protein 3 [Danio rerio]
gi|190336980|gb|AAI62401.1| GATA-binding protein 3 [Danio rerio]
Length = 438
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 251 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 305
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 306 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 347
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 251 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 304
>gi|297259418|ref|XP_001115055.2| PREDICTED: hypothetical protein LOC720048 [Macaca mulatta]
Length = 780
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 201 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 257
Score = 91.3 bits (225), Expect = 1e-16, Method: Composition-based stats.
Identities = 53/116 (45%), Positives = 68/116 (58%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 201 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 252
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 253 LSSSRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 308
>gi|348521214|ref|XP_003448121.1| PREDICTED: transcription factor GATA-3-like isoform 1 [Oreochromis
niloticus]
Length = 443
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 256 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 309
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 256 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 310
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 311 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRPL 352
>gi|307775397|ref|NP_001182721.1| transcription factor GATA-3 [Oncorhynchus mykiss]
gi|307342716|emb|CAR95099.1| GATA binding protein 3 [Oncorhynchus mykiss]
Length = 441
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 310
Score = 106 bits (264), Expect = 3e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 311
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 312 AGTSCANCQTSTTTLWRRNANGDPVCNACGLYYKLHNINRPL 353
>gi|157278177|ref|NP_001098188.1| transcription factor GATA-3 [Oryzias latipes]
gi|62005802|dbj|BAD91312.1| transcription factor GATA-3 [Oryzias latipes]
Length = 440
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 253 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 306
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 253 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 307
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 308 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRPL 349
>gi|51242135|ref|NP_001003797.1| GATA-binding factor 2 [Gallus gallus]
gi|326927940|ref|XP_003210145.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Meleagris
gallopavo]
gi|120958|sp|P23824.1|GATA2_CHICK RecName: Full=GATA-binding factor 2; Short=GATA-2; AltName:
Full=Transcription factor NF-E1b
gi|62966|emb|CAA40252.1| NF-E1 [Gallus gallus]
Length = 466
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 47/71 (66%), Positives = 50/71 (70%)
Query: 64 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
F P +EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K R
Sbjct: 264 FTPKPRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKR 323
Query: 124 RLRFDRGNFNC 134
RL R C
Sbjct: 324 RLSAARRAGTC 334
Score = 104 bits (260), Expect = 9e-21, Method: Composition-based stats.
Identities = 54/114 (47%), Positives = 65/114 (57%), Gaps = 6/114 (5%)
Query: 6 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA 65
F P +EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L
Sbjct: 264 FTPKPRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK 320
Query: 66 SLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
P + G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 321 --PKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 372
>gi|410918522|ref|XP_003972734.1| PREDICTED: transcription factor GATA-3-like isoform 1 [Takifugu
rubripes]
Length = 444
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 310
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 311
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 312 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRPL 353
>gi|284520963|ref|NP_001165271.1| transcription factor GATA-3 [Salmo salar]
gi|180038038|gb|ACB87012.1| transcription factor GATA-3 [Salmo salar]
Length = 441
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 310
Score = 106 bits (264), Expect = 3e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 311
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 312 AGTSCANCQTSTTTLWRRNANGDPVCNACGLYYKLHNINRPL 353
>gi|410918524|ref|XP_003972735.1| PREDICTED: transcription factor GATA-3-like isoform 2 [Takifugu
rubripes]
Length = 451
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 264 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 317
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 264 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 318
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 319 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRPL 360
>gi|359322060|ref|XP_541740.3| PREDICTED: LOW QUALITY PROTEIN: endothelial transcription factor
GATA-2 isoform 1 [Canis lupus familiaris]
Length = 480
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 101 bits (252), Expect = 9e-20, Method: Composition-based stats.
Identities = 51/102 (50%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWR + G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRGNANGDPVCNACGLYYKLHNVNRPL 386
>gi|301764531|ref|XP_002917681.1| PREDICTED: endothelial transcription factor GATA-2-like isoform 1
[Ailuropoda melanoleuca]
gi|281347007|gb|EFB22591.1| hypothetical protein PANDA_006028 [Ailuropoda melanoleuca]
Length = 480
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|347971495|ref|XP_562743.4| AGAP004228-PA [Anopheles gambiae str. PEST]
gi|333468694|gb|EAL40667.4| AGAP004228-PA [Anopheles gambiae str. PEST]
Length = 603
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 54/104 (51%), Positives = 62/104 (59%), Gaps = 7/104 (6%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDIEY 74
EGRECVNCGA STPLWRRDGTGHYLCNA G R L +SL
Sbjct: 334 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVSSLQSAARR- 392
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 393 --AGTSCANCKTTTTTLWRRNQGGEPVCNACGLYYKLHNVNRPL 434
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 334 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 382
>gi|204306643|gb|ACH99859.1| GATA-3 [Cyprinus carpio]
Length = 442
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 255 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 309
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 310 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 351
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 255 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 308
>gi|165292335|dbj|BAF98873.1| GATA binding protein 3 [Carassius auratus langsdorfii]
Length = 441
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 254 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 308
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 309 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 350
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 254 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 307
>gi|1699253|gb|AAB37426.1| zinc finger transcription factor [Mus sp.]
Length = 444
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 230 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 284
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 285 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 330
Score = 99.0 bits (245), Expect = 5e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 230 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 289
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 290 GLSCANCHTTTT 301
>gi|410953434|ref|XP_003983375.1| PREDICTED: transcription factor GATA-5 [Felis catus]
Length = 391
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 57/81 (70%), Gaps = 2/81 (2%)
Query: 51 TSFTGYFIMSRFLFASLPVEDI--EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLY 108
T G + +A PV D E EGRECVNCGA+STPLWRRDGTGHYLCNACGLY
Sbjct: 150 TELGGPGEAGSWRWAPSPVPDFLEELPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLY 209
Query: 109 HKMNGMNRPLVKQPRRLRFDR 129
HKMNG+NRPLV+ +RL R
Sbjct: 210 HKMNGVNRPLVRPQKRLTSSR 230
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 79/147 (53%), Gaps = 24/147 (16%)
Query: 4 FLFASLPVEDI--EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSR 61
+ +A PV D E EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R
Sbjct: 161 WRWAPSPVPDFLEELPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNR 217
Query: 62 FLFASLPVEDIEYFTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
L V + T R C NC +T LWRR+ G +CNACGLY K++G+ RP
Sbjct: 218 PL-----VRPQKRLTSSRRAGLCCSNCHTTNTTLWRRNADGEPVCNACGLYMKLHGVPRP 272
Query: 118 LV----------KQPRRLRFDRGNFNC 134
L ++P+ +G+ C
Sbjct: 273 LAMKKESIQTRKRKPKNAAKTKGSSGC 299
>gi|374671149|gb|AEZ56247.1| transcription factor GATA-3 [Gadus morhua]
Length = 440
Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 310
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 311
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 312 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNINRPL 353
>gi|18033511|gb|AAL57180.1| transcription factor GATA-2 [Rattus norvegicus]
Length = 480
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 49/59 (83%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R +C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGSC 348
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGSCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|293345|gb|AAA37662.1| GATA-binding transcription factor [Mus musculus]
Length = 439
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 209 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 263
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 209 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 265
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 266 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 307
>gi|410951840|ref|XP_003982601.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Felis
catus]
Length = 480
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|410052588|ref|XP_003954430.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6 [Pan
troglodytes]
Length = 472
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 258 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 312
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 313 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 358
Score = 99.0 bits (245), Expect = 6e-19, Method: Composition-based stats.
Identities = 43/72 (59%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 258 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 317
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 318 GLSCANCHTTTT 329
>gi|126302721|ref|XP_001368185.1| PREDICTED: GATA-binding factor 5-A-like [Monodelphis domestica]
Length = 397
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL R
Sbjct: 183 EFPGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSTR 239
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L P + +
Sbjct: 183 EFPGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIK--PQKRLSS 237
Query: 75 F-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 238 TRRAGLCCTNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 283
>gi|58332620|ref|NP_001011384.1| GATA binding protein 5 [Xenopus (Silurana) tropicalis]
gi|56789762|gb|AAH88567.1| GATA binding protein 5 [Xenopus (Silurana) tropicalis]
Length = 389
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL R
Sbjct: 176 EFHGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSR 232
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 70/116 (60%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L ++ +
Sbjct: 176 EFHGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---INRPL-----IKPQKR 227
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 228 LSSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 283
>gi|66730290|ref|NP_001019487.1| transcription factor GATA-5 [Rattus norvegicus]
gi|55250430|gb|AAH85855.1| GATA binding protein 5 [Rattus norvegicus]
gi|149034029|gb|EDL88812.1| similar to Transcription factor GATA-5 (GATA binding factor-5)
[Rattus norvegicus]
Length = 404
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/73 (65%), Positives = 56/73 (76%), Gaps = 4/73 (5%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN- 131
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R +
Sbjct: 188 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSG 247
Query: 132 ---FNCYERSTLF 141
NC+ +T
Sbjct: 248 LCCSNCHTATTTL 260
Score = 102 bits (253), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 69/116 (59%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 188 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 239
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 240 LSSSRRSGLCCSNCHTATTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 295
>gi|355689765|gb|AER98939.1| GATA binding protein 2 [Mustela putorius furo]
Length = 480
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|348507926|ref|XP_003441506.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Oreochromis
niloticus]
Length = 452
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 47/59 (79%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
TEGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 268 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 326
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRD TGHYLCNA G +R L P +
Sbjct: 268 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 322
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 323 AGTCCANCQTTTTTLWRRNAHGDPVCNACGLYYKLHNVNRPL 364
>gi|348507924|ref|XP_003441505.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Oreochromis
niloticus]
Length = 455
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 47/59 (79%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
TEGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 271 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 329
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRD TGHYLCNA G +R L P +
Sbjct: 271 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 325
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 326 AGTCCANCQTTTTTLWRRNAHGDPVCNACGLYYKLHNVNRPL 367
>gi|157278509|ref|NP_001098356.1| transcription factor GATA-2 [Oryzias latipes]
gi|57157777|dbj|BAD83861.1| transcription factor GATA-2 [Oryzias latipes]
Length = 455
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 47/59 (79%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
TEGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 270 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 328
Score = 105 bits (261), Expect = 9e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRD TGHYLCNA G +R L P +
Sbjct: 270 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 324
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 325 AGTCCANCQTTTTTLWRRNASGDPVCNACGLYYKLHNVNRPL 366
>gi|126321769|ref|XP_001363044.1| PREDICTED: transcription factor GATA-6 [Monodelphis domestica]
Length = 590
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 53/108 (49%), Positives = 67/108 (62%), Gaps = 6/108 (5%)
Query: 12 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVED 71
E ++ +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P +
Sbjct: 374 ELLDDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKR 428
Query: 72 IEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 429 VPSTRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 476
Score = 98.6 bits (244), Expect = 7e-19, Method: Composition-based stats.
Identities = 43/74 (58%), Positives = 54/74 (72%), Gaps = 4/74 (5%)
Query: 70 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL---- 125
E ++ +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+
Sbjct: 374 ELLDDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSTR 433
Query: 126 RFDRGNFNCYERST 139
R NC+ +T
Sbjct: 434 RLGLSCANCHTTTT 447
>gi|182996|gb|AAA35868.1| GATA-binding protein [Homo sapiens]
Length = 480
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|12836018|dbj|BAB23463.1| unnamed protein product [Mus musculus]
Length = 480
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|15559217|ref|NP_254277.1| endothelial transcription factor GATA-2 [Rattus norvegicus]
gi|62286680|sp|Q924Y4.1|GATA2_RAT RecName: Full=Endothelial transcription factor GATA-2; AltName:
Full=GATA-binding protein 2
gi|14029161|gb|AAK51128.1| GATA2 [Rattus norvegicus]
gi|38197660|gb|AAH61745.1| GATA binding protein 2 [Rattus norvegicus]
gi|149036689|gb|EDL91307.1| GATA binding protein 2, isoform CRA_a [Rattus norvegicus]
gi|149036690|gb|EDL91308.1| GATA binding protein 2, isoform CRA_a [Rattus norvegicus]
gi|184185940|dbj|BAG30820.1| GATA-binding protein 2 [Rattus norvegicus]
gi|184185942|dbj|BAG30821.1| GATA-binding protein 2 [Rattus norvegicus]
Length = 480
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|226530726|ref|NP_032116.4| endothelial transcription factor GATA-2 [Mus musculus]
gi|21264417|sp|O09100.2|GATA2_MOUSE RecName: Full=Endothelial transcription factor GATA-2; AltName:
Full=GATA-binding protein 2
gi|18181872|dbj|BAA19053.2| GATA-2 protein [Mus musculus]
gi|74191748|dbj|BAE32831.1| unnamed protein product [Mus musculus]
gi|74196104|dbj|BAE32970.1| unnamed protein product [Mus musculus]
gi|76827995|gb|AAI07010.1| GATA binding protein 2 [Mus musculus]
gi|148666827|gb|EDK99243.1| GATA binding protein 2 [Mus musculus]
Length = 480
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|111493996|gb|AAI05656.1| Gata5 protein [Mus musculus]
Length = 320
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/73 (65%), Positives = 56/73 (76%), Gaps = 4/73 (5%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN- 131
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R +
Sbjct: 104 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSG 163
Query: 132 ---FNCYERSTLF 141
NC+ +T
Sbjct: 164 LCCSNCHTATTTL 176
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 69/116 (59%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 104 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 155
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 156 LSSSRRSGLCCSNCHTATTTLWRRNSEGGPVCNACGLYMKLHGVPRPLAMKKESIQ 211
>gi|126336247|ref|XP_001366872.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Monodelphis
domestica]
Length = 473
Score = 106 bits (264), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 283 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 341
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 283 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 337
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 338 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 379
>gi|1841756|gb|AAB47506.1| GATA-5 cardiac transcription factor [Mus musculus]
Length = 404
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/73 (65%), Positives = 56/73 (76%), Gaps = 4/73 (5%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN- 131
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R +
Sbjct: 188 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSG 247
Query: 132 ---FNCYERSTLF 141
NC+ +T
Sbjct: 248 LCCSNCHTATTTL 260
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 69/116 (59%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 188 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 239
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 240 LSSSRRSGLCCSNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 295
>gi|33416800|gb|AAH55963.1| Gata-5b protein [Xenopus laevis]
Length = 388
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL R
Sbjct: 176 EFHGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSR 232
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 70/116 (60%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L ++ +
Sbjct: 176 EFHGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---INRPL-----IKPQKR 227
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 228 LSSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 283
>gi|148230811|ref|NP_001079831.1| GATA-binding factor 5-B [Xenopus laevis]
gi|1169807|sp|P43696.1|GAT5B_XENLA RecName: Full=GATA-binding factor 5-B; AltName: Full=Transcription
factor xGATA-5B
gi|311052|gb|AAA63687.1| transcription factor xGATA-5b [Xenopus laevis]
Length = 388
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL R
Sbjct: 176 EFHGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSR 232
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 70/116 (60%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L ++ +
Sbjct: 176 EFHGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---INRPL-----IKPQKR 227
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 228 LSSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 283
>gi|426392409|ref|XP_004062545.1| PREDICTED: transcription factor GATA-5 [Gorilla gorilla gorilla]
Length = 479
Score = 106 bits (264), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 263 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 319
Score = 90.5 bits (223), Expect = 2e-16, Method: Composition-based stats.
Identities = 54/113 (47%), Positives = 69/113 (61%), Gaps = 6/113 (5%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L P + +
Sbjct: 263 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPLVR--PQKRLSS 317
Query: 75 FTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
G C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 318 SRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 370
>gi|181337117|ref|NP_032119.2| transcription factor GATA-5 [Mus musculus]
gi|341940731|sp|P97489.2|GATA5_MOUSE RecName: Full=Transcription factor GATA-5; AltName:
Full=GATA-binding factor 5
gi|74204896|dbj|BAE20945.1| unnamed protein product [Mus musculus]
gi|74209194|dbj|BAE24979.1| unnamed protein product [Mus musculus]
gi|148675383|gb|EDL07330.1| GATA binding protein 5 [Mus musculus]
Length = 404
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/73 (65%), Positives = 56/73 (76%), Gaps = 4/73 (5%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN- 131
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R +
Sbjct: 188 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSG 247
Query: 132 ---FNCYERSTLF 141
NC+ +T
Sbjct: 248 LCCSNCHTATTTL 260
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 69/116 (59%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 188 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 239
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 240 LSSSRRSGLCCSNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 295
>gi|300797896|ref|NP_001179043.1| endothelial transcription factor GATA-2 [Bos taurus]
Length = 480
Score = 106 bits (264), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|402887121|ref|XP_003906953.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Papio
anubis]
gi|402887123|ref|XP_003906954.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Papio
anubis]
Length = 480
Score = 106 bits (264), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|109098094|ref|XP_001097801.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Macaca mulatta]
Length = 480
Score = 106 bits (264), Expect = 3e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|426341992|ref|XP_004036301.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Gorilla gorilla gorilla]
gi|426341996|ref|XP_004036303.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Gorilla gorilla gorilla]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|47523100|ref|NP_999044.1| endothelial transcription factor GATA-2 [Sus scrofa]
gi|29470193|gb|AAO73945.1| transcription factor GATA-2 [Sus scrofa]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|351706178|gb|EHB09097.1| Endothelial transcription factor GATA-2 [Heterocephalus glaber]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|149728223|ref|XP_001488214.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Equus
caballus]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|12835796|dbj|BAB23365.1| unnamed protein product [Mus musculus]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|157169243|gb|ABV25956.1| GATA-binding transcription factor B3 [Capitella teleta]
Length = 372
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 46/74 (62%), Positives = 58/74 (78%), Gaps = 3/74 (4%)
Query: 55 GYFIMSRFLFA---SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKM 111
G+F +++ A + +YF EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKM
Sbjct: 133 GFFSSQQYMAACRSGSAYDQSDYFCEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKM 192
Query: 112 NGMNRPLVKQPRRL 125
NG+NRP++K +R+
Sbjct: 193 NGLNRPMMKPQKRM 206
Score = 105 bits (261), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 70/108 (64%), Gaps = 6/108 (5%)
Query: 12 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVED 71
+ +YF EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R + P +
Sbjct: 151 DQSDYFCEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---LNRPMMK--PQKR 205
Query: 72 I-EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G +C NC +T LWRR+ G +CNACGLY K++G++RP+
Sbjct: 206 MCASRRMGLQCANCQTSTTTLWRRNADGEPVCNACGLYFKLHGVHRPM 253
>gi|407025369|gb|AFS65551.1| Gata1/2/3, partial [Parastichopus parvimensis]
Length = 261
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 47/70 (67%), Positives = 51/70 (72%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
++ P TEGRECVNCGA STPLWRRDG GHYLCNACGLYHKMNG NRPL+K RR
Sbjct: 169 STKPRAKSRSSTEGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRPLIKPKRR 228
Query: 125 LRFDRGNFNC 134
L R +C
Sbjct: 229 LSSKRTGTSC 238
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 53/98 (54%), Gaps = 5/98 (5%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFAS 66
++ P TEGRECVNCGA STPLWRRDG GHYLCNA G +R L
Sbjct: 169 STKPRAKSRSSTEGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNG---QNRPLIK- 224
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 104
P + G C NC A +T LWRR+ G +CNA
Sbjct: 225 -PKRRLSSKRTGTSCANCQATTTTLWRRNPNGEPVCNA 261
>gi|395847143|ref|XP_003796243.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Otolemur garnettii]
gi|395847147|ref|XP_003796245.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Otolemur garnettii]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|345308422|ref|XP_001505950.2| PREDICTED: GATA-binding factor 5-A-like [Ornithorhynchus anatinus]
Length = 397
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL R
Sbjct: 182 EFPGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSR 238
Score = 90.1 bits (222), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 69/116 (59%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L ++ +
Sbjct: 182 EFPGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---INRPL-----IKPQKR 233
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 234 LSSSRRAGLCCTNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 289
>gi|194044318|ref|XP_001924217.1| PREDICTED: transcription factor GATA-5-like [Sus scrofa]
Length = 400
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 183 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 239
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 183 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 234
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 235 LSSSRRAGLCCTNCRTTNTTLWRRNADGEPVCNACGLYMKLHGVPRPLA 283
>gi|328697578|ref|XP_001945293.2| PREDICTED: hypothetical protein LOC100159936 [Acyrthosiphon pisum]
Length = 657
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 62/102 (60%), Gaps = 5/102 (4%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE- 77
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L ++
Sbjct: 418 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLSLQSAARR 474
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 475 AGTSCANCKTSTTTLWRRNQNGEPVCNACGLYYKLHNVNRPL 516
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 418 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 466
>gi|258645094|ref|NP_001158260.1| grain [Tribolium castaneum]
Length = 492
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 62/102 (60%), Gaps = 5/102 (4%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE- 77
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L ++
Sbjct: 265 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLSLQSAARR 321
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 322 AGTSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVNRPL 363
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 265 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 313
>gi|296225992|ref|XP_002758738.1| PREDICTED: endothelial transcription factor GATA-2 [Callithrix
jacchus]
gi|403268240|ref|XP_003926186.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Saimiri boliviensis boliviensis]
gi|403268242|ref|XP_003926187.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Saimiri boliviensis boliviensis]
gi|403268244|ref|XP_003926188.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Saimiri boliviensis boliviensis]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|344306282|ref|XP_003421817.1| PREDICTED: transcription factor GATA-5-like [Loxodonta africana]
Length = 409
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 193 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGINRPLVQPQKRLSSSR 249
Score = 91.7 bits (226), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V+ +
Sbjct: 193 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---INRPL-----VQPQKR 244
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR G +CNACGLY K++G+ RPL
Sbjct: 245 LSSSRRAGLCCTNCRTTNTTLWRRSAEGEPVCNACGLYMKLHGVPRPLA 293
>gi|156147115|gb|ABU53701.1| GATA4 [Oreochromis niloticus]
Length = 392
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 45/61 (73%), Positives = 51/61 (83%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
++ + F EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 174 IDLFDDFAEGRECVNCGAMPTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 129 R 129
R
Sbjct: 234 R 234
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 66/115 (57%), Gaps = 12/115 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFAS 66
++ + F EGRECVNCGA+ TPLWRRDGTGHYLCNA G R L AS
Sbjct: 174 IDLFDDFAEGRECVNCGAMPTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 233
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL +
Sbjct: 234 RRV--------GLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMK 280
>gi|30584557|gb|AAP36531.1| Homo sapiens hypothetical protein MGC2306 [synthetic construct]
gi|61371794|gb|AAX43732.1| GATA binding protein 2 [synthetic construct]
gi|61371798|gb|AAX43733.1| GATA binding protein 2 [synthetic construct]
Length = 481
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|291393346|ref|XP_002713202.1| PREDICTED: GATA binding protein 2 isoform 2 [Oryctolagus cuniculus]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|20070352|ref|NP_116027.2| endothelial transcription factor GATA-2 isoform 1 [Homo sapiens]
gi|224611699|ref|NP_001139133.1| endothelial transcription factor GATA-2 isoform 1 [Homo sapiens]
gi|114589043|ref|XP_516728.2| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Pan
troglodytes]
gi|332261773|ref|XP_003279941.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Nomascus leucogenys]
gi|397518553|ref|XP_003829449.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Pan
paniscus]
gi|397518555|ref|XP_003829450.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Pan
paniscus]
gi|397518557|ref|XP_003829451.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3 [Pan
paniscus]
gi|229462971|sp|P23769.3|GATA2_HUMAN RecName: Full=Endothelial transcription factor GATA-2; AltName:
Full=GATA-binding protein 2
gi|15990463|gb|AAH15613.1| GATA binding protein 2 [Homo sapiens]
gi|30410997|gb|AAH51342.1| GATA binding protein 2 [Homo sapiens]
gi|119599719|gb|EAW79313.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|119599721|gb|EAW79315.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|119599722|gb|EAW79316.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|119599723|gb|EAW79317.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|208966348|dbj|BAG73188.1| GATA binding protein 2 [synthetic construct]
gi|410209098|gb|JAA01768.1| GATA binding protein 2 [Pan troglodytes]
gi|410267030|gb|JAA21481.1| GATA binding protein 2 [Pan troglodytes]
gi|410288416|gb|JAA22808.1| GATA binding protein 2 [Pan troglodytes]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|12803465|gb|AAH02557.1| GATA binding protein 2 [Homo sapiens]
gi|30582181|gb|AAP35317.1| hypothetical protein MGC2306 [Homo sapiens]
gi|60655087|gb|AAX32107.1| GATA binding protein 2 [synthetic construct]
gi|123982652|gb|ABM83067.1| GATA binding protein 2 [synthetic construct]
gi|123997319|gb|ABM86261.1| GATA binding protein 2 [synthetic construct]
gi|189054715|dbj|BAG37347.1| unnamed protein product [Homo sapiens]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|354482833|ref|XP_003503600.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Cricetulus griseus]
gi|344253365|gb|EGW09469.1| Endothelial transcription factor GATA-2 [Cricetulus griseus]
Length = 480
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|395506703|ref|XP_003757670.1| PREDICTED: GATA-binding factor 5-A-like [Sarcophilus harrisii]
Length = 398
Score = 105 bits (263), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL R
Sbjct: 183 EFPGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSTR 239
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L P + +
Sbjct: 183 EFPGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIK--PQKRLSS 237
Query: 75 F-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 238 TRRAGLCCTNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 283
>gi|148234322|ref|NP_001084043.1| GATA-binding factor 2 [Xenopus laevis]
gi|80476387|gb|AAI08545.1| XGATA-2 protein [Xenopus laevis]
Length = 453
Score = 105 bits (263), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 263 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 321
Score = 103 bits (256), Expect = 3e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 263 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 317
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 318 AGTCCANCQTSTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 359
>gi|301607037|ref|XP_002933110.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Xenopus (Silurana)
tropicalis]
Length = 451
Score = 105 bits (263), Expect = 4e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 261 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 319
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 261 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 315
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 316 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 357
>gi|354481949|ref|XP_003503163.1| PREDICTED: transcription factor GATA-5 [Cricetulus griseus]
gi|344254951|gb|EGW11055.1| Transcription factor GATA-5 [Cricetulus griseus]
Length = 406
Score = 105 bits (263), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 48/73 (65%), Positives = 56/73 (76%), Gaps = 4/73 (5%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN- 131
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R +
Sbjct: 189 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSTRRSG 248
Query: 132 ---FNCYERSTLF 141
NC+ +T
Sbjct: 249 LCCSNCHTATTTL 261
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 69/116 (59%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 189 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 240
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 241 LSSTRRSGLCCSNCHTATTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 296
>gi|120974910|gb|ABM46761.1| GATA4 [Gorilla gorilla]
Length = 176
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 59
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 61
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 62 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 103
>gi|410919499|ref|XP_003973222.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Takifugu rubripes]
Length = 456
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 46/59 (77%), Positives = 47/59 (79%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
TEGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 271 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 329
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRD TGHYLCNA G +R L P +
Sbjct: 271 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 325
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 326 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 367
>gi|348501136|ref|XP_003438126.1| PREDICTED: transcription factor GATA-6-like [Oreochromis niloticus]
Length = 491
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 52/100 (52%), Positives = 62/100 (62%), Gaps = 4/100 (4%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCG+ISTPLWRRDGTGH+LCNA + G +SR L P G
Sbjct: 287 ESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNG---LSRPLIKP-PKRTSTSRRIG 342
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 343 LSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRPL 382
Score = 99.8 bits (247), Expect = 3e-19, Method: Composition-based stats.
Identities = 40/53 (75%), Positives = 46/53 (86%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCG+ISTPLWRRDGTGH+LCNACGLY KMNG++RPL+K P+R R
Sbjct: 287 ESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPPKRTSTSR 339
>gi|182998|gb|AAA35869.1| transcription factor GATA-2 [Homo sapiens]
Length = 474
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 64/102 (62%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA F G +R L P +
Sbjct: 284 SEGRECVNCGATATPLWRRDGTGHYLCNACGFYHKMKG---QNRPLIK--PKRRLSAARR 338
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 339 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 380
Score = 102 bits (254), Expect = 6e-20, Method: Composition-based stats.
Identities = 43/59 (72%), Positives = 46/59 (77%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACG YHKM G NRPL+K RRL R C
Sbjct: 284 SEGRECVNCGATATPLWRRDGTGHYLCNACGFYHKMKGQNRPLIKPKRRLSAARRAGTC 342
>gi|301607039|ref|XP_002933111.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Xenopus (Silurana)
tropicalis]
Length = 453
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 263 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 321
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 263 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 317
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 318 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 359
>gi|395752542|ref|XP_002830553.2| PREDICTED: transcription factor GATA-5 [Pongo abelii]
Length = 397
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 181 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 237
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 181 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 232
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 233 LSSSRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLA 281
>gi|260809775|ref|XP_002599680.1| hypothetical protein BRAFLDRAFT_205741 [Branchiostoma floridae]
gi|229284961|gb|EEN55692.1| hypothetical protein BRAFLDRAFT_205741 [Branchiostoma floridae]
Length = 473
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 52/100 (52%), Positives = 62/100 (62%), Gaps = 6/100 (6%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE-G 78
GRECVNCGA STPLWRRDGTGHYLCNA G +R L P + G
Sbjct: 273 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARRAG 327
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 328 TQCANCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVNRPL 367
Score = 105 bits (262), Expect = 7e-21, Method: Composition-based stats.
Identities = 44/52 (84%), Positives = 45/52 (86%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 273 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 324
>gi|111493951|gb|AAI05654.1| Gata5 protein [Mus musculus]
Length = 350
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 48/73 (65%), Positives = 56/73 (76%), Gaps = 4/73 (5%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN- 131
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R +
Sbjct: 134 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSG 193
Query: 132 ---FNCYERSTLF 141
NC+ +T
Sbjct: 194 LCCSNCHTATTTL 206
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 69/116 (59%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 134 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 185
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 186 LSSSRRSGLCCSNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 241
>gi|77735401|ref|NP_001029393.1| transcription factor GATA-5 [Bos taurus]
gi|122140236|sp|Q3SZJ5.1|GATA5_BOVIN RecName: Full=Transcription factor GATA-5; AltName:
Full=GATA-binding factor 5
gi|74354092|gb|AAI02822.1| GATA binding protein 5 [Bos taurus]
gi|296481044|tpg|DAA23159.1| TPA: transcription factor GATA-5 [Bos taurus]
Length = 403
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 186 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 242
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 186 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 237
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 238 LSSSRRAGLCCTNCHTTTTTLWRRNVDGEPVCNACGLYMKLHGVPRPLA 286
>gi|3702858|gb|AAC62961.1| GATA transcription factor [Strongylocentrotus purpuratus]
Length = 119
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 55/100 (55%), Positives = 63/100 (63%), Gaps = 3/100 (3%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR 79
GRECVNCGAISTPLWRRDGTGHYLCNA GY +R L + EG
Sbjct: 1 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGY---NRPLIKNPRRLQSGSRREGI 57
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
C NC +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 58 TCANCHTSTTTLWRRNKDGEPVCNACGLYFKLHGVNRPLA 97
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 45/49 (91%), Positives = 47/49 (95%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG NRPL+K PRRL+
Sbjct: 1 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRPLIKNPRRLQ 49
>gi|301630516|ref|XP_002944363.1| PREDICTED: GATA-binding factor 5-B-like, partial [Xenopus
(Silurana) tropicalis]
Length = 217
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL R
Sbjct: 4 EFHGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSR 60
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 70/116 (60%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L ++ +
Sbjct: 4 EFHGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---INRPL-----IKPQKR 55
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 56 LSSSRRAGLCCTNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 111
>gi|133777354|gb|AAI07011.2| Gata2 protein [Mus musculus]
Length = 447
Score = 105 bits (263), Expect = 5e-21, Method: Composition-based stats.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 257 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 315
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 257 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 311
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 312 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 353
>gi|384942494|gb|AFI34852.1| transcription factor GATA-5 [Macaca mulatta]
Length = 397
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 181 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 237
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/131 (42%), Positives = 73/131 (55%), Gaps = 22/131 (16%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 181 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 232
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV----------K 120
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL +
Sbjct: 233 LSSSRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQTRKR 292
Query: 121 QPRRLRFDRGN 131
+P+ + RG+
Sbjct: 293 KPKTIAKTRGS 303
>gi|426241173|ref|XP_004014466.1| PREDICTED: transcription factor GATA-5 [Ovis aries]
Length = 403
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 186 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 242
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 186 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 237
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 238 LSSSRRAGLCCTNCHTTTTTLWRRNADGEPVCNACGLYMKLHGVPRPLA 286
>gi|332858914|ref|XP_514767.3| PREDICTED: transcription factor GATA-5 [Pan troglodytes]
Length = 397
Score = 105 bits (262), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 181 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 237
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 181 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 232
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 233 LSSSRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLA 281
>gi|74185986|dbj|BAE34137.1| unnamed protein product [Mus musculus]
Length = 408
Score = 105 bits (262), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 223 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 276
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 63/103 (61%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 223 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 277
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 278 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 320
>gi|37595582|gb|AAQ94635.1| Gata6 protein [Danio rerio]
gi|111185536|gb|AAH67710.2| Gata6 protein [Danio rerio]
Length = 501
Score = 105 bits (262), Expect = 6e-21, Method: Composition-based stats.
Identities = 52/106 (49%), Positives = 66/106 (62%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E E RECVNCG++STPLWRRDGTGH+LCNA + G +SR L P + +
Sbjct: 288 LEEMVESRECVNCGSMSTPLWRRDGTGHFLCNACGLYSKMNG---LSRPLIK--PQKRMS 342
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 343 SSRRIGLSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRPL 388
Score = 98.6 bits (244), Expect = 7e-19, Method: Composition-based stats.
Identities = 39/58 (67%), Positives = 48/58 (82%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+E E RECVNCG++STPLWRRDGTGH+LCNACGLY KMNG++RPL+K +R+ R
Sbjct: 288 LEEMVESRECVNCGSMSTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRMSSSR 345
>gi|111494048|gb|AAI05655.1| Gata5 protein [Mus musculus]
Length = 271
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 48/73 (65%), Positives = 56/73 (76%), Gaps = 4/73 (5%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN- 131
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R +
Sbjct: 55 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSG 114
Query: 132 ---FNCYERSTLF 141
NC+ +T
Sbjct: 115 LCCSNCHTATTTL 127
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 69/116 (59%), Gaps = 12/116 (10%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 55 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 106
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 107 LSSSRRSGLCCSNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 162
>gi|387538427|gb|AFJ79490.1| GATA binding protein 1/2/3, partial [Branchiostoma lanceolatum]
Length = 278
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 64/103 (62%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 115 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 169
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 170 AGTQCANCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVNRPLT 212
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 115 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 168
>gi|348540838|ref|XP_003457894.1| PREDICTED: GATA-binding factor 5-A-like [Oreochromis niloticus]
Length = 383
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 44/58 (75%), Positives = 53/58 (91%), Gaps = 2/58 (3%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
++D+ TEGRECVNCG++STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL+
Sbjct: 176 IDDMP--TEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQ 231
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 69/116 (59%), Gaps = 5/116 (4%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE 70
++D+ TEGRECVNCG++STPLWRRDGTGHYLCNA G ++R L
Sbjct: 176 IDDMP--TEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIKPQKRL 230
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
G C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 231 QTTSRRAGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 286
>gi|17998698|ref|NP_536721.1| transcription factor GATA-5 [Homo sapiens]
gi|20138325|sp|Q9BWX5.1|GATA5_HUMAN RecName: Full=Transcription factor GATA-5; AltName:
Full=GATA-binding factor 5
gi|109658580|gb|AAI17359.1| GATA binding protein 5 [Homo sapiens]
gi|109658884|gb|AAI17357.1| GATA binding protein 5 [Homo sapiens]
gi|119595766|gb|EAW75360.1| GATA binding protein 5, isoform CRA_a [Homo sapiens]
gi|119595767|gb|EAW75361.1| GATA binding protein 5, isoform CRA_a [Homo sapiens]
gi|302313183|gb|ADL14516.1| GATA binding protein 5 [Homo sapiens]
Length = 397
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 181 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 237
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 181 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 232
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 233 LSSSRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLA 281
>gi|326911086|ref|XP_003201893.1| PREDICTED: GATA-binding factor 3-like [Meleagris gallopavo]
Length = 408
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 223 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 276
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 63/103 (61%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 223 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 277
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 278 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 320
>gi|342671956|dbj|BAK57316.1| transcription factor GATA5 [Ovis aries]
Length = 333
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 116 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 172
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 116 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 167
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 168 LSSSRRAGLCCTNCHTTTTTLWRRNADGEPVCNACGLYMKLHGVPRPLA 216
>gi|345495111|ref|XP_003427437.1| PREDICTED: hypothetical protein LOC100680343 [Nasonia vitripennis]
Length = 478
Score = 105 bits (262), Expect = 6e-21, Method: Composition-based stats.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 413 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 461
>gi|29126838|gb|AAH47790.1| GATA5 protein, partial [Homo sapiens]
Length = 417
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 201 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 257
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 201 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 252
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 253 LSSSRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLA 301
>gi|449473953|ref|XP_002194063.2| PREDICTED: GATA-binding factor 2-like [Taeniopygia guttata]
Length = 347
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 61/99 (61%), Gaps = 9/99 (9%)
Query: 42 YLCNARLFSTSFTGYFIMSRFL------FASLPVEDIEYFTEGRECVNCGAISTPLWRRD 95
Y+ A ++S+S F FL F P +EGRECVNCGA +TPLWRRD
Sbjct: 120 YVPAAHVYSSSL---FHPGSFLGGPASSFTPKPRSKARSCSEGRECVNCGATATPLWRRD 176
Query: 96 GTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
GTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 177 GTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 215
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/120 (44%), Positives = 67/120 (55%), Gaps = 12/120 (10%)
Query: 6 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA 65
F P +EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L
Sbjct: 145 FTPKPRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-- 199
Query: 66 SLPVEDIEYFTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
++ + R C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 200 ---IKPKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMK 256
>gi|237512877|dbj|BAH58791.1| GATA-binding protein 5 [Polypterus senegalus]
Length = 395
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 50/54 (92%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
E+ EGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL+
Sbjct: 181 EFPGEGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQ 234
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 66/112 (58%), Gaps = 3/112 (2%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCG+ISTPLWRRDGTGHYLCNA G ++R L
Sbjct: 181 EFPGEGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIKPQKRLQTSS 237
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
G C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 238 RRAGLCCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 289
>gi|162316932|gb|ABX84143.1| GATA-binding protein 6 [Microtus levis]
Length = 248
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 71/120 (59%), Gaps = 12/120 (10%)
Query: 7 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMS 60
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +S
Sbjct: 28 APLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LS 84
Query: 61 RFLFASLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R L P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 85 RPLIK--PQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 142
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/87 (54%), Positives = 58/87 (66%), Gaps = 10/87 (11%)
Query: 65 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL
Sbjct: 28 APLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 87
Query: 119 VKQPRRL----RFDRGNFNCYERSTLF 141
+K +R+ R NC+ +T
Sbjct: 88 IKPQKRVPSSRRLGLSCANCHTTTTTL 114
>gi|268551963|ref|XP_002633964.1| C. briggsae CBR-ELT-1 protein [Caenorhabditis briggsae]
Length = 483
Score = 105 bits (261), Expect = 7e-21, Method: Composition-based stats.
Identities = 52/104 (50%), Positives = 65/104 (62%), Gaps = 9/104 (8%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIM---SRFLFASLPVEDIEY 74
TE RECVNCG +TPLWRRDG+G+YLCNA YF M +R L +
Sbjct: 279 TEDRECVNCGVHATPLWRRDGSGNYLCNACGL------YFKMNHQARPLVKPKKRQQNAQ 332
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G ECVNC +T LWRR+G GH +CNACGLY+K++ + RP+
Sbjct: 333 KRTGIECVNCHTNNTTLWRRNGEGHPVCNACGLYYKLHKVERPM 376
Score = 87.0 bits (214), Expect = 3e-15, Method: Composition-based stats.
Identities = 36/49 (73%), Positives = 40/49 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
TE RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVK +R
Sbjct: 279 TEDRECVNCGVHATPLWRRDGSGNYLCNACGLYFKMNHQARPLVKPKKR 327
>gi|238863842|gb|ACR66215.1| transcription factor GATA123s [Branchiostoma floridae]
Length = 337
Score = 105 bits (261), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 135 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 188
Score = 104 bits (260), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 64/103 (62%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 135 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 189
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 190 AGTQCANCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVNRPLT 232
>gi|356582220|ref|NP_001239112.1| trans-acting T-cell-specific transcription factor GATA-3 [Ovis
aries]
gi|342671952|dbj|BAK57314.1| transcription factor GATA3 [Ovis aries]
Length = 444
Score = 105 bits (261), Expect = 7e-21, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
T GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 259 TLGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 355
Score = 105 bits (261), Expect = 8e-21, Method: Composition-based stats.
Identities = 45/54 (83%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
T GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 259 TLGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 312
>gi|397479159|ref|XP_003810895.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-5 [Pan
paniscus]
Length = 532
Score = 105 bits (261), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 316 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 372
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 316 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 367
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 368 LSSSRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLA 416
>gi|281339813|gb|EFB15397.1| hypothetical protein PANDA_008702 [Ailuropoda melanoleuca]
Length = 412
Score = 105 bits (261), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 227 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 280
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 64/105 (60%), Gaps = 6/105 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 227 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 281
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 282 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLTMK 326
>gi|147903431|ref|NP_001083725.1| GATA-binding factor 6-B [Xenopus laevis]
gi|51950143|gb|AAH82349.1| Gata-6 protein [Xenopus laevis]
Length = 502
Score = 105 bits (261), Expect = 8e-21, Method: Composition-based stats.
Identities = 52/110 (47%), Positives = 67/110 (60%), Gaps = 6/110 (5%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV 69
P + ++ E RECVNCG++ TPLWRRDGTGHYLCNA + G +SR L P
Sbjct: 280 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQ 334
Query: 70 EDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 335 KRVPSSRRIGLACANCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVPRPL 384
Score = 98.2 bits (243), Expect = 1e-18, Method: Composition-based stats.
Identities = 42/76 (55%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 68 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL-- 125
P + ++ E RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+
Sbjct: 280 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPS 339
Query: 126 --RFDRGNFNCYERST 139
R NC+ +T
Sbjct: 340 SRRIGLACANCHTTTT 355
>gi|431917650|gb|ELK16915.1| Trans-acting T-cell-specific transcription factor GATA-3 [Pteropus
alecto]
Length = 409
Score = 105 bits (261), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 224 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 277
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 64/105 (60%), Gaps = 6/105 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 224 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 278
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 279 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLTMK 323
>gi|238863844|gb|ACR66216.1| transcription factor GATA456a, partial [Branchiostoma floridae]
Length = 424
Score = 105 bits (261), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 45/54 (83%), Positives = 48/54 (88%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 289 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLSASR 342
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 62/106 (58%), Gaps = 12/106 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIE 73
+EGRECVNCGA STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 289 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLSASRRV---- 344
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G +C NC T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 345 ----GLQCANCRTTQTTLWRRNNEGEPVCNACGLYFKLHNVNRPLA 386
>gi|19386566|gb|AAL86577.1| GATA-3 [Raja eglanteria]
Length = 444
Score = 105 bits (261), Expect = 9e-21, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 254 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 308
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 309 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 350
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 254 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 307
>gi|238863846|gb|ACR66217.1| transcription factor GATA456b, partial [Branchiostoma floridae]
Length = 380
Score = 105 bits (261), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 45/54 (83%), Positives = 48/54 (88%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 289 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLSASR 342
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 58/99 (58%), Gaps = 12/99 (12%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIE 73
+EGRECVNCGA STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 289 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLSASRRV---- 344
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
G +C NC T LWRR+ G +CNACGLY+K++
Sbjct: 345 ----GLQCANCRTTQTTLWRRNNEGEPVCNACGLYYKLH 379
>gi|326917513|ref|XP_003205043.1| PREDICTED: transcription factor GATA-6-like [Meleagris gallopavo]
Length = 402
Score = 104 bits (260), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 55/112 (49%), Positives = 68/112 (60%), Gaps = 6/112 (5%)
Query: 9 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLP 68
L E +E +E RECVNCG+I TPLWRRDGTG+YLCNA T G +SR L P
Sbjct: 182 LSAELLEDLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNG---LSRPLIK--P 236
Query: 69 VEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 237 QKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 288
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 44/79 (55%), Positives = 55/79 (69%), Gaps = 4/79 (5%)
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL- 125
L E +E +E RECVNCG+I TPLWRRDGTG+YLCNACGLY KMNG++RPL+K +R+
Sbjct: 182 LSAELLEDLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVP 241
Query: 126 ---RFDRGNFNCYERSTLF 141
R NC+ +T
Sbjct: 242 SSRRLGLSCANCHTTTTTL 260
>gi|45361689|ref|NP_989422.1| GATA binding protein 6 [Xenopus (Silurana) tropicalis]
gi|32442468|gb|AAP82292.1| zinc-finger transcription factor Gata6 [Xenopus (Silurana)
tropicalis]
Length = 524
Score = 104 bits (260), Expect = 9e-21, Method: Composition-based stats.
Identities = 52/110 (47%), Positives = 67/110 (60%), Gaps = 6/110 (5%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV 69
P + ++ E RECVNCG++ TPLWRRDGTGHYLCNA + G +SR L P
Sbjct: 303 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQ 357
Query: 70 EDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 358 KRVPSSRRIGLACANCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVPRPL 407
Score = 97.8 bits (242), Expect = 1e-18, Method: Composition-based stats.
Identities = 42/76 (55%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 68 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL-- 125
P + ++ E RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+
Sbjct: 303 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPS 362
Query: 126 --RFDRGNFNCYERST 139
R NC+ +T
Sbjct: 363 SRRIGLACANCHTTTT 378
>gi|301769219|ref|XP_002920028.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like [Ailuropoda melanoleuca]
Length = 411
Score = 104 bits (260), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 226 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 279
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 64/105 (60%), Gaps = 6/105 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 226 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 280
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 281 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLTMK 325
>gi|395823099|ref|XP_003784834.1| PREDICTED: transcription factor GATA-6 [Otolemur garnettii]
Length = 589
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 53/108 (49%), Positives = 64/108 (59%), Gaps = 10/108 (9%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFAS---LPVE 70
+E E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L +P
Sbjct: 375 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIKQQKRVPSS 431
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 432 R----RLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 475
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 44/72 (61%), Positives = 53/72 (73%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KQ +R+ R
Sbjct: 375 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKQQKRVPSSRRL 434
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 435 GLSCANCHTTTT 446
>gi|327290258|ref|XP_003229840.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like, partial [Anolis carolinensis]
Length = 364
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 179 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 232
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 64/105 (60%), Gaps = 6/105 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 179 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 233
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 234 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLTMK 278
>gi|30584633|gb|AAP36569.1| Homo sapiens GATA binding protein 3 [synthetic construct]
gi|61372534|gb|AAX43861.1| GATA binding protein 3 [synthetic construct]
gi|61372538|gb|AAX43862.1| GATA binding protein 3 [synthetic construct]
Length = 444
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 312
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 313 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 354
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 311
>gi|403282679|ref|XP_003932769.1| PREDICTED: transcription factor GATA-5 [Saimiri boliviensis
boliviensis]
Length = 288
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/68 (69%), Positives = 54/68 (79%), Gaps = 5/68 (7%)
Query: 67 LPVEDI-----EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
LPV + E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL++
Sbjct: 61 LPVPSVSDFLEEFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLLRP 120
Query: 122 PRRLRFDR 129
+RL R
Sbjct: 121 QKRLSSSR 128
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/117 (48%), Positives = 69/117 (58%), Gaps = 11/117 (9%)
Query: 9 LPVEDI-----EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFL 63
LPV + E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L
Sbjct: 61 LPVPSVSDFLEEFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL 117
Query: 64 FASLPVEDIEYF-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
P + + G C NC +T LWRR G +CNACGLY K++G+ RPL
Sbjct: 118 L--RPQKRLSSSRRAGLCCTNCHTTTTTLWRRSSEGEPVCNACGLYMKLHGVPRPLA 172
>gi|74207086|dbj|BAE33319.1| unnamed protein product [Mus musculus]
Length = 442
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 257 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 311
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 312 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 353
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 257 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 310
>gi|4503929|ref|NP_002042.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 2
[Homo sapiens]
gi|120962|sp|P23771.1|GATA3_HUMAN RecName: Full=Trans-acting T-cell-specific transcription factor
GATA-3; AltName: Full=GATA-binding factor 3
gi|31666|emb|CAA41102.1| hGATA3 transcription factor [Homo sapiens]
gi|13111766|gb|AAH03070.1| GATA binding protein 3 [Homo sapiens]
gi|39843077|gb|AAR32096.1| GATA binding protein 3 [Homo sapiens]
gi|119606773|gb|EAW86367.1| GATA binding protein 3, isoform CRA_a [Homo sapiens]
Length = 443
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 312
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 313 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 354
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 311
>gi|116003829|ref|NP_001070272.1| trans-acting T-cell-specific transcription factor GATA-3 [Bos
taurus]
gi|122132376|sp|Q08DV0.1|GATA3_BOVIN RecName: Full=Trans-acting T-cell-specific transcription factor
GATA-3; AltName: Full=GATA-binding factor 3
gi|115305262|gb|AAI23556.1| GATA binding protein 3 [Bos taurus]
gi|296481519|tpg|DAA23634.1| TPA: trans-acting T-cell-specific transcription factor GATA-3 [Bos
taurus]
Length = 443
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 312
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 313 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 354
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 311
>gi|117646902|emb|CAL37566.1| hypothetical protein [synthetic construct]
Length = 443
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 312
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 313 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 354
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 311
>gi|31662|emb|CAA38877.1| GATA-3 [Homo sapiens]
Length = 443
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 52/102 (50%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 312
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 313 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 354
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 311
>gi|355689768|gb|AER98940.1| GATA binding protein 3 [Mustela putorius furo]
Length = 328
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 55/84 (65%), Gaps = 8/84 (9%)
Query: 46 ARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNAC 105
+ L S TG+ SR TEGRECVNCGA STPLWRRDGTGHYLCNAC
Sbjct: 241 SSLLGGSPTGFGCKSR--------PKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNAC 292
Query: 106 GLYHKMNGMNRPLVKQPRRLRFDR 129
GLYHKMNG NRPL+K RRL R
Sbjct: 293 GLYHKMNGQNRPLIKPKRRLSAAR 316
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/29 (96%), Positives = 28/29 (96%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
TEGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 263 TEGRECVNCGATSTPLWRRDGTGHYLCNA 291
>gi|410923801|ref|XP_003975370.1| PREDICTED: GATA-binding factor 6-A-like [Takifugu rubripes]
Length = 494
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 55/125 (44%), Positives = 70/125 (56%), Gaps = 14/125 (11%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCG+ISTPLWRRDGTGH+LCNA + G +SR L G
Sbjct: 291 ESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNG---LSRPLIKPQKRTSTSRRI-G 346
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV----------KQPRRLRFD 128
C NC +T LWRR+ G +CNACGLY K++G+ RPL ++P+ L
Sbjct: 347 LSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRPLAMKKEGIQTRKRKPKTLNKA 406
Query: 129 RGNFN 133
+G+ N
Sbjct: 407 KGSGN 411
>gi|390462790|ref|XP_003732908.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-5
[Callithrix jacchus]
Length = 338
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 122 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 178
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 64/109 (58%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 122 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 173
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR G +CNACGLY K++G+ RPL
Sbjct: 174 LSSSRRTGLCCTNCHTTNTTLWRRSSEGEPVCNACGLYMKLHGVPRPLA 222
>gi|441637994|ref|XP_004090097.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-5
[Nomascus leucogenys]
Length = 369
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 150 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 206
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 150 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 201
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 202 LSSSRRAGLCCTNCRTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLA 250
>gi|117646668|emb|CAL37449.1| hypothetical protein [synthetic construct]
gi|208966350|dbj|BAG73189.1| GATA binding protein 3 [synthetic construct]
Length = 443
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 311
Score = 103 bits (256), Expect = 3e-20, Method: Composition-based stats.
Identities = 51/102 (50%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 258 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 312
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY++++ +NRPL
Sbjct: 313 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYELHNINRPL 354
>gi|351714890|gb|EHB17809.1| Transcription factor GATA-5, partial [Heterocephalus glaber]
Length = 223
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 7 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 63
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 64/109 (58%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 7 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 58
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 59 LSSSRRAGLCCTNCHTTHTTLWRRNTEGEPVCNACGLYMKLHGVPRPLA 107
>gi|444724388|gb|ELW64993.1| Trans-acting T-cell-specific transcription factor GATA-3 [Tupaia
chinensis]
Length = 363
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 55/84 (65%), Gaps = 8/84 (9%)
Query: 46 ARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNAC 105
+ L S TG+ SR TEGRECVNCGA STPLWRRDGTGHYLCNAC
Sbjct: 236 SSLLGGSPTGFGCKSR--------PKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNAC 287
Query: 106 GLYHKMNGMNRPLVKQPRRLRFDR 129
GLYHKMNG NRPL+K RRL R
Sbjct: 288 GLYHKMNGQNRPLIKPKRRLSAAR 311
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 59/102 (57%), Gaps = 12/102 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIE 73
TEGRECVNCGA STPLWRRDGTGHYLCNA G R L A+
Sbjct: 258 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR------ 311
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 115
G C NC +T LWRR+ G +CNACGLY+K++ ++
Sbjct: 312 --RAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVS 351
>gi|355564543|gb|EHH21043.1| hypothetical protein EGK_04020 [Macaca mulatta]
Length = 480
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 51/101 (50%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIKP-KRRRSAARRA 345
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 346 GTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|54306071|ref|NP_571310.2| transcription factor GATA-5 [Danio rerio]
gi|30519592|emb|CAB43400.2| transcription factor gata5 [Danio rerio]
Length = 383
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 43/50 (86%), Positives = 48/50 (96%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
EGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL+
Sbjct: 182 EGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQ 231
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 69/110 (62%), Gaps = 7/110 (6%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT-- 76
EGRECVNCG+ISTPLWRRDGTGHYLCNA G ++R L P + ++ +
Sbjct: 182 EGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIK--PQKRLQSTSRR 236
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
G C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 237 AGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 286
>gi|73992691|ref|XP_543086.2| PREDICTED: transcription factor GATA-5 [Canis lupus familiaris]
Length = 400
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 49/57 (85%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 184 ELPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 240
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 64/109 (58%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 184 ELPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 235
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 236 LSSSRRAGLCCTNCHTTTTTLWRRNADGEPVCNACGLYMKLHGVPRPLA 284
>gi|94573490|gb|AAI16538.1| GATA-binding protein 5 [Danio rerio]
Length = 383
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 43/50 (86%), Positives = 48/50 (96%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
EGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL+
Sbjct: 182 EGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQ 231
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 69/110 (62%), Gaps = 7/110 (6%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT-- 76
EGRECVNCG+ISTPLWRRDGTGHYLCNA G ++R L P + ++ +
Sbjct: 182 EGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIK--PQKRLQSTSRR 236
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
G C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 237 AGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKESIQ 286
>gi|432917241|ref|XP_004079469.1| PREDICTED: transcription factor GATA-6-like [Oryzias latipes]
Length = 501
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 51/100 (51%), Positives = 61/100 (61%), Gaps = 4/100 (4%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCG+ISTPLWRRDGTGH+LCNA + G +SR L G
Sbjct: 300 ESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNG---LSRPLIKPQKRTSTSRRI-G 355
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 356 LSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRPL 395
Score = 96.7 bits (239), Expect = 2e-18, Method: Composition-based stats.
Identities = 39/53 (73%), Positives = 45/53 (84%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCG+ISTPLWRRDGTGH+LCNACGLY KMNG++RPL+K +R R
Sbjct: 300 ESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRTSTSR 352
>gi|426240701|ref|XP_004014232.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3 [Ovis aries]
Length = 421
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 236 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 289
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 63/103 (61%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 236 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 290
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 291 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 333
>gi|335353907|dbj|BAK39710.1| GATA binding protein 3 [Tursiops truncatus]
Length = 251
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 55/84 (65%), Gaps = 8/84 (9%)
Query: 46 ARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNAC 105
+ L S TG+ SR TEGRECVNCGA STPLWRRDGTGHYLCNAC
Sbjct: 94 SSLLGGSPTGFGCKSR--------PKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNAC 145
Query: 106 GLYHKMNGMNRPLVKQPRRLRFDR 129
GLYHKMNG NRPL+K RRL R
Sbjct: 146 GLYHKMNGQNRPLIKPKRRLSAAR 169
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 63/103 (61%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 116 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 170
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 171 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 213
>gi|8648977|emb|CAB94842.1| GATA-4 zinc-finger transcription factor [Oryctolagus cuniculus]
Length = 116
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 1 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 55
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 64/107 (59%), Gaps = 12/107 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 1 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 57
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 58 -----GLSCANCQTTTTTLWRRNAEGKPVCNACGLYMKLHGVPRPLA 99
>gi|355786384|gb|EHH66567.1| hypothetical protein EGM_03585 [Macaca fascicularis]
Length = 480
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 44/59 (74%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRR+GTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 290 SEGRECVNCGATATPLWRRNGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 348
Score = 102 bits (254), Expect = 6e-20, Method: Composition-based stats.
Identities = 51/102 (50%), Positives = 63/102 (61%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRR+GTGHYLCNA G +R L P +
Sbjct: 290 SEGRECVNCGATATPLWRRNGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 344
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 345 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 386
>gi|343488529|ref|NP_999493.2| transcription factor GATA-6 [Sus scrofa]
Length = 592
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 50/105 (47%), Positives = 61/105 (58%), Gaps = 4/105 (3%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E E RECVNCG+I TPLWRRDGTGHYLCN + G + + PV
Sbjct: 378 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNG--LSGPLIKPQKPVPSSR 435
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 436 RL--GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 478
Score = 92.0 bits (227), Expect = 8e-17, Method: Composition-based stats.
Identities = 41/72 (56%), Positives = 48/72 (66%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ----PRRLRF 127
+E E RECVNCG+I TPLWRRDGTGHYLCN CGLY KMNG++ PL+K P R
Sbjct: 378 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNGLSGPLIKPQKPVPSSRRL 437
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 438 GLSCANCHTTTT 449
>gi|215274026|sp|Q95JA5.2|GATA6_PIG RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
Length = 451
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 50/105 (47%), Positives = 61/105 (58%), Gaps = 4/105 (3%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E E RECVNCG+I TPLWRRDGTGHYLCN + G + + PV
Sbjct: 237 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNG--LSGPLIKPQKPVPSSR 294
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 295 RL--GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 337
Score = 92.0 bits (227), Expect = 8e-17, Method: Composition-based stats.
Identities = 41/72 (56%), Positives = 48/72 (66%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ----PRRLRF 127
+E E RECVNCG+I TPLWRRDGTGHYLCN CGLY KMNG++ PL+K P R
Sbjct: 237 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNGLSGPLIKPQKPVPSSRRL 296
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 297 GLSCANCHTTTT 308
>gi|392900106|ref|NP_001255402.1| Protein ELT-1, isoform d [Caenorhabditis elegans]
gi|379657042|emb|CCG28208.1| Protein ELT-1, isoform d [Caenorhabditis elegans]
Length = 487
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 53/107 (49%), Positives = 69/107 (64%), Gaps = 10/107 (9%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIM---SRFLFASLPVED 71
++ TE RECVNCG +TPLWRRDG+G+YLCNA YF M +R L ++
Sbjct: 281 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGL------YFKMNHHARPLVKPKKRQN 334
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ T G ECVNC +T LWRR+G GH +CNACGLY K++ + RP+
Sbjct: 335 AQKRT-GIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVERPI 380
Score = 86.3 bits (212), Expect = 4e-15, Method: Composition-based stats.
Identities = 36/52 (69%), Positives = 42/52 (80%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
++ TE RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVK +R
Sbjct: 281 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKR 332
>gi|15824330|gb|AAL09304.1|AF295687_1 transcription factor GATA-6 [Sus scrofa]
Length = 446
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 50/105 (47%), Positives = 61/105 (58%), Gaps = 4/105 (3%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E E RECVNCG+I TPLWRRDGTGHYLCN + G + + PV
Sbjct: 232 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNG--LSGPLIKPQKPVPSSR 289
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 290 RL--GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 332
Score = 91.7 bits (226), Expect = 8e-17, Method: Composition-based stats.
Identities = 41/72 (56%), Positives = 48/72 (66%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ----PRRLRF 127
+E E RECVNCG+I TPLWRRDGTGHYLCN CGLY KMNG++ PL+K P R
Sbjct: 232 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNGLSGPLIKPQKPVPSSRRL 291
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 292 GLSCANCHTTTT 303
>gi|405960917|gb|EKC26787.1| Trans-acting T-cell-specific transcription factor GATA-3
[Crassostrea gigas]
Length = 518
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/64 (73%), Positives = 48/64 (75%)
Query: 66 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
S P EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL
Sbjct: 199 SRPRTKARSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRL 258
Query: 126 RFDR 129
R
Sbjct: 259 SAAR 262
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 61/106 (57%), Gaps = 6/106 (5%)
Query: 8 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASL 67
S P EGRECVNCGA STPLWRRDGTGHYLCNA G +R L
Sbjct: 199 SRPRTKARSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK-- 253
Query: 68 PVEDIEYF-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
P + G C NCG +T LWRR+G G +CNACGLY+K++
Sbjct: 254 PKRRLSAARRAGTSCANCGTNTTTLWRRNGNGDPVCNACGLYYKLH 299
>gi|397489370|ref|XP_003815701.1| PREDICTED: transcription factor GATA-6 [Pan paniscus]
Length = 402
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/107 (49%), Positives = 66/107 (61%), Gaps = 6/107 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 188 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 242
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 243 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 289
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 40/58 (68%), Positives = 48/58 (82%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 188 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSR 245
>gi|432865839|ref|XP_004070639.1| PREDICTED: GATA-binding factor 5-A-like [Oryzias latipes]
Length = 217
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/63 (71%), Positives = 53/63 (84%), Gaps = 4/63 (6%)
Query: 68 PVEDIEYF----TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
P D++ TEGRECVNCG++STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +
Sbjct: 3 PRNDLDLLDDMPTEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQK 62
Query: 124 RLR 126
RL+
Sbjct: 63 RLQ 65
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/116 (47%), Positives = 71/116 (61%), Gaps = 11/116 (9%)
Query: 10 PVEDIEYF----TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA 65
P D++ TEGRECVNCG++STPLWRRDGTGHYLCNA G ++R L
Sbjct: 3 PRNDLDLLDDMPTEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIK 59
Query: 66 SLPVEDIEYFT--EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
P + ++ + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 60 --PQKRLQSTSRRAGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 113
>gi|195108873|ref|XP_001999017.1| GI24284 [Drosophila mojavensis]
gi|193915611|gb|EDW14478.1| GI24284 [Drosophila mojavensis]
Length = 378
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 67/106 (63%), Gaps = 8/106 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFAS---LPVEDIEYF 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L L ++ ++
Sbjct: 149 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRRLTLQSLQSA 205
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 206 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLT 251
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 43/52 (82%), Positives = 45/52 (86%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 149 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQ 200
>gi|335371115|gb|AEH57086.1| GATA123 [Bugula neritina]
Length = 281
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/100 (54%), Positives = 63/100 (63%), Gaps = 4/100 (4%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDGTGHYLCNA G SR L +
Sbjct: 75 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---SSRPLIKPKRRQSATR-RA 130
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG +T LWRR+ +G +CNACGLY+K++ +NRP
Sbjct: 131 GTCCANCGTFATTLWRRNQSGDPVCNACGLYYKLHNVNRP 170
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG +RPL+K RR R
Sbjct: 75 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGSSRPLIKPKRRQSATR 128
>gi|449494011|ref|XP_002194999.2| PREDICTED: transcription factor GATA-6 [Taeniopygia guttata]
Length = 390
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/119 (47%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 7 ASLPV-----EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSR 61
A +PV E +E +E RECVNCG+I TPLWRRDGTG+YLCNA T G +SR
Sbjct: 163 APIPVRAPSAELLEDLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNG---LSR 219
Query: 62 FLFASLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
L P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 220 PLIK--PQKRVPSSRRMGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 276
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 43/70 (61%), Positives = 53/70 (75%), Gaps = 5/70 (7%)
Query: 65 ASLPV-----EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
A +PV E +E +E RECVNCG+I TPLWRRDGTG+YLCNACGLY KMNG++RPL+
Sbjct: 163 APIPVRAPSAELLEDLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLI 222
Query: 120 KQPRRLRFDR 129
K +R+ R
Sbjct: 223 KPQKRVPSSR 232
>gi|357609507|gb|EHJ66487.1| BmGATA beta isoform 2 - silkworm [Danaus plexippus]
Length = 267
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/57 (85%), Positives = 49/57 (85%), Gaps = 1/57 (1%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E FTEGRECVNCGAI TPLWRRDGTGHYLCNACGLY KMNGMNRPL K PRRL R
Sbjct: 3 EVFTEGRECVNCGAIDTPLWRRDGTGHYLCNACGLYTKMNGMNRPL-KPPRRLVRQR 58
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 67/149 (44%), Gaps = 49/149 (32%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFL----------- 63
E FTEGRECVNCGAI TPLWRRDGTGHYLCNA T G M+R L
Sbjct: 3 EVFTEGRECVNCGAIDTPLWRRDGTGHYLCNACGLYTKMNG---MNRPLKPPRRLVRQRH 59
Query: 64 -----FASLPVEDIEYFTEGRE------------------------------CVNCGAIS 88
+ V + T R C NC
Sbjct: 60 AAQAPAPAPDVRSLALTTSARPTLPLHHPATLALPAPARNPRPSMGTKRQGVCSNCETTI 119
Query: 89 TPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
T LWRR+ G +CNACGLY K++G+NRP
Sbjct: 120 TTLWRRNPLGENVCNACGLYFKLHGINRP 148
>gi|157169245|gb|ABV25957.1| GATA-binding transcription factor A1 [Capitella teleta]
Length = 372
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 65/103 (63%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
+EGRECVNCGA STPLWRRDGTGHYLCNA +G +R L P +
Sbjct: 268 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMSG---QNRPLIK--PKRRLSAARR 322
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NCG +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 323 AGTNCANCGTTTTTLWRRNHNGDPVCNACGLYYKLHNVNRPLT 365
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKM+G NRPL+K RRL R
Sbjct: 268 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMSGQNRPLIKPKRRLSAAR 321
>gi|297489684|ref|XP_002697773.1| PREDICTED: transcription factor GATA-6, partial [Bos taurus]
gi|296473832|tpg|DAA15947.1| TPA: GATA-6 [Bos taurus]
Length = 344
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 70/120 (58%), Gaps = 12/120 (10%)
Query: 7 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMS 60
A LPV + +E E RECVNCG+I TPLWRRDGTGHYLCNA + G +S
Sbjct: 117 APLPVPRGPGTDLLEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LS 173
Query: 61 RFLFASLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R L P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 174 RPLIK--PQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 231
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/109 (49%), Positives = 64/109 (58%), Gaps = 18/109 (16%)
Query: 27 GAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV------EDIEYFTEGRE 80
GA TP W A F T + + SR A LPV + +E E RE
Sbjct: 91 GAPLTPAWP----------AGPFETPVL-HSLQSRA-GAPLPVPRGPGTDLLEDLPESRE 138
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
CVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 139 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSR 187
>gi|124001863|gb|ABM87879.1| GATA4 [Papio hamadryas]
Length = 97
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 46/55 (83%), Positives = 50/55 (90%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 59
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 53/101 (52%), Positives = 60/101 (59%), Gaps = 12/101 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDI 72
F+EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--- 61
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG 113
G C NC +T LWRR+ G +CNACGLY K++G
Sbjct: 62 -----GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHG 97
>gi|342671958|dbj|BAK57317.1| transcription factor GATA6 [Ovis aries]
Length = 217
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/107 (49%), Positives = 65/107 (60%), Gaps = 6/107 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 3 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 57
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 58 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 104
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/74 (58%), Positives = 52/74 (70%), Gaps = 4/74 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 3 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 62
Query: 128 DRGNFNCYERSTLF 141
NC+ +T
Sbjct: 63 GLSCANCHTTTTTL 76
>gi|50540446|ref|NP_001002689.1| GATA-binding protein 2b [Danio rerio]
gi|49900268|gb|AAH76462.1| GATA-binding protein 2b [Danio rerio]
gi|182890930|gb|AAI65802.1| Gata2b protein [Danio rerio]
Length = 427
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 44/57 (77%), Positives = 46/57 (80%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL++ RRL R C
Sbjct: 261 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKRRLSASRRAGTC 317
Score = 101 bits (252), Expect = 8e-20, Method: Composition-based stats.
Identities = 53/103 (51%), Positives = 60/103 (58%), Gaps = 12/103 (11%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFASLPVEDIEYF 75
GRECVNCGA STPLWRRDGTGHYLCNA G R L AS
Sbjct: 261 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKRRLSASR-------- 312
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 313 RAGTCCANCQTGTTTLWRRNANGEPVCNACGLYYKLHNVNRPL 355
>gi|313224235|emb|CBY20024.1| unnamed protein product [Oikopleura dioica]
Length = 265
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 65/103 (63%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
+EGRECVNCGA STPLWRRDGTGHYLCNA G+ +R L P + +
Sbjct: 37 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGH---NRPLIK--PKKRLSAARR 91
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 92 AGTSCSNCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVNRPLT 134
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K +RL R
Sbjct: 37 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRPLIKPKKRLSAAR 90
>gi|443686031|gb|ELT89440.1| hypothetical protein CAPTEDRAFT_149492, partial [Capitella teleta]
Length = 347
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 55/120 (45%), Positives = 69/120 (57%), Gaps = 6/120 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 215 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 269
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNCYE 136
G C NCG +T LWRR+ G +CNACGLY+K++ +NRPL + ++ + Y
Sbjct: 270 AGTNCANCGTTTTTLWRRNHNGDPVCNACGLYYKLHNVNRPLTMKKDGIQTPNPKMSTYS 329
>gi|410933019|ref|XP_003979890.1| PREDICTED: GATA-binding factor 5-A-like, partial [Takifugu
rubripes]
Length = 226
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 52/57 (91%), Gaps = 2/57 (3%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
+ED+ TEGRECVNCG++STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL
Sbjct: 172 MEDV--CTEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRL 226
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/36 (80%), Positives = 33/36 (91%), Gaps = 2/36 (5%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+ED+ TEGRECVNCG++STPLWRRDGTGHYLCNA
Sbjct: 172 MEDV--CTEGRECVNCGSVSTPLWRRDGTGHYLCNA 205
>gi|86575041|ref|NP_001033435.1| Protein ELT-1, isoform a [Caenorhabditis elegans]
gi|74834771|emb|CAJ30234.1| Protein ELT-1, isoform a [Caenorhabditis elegans]
Length = 488
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/107 (48%), Positives = 66/107 (61%), Gaps = 9/107 (8%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIM---SRFLFASLPVED 71
++ TE RECVNCG +TPLWRRDG+G+YLCNA YF M +R L +
Sbjct: 281 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGL------YFKMNHHARPLVKPKKRQQ 334
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G ECVNC +T LWRR+G GH +CNACGLY K++ + RP+
Sbjct: 335 NAQKRTGIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVERPI 381
Score = 86.3 bits (212), Expect = 4e-15, Method: Composition-based stats.
Identities = 36/52 (69%), Positives = 42/52 (80%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
++ TE RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVK +R
Sbjct: 281 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKR 332
>gi|355701861|gb|EHH29214.1| GATA-binding factor 6 [Macaca mulatta]
Length = 398
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 67/109 (61%), Gaps = 6/109 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P + +
Sbjct: 184 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 238
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY K++G+ RPL +
Sbjct: 239 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMK 287
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 40/58 (68%), Positives = 48/58 (82%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 184 LEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSR 241
>gi|392900108|ref|NP_001255403.1| Protein ELT-1, isoform c [Caenorhabditis elegans]
gi|379657041|emb|CCG28207.1| Protein ELT-1, isoform c [Caenorhabditis elegans]
Length = 487
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/107 (48%), Positives = 66/107 (61%), Gaps = 9/107 (8%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIM---SRFLFASLPVED 71
++ TE RECVNCG +TPLWRRDG+G+YLCNA YF M +R L +
Sbjct: 280 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGL------YFKMNHHARPLVKPKKRQQ 333
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G ECVNC +T LWRR+G GH +CNACGLY K++ + RP+
Sbjct: 334 NAQKRTGIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVERPI 380
Score = 86.3 bits (212), Expect = 4e-15, Method: Composition-based stats.
Identities = 36/52 (69%), Positives = 42/52 (80%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
++ TE RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVK +R
Sbjct: 280 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKR 331
>gi|432094055|gb|ELK25847.1| Transcription factor GATA-5, partial [Myotis davidii]
Length = 226
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 50/57 (87%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 9 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSTR 65
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 75/139 (53%), Gaps = 22/139 (15%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 9 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 60
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV----------K 120
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL +
Sbjct: 61 LSSTRRAGLCCTNCHTTTTTLWRRNADGEPVCNACGLYTKLHGVPRPLAMKKESIQTRKR 120
Query: 121 QPRRLRFDRGNFNCYERST 139
+P+ + +G C +T
Sbjct: 121 KPKNVAKTKGPSGCPGNTT 139
>gi|313240837|emb|CBY33127.1| unnamed protein product [Oikopleura dioica]
Length = 249
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 65/103 (63%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
+EGRECVNCGA STPLWRRDGTGHYLCNA G+ +R L P + +
Sbjct: 21 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGH---NRPLIK--PKKRLSAARR 75
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 76 AGTSCSNCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVNRPLT 118
Score = 100 bits (248), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K +RL R
Sbjct: 21 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRPLIKPKKRLSAAR 74
>gi|297263167|ref|XP_001097903.2| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Macaca mulatta]
Length = 479
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 44/57 (77%), Positives = 46/57 (80%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
GRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 291 GRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 347
Score = 101 bits (252), Expect = 9e-20, Method: Composition-based stats.
Identities = 51/100 (51%), Positives = 61/100 (61%), Gaps = 6/100 (6%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE-G 78
GRECVNCGA +TPLWRRDGTGHYLCNA G +R L P + G
Sbjct: 291 GRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARRAG 345
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 346 TCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPL 385
>gi|395511636|ref|XP_003760062.1| PREDICTED: transcription factor GATA-6 [Sarcophilus harrisii]
Length = 461
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 67/110 (60%), Gaps = 6/110 (5%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE 70
E +E ++ RECVNCG+I TPLWRRDGTGHYLCNA + G +SR L P +
Sbjct: 244 AELLEDLSDSRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQK 298
Query: 71 DIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 299 RVPSTRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 348
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/77 (55%), Positives = 54/77 (70%), Gaps = 4/77 (5%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL--- 125
E +E ++ RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+
Sbjct: 244 AELLEDLSDSRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPST 303
Query: 126 -RFDRGNFNCYERSTLF 141
R NC+ +T
Sbjct: 304 RRLGLSCANCHTTTTTL 320
>gi|449276666|gb|EMC85098.1| GATA-binding factor 2, partial [Columba livia]
Length = 233
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/71 (66%), Positives = 50/71 (70%)
Query: 64 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
F P +EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K R
Sbjct: 31 FTPKPRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKR 90
Query: 124 RLRFDRGNFNC 134
RL R C
Sbjct: 91 RLSAARRAGTC 101
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/120 (44%), Positives = 67/120 (55%), Gaps = 12/120 (10%)
Query: 6 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA 65
F P +EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L
Sbjct: 31 FTPKPRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-- 85
Query: 66 SLPVEDIEYFTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
++ + R C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 86 ---IKPKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMK 142
>gi|45382241|ref|NP_990751.1| transcription factor GATA-6 [Gallus gallus]
gi|1169849|sp|P43693.1|GATA6_CHICK RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
gi|511484|gb|AAA57505.1| GATA-6 transcription factor [Gallus gallus]
Length = 387
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 67/110 (60%), Gaps = 6/110 (5%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVE 70
E +E +E RECVNCG+I TPLWRRDGTG+YLCNA T G +SR L P +
Sbjct: 169 AELLEDLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNG---LSRPLIK--PQK 223
Query: 71 DIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 224 RVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 273
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 40/61 (65%), Positives = 49/61 (80%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
E +E +E RECVNCG+I TPLWRRDGTG+YLCNACGLY KMNG++RPL+K +R+
Sbjct: 169 AELLEDLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVPSS 228
Query: 129 R 129
R
Sbjct: 229 R 229
>gi|440907455|gb|ELR57603.1| Transcription factor GATA-5 [Bos grunniens mutus]
Length = 403
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 44/57 (77%), Positives = 49/57 (85%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKM G+NRPLV+ +RL R
Sbjct: 186 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMKGVNRPLVRPQKRLSSSR 242
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 186 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMKG---VNRPL-----VRPQKR 237
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 238 LSSSRRAGLCCTNCHTTTTTLWRRNVDGEPVCNACGLYMKLHGVPRPLA 286
>gi|285157785|gb|ADC35037.1| GATA123a [Chaetopterus sp. MB-2010a]
Length = 489
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/53 (84%), Positives = 46/53 (86%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 264 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGANRPLIKPKRRLSAAR 316
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 54/104 (51%), Positives = 64/104 (61%), Gaps = 6/104 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-TE 77
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 264 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---ANRPLIK--PKRRLSAARRA 318
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NCG +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 319 GTSCANCGTSTTTLWRRNPNGDPVCNACGLYYKLHNVNRPLTMK 362
>gi|391332619|ref|XP_003740730.1| PREDICTED: GATA-binding factor 5-A-like [Metaseiulus occidentalis]
Length = 158
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 55/116 (47%), Positives = 66/116 (56%), Gaps = 6/116 (5%)
Query: 4 FLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFL 63
F +A + +E Y EGRECVNCGAISTPLWRRDGTGHYLCNA G +
Sbjct: 2 FQYAGIDME--MYAGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYNKMNG----AHRP 55
Query: 64 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
P G C NC +T LWRR+ G +CNACGLY +++G+ RPL
Sbjct: 56 IIKTPRRLSASRRAGLTCSNCQTTNTSLWRRNNVGEPVCNACGLYFRLHGVPRPLT 111
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/68 (70%), Positives = 55/68 (80%), Gaps = 2/68 (2%)
Query: 62 FLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
F +A + +E Y EGRECVNCGAISTPLWRRDGTGHYLCNACGLY+KMNG +RP++K
Sbjct: 2 FQYAGIDME--MYAGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYNKMNGAHRPIIKT 59
Query: 122 PRRLRFDR 129
PRRL R
Sbjct: 60 PRRLSASR 67
>gi|449278190|gb|EMC86134.1| Transcription factor GATA-6, partial [Columba livia]
Length = 396
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/109 (49%), Positives = 67/109 (61%), Gaps = 6/109 (5%)
Query: 12 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVED 71
E +E +E RECVNCG+I TPLWRRDGTG+YLCNA T G +SR L P +
Sbjct: 179 ELLEDLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNG---LSRPLIK--PQKR 233
Query: 72 IEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 234 VPSSRRVGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 282
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 40/60 (66%), Positives = 49/60 (81%)
Query: 70 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E +E +E RECVNCG+I TPLWRRDGTG+YLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 179 ELLEDLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVPSSR 238
>gi|402589688|gb|EJW83619.1| GATA zinc finger family protein, partial [Wuchereria bancrofti]
Length = 276
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 42/51 (82%), Positives = 46/51 (90%)
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
+ +EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVK +R
Sbjct: 225 FTSEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKR 275
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 26/31 (83%), Positives = 29/31 (93%)
Query: 16 YFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+ +EGRECVNCGA+ TPLWRRDGTGHYLCNA
Sbjct: 225 FTSEGRECVNCGAVQTPLWRRDGTGHYLCNA 255
>gi|241998888|ref|XP_002434087.1| endothelial transcription factor GATA-2, putative [Ixodes
scapularis]
gi|215495846|gb|EEC05487.1| endothelial transcription factor GATA-2, putative [Ixodes
scapularis]
Length = 385
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/65 (72%), Positives = 49/65 (75%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
A+ P EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RR
Sbjct: 164 AAKPRTKGRSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRR 223
Query: 125 LRFDR 129
L R
Sbjct: 224 LSAAR 228
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/114 (48%), Positives = 64/114 (56%), Gaps = 6/114 (5%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFAS 66
A+ P EGRECVNCGA STPLWRRDGTGHYLCNA G +R L
Sbjct: 164 AAKPRTKGRSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK- 219
Query: 67 LPVEDIEYF-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
P + G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 220 -PKRRLSAARRAGTSCANCKTTTTTLWRRNQNGEPVCNACGLYFKLHNVNRPLT 272
>gi|170589177|ref|XP_001899350.1| GATA zinc finger family protein [Brugia malayi]
gi|158593563|gb|EDP32158.1| GATA zinc finger family protein [Brugia malayi]
Length = 278
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 42/51 (82%), Positives = 46/51 (90%)
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
+ +EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVK +R
Sbjct: 225 FTSEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKR 275
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 26/31 (83%), Positives = 29/31 (93%)
Query: 16 YFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+ +EGRECVNCGA+ TPLWRRDGTGHYLCNA
Sbjct: 225 FTSEGRECVNCGAVQTPLWRRDGTGHYLCNA 255
>gi|47208918|emb|CAF91187.1| unnamed protein product [Tetraodon nigroviridis]
Length = 334
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/54 (83%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 239 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 292
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 59/103 (57%), Gaps = 12/103 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIE 73
+EGRECVNCGA STPLWRRDGTGHYLCNA G R L A+
Sbjct: 239 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR------ 292
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNR 116
G C NC +T LWRR+ G +CNACGLY K++ +++
Sbjct: 293 --RAGTSCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNVSK 333
>gi|395733176|ref|XP_002813217.2| PREDICTED: endothelial transcription factor GATA-2 [Pongo abelii]
Length = 406
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 216 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 274
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/106 (48%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L ++ +
Sbjct: 216 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-----IKPKRRLSA 267
Query: 78 GRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 268 ARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLT 313
>gi|301780742|ref|XP_002925788.1| PREDICTED: transcription factor GATA-5-like, partial [Ailuropoda
melanoleuca]
gi|281352342|gb|EFB27926.1| hypothetical protein PANDA_015338 [Ailuropoda melanoleuca]
Length = 292
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/57 (78%), Positives = 49/57 (85%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +RL R
Sbjct: 76 ELPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSR 132
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/109 (49%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L V +
Sbjct: 76 ELPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPL-----VRPQKR 127
Query: 75 FTEGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 128 LSSSRRAGLCCSNCRTTTTTLWRRNADGDPVCNACGLYTKLHGVPRPLA 176
>gi|165292343|dbj|BAF98877.1| GATA binding protein 3 isoform 4 [Carassius auratus langsdorfii]
Length = 277
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/55 (81%), Positives = 46/55 (83%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 101 LIEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 155
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 63/106 (59%), Gaps = 6/106 (5%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF- 75
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 101 LIEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAAR 155
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 156 RAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLTMK 201
>gi|402887127|ref|XP_003906956.1| PREDICTED: endothelial transcription factor GATA-2 isoform 4 [Papio
anubis]
Length = 568
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 378 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 436
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/106 (48%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L ++ +
Sbjct: 378 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-----IKPKRRLSA 429
Query: 78 GRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 430 ARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLT 475
>gi|291461544|dbj|BAI83406.1| GATA-C [Parasteatoda tepidariorum]
Length = 528
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/53 (84%), Positives = 46/53 (86%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 313 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 365
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/104 (50%), Positives = 63/104 (60%), Gaps = 6/104 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-TE 77
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 313 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARRA 367
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 368 GTSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVNRPLTMK 411
>gi|360042733|emb|CCD78143.1| putative gata binding factor [Schistosoma mansoni]
Length = 565
Score = 103 bits (256), Expect = 3e-20, Method: Composition-based stats.
Identities = 48/72 (66%), Positives = 54/72 (75%), Gaps = 5/72 (6%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR--GNF- 132
T+GRECVNCGA TPLWRRD GHYLCNACGLYHKMNG NRPL+K RR+ +R G F
Sbjct: 284 TDGRECVNCGATQTPLWRRDEAGHYLCNACGLYHKMNGTNRPLIKPKRRISANRKLGTFC 343
Query: 133 -NC-YERSTLFR 142
NC +TL+R
Sbjct: 344 ANCRTSHTTLWR 355
Score = 98.2 bits (243), Expect = 1e-18, Method: Composition-based stats.
Identities = 51/102 (50%), Positives = 60/102 (58%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
T+GRECVNCGA TPLWRRD GHYLCNA G +R L P I +
Sbjct: 284 TDGRECVNCGATQTPLWRRDEAGHYLCNACGLYHKMNG---TNRPLIK--PKRRISANRK 338
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 339 LGTFCANCRTSHTTLWRRNQQGDSVCNACGLYYKLHHINRPL 380
>gi|431896291|gb|ELK05707.1| Transcription factor GATA-6 [Pteropus alecto]
Length = 291
Score = 102 bits (255), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/120 (48%), Positives = 71/120 (59%), Gaps = 12/120 (10%)
Query: 7 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMS 60
A LPV + +E TE RECVNCG+I TPLWRRDGTGHYLCNA + G +S
Sbjct: 63 APLPVPRGPSADLLEDMTESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LS 119
Query: 61 RFLFASLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R L P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 120 RPLIK--PQKRVPSSRRLGLSCANCRTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 177
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/71 (63%), Positives = 53/71 (74%), Gaps = 6/71 (8%)
Query: 65 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
A LPV + +E TE RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL
Sbjct: 63 APLPVPRGPSADLLEDMTESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 122
Query: 119 VKQPRRLRFDR 129
+K +R+ R
Sbjct: 123 IKPQKRVPSSR 133
>gi|18858735|ref|NP_571311.1| transcription factor GATA-4 [Danio rerio]
gi|6524988|gb|AAF15275.1|AF191577_1 zinc finger transcription factor Gata4 [Danio rerio]
Length = 338
Score = 102 bits (255), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/52 (86%), Positives = 47/52 (90%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
GRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLVK RRL R
Sbjct: 164 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLVKPQRRLSASR 215
Score = 100 bits (248), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/104 (51%), Positives = 61/104 (58%), Gaps = 12/104 (11%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIEYF 75
GRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 164 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLVKPQRRLSASRRV------ 217
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 218 --GLSCTNCQPTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 259
>gi|334350414|ref|XP_001371963.2| PREDICTED: histone deacetylase 6-like [Monodelphis domestica]
Length = 1117
Score = 102 bits (255), Expect = 3e-20, Method: Composition-based stats.
Identities = 41/54 (75%), Positives = 46/54 (85%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRG 130
+ RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL RG
Sbjct: 29 KARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLGQKRG 82
>gi|327271947|ref|XP_003220748.1| PREDICTED: GATA-binding factor 5-A-like [Anolis carolinensis]
Length = 399
Score = 102 bits (255), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 43/57 (75%), Positives = 49/57 (85%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ E RECVNCGA+STPLWR+DGTGHYLCNACGLYHKMNG+NRPL+K +RL R
Sbjct: 183 EFPGESRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSR 239
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/109 (45%), Positives = 65/109 (59%), Gaps = 12/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ E RECVNCGA+STPLWR+DGTGHYLCNA G ++R L ++ +
Sbjct: 183 EFPGESRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNG---INRPL-----IKPQKR 234
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 235 LSSSRRAGLCCTNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 283
>gi|317419025|emb|CBN81063.1| GATA-binding factor 2 [Dicentrarchus labrax]
Length = 345
Score = 102 bits (255), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 46/59 (77%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 154 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 212
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA STPLWRRDGTGHYLCNA G +R L ++ +
Sbjct: 154 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-----IKPKRRLSA 205
Query: 78 GRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NC +T LWRR+G G +CNACGLY K++ +NRPL
Sbjct: 206 ARRAGTCCANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVNRPLT 251
>gi|301349225|gb|ADK74261.1| Gata 4-like protein [Salmo salar]
Length = 181
Score = 102 bits (255), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 46/61 (75%), Positives = 52/61 (85%)
Query: 69 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
+E + F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHK+NG+NRPL+K RRL
Sbjct: 56 LELYDDFAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKVNGINRPLIKPQRRLSAS 115
Query: 129 R 129
R
Sbjct: 116 R 116
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 66/113 (58%), Gaps = 12/113 (10%)
Query: 11 VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG----YFIMSRFLFAS 66
+E + F EGRECVNCGA+STPLWRRDGTGHYLCNA G R L AS
Sbjct: 56 LELYDDFAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKVNGINRPLIKPQRRLSAS 115
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
V G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 116 RRV--------GLLCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 160
>gi|256073972|ref|XP_002573301.1| endothelial transcription factor GATA-2 [Schistosoma mansoni]
Length = 565
Score = 102 bits (255), Expect = 4e-20, Method: Composition-based stats.
Identities = 48/72 (66%), Positives = 54/72 (75%), Gaps = 5/72 (6%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR--GNF- 132
T+GRECVNCGA TPLWRRD GHYLCNACGLYHKMNG NRPL+K RR+ +R G F
Sbjct: 284 TDGRECVNCGATQTPLWRRDEAGHYLCNACGLYHKMNGTNRPLIKPKRRISANRKLGTFC 343
Query: 133 -NC-YERSTLFR 142
NC +TL+R
Sbjct: 344 ANCRTSHTTLWR 355
Score = 97.8 bits (242), Expect = 1e-18, Method: Composition-based stats.
Identities = 51/102 (50%), Positives = 60/102 (58%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
T+GRECVNCGA TPLWRRD GHYLCNA G +R L P I +
Sbjct: 284 TDGRECVNCGATQTPLWRRDEAGHYLCNACGLYHKMNG---TNRPLIK--PKRRISANRK 338
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 339 LGTFCANCRTSHTTLWRRNQQGDSVCNACGLYYKLHHINRPL 380
>gi|170060991|ref|XP_001866045.1| GATA-binding factor-C [Culex quinquefasciatus]
gi|167879282|gb|EDS42665.1| GATA-binding factor-C [Culex quinquefasciatus]
Length = 474
Score = 102 bits (255), Expect = 4e-20, Method: Composition-based stats.
Identities = 52/104 (50%), Positives = 63/104 (60%), Gaps = 7/104 (6%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF--ASLPVEDIEYFT 76
EGRECVNCGA STPLWRRD TGHYLCNA G +R L PV +
Sbjct: 252 EGRECVNCGATSTPLWRRDATGHYLCNACGLYYKMNG---QNRPLIKPKRKPVTSQQSAA 308
Query: 77 E--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ +G +CNACGLY+K++ ++RPL
Sbjct: 309 RRAGTSCANCKTTTTSLWRRNQSGEPVCNACGLYYKLHNVDRPL 352
Score = 98.2 bits (243), Expect = 9e-19, Method: Composition-based stats.
Identities = 40/48 (83%), Positives = 43/48 (89%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
EGRECVNCGA STPLWRRD TGHYLCNACGLY+KMNG NRPL+K R+
Sbjct: 252 EGRECVNCGATSTPLWRRDATGHYLCNACGLYYKMNGQNRPLIKPKRK 299
>gi|312084748|ref|XP_003144401.1| GATA zinc finger family protein [Loa loa]
Length = 278
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 42/51 (82%), Positives = 46/51 (90%)
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
+ +EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVK +R
Sbjct: 225 FTSEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKR 275
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 26/31 (83%), Positives = 29/31 (93%)
Query: 16 YFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+ +EGRECVNCGA+ TPLWRRDGTGHYLCNA
Sbjct: 225 FTSEGRECVNCGAVQTPLWRRDGTGHYLCNA 255
>gi|70569766|dbj|BAE06473.1| transcription factor protein [Ciona intestinalis]
Length = 485
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K +RL R
Sbjct: 258 TEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKKRLSAAR 311
Score = 91.7 bits (226), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 77/145 (53%), Gaps = 26/145 (17%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P + +
Sbjct: 258 TEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKKRLSAARR 312
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP-------------- 122
G C NC +T LWRR+ +G +CNACGLY K++G+NRPL +
Sbjct: 313 AGTSCSNCSTTTTTLWRRNASGDPVCNACGLYFKLHGVNRPLTMKKEGIQTRNRKISTKL 372
Query: 123 ------RRLRFDRGNFNCYERSTLF 141
R RFD NF ++ S+ F
Sbjct: 373 KKSSVCRDPRFDATNFKFFDGSSGF 397
>gi|148236237|ref|NP_001081452.1| GATA-binding factor 6-A [Xenopus laevis]
gi|49899014|gb|AAH76718.1| LOC397845 protein [Xenopus laevis]
Length = 502
Score = 102 bits (255), Expect = 4e-20, Method: Composition-based stats.
Identities = 51/110 (46%), Positives = 67/110 (60%), Gaps = 6/110 (5%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV 69
P + ++ E RECVNCG++ TPLWRRDGTGH+LCNA + G +SR L P
Sbjct: 280 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHFLCNACGLYSKMNG---LSRPLIK--PQ 334
Query: 70 EDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 335 KRVPSSRRIGLACANCHTSTTTLWRRNTEGEPVCNACGLYMKLHGVPRPL 384
Score = 96.7 bits (239), Expect = 3e-18, Method: Composition-based stats.
Identities = 41/76 (53%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 68 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL-- 125
P + ++ E RECVNCG++ TPLWRRDGTGH+LCNACGLY KMNG++RPL+K +R+
Sbjct: 280 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRVPS 339
Query: 126 --RFDRGNFNCYERST 139
R NC+ +T
Sbjct: 340 SRRIGLACANCHTSTT 355
>gi|3183008|sp|P70005.1|GAT6B_XENLA RecName: Full=GATA-binding factor 6-B; AltName: Full=Transcription
factor xGATA-6b
gi|1621281|emb|CAA70088.1| GATA-6 protein [Xenopus laevis]
Length = 391
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 52/111 (46%), Positives = 67/111 (60%), Gaps = 6/111 (5%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV 69
P + ++ E RECVNCG++ TPLWRRDGTGHYLCNA + G +SR L P
Sbjct: 169 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQ 223
Query: 70 EDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 224 KRVPSSRRIGLACANCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVPRPLA 274
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 42/78 (53%), Positives = 54/78 (69%), Gaps = 4/78 (5%)
Query: 68 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL-- 125
P + ++ E RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+
Sbjct: 169 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPS 228
Query: 126 --RFDRGNFNCYERSTLF 141
R NC+ +T
Sbjct: 229 SRRIGLACANCHTTTTTL 246
>gi|431894591|gb|ELK04391.1| Transcription factor GATA-5 [Pteropus alecto]
Length = 243
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 57/118 (48%), Positives = 70/118 (59%), Gaps = 11/118 (9%)
Query: 8 SLPVEDI-----EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRF 62
+LPV + E+ EGRECVNCGA+STPLWRRD TGHYLCNA G ++R
Sbjct: 15 ALPVPAVSDVLGEFPGEGRECVNCGALSTPLWRRDTTGHYLCNACGLYHKMNG---VNRP 71
Query: 63 LFASLPVEDIEYF-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
L P + + G C NC +T LWRRD G +CNACGLY K++G+ RPL
Sbjct: 72 LL--RPQKRLSSSRRAGLCCTNCHTATTTLWRRDAEGQPVCNACGLYAKLHGVPRPLA 127
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 49/85 (57%), Positives = 59/85 (69%), Gaps = 9/85 (10%)
Query: 66 SLPVEDI-----EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+LPV + E+ EGRECVNCGA+STPLWRRD TGHYLCNACGLYHKMNG+NRPL++
Sbjct: 15 ALPVPAVSDVLGEFPGEGRECVNCGALSTPLWRRDTTGHYLCNACGLYHKMNGVNRPLLR 74
Query: 121 QPRRLRFDRGN----FNCYERSTLF 141
+RL R NC+ +T
Sbjct: 75 PQKRLSSSRRAGLCCTNCHTATTTL 99
>gi|198415760|ref|XP_002122567.1| PREDICTED: zinc finger protein [Ciona intestinalis]
gi|93003144|tpd|FAA00155.1| TPA: zinc finger protein [Ciona intestinalis]
Length = 518
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K +RL R
Sbjct: 291 TEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKKRLSAAR 344
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 77/145 (53%), Gaps = 26/145 (17%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P + +
Sbjct: 291 TEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKKRLSAARR 345
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP-------------- 122
G C NC +T LWRR+ +G +CNACGLY K++G+NRPL +
Sbjct: 346 AGTSCSNCSTTTTTLWRRNASGDPVCNACGLYFKLHGVNRPLTMKKEGIQTRNRKISTKL 405
Query: 123 ------RRLRFDRGNFNCYERSTLF 141
R RFD NF ++ S+ F
Sbjct: 406 KKSSVCRDPRFDATNFKFFDGSSGF 430
>gi|120960|sp|P23770.1|GATA2_XENLA RecName: Full=GATA-binding factor 2; Short=GATA-2; AltName:
Full=Transcription factor xGATA-2
gi|214171|gb|AAA49723.1| GATA binding factor-2 [Xenopus laevis]
Length = 452
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 262 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 320
Score = 98.6 bits (244), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 51/106 (48%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L ++ +
Sbjct: 262 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-----IKPKRRLSA 313
Query: 78 GRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 ARRAGTCCANCQTSTTTLWRRNANGDPVCNACGLYYKLHNVNRPLT 359
>gi|395516754|ref|XP_003762552.1| PREDICTED: endothelial transcription factor GATA-2 [Sarcophilus
harrisii]
Length = 402
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 212 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 270
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/106 (48%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L ++ +
Sbjct: 212 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-----IKPKRRLSA 263
Query: 78 GRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 264 ARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLT 309
>gi|348521216|ref|XP_003448122.1| PREDICTED: transcription factor GATA-3-like isoform 2 [Oreochromis
niloticus]
Length = 429
Score = 102 bits (255), Expect = 4e-20, Method: Composition-based stats.
Identities = 41/45 (91%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 256 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 300
>gi|332016552|gb|EGI57433.1| GATA-binding factor A [Acromyrmex echinatior]
Length = 704
Score = 102 bits (254), Expect = 5e-20, Method: Composition-based stats.
Identities = 48/104 (46%), Positives = 62/104 (59%), Gaps = 5/104 (4%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI---EY 74
TE +ECVNC A TPLWRRDGTGHYLCNA G + + + P + +
Sbjct: 477 TEPKECVNCAASMTPLWRRDGTGHYLCNACGLYNKMNG--VNRPPMRCTKPKQSVAPTNV 534
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 535 RRAGVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRPL 578
>gi|157115508|ref|XP_001658239.1| gata-binding factor-c (transcription factor gata-c) [Aedes aegypti]
gi|108876905|gb|EAT41130.1| AAEL007205-PA [Aedes aegypti]
Length = 509
Score = 102 bits (254), Expect = 5e-20, Method: Composition-based stats.
Identities = 52/104 (50%), Positives = 63/104 (60%), Gaps = 7/104 (6%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF--ASLPVEDIEYFT 76
EGRECVNCGA STPLWRRD TGHYLCNA G +R L PV +
Sbjct: 242 EGRECVNCGATSTPLWRRDATGHYLCNACGLYYKMNG---QNRPLIKPKRKPVTSQQSAA 298
Query: 77 E--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ +G +CNACGLY+K++ ++RPL
Sbjct: 299 RRAGTSCANCKTTTTSLWRRNQSGEPVCNACGLYYKLHNVDRPL 342
Score = 97.8 bits (242), Expect = 1e-18, Method: Composition-based stats.
Identities = 40/48 (83%), Positives = 43/48 (89%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
EGRECVNCGA STPLWRRD TGHYLCNACGLY+KMNG NRPL+K R+
Sbjct: 242 EGRECVNCGATSTPLWRRDATGHYLCNACGLYYKMNGQNRPLIKPKRK 289
>gi|390465072|ref|XP_002750080.2| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Callithrix jacchus]
Length = 567
Score = 102 bits (254), Expect = 5e-20, Method: Composition-based stats.
Identities = 42/45 (93%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
TEGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 396 TEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 440
>gi|432103480|gb|ELK30584.1| Endothelial transcription factor GATA-2 [Myotis davidii]
Length = 355
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 165 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 223
Score = 99.0 bits (245), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 65/108 (60%), Gaps = 12/108 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L ++ +
Sbjct: 165 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-----IKPKRRLSA 216
Query: 78 GRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
R C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 217 ARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMK 264
>gi|410918526|ref|XP_003972736.1| PREDICTED: transcription factor GATA-3-like isoform 3 [Takifugu
rubripes]
Length = 430
Score = 102 bits (254), Expect = 5e-20, Method: Composition-based stats.
Identities = 41/45 (91%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 301
>gi|321467981|gb|EFX78968.1| hypothetical protein DAPPUDRAFT_52915 [Daphnia pulex]
Length = 108
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 66/112 (58%), Gaps = 9/112 (8%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE- 77
+GRECVNCG+ISTPLWRRDGTGHYLCNA Y M+ L + T
Sbjct: 1 DGRECVNCGSISTPLWRRDGTGHYLCNACGL------YHKMNNGTQRPLIKQTRRLSTTR 54
Query: 78 --GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRF 127
G C NC +T LWRR+ G +CNACGLY K++G+NRPL + ++
Sbjct: 55 RLGLRCANCATTTTSLWRRNNQGETVCNACGLYFKLHGVNRPLAMKKDNIQV 106
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 47/54 (87%), Gaps = 1/54 (1%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKM-NGMNRPLVKQPRRLRFDR 129
+GRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKM NG RPL+KQ RRL R
Sbjct: 1 DGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNNGTQRPLIKQTRRLSTTR 54
>gi|344276401|ref|XP_003409997.1| PREDICTED: endothelial transcription factor GATA-2-like [Loxodonta
africana]
Length = 404
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/60 (75%), Positives = 49/60 (81%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
++EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 213 WSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 272
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 66/109 (60%), Gaps = 12/109 (11%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
++EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L ++ +
Sbjct: 213 WSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-----IKPKRRLS 264
Query: 77 EGRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
R C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 265 AARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMK 313
>gi|417079353|gb|AFX60071.1| GATA binding protein 2, partial [Mesocricetus auratus]
Length = 309
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 247 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 305
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/29 (89%), Positives = 28/29 (96%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+EGRECVNCGA +TPLWRRDGTGHYLCNA
Sbjct: 247 SEGRECVNCGATATPLWRRDGTGHYLCNA 275
>gi|70569760|dbj|BAE06472.1| transcription factor protein [Ciona intestinalis]
Length = 553
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 47/54 (87%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K +RL R
Sbjct: 326 TEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKKRLSAAR 379
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 77/145 (53%), Gaps = 26/145 (17%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P + +
Sbjct: 326 TEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKKRLSAARR 380
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP-------------- 122
G C NC +T LWRR+ +G +CNACGLY K++G+NRPL +
Sbjct: 381 AGTSCSNCSTTTTTLWRRNASGDPVCNACGLYFKLHGVNRPLTMKKEGIQTRNRKISTKL 440
Query: 123 ------RRLRFDRGNFNCYERSTLF 141
R RFD NF ++ S+ F
Sbjct: 441 KKSSVCRDPRFDATNFKFFDGSSGF 465
>gi|62205319|gb|AAH93138.1| Gata5 protein, partial [Danio rerio]
Length = 285
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 43/50 (86%), Positives = 48/50 (96%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
EGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K +RL+
Sbjct: 182 EGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQ 231
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 69/108 (63%), Gaps = 7/108 (6%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT-- 76
EGRECVNCG+ISTPLWRRDGTGHYLCNA G ++R L P + ++ +
Sbjct: 182 EGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNG---INRPLIK--PQKRLQSTSRR 236
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
G C NC +T LWRR+ G +CNACGLY K++G+ RPL + ++
Sbjct: 237 AGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKKK 284
>gi|405957946|gb|EKC24122.1| GATA-binding factor A [Crassostrea gigas]
Length = 358
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 74/136 (54%), Gaps = 28/136 (20%)
Query: 11 VEDIEYFT-EGRECVNCGAISTPLWRRDGTGHYLCNA-RLFSTSFTGY------------ 56
+D +YF E RECVNCGAISTPLWRRDGTGHYLCNA L G
Sbjct: 5 ADDQDYFNNECRECVNCGAISTPLWRRDGTGHYLCNACGLLHKMNPGQSRTPVKQPNREE 64
Query: 57 FIMSRFLFASLPVEDIE--YFTE------------GRECVNCGAISTPLWRRDGTGHYLC 102
+S A P++ I+ F + G C NC +T LWRR+G G +C
Sbjct: 65 SPISEEEKAPPPIKRIDDAKFDKISGINNNNRSRMGLSCANCNTSTTTLWRRNGEGEPVC 124
Query: 103 NACGLYHKMNGMNRPL 118
NACGLY+K++ +NRPL
Sbjct: 125 NACGLYYKLHQVNRPL 140
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 44/58 (75%), Positives = 47/58 (81%), Gaps = 2/58 (3%)
Query: 69 VEDIEYFT-EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVKQPRR 124
+D +YF E RECVNCGAISTPLWRRDGTGHYLCNACGL HKMN G +R VKQP R
Sbjct: 5 ADDQDYFNNECRECVNCGAISTPLWRRDGTGHYLCNACGLLHKMNPGQSRTPVKQPNR 62
>gi|18858739|ref|NP_571632.1| transcription factor GATA-6 [Danio rerio]
gi|6524990|gb|AAF15276.1|AF191578_1 zinc finger transcription factor Gata6 [Danio rerio]
Length = 383
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/107 (48%), Positives = 66/107 (61%), Gaps = 6/107 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E E RECVNCG++STPLWRRDGTGH+LCNA + G +SR L P + +
Sbjct: 175 LEEMVESRECVNCGSMSTPLWRRDGTGHFLCNACGLYSKMNG---LSRPLIK--PQKRMS 229
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 230 SSRRIGLSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRPLA 276
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 39/58 (67%), Positives = 48/58 (82%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+E E RECVNCG++STPLWRRDGTGH+LCNACGLY KMNG++RPL+K +R+ R
Sbjct: 175 LEEMVESRECVNCGSMSTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRMSSSR 232
>gi|122054007|gb|ABM66009.1| GATA4 [Ateles geoffroyi]
Length = 55
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/51 (88%), Positives = 49/51 (96%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F+EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRL 55
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/30 (93%), Positives = 30/30 (100%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
F+EGRECVNCGA+STPLWRRDGTGHYLCNA
Sbjct: 5 FSEGRECVNCGAMSTPLWRRDGTGHYLCNA 34
>gi|345310085|ref|XP_003428923.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3-like [Ornithorhynchus
anatinus]
Length = 440
Score = 102 bits (253), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 45/53 (84%), Positives = 46/53 (86%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 256 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 308
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/104 (50%), Positives = 63/104 (60%), Gaps = 6/104 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-TE 77
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 256 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARRA 310
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 311 GTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLTMK 354
>gi|308478267|ref|XP_003101345.1| CRE-ELT-1 protein [Caenorhabditis remanei]
gi|308263246|gb|EFP07199.1| CRE-ELT-1 protein [Caenorhabditis remanei]
Length = 412
Score = 102 bits (253), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 52/115 (45%), Positives = 70/115 (60%), Gaps = 9/115 (7%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIM---SRFLFASLPVED 71
++ TE RECVNCG +TPLWRRDG+G+YLCNA YF M +R L +
Sbjct: 208 QFGTEDRECVNCGVHATPLWRRDGSGNYLCNACGL------YFKMNHHARPLVKPKKRQQ 261
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
G ECVNC +T LWRR+G GH +CNACGLY+K++ + RP+ + ++
Sbjct: 262 NAQKRTGIECVNCHTNTTTLWRRNGEGHPVCNACGLYYKLHKVERPMAMKKEGIQ 316
>gi|345314700|ref|XP_001514983.2| PREDICTED: transcription factor GATA-4-like, partial
[Ornithorhynchus anatinus]
Length = 315
Score = 102 bits (253), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 44/52 (84%), Positives = 47/52 (90%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
GRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 22 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 73
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/104 (51%), Positives = 61/104 (58%), Gaps = 12/104 (11%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIEYF 75
GRECVNCGA+STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 22 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV------ 75
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 76 --GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 117
>gi|326931899|ref|XP_003212061.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-5-like
[Meleagris gallopavo]
Length = 386
Score = 102 bits (253), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 47/66 (71%), Positives = 55/66 (83%), Gaps = 2/66 (3%)
Query: 65 ASLPVEDIEYFT-EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
A+LP + +E F +GRECVNCGA+STPLWR+DGTGHYLCNACGLYHKMNG+NRPL Q +
Sbjct: 164 AALPGQYLEEFPGDGRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGINRPLKPQ-K 222
Query: 124 RLRFDR 129
RL R
Sbjct: 223 RLSSSR 228
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 70/118 (59%), Gaps = 14/118 (11%)
Query: 7 ASLPVEDIEYFT-EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFA 65
A+LP + +E F +GRECVNCGA+STPLWR+DGTGHYLCNA G
Sbjct: 164 AALPGQYLEEFPGDGRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGI--------- 214
Query: 66 SLPVEDIEYFTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ P++ + + R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 215 NRPLKPQKRLSSSRRAGLCCTNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 272
>gi|440913274|gb|ELR62746.1| Endothelial transcription factor GATA-2, partial [Bos grunniens
mutus]
Length = 385
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 222 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 280
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/106 (48%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L ++ +
Sbjct: 222 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-----IKPKRRLSA 273
Query: 78 GRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 274 ARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLT 319
>gi|327266039|ref|XP_003217814.1| PREDICTED: LOW QUALITY PROTEIN: endothelial transcription factor
GATA-2-like [Anolis carolinensis]
Length = 387
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 48/59 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 197 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 255
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/106 (48%), Positives = 64/106 (60%), Gaps = 12/106 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L ++ +
Sbjct: 197 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPL-----IKPKRRLSA 248
Query: 78 GRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 249 ARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLT 294
>gi|326927942|ref|XP_003210146.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Meleagris
gallopavo]
Length = 452
Score = 102 bits (253), Expect = 7e-20, Method: Composition-based stats.
Identities = 42/57 (73%), Positives = 45/57 (78%)
Query: 64 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
F P +EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 264 FTPKPRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 320
>gi|260782140|ref|XP_002586149.1| hypothetical protein BRAFLDRAFT_255109 [Branchiostoma floridae]
gi|229271242|gb|EEN42160.1| hypothetical protein BRAFLDRAFT_255109 [Branchiostoma floridae]
Length = 107
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 45/54 (83%), Positives = 48/54 (88%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 1 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLSASR 54
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 61/105 (58%), Gaps = 12/105 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIE 73
+EGRECVNCGA STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 1 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLSASRRV---- 56
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC T LWRR+ G +CNACGLY+K++ ++ L
Sbjct: 57 ----GLQCANCRTTQTTLWRRNNEGEPVCNACGLYYKLHNVSTTL 97
>gi|387538429|gb|AFJ79491.1| GATA binding protein 4/5/6 isoform 2a, partial [Branchiostoma
lanceolatum]
Length = 184
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 48/54 (88%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYH+MNG+NRPL+K RRL R
Sbjct: 106 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHRMNGVNRPLIKPQRRLSASR 159
Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 46/85 (54%), Gaps = 12/85 (14%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIE 73
+EGRECVNCGA STPLWRRDGTGHYLCNA G R L AS V
Sbjct: 106 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHRMNGVNRPLIKPQRRLSASRRV---- 161
Query: 74 YFTEGRECVNCGAISTPLWRRDGTG 98
G +C NC T LWRR+ G
Sbjct: 162 ----GLQCANCRTTQTTLWRRNNEG 182
>gi|335371117|gb|AEH57087.1| GATA456 [Bugula neritina]
Length = 512
Score = 102 bits (253), Expect = 7e-20, Method: Composition-based stats.
Identities = 50/108 (46%), Positives = 60/108 (55%), Gaps = 6/108 (5%)
Query: 16 YFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF 75
YF E RECVNCGA+ TPLWRRD GHYLCNA G + A P +
Sbjct: 246 YFGENRECVNCGAMHTPLWRRDTDGHYLCNACGLYQKMNGLNRPLQVKPAPPPTKKQTNS 305
Query: 76 TEGRE------CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
T G C NC +T LWRR+ G +CNACGLY+K++ +NRP
Sbjct: 306 TAGLSRRTGLVCANCNTGTTTLWRRNNDGEPVCNACGLYYKLHNVNRP 353
Score = 95.1 bits (235), Expect = 8e-18, Method: Composition-based stats.
Identities = 38/49 (77%), Positives = 41/49 (83%)
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP 122
YF E RECVNCGA+ TPLWRRD GHYLCNACGLY KMNG+NRPL +P
Sbjct: 246 YFGENRECVNCGAMHTPLWRRDTDGHYLCNACGLYQKMNGLNRPLQVKP 294
>gi|301764533|ref|XP_002917682.1| PREDICTED: endothelial transcription factor GATA-2-like isoform 2
[Ailuropoda melanoleuca]
Length = 466
Score = 101 bits (252), Expect = 8e-20, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|355784333|gb|EHH65184.1| GATA-binding factor 5, partial [Macaca fascicularis]
Length = 283
Score = 101 bits (252), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 44/60 (73%), Positives = 49/60 (81%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNF 132
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +R R
Sbjct: 68 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRSSSRRAGL 127
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 65/105 (61%), Gaps = 5/105 (4%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L P +
Sbjct: 68 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPLV--RPQKRSSS 122
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 123 RRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLA 167
>gi|410951842|ref|XP_003982602.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Felis
catus]
Length = 466
Score = 101 bits (252), Expect = 8e-20, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|348576623|ref|XP_003474086.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Cavia porcellus]
Length = 418
Score = 101 bits (252), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 67/113 (59%), Gaps = 6/113 (5%)
Query: 8 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASL 67
++ E +E EGREC NCG+ TPLWRRDGTGHYLCNA + G +SR L
Sbjct: 198 AVAAEVLEDPHEGRECANCGSFQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK-- 252
Query: 68 PVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
P + + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 253 PQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 305
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 44/89 (49%), Positives = 56/89 (62%), Gaps = 4/89 (4%)
Query: 57 FIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNR 116
+ S ++ E +E EGREC NCG+ TPLWRRDGTGHYLCNACGLY KMNG++R
Sbjct: 189 VVASPTSHPAVAAEVLEDPHEGRECANCGSFQTPLWRRDGTGHYLCNACGLYSKMNGLSR 248
Query: 117 PLVKQPRRL----RFDRGNFNCYERSTLF 141
PL+K +R+ R NC+ +T
Sbjct: 249 PLIKPQKRVPSSRRLGLSCANCHTTTTTL 277
>gi|165292337|dbj|BAF98874.1| GATA binding protein 3 isoform 1 [Carassius auratus langsdorfii]
Length = 178
Score = 101 bits (252), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 44/50 (88%), Positives = 46/50 (92%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL
Sbjct: 121 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRL 170
Score = 65.1 bits (157), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 27/29 (93%), Positives = 28/29 (96%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 121 SEGRECVNCGATSTPLWRRDGTGHYLCNA 149
>gi|165292345|dbj|BAF98878.1| GATA binding protein 3 isoform 5 [Carassius auratus langsdorfii]
Length = 159
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 44/51 (86%), Positives = 45/51 (88%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL
Sbjct: 101 LIEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRL 151
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/30 (90%), Positives = 27/30 (90%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 101 LIEGRECVNCGATSTPLWRRDGTGHYLCNA 130
>gi|348507928|ref|XP_003441507.1| PREDICTED: GATA-binding factor 2-like isoform 3 [Oreochromis
niloticus]
Length = 441
Score = 101 bits (252), Expect = 9e-20, Method: Composition-based stats.
Identities = 41/45 (91%), Positives = 42/45 (93%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
TEGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+K
Sbjct: 271 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIK 315
>gi|345324774|ref|XP_001506274.2| PREDICTED: GATA-binding factor 2-like [Ornithorhynchus anatinus]
Length = 250
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 45/61 (73%), Positives = 48/61 (78%)
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFN 133
+ EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 58 FHLEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGT 117
Query: 134 C 134
C
Sbjct: 118 C 118
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/107 (48%), Positives = 64/107 (59%), Gaps = 6/107 (5%)
Query: 16 YFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF 75
+ EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L P +
Sbjct: 58 FHLEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAA 112
Query: 76 -TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 113 RRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMK 159
>gi|307198089|gb|EFN79142.1| GATA-binding factor A [Harpegnathos saltator]
Length = 728
Score = 101 bits (252), Expect = 9e-20, Method: Composition-based stats.
Identities = 49/104 (47%), Positives = 61/104 (58%), Gaps = 5/104 (4%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE---Y 74
TE +EC NC A TPLWRRDGTGHYLCNA + G+ + P + I
Sbjct: 500 TEPKECANCAASLTPLWRRDGTGHYLCNACGLYSKMNGH--NRPPMRCPKPKQTITPTGV 557
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ +G +CNACGLY K++ MNRPL
Sbjct: 558 RRTGVQCANCKTSNTTLWRRNNSGEPVCNACGLYFKLHNMNRPL 601
Score = 88.2 bits (217), Expect = 9e-16, Method: Composition-based stats.
Identities = 35/48 (72%), Positives = 39/48 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
TE +EC NC A TPLWRRDGTGHYLCNACGLY KMNG NRP ++ P+
Sbjct: 500 TEPKECANCAASLTPLWRRDGTGHYLCNACGLYSKMNGHNRPPMRCPK 547
>gi|341894024|gb|EGT49959.1| hypothetical protein CAEBREN_00985 [Caenorhabditis brenneri]
Length = 375
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 65/105 (61%), Gaps = 9/105 (8%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIM---SRFLFASLPVEDIEY 74
TE RECVNCG +TPLWRRDG+G+YLCNA YF M +R L +
Sbjct: 207 TEDRECVNCGVHNTPLWRRDGSGNYLCNACGL------YFKMNHHARPLVKPKKRQQNAQ 260
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G ECVNC +T LWRR+G GH +CNACGLY+K++ + RP+
Sbjct: 261 KRTGIECVNCHTNNTTLWRRNGEGHPVCNACGLYYKLHKVERPMA 305
Score = 82.0 bits (201), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 36/51 (70%), Positives = 41/51 (80%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
TE RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVK +R +
Sbjct: 207 TEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKRQQ 257
>gi|410963189|ref|XP_003988148.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Felis catus]
Length = 492
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/84 (60%), Positives = 55/84 (65%), Gaps = 9/84 (10%)
Query: 46 ARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNAC 105
+ L S TG+ SR S + GRECVNCGA STPLWRRDGTGHYLCNAC
Sbjct: 286 SSLLGGSPTGFGCKSRPKARS---------STGRECVNCGATSTPLWRRDGTGHYLCNAC 336
Query: 106 GLYHKMNGMNRPLVKQPRRLRFDR 129
GLYHKMNG NRPL+K RRL R
Sbjct: 337 GLYHKMNGQNRPLIKPKRRLSAAR 360
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 61/106 (57%), Gaps = 12/106 (11%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIE 73
+ GRECVNCGA STPLWRRDGTGHYLCNA G R L A+
Sbjct: 307 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR------ 360
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 361 --RAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 404
>gi|402887125|ref|XP_003906955.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3 [Papio
anubis]
Length = 466
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|334342417|ref|XP_003341811.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Monodelphis
domestica]
Length = 459
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 283 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 327
>gi|109098098|ref|XP_001097707.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Macaca mulatta]
Length = 466
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|296474637|tpg|DAA16752.1| TPA: GATA binding protein 2 [Bos taurus]
Length = 466
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|47229585|emb|CAG06781.1| unnamed protein product [Tetraodon nigroviridis]
Length = 456
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 44/59 (74%), Positives = 45/59 (76%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
TEGRECVNCGA STPLWRRD TGHYLC CGLYHKMNG NRPL+K RRL R C
Sbjct: 271 TEGRECVNCGATSTPLWRRDSTGHYLCKRCGLYHKMNGQNRPLIKPKRRLSAARRAGTC 329
Score = 97.8 bits (242), Expect = 1e-18, Method: Composition-based stats.
Identities = 51/102 (50%), Positives = 59/102 (57%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRD TGHYLC G +R L P +
Sbjct: 271 TEGRECVNCGATSTPLWRRDSTGHYLCKRCGLYHKMNG---QNRPLIK--PKRRLSAARR 325
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 326 AGTCCANCQTTTTTLWRRNANGDPVCNACGLYFKLHNVNRPL 367
>gi|341880193|gb|EGT36128.1| CBN-ELT-1 protein [Caenorhabditis brenneri]
Length = 393
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 65/105 (61%), Gaps = 9/105 (8%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIM---SRFLFASLPVEDIEY 74
TE RECVNCG +TPLWRRDG+G+YLCNA YF M +R L +
Sbjct: 207 TEDRECVNCGVHNTPLWRRDGSGNYLCNACGL------YFKMNHHARPLVKPKKRQQNAQ 260
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G ECVNC +T LWRR+G GH +CNACGLY+K++ + RP+
Sbjct: 261 KRTGIECVNCHTNNTTLWRRNGEGHPVCNACGLYYKLHKVERPMA 305
Score = 82.0 bits (201), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 36/51 (70%), Positives = 41/51 (80%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
TE RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVK +R +
Sbjct: 207 TEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKRQQ 257
>gi|426341994|ref|XP_004036302.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Gorilla gorilla gorilla]
Length = 466
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|338714471|ref|XP_003363086.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Equus
caballus]
Length = 466
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|395847145|ref|XP_003796244.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Otolemur garnettii]
Length = 466
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|355562973|gb|EHH19535.1| GATA-binding factor 5, partial [Macaca mulatta]
Length = 237
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/60 (73%), Positives = 49/60 (81%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNF 132
E+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+ +R R
Sbjct: 22 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRSSSRRAGL 81
Score = 91.7 bits (226), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 65/105 (61%), Gaps = 5/105 (4%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L P +
Sbjct: 22 EFPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPLV--RPQKRSSS 76
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 77 RRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLA 121
>gi|353228552|emb|CCD74723.1| putative gata binding factor [Schistosoma mansoni]
Length = 735
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 50/101 (49%), Positives = 59/101 (58%), Gaps = 3/101 (2%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDG G+YLCNA G +R L
Sbjct: 197 TEGRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNG---QNRPLIKPKRRLQSSSRRT 253
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC I+T LWRR+ G +CNACGLY K++ + RP+
Sbjct: 254 GTICSNCRTITTTLWRRNTNGEPVCNACGLYFKLHNIQRPI 294
Score = 100 bits (250), Expect = 1e-19, Method: Composition-based stats.
Identities = 42/51 (82%), Positives = 45/51 (88%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
TEGRECVNCGA STPLWRRDG G+YLCNACGLY KMNG NRPL+K RRL+
Sbjct: 197 TEGRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNGQNRPLIKPKRRLQ 247
>gi|291393344|ref|XP_002713201.1| PREDICTED: GATA binding protein 2 isoform 1 [Oryctolagus cuniculus]
Length = 466
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|224611701|ref|NP_001139134.1| endothelial transcription factor GATA-2 isoform 2 [Homo sapiens]
gi|114589045|ref|XP_001138192.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Pan
troglodytes]
gi|332261775|ref|XP_003279942.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Nomascus leucogenys]
gi|15990428|gb|AAH15577.1| GATA2 protein [Homo sapiens]
gi|17512029|gb|AAH18988.1| GATA2 protein [Homo sapiens]
gi|30410878|gb|AAH51272.1| GATA2 protein [Homo sapiens]
gi|119599720|gb|EAW79314.1| GATA binding protein 2, isoform CRA_b [Homo sapiens]
Length = 466
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|74192130|dbj|BAE34273.1| unnamed protein product [Mus musculus]
Length = 365
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 257 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 310
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 56/99 (56%), Gaps = 12/99 (12%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIE 73
+ GRECVNCGA STPLWRRDGTGHYLCNA G R L A+
Sbjct: 257 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR------ 310
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
G C NC +T LWRR+ G +CNACGLY+K++
Sbjct: 311 --RAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLH 347
>gi|256083783|ref|XP_002578117.1| transcription factor GATA-1 [Schistosoma mansoni]
Length = 735
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 50/101 (49%), Positives = 59/101 (58%), Gaps = 3/101 (2%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDG G+YLCNA G +R L
Sbjct: 197 TEGRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNG---QNRPLIKPKRRLQSSSRRT 253
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC I+T LWRR+ G +CNACGLY K++ + RP+
Sbjct: 254 GTICSNCRTITTTLWRRNTNGEPVCNACGLYFKLHNIQRPI 294
Score = 100 bits (250), Expect = 1e-19, Method: Composition-based stats.
Identities = 42/51 (82%), Positives = 45/51 (88%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
TEGRECVNCGA STPLWRRDG G+YLCNACGLY KMNG NRPL+K RRL+
Sbjct: 197 TEGRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNGQNRPLIKPKRRLQ 247
>gi|358334312|dbj|GAA52740.1| transcription factor GATA-3 [Clonorchis sinensis]
Length = 1004
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 42/51 (82%), Positives = 45/51 (88%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
TEGRECVNCGA STPLWRRDG G+YLCNACGLY KMNG NRPL+K RRL+
Sbjct: 610 TEGRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNGQNRPLIKPKRRLQ 660
Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats.
Identities = 49/101 (48%), Positives = 59/101 (58%), Gaps = 3/101 (2%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA STPLWRRDG G+YLCNA G +R L
Sbjct: 610 TEGRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNG---QNRPLIKPKRRLQSSSRRT 666
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ++T LWRR+ G +CNACGLY K++ + RP+
Sbjct: 667 GTICSNCRTVTTTLWRRNTNGEPVCNACGLYFKLHNIQRPI 707
>gi|354482835|ref|XP_003503601.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Cricetulus griseus]
Length = 466
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 334
>gi|403265451|ref|XP_003924952.1| PREDICTED: transcription factor GATA-6-like [Saimiri boliviensis
boliviensis]
Length = 452
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 51/106 (48%), Positives = 63/106 (59%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E E RECVNCG+I TPLWRR G GHYLCNA + G +SR L P + +
Sbjct: 238 LEDLPESRECVNCGSIQTPLWRRGGPGHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 292
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 293 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 338
Score = 93.2 bits (230), Expect = 3e-17, Method: Composition-based stats.
Identities = 41/72 (56%), Positives = 50/72 (69%), Gaps = 4/72 (5%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E E RECVNCG+I TPLWRR G GHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 238 LEDLPESRECVNCGSIQTPLWRRGGPGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 297
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 298 GLSCANCHTTTT 309
>gi|323652552|gb|ADX98527.1| GATA-binding protein 3 [Coturnix japonica]
Length = 281
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 147 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 200
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/103 (50%), Positives = 62/103 (60%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 147 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 201
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 202 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 244
>gi|301607041|ref|XP_002933112.1| PREDICTED: GATA-binding factor 2-like isoform 3 [Xenopus (Silurana)
tropicalis]
Length = 437
Score = 100 bits (250), Expect = 1e-19, Method: Composition-based stats.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 261 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 305
>gi|410919501|ref|XP_003973223.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Takifugu rubripes]
Length = 442
Score = 100 bits (250), Expect = 1e-19, Method: Composition-based stats.
Identities = 41/45 (91%), Positives = 42/45 (93%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
TEGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+K
Sbjct: 271 TEGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIK 315
>gi|344269962|ref|XP_003406816.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Loxodonta africana]
Length = 390
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 62/109 (56%), Gaps = 7/109 (6%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E E RECVNCG+I TPLWRRDGTGHYLCNA Y M+ P +
Sbjct: 176 LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNA------CGXYSKMNGLSGPDQPPKRAP 229
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY K++G+ RPL +
Sbjct: 230 ASRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMK 278
Score = 86.3 bits (212), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 51/88 (57%), Gaps = 7/88 (7%)
Query: 57 FIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM-- 114
M R A L +E E RECVNCG+I TPLWRRDGTGHYLCNACG Y KMNG+
Sbjct: 165 LPMPRGPSADL----LEDLPESRECVNCGSIQTPLWRRDGTGHYLCNACGXYSKMNGLSG 220
Query: 115 -NRPLVKQPRRLRFDRGNFNCYERSTLF 141
++P + P R NC+ +T
Sbjct: 221 PDQPPKRAPASRRLGLSCANCHTTTTTL 248
>gi|242020471|ref|XP_002430677.1| endothelial transcription factor GATA-2, putative [Pediculus
humanus corporis]
gi|212515857|gb|EEB17939.1| endothelial transcription factor GATA-2, putative [Pediculus
humanus corporis]
Length = 399
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/102 (51%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-TE 77
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 296 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIK--PKRRLSAARRA 350
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 351 GTSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVNRPLT 392
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/53 (83%), Positives = 46/53 (86%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL R
Sbjct: 296 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLSAAR 348
>gi|270015717|gb|EFA12165.1| hypothetical protein TcasGA2_TC002315 [Tribolium castaneum]
Length = 409
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 63/112 (56%), Gaps = 5/112 (4%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV 69
P EGRECVNCGA STPLWRRDGTGHYLCNA G +R L
Sbjct: 173 PRNKSRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIKPKRR 229
Query: 70 EDIEYFT--EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
++ G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 230 LSLQSAARRAGTSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVNRPLT 281
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 44/61 (72%), Positives = 46/61 (75%)
Query: 68 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRF 127
P EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 173 PRNKSRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLSL 232
Query: 128 D 128
Sbjct: 233 Q 233
>gi|327269885|ref|XP_003219723.1| PREDICTED: transcription factor GATA-6-like [Anolis carolinensis]
Length = 504
Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats.
Identities = 50/106 (47%), Positives = 64/106 (60%), Gaps = 6/106 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
++ E RECVNCG+ TPLWR+DGTG+YLCNA T G +SR L P + +
Sbjct: 287 LDELQESRECVNCGSFQTPLWRKDGTGNYLCNACGLYTKMNG---LSRPLIK--PQKRVP 341
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 342 SQRRLGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 387
Score = 90.5 bits (223), Expect = 2e-16, Method: Composition-based stats.
Identities = 36/58 (62%), Positives = 46/58 (79%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
++ E RECVNCG+ TPLWR+DGTG+YLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 287 LDELQESRECVNCGSFQTPLWRKDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVPSQR 344
>gi|86575043|ref|NP_001033436.1| Protein ELT-1, isoform b [Caenorhabditis elegans]
gi|119299|sp|P28515.1|ELT1_CAEEL RecName: Full=Transcription factor elt-1
gi|6702|emb|CAA40967.1| elt-1 [Caenorhabditis elegans]
gi|74834772|emb|CAJ30235.1| Protein ELT-1, isoform b [Caenorhabditis elegans]
Length = 416
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 72/122 (59%), Gaps = 9/122 (7%)
Query: 1 MSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIM- 59
+S+ +S + ++ TE RECVNCG +TPLWRRDG+G+YLCNA YF M
Sbjct: 195 ISKANRSSGGANNSQFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGL------YFKMN 248
Query: 60 --SRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
+R L + G ECVNC +T LWRR+G GH +CNACGLY K++ + RP
Sbjct: 249 HHARPLVKPKKRQQNAQKRTGIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVERP 308
Query: 118 LV 119
+
Sbjct: 309 IT 310
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/68 (55%), Positives = 49/68 (72%)
Query: 59 MSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+S+ +S + ++ TE RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPL
Sbjct: 195 ISKANRSSGGANNSQFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPL 254
Query: 119 VKQPRRLR 126
VK +R +
Sbjct: 255 VKPKKRQQ 262
>gi|340726756|ref|XP_003401719.1| PREDICTED: endothelial transcription factor GATA-2-like [Bombus
terrestris]
Length = 362
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 311 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 359
Score = 65.9 bits (159), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/28 (96%), Positives = 27/28 (96%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 311 EGRECVNCGATSTPLWRRDGTGHYLCNA 338
>gi|2494686|sp|Q91678.1|GAT6A_XENLA RecName: Full=GATA-binding factor 6-A; AltName: Full=Transcription
factor xGATA-6a
gi|1209880|gb|AAB05648.1| transcription factor xGATA-6 [Xenopus laevis]
Length = 391
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/111 (45%), Positives = 67/111 (60%), Gaps = 6/111 (5%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV 69
P + ++ E RECVNCG++ TPLWRRDGTGH+LCNA + G +SR L P
Sbjct: 169 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHFLCNACGLYSKMNG---LSRPLIK--PQ 223
Query: 70 EDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ + G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 224 KRVPSSRRIGLACANCHTSTTTLWRRNTEGEPVCNACGLYMKLHGVPRPLA 274
Score = 95.5 bits (236), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 41/78 (52%), Positives = 54/78 (69%), Gaps = 4/78 (5%)
Query: 68 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL-- 125
P + ++ E RECVNCG++ TPLWRRDGTGH+LCNACGLY KMNG++RPL+K +R+
Sbjct: 169 PGDVLDELPESRECVNCGSVQTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRVPS 228
Query: 126 --RFDRGNFNCYERSTLF 141
R NC+ +T
Sbjct: 229 SRRIGLACANCHTSTTTL 246
>gi|328784695|ref|XP_003250485.1| PREDICTED: endothelial transcription factor GATA-2-like [Apis
mellifera]
Length = 311
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 308
Score = 65.9 bits (159), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/28 (96%), Positives = 27/28 (96%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNA 287
>gi|47206170|emb|CAF89866.1| unnamed protein product [Tetraodon nigroviridis]
Length = 306
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/65 (73%), Positives = 53/65 (81%), Gaps = 1/65 (1%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
ASL + D + TEGRECVNCG++STPLWRRDGTGHYLCNACGLYHKMNG NRPLV+ RR
Sbjct: 183 ASLDLMD-DGSTEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGSNRPLVRPQRR 241
Query: 125 LRFDR 129
R
Sbjct: 242 PAPSR 246
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 68/116 (58%), Gaps = 6/116 (5%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFAS 66
ASL + D + TEGRECVNCG++STPLWRRDGTGHYLCNA G S
Sbjct: 183 ASLDLMD-DGSTEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNG----SNRPLVR 237
Query: 67 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM-NRPLVKQ 121
G C NCG +T LWRR+ G +CNACGLY K++G+ +RPL +
Sbjct: 238 PQRRPAPSRRAGLCCTNCGTSTTTLWRRNAEGEPVCNACGLYMKLHGVGHRPLAMK 293
>gi|443718380|gb|ELU09032.1| hypothetical protein CAPTEDRAFT_121659, partial [Capitella teleta]
Length = 144
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/103 (50%), Positives = 67/103 (65%), Gaps = 6/103 (5%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI-EYF 75
F EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R + P + +
Sbjct: 1 FCEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNG---LNRPMMK--PQKRMCASR 55
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY K++G++RP+
Sbjct: 56 RMGLQCANCQTSTTTLWRRNADGEPVCNACGLYFKLHGVHRPM 98
Score = 99.0 bits (245), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 42/51 (82%), Positives = 48/51 (94%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRP++K +R+
Sbjct: 1 FCEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGLNRPMMKPQKRM 51
>gi|380020456|ref|XP_003694099.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like [Apis florea]
Length = 310
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 259 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 307
Score = 65.9 bits (159), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/28 (96%), Positives = 27/28 (96%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 259 EGRECVNCGATSTPLWRRDGTGHYLCNA 286
>gi|47223008|emb|CAG07095.1| unnamed protein product [Tetraodon nigroviridis]
Length = 225
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 66/112 (58%), Gaps = 5/112 (4%)
Query: 11 VEDI-EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV 69
V DI + E RECVNCG+ISTPLWRRDGTGH+LCNA + G +SR L
Sbjct: 71 VSDILDDMGESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNG---LSRPLIKPQKR 127
Query: 70 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY K++G+ RPL +
Sbjct: 128 TSTSRRI-GLSCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVPRPLAMK 178
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/62 (67%), Positives = 49/62 (79%), Gaps = 1/62 (1%)
Query: 69 VEDI-EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRF 127
V DI + E RECVNCG+ISTPLWRRDGTGH+LCNACGLY KMNG++RPL+K +R
Sbjct: 71 VSDILDDMGESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRTST 130
Query: 128 DR 129
R
Sbjct: 131 SR 132
>gi|332029518|gb|EGI69407.1| Endothelial transcription factor GATA-2 [Acromyrmex echinatior]
Length = 303
Score = 100 bits (248), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 252 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 300
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/28 (96%), Positives = 27/28 (96%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 252 EGRECVNCGATSTPLWRRDGTGHYLCNA 279
>gi|444707682|gb|ELW48920.1| Transcription factor GATA-5 [Tupaia chinensis]
Length = 242
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 43/57 (75%), Positives = 49/57 (85%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHK+NG+NRPL++ +RL R
Sbjct: 19 ESPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKVNGVNRPLLRPQKRLPSSR 75
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/113 (48%), Positives = 67/113 (59%), Gaps = 6/113 (5%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E EGRECVNCGA+STPLWRRDGTGHYLCNA G ++R L P + +
Sbjct: 19 ESPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKVNG---VNRPLL--RPQKRLPS 73
Query: 75 F-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
G C NC T LWRR G +CNACGLY K++G+ RPL + R++
Sbjct: 74 SRRAGLCCTNCHTSCTTLWRRSADGEPVCNACGLYMKLHGVPRPLAMKKERIQ 126
>gi|323319537|gb|ADX36139.1| GATA123a [Schmidtea polychroa]
Length = 101
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/54 (83%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 3 TEGRECVNCGATQTPLWRRDGTGHYLCNACGLYHKMNGTNRPLIKPKRRLSSAR 56
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 52/102 (50%), Positives = 64/102 (62%), Gaps = 6/102 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TEGRECVNCGA TPLWRRDGTGHYLCNA G +R L P + +
Sbjct: 3 TEGRECVNCGATQTPLWRRDGTGHYLCNACGLYHKMNG---TNRPLIK--PKRRLSSARK 57
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C+NCG T LWRR+ G +CNACGLY+K++ ++RP+
Sbjct: 58 VGTTCINCGTNHTTLWRRNQQGDSVCNACGLYYKLHHISRPI 99
>gi|357619685|gb|EHJ72160.1| grain [Danaus plexippus]
Length = 304
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 43/50 (86%), Positives = 45/50 (90%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 198 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 247
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/29 (93%), Positives = 27/29 (93%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 198 AEGRECVNCGATSTPLWRRDGTGHYLCNA 226
>gi|194475182|gb|ACF74542.1| GATA binding protein 3 [Cervus elaphus]
Length = 191
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/82 (62%), Positives = 54/82 (65%), Gaps = 9/82 (10%)
Query: 48 LFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGL 107
L S TG+ SR S + GRECVNCGA STPLWRRDGTGHYLCNACGL
Sbjct: 48 LLGGSPTGFGCKSRPKARS---------STGRECVNCGATSTPLWRRDGTGHYLCNACGL 98
Query: 108 YHKMNGMNRPLVKQPRRLRFDR 129
YHKMNG NRPL+K RRL R
Sbjct: 99 YHKMNGQNRPLIKPKRRLSAAR 120
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 63/105 (60%), Gaps = 6/105 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
+ GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 67 STGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARR 121
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 122 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLTMK 166
>gi|183000|gb|AAA35870.1| Gata3 enhancer-binding protein [Homo sapiens]
Length = 444
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 45/54 (83%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCGA STPLWRRDGTGHYLC CGLYHKMNG NRPL+K RRL R
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCRRCGLYHKMNGQNRPLIKPKRRLSAAR 312
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/103 (50%), Positives = 61/103 (59%), Gaps = 6/103 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-T 76
TEGRECVNCGA STPLWRRDGTGHYLC G +R L P +
Sbjct: 259 TEGRECVNCGATSTPLWRRDGTGHYLCRRCGLYHKMNG---QNRPLIK--PKRRLSAARR 313
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 356
>gi|350422369|ref|XP_003493143.1| PREDICTED: hypothetical protein LOC100742684 [Bombus impatiens]
Length = 215
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 164 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 212
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/28 (96%), Positives = 27/28 (96%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 164 EGRECVNCGATSTPLWRRDGTGHYLCNA 191
>gi|156408197|ref|XP_001641743.1| predicted protein [Nematostella vectensis]
gi|156228883|gb|EDO49680.1| predicted protein [Nematostella vectensis]
Length = 423
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/75 (64%), Positives = 56/75 (74%), Gaps = 5/75 (6%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR--- 129
Y +GRECVNCGA STPLWRRDG+GHYLCNACGLYHKMNG +RPL+K RRL R
Sbjct: 210 SYGADGRECVNCGATSTPLWRRDGSGHYLCNACGLYHKMNGSSRPLIKPKRRLSAARRAG 269
Query: 130 -GNFNCY-ERSTLFR 142
NC+ ++TL+R
Sbjct: 270 TSCANCHTTQTTLWR 284
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 53/105 (50%), Positives = 61/105 (58%), Gaps = 6/105 (5%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
Y +GRECVNCGA STPLWRRDG+GHYLCNA G SR L P +
Sbjct: 210 SYGADGRECVNCGATSTPLWRRDGSGHYLCNACGLYHKMNGS---SRPLIK--PKRRLSA 264
Query: 75 F-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 265 ARRAGTSCANCHTTQTTLWRRNQNGDPVCNACGLYWKLHAVNRPL 309
>gi|307212940|gb|EFN88533.1| GATA-binding factor 2 [Harpegnathos saltator]
Length = 306
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 43/49 (87%), Positives = 45/49 (91%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 255 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 303
Score = 65.1 bits (157), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 27/28 (96%), Positives = 27/28 (96%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 255 EGRECVNCGATSTPLWRRDGTGHYLCNA 282
>gi|351706522|gb|EHB09441.1| Histone deacetylase 6 [Heterocephalus glaber]
Length = 1489
Score = 99.4 bits (246), Expect = 4e-19, Method: Composition-based stats.
Identities = 39/49 (79%), Positives = 43/49 (87%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL
Sbjct: 174 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRL 222
Score = 87.0 bits (214), Expect = 2e-15, Method: Composition-based stats.
Identities = 51/101 (50%), Positives = 63/101 (62%), Gaps = 6/101 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVED-IEYFTE 77
E RECVNCGA +TPLWRRD TGHYLCNA G +R L P + I
Sbjct: 174 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIR--PKKRLIVSKRA 228
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 229 GTQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPL 269
>gi|116284294|gb|AAI24484.1| GATA-binding protein 1 [Danio rerio]
gi|182889206|gb|AAI64788.1| Gata1 protein [Danio rerio]
Length = 418
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRDGTGHYLCNA G +R L I G
Sbjct: 234 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 289
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY K++ +NRPL
Sbjct: 290 TQCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVNRPLT 330
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 40/53 (75%), Positives = 45/53 (84%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 234 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 286
>gi|427930996|pdb|4HC7|A Chain A, Crystal Structure Of The Full Dna Binding Domain Of
Gata3-Complex 2
gi|427930999|pdb|4HC7|B Chain B, Crystal Structure Of The Full Dna Binding Domain Of
Gata3-Complex 2
gi|427931003|pdb|4HC9|A Chain A, Dna Binding By Gata Transcription Factor-Complex 3
gi|427931007|pdb|4HCA|A Chain A, Dna Binding By Gata Transcription Factor-Complex 1
Length = 115
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 61/101 (60%), Gaps = 6/101 (5%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE-G 78
GRECVNCGA STPLWRRDGTGHYLCNA G +R L P + G
Sbjct: 5 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARRAG 59
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 60 TSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 100
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 44/52 (84%), Positives = 45/52 (86%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 5 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 56
>gi|327264202|ref|XP_003216904.1| PREDICTED: erythroid transcription factor-like [Anolis
carolinensis]
Length = 383
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 41/54 (75%), Positives = 46/54 (85%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TE RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 188 TEARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 241
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 52/102 (50%), Positives = 61/102 (59%), Gaps = 4/102 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TE RECVNCGA +TPLWRRDGTGHYLCNA G +R L I
Sbjct: 188 TEARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRA 243
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G +C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 244 GTQCSNCQTTTTTLWRRNVNGEPVCNACGLYFKLHNVNRPLA 285
>gi|45382243|ref|NP_990752.1| transcription factor GATA-5 [Gallus gallus]
gi|1169848|sp|P43692.1|GATA5_CHICK RecName: Full=Transcription factor GATA-5; AltName:
Full=GATA-binding factor 5
gi|511482|gb|AAA57504.1| GATA-5 transcription factor [Gallus gallus]
Length = 391
Score = 99.0 bits (245), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 43/57 (75%), Positives = 49/57 (85%), Gaps = 1/57 (1%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E+ +GRECVNCGA+STPLWR+DGTGHYLCNACGLYHKMNG+NRPL Q +RL R
Sbjct: 178 EFPGDGRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGINRPLKPQ-KRLSSSR 233
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 49/109 (44%), Positives = 64/109 (58%), Gaps = 13/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ +GRECVNCGA+STPLWR+DGTGHYLCNA G + P++ +
Sbjct: 178 EFPGDGRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGI---------NRPLKPQKR 228
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 229 LSSSRRAGLCCTNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 277
>gi|321467985|gb|EFX78972.1| hypothetical protein DAPPUDRAFT_28576 [Daphnia pulex]
Length = 115
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 46/56 (82%), Positives = 50/56 (89%), Gaps = 1/56 (1%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKM-NGMNRPLVKQPRRLRFDR 129
+ EGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKM NG+NRPL+K PRRL R
Sbjct: 1 YLEGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNNGINRPLLKPPRRLSATR 56
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/104 (54%), Positives = 68/104 (65%), Gaps = 5/104 (4%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNA-RLFSTSFTGYFIMSRFLFASLPVEDIEYF 75
+ EGRECVNCG+ISTPLWRRDGTGHYLCNA L+ G ++R L P
Sbjct: 1 YLEGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNNG---INRPLLKP-PRRLSATR 56
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NCG +T LWRR+ G +CNACGLYHK++G+NRPL
Sbjct: 57 RLGLCCTNCGTTTTTLWRRNAEGEPVCNACGLYHKLHGVNRPLA 100
>gi|170044813|ref|XP_001850027.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867945|gb|EDS31328.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 110
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 45/58 (77%), Positives = 49/58 (84%), Gaps = 4/58 (6%)
Query: 68 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
P+ D+E GRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 39 PINDLE----GRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 92
Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/37 (78%), Positives = 31/37 (83%), Gaps = 4/37 (10%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
P+ D+E GRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 39 PINDLE----GRECVNCGATSTPLWRRDGTGHYLCNA 71
>gi|348510297|ref|XP_003442682.1| PREDICTED: GATA-binding factor 2-like [Oreochromis niloticus]
Length = 424
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 44/69 (63%), Positives = 51/69 (73%), Gaps = 4/69 (5%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR--GNF-- 132
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL + G
Sbjct: 239 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTLCA 298
Query: 133 NCYERSTLF 141
NC+ +T
Sbjct: 299 NCHTSTTTL 307
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 60/103 (58%), Gaps = 4/103 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRDGTGHYLCNA G +R L I G
Sbjct: 239 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 294
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
C NC +T LWRR+ G +CNACGLY K++ +NRPL +
Sbjct: 295 TLCANCHTSTTTLWRRNANGEPVCNACGLYFKLHNVNRPLTMK 337
>gi|321467569|gb|EFX78559.1| hypothetical protein DAPPUDRAFT_24785 [Daphnia pulex]
Length = 118
Score = 98.6 bits (244), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 53/102 (51%), Positives = 61/102 (59%), Gaps = 6/102 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE- 77
EGRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 1 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIK--PKRRLSAARRA 55
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 56 GTTCANCKTTMTTLWRRNHNGEPVCNACGLYYKLHNVNRPLT 97
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/53 (83%), Positives = 46/53 (86%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL R
Sbjct: 1 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLSAAR 53
>gi|383851463|ref|XP_003701252.1| PREDICTED: GATA-binding factor C-like [Megachile rotundata]
Length = 292
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 47/79 (59%), Positives = 53/79 (67%)
Query: 51 TSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHK 110
T + GY+ + Y +GRECVNCGA STPLWRRDGTGHYLCNACGLY+K
Sbjct: 37 TDYRGYYPHAGPTAHHPAAAAAHYAHKGRECVNCGATSTPLWRRDGTGHYLCNACGLYYK 96
Query: 111 MNGMNRPLVKQPRRLRFDR 129
MNG NRPL+K RRL R
Sbjct: 97 MNGQNRPLIKPKRRLSAAR 115
Score = 98.2 bits (243), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/104 (50%), Positives = 63/104 (60%), Gaps = 6/104 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-TE 77
+GRECVNCGA STPLWRRDGTGHYLCNA G +R L P +
Sbjct: 63 KGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNG---QNRPLIK--PKRRLSAARRA 117
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 118 GTSCANCKTATTTLWRRNQAGEPVCNACGLYYKLHNVNRPLTMK 161
>gi|410920497|ref|XP_003973720.1| PREDICTED: GATA-binding factor 2-like [Takifugu rubripes]
Length = 421
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 44/69 (63%), Positives = 51/69 (73%), Gaps = 4/69 (5%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR--GNF-- 132
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL + G
Sbjct: 235 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTLCA 294
Query: 133 NCYERSTLF 141
NC+ +T
Sbjct: 295 NCHTSTTTL 303
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 61/103 (59%), Gaps = 4/103 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRDGTGHYLCNA G +R L I G
Sbjct: 235 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 290
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
C NC +T LWRR+ +G +CNACGLY K++ +NRPL +
Sbjct: 291 TLCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVNRPLTMK 333
>gi|38569869|gb|AAR24452.1| GATA transcription factor [Nematostella vectensis]
Length = 422
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 48/78 (61%), Positives = 57/78 (73%), Gaps = 5/78 (6%)
Query: 70 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
D + +GRECVNCGA STPLWRRDG+GHYLCNACGLYHKMNG +RPL+K RRL R
Sbjct: 206 SDFFSYGDGRECVNCGATSTPLWRRDGSGHYLCNACGLYHKMNGSSRPLIKPKRRLSAAR 265
Query: 130 ----GNFNCY-ERSTLFR 142
NC+ ++TL+R
Sbjct: 266 RAGTSCANCHTTQTTLWR 283
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 62/108 (57%), Gaps = 6/108 (5%)
Query: 12 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVED 71
D + +GRECVNCGA STPLWRRDG+GHYLCNA G SR L P
Sbjct: 206 SDFFSYGDGRECVNCGATSTPLWRRDGSGHYLCNACGLYHKMNGS---SRPLIK--PKRR 260
Query: 72 IEYF-TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 261 LSAARRAGTSCANCHTTQTTLWRRNQNGDPVCNACGLYWKLHAVNRPL 308
>gi|449486444|ref|XP_002194802.2| PREDICTED: transcription factor GATA-5 [Taeniopygia guttata]
Length = 260
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 41/51 (80%), Positives = 46/51 (90%)
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
E+ +GRECVNCGA+STPLWR+DGTGHYLCNACGLYHKMNG+NRPL Q R
Sbjct: 45 EFPGDGRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGINRPLKPQKR 95
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/109 (44%), Positives = 64/109 (58%), Gaps = 13/109 (11%)
Query: 15 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEY 74
E+ +GRECVNCGA+STPLWR+DGTGHYLCNA G + P++ +
Sbjct: 45 EFPGDGRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGI---------NRPLKPQKR 95
Query: 75 FTEGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ R NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 96 LSSSRRAGLCCTNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 144
>gi|157278507|ref|NP_001098355.1| hematopietic transcription factor GATA-1 [Oryzias latipes]
gi|39748630|dbj|BAD04991.1| hematopietic transcription factor GATA-1 [Oryzias latipes]
Length = 417
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 40/53 (75%), Positives = 45/53 (84%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 234 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 286
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 61/103 (59%), Gaps = 4/103 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRDGTGHYLCNA G +R L I G
Sbjct: 234 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 289
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
+C NC +T LWRR+ G +CNACGLY K++ +NRPL +
Sbjct: 290 TQCANCHTSTTTLWRRNANGEPVCNACGLYFKLHNVNRPLTMK 332
>gi|477758|pir||B48099 transcription factor GATA-2, retinoic acid-inducible - mouse
(fragment)
Length = 118
Score = 98.2 bits (243), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 53/104 (50%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFAS---LPVEDIEYF 75
EGRECVNCGA +TPLWRRDGTGHYLCNA G +R L LP
Sbjct: 1 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIKPKRRLPAAR---- 53
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 54 RAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLT 97
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/53 (83%), Positives = 46/53 (86%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 1 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLPAAR 53
>gi|371501187|emb|CBN81009.2| GATA-binding factor 3 [Dicentrarchus labrax]
Length = 460
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/69 (63%), Positives = 51/69 (73%), Gaps = 4/69 (5%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR--GNF-- 132
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL + G
Sbjct: 241 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTLCA 300
Query: 133 NCYERSTLF 141
NC+ +T
Sbjct: 301 NCHTSTTTL 309
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 52/104 (50%), Positives = 62/104 (59%), Gaps = 6/104 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVED-IEYFTE 77
E RECVNCGA +TPLWRRDGTGHYLCNA G +R L P + I
Sbjct: 241 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIR--PKKRLIVSKRA 295
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G C NC +T LWRR+ G +CNACGLY K++ +NRPL +
Sbjct: 296 GTLCANCHTSTTTLWRRNANGEPVCNACGLYFKLHNVNRPLTMK 339
>gi|126342847|ref|XP_001372037.1| PREDICTED: erythroid transcription factor-like [Monodelphis
domestica]
Length = 401
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 40/53 (75%), Positives = 45/53 (84%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 203 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 255
Score = 85.5 bits (210), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 62/101 (61%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRDGTGHYLCNA G +R L I G
Sbjct: 203 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 258
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 259 TQCTNCQTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLT 299
>gi|18858731|ref|NP_571309.1| GATA binding protein 1a [Danio rerio]
gi|1132419|gb|AAA86090.1| zg1 [Danio rerio]
Length = 418
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWR+DGTGHYLCNA G +R L I G
Sbjct: 234 EARECVNCGATATPLWRQDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 289
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY K++ +NRPL
Sbjct: 290 TQCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVNRPLT 330
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 45/53 (84%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWR+DGTGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 234 EARECVNCGATATPLWRQDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 286
>gi|68364574|ref|XP_693371.1| PREDICTED: GATA-binding factor 2 [Danio rerio]
Length = 372
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 62/103 (60%), Gaps = 8/103 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF--ASLPVEDIEYFT 76
+ RECVNCGA +TPLWRRDGTGHYLCNA G +R L PV
Sbjct: 194 DARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPKKRPVVSKRI-- 248
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G +C NC +T LWRR+ +G +CNACGLY K++ +NRPL
Sbjct: 249 -GTQCANCQTSTTTLWRRNSSGEPVCNACGLYFKLHNVNRPLA 290
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 38/48 (79%), Positives = 43/48 (89%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
+ RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +R
Sbjct: 194 DARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKR 241
>gi|196049282|dbj|BAG68610.1| GATA-binding protein 1 [Cyprinus carpio]
Length = 277
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 40/49 (81%), Positives = 44/49 (89%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL
Sbjct: 213 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRL 261
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/28 (89%), Positives = 26/28 (92%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
E RECVNCGA +TPLWRRDGTGHYLCNA
Sbjct: 213 EARECVNCGATATPLWRRDGTGHYLCNA 240
>gi|270008082|gb|EFA04530.1| grain [Tribolium castaneum]
Length = 537
Score = 97.8 bits (242), Expect = 1e-18, Method: Composition-based stats.
Identities = 46/98 (46%), Positives = 55/98 (56%), Gaps = 1/98 (1%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRE 80
+ECVNCGA TPLWRRDGTGHYLCNA G + G +
Sbjct: 285 KECVNCGASVTPLWRRDGTGHYLCNACGLYNKINGVNRPPVRPPKKPQAQPGPR-RNGVQ 343
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 344 CANCKTGNTTLWRRNNQGEPVCNACGLYFKLHNVNRPL 381
>gi|358335481|dbj|GAA54062.1| endothelial transcription factor GATA-2 [Clonorchis sinensis]
Length = 412
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/120 (46%), Positives = 66/120 (55%), Gaps = 11/120 (9%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF- 75
T+GRECVNCGA TPLWRRD TGHYLCNA G SR L P +
Sbjct: 62 LTDGRECVNCGATQTPLWRRDETGHYLCNACGLYHKMNG---TSRPLIK--PKRRMSATR 116
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-----VKQPRRLRFDRG 130
G C NC T LWRR+ G +CNACGLY+K++ +NRPL + Q R R +G
Sbjct: 117 KSGTICANCRTAHTTLWRRNQHGDSVCNACGLYYKLHHINRPLSMKKEIIQTRNRRLTQG 176
>gi|213511622|ref|NP_001133239.1| GATA-binding factor 1 [Salmo salar]
gi|209147311|gb|ACI32884.1| GATA-binding factor 3 [Salmo salar]
Length = 424
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
+ RECVNCGA +TPLWRRDGTGHYLCNA G +R L I G
Sbjct: 239 DARECVNCGATNTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 294
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY K++ +NRPL
Sbjct: 295 TQCANCNTSTTTLWRRNASGEPVCNACGLYFKLHNVNRPLT 335
Score = 95.9 bits (237), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 45/53 (84%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
+ RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 239 DARECVNCGATNTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 291
>gi|307190871|gb|EFN74708.1| GATA-binding factor C [Camponotus floridanus]
Length = 57
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 47/54 (87%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
+E EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+K RRL
Sbjct: 1 MEIPIEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 54
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/33 (84%), Positives = 29/33 (87%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+E EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 1 MEIPIEGRECVNCGATSTPLWRRDGTGHYLCNA 33
>gi|431913709|gb|ELK15199.1| Endothelial transcription factor GATA-2 [Pteropus alecto]
Length = 383
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 40/45 (88%), Positives = 43/45 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K
Sbjct: 207 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK 251
>gi|322782909|gb|EFZ10627.1| hypothetical protein SINV_01972 [Solenopsis invicta]
Length = 726
Score = 97.4 bits (241), Expect = 2e-18, Method: Composition-based stats.
Identities = 48/105 (45%), Positives = 63/105 (60%), Gaps = 6/105 (5%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA-RLFSTSFTGYFIMSRFLFASLPVEDI---E 73
TE +ECVNC A TPLWRRDG+GHYLCNA L++ G + + P + +
Sbjct: 498 TEPKECVNCAASMTPLWRRDGSGHYLCNACGLYNKMNNG--LNRPPMRCPKPKQTVAPAN 555
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 556 LRRTGVQCANCRTSNTTLWRRNNNGEPVCNACGLYFKLHNVNRPL 600
Score = 86.7 bits (213), Expect = 3e-15, Method: Composition-based stats.
Identities = 35/49 (71%), Positives = 42/49 (85%), Gaps = 1/49 (2%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKM-NGMNRPLVKQPR 123
TE +ECVNC A TPLWRRDG+GHYLCNACGLY+KM NG+NRP ++ P+
Sbjct: 498 TEPKECVNCAASMTPLWRRDGSGHYLCNACGLYNKMNNGLNRPPMRCPK 546
>gi|45382623|ref|NP_990795.1| erythroid transcription factor [Gallus gallus]
gi|120955|sp|P17678.1|GATA1_CHICK RecName: Full=Erythroid transcription factor; AltName: Full=Eryf1;
AltName: Full=GATA-binding factor 1; Short=GATA-1;
AltName: Full=NF-E1 DNA-binding protein; Short=NF-E1a
gi|212629|gb|AAA49055.1| Eryf1 protein [Gallus gallus]
Length = 304
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 63/116 (54%), Gaps = 14/116 (12%)
Query: 9 LPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA-----RLFSTSFTGYFIMSRFL 63
LP+ E RECVNCGA +TPLWRRDGTGHYLCNA RL + R L
Sbjct: 96 LPLSSGPPPCEARECVNCGATATPLWRRDGTGHYLCNACGLYHRLNGQNRPLIRPKKRLL 155
Query: 64 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ G C NC +T LWRR G +CNACGLY+K++ +NRPL
Sbjct: 156 VSK---------RAGTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVNRPLT 202
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 62/110 (56%), Gaps = 9/110 (8%)
Query: 24 VNCGAISTPLWRRDGT----GHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR 79
+ G + TP W T HYL L + S L LP+ E R
Sbjct: 54 ADTGTLGTPQWVPPATQMEPPHYL--ELLQPPRGSPPHPSSGPL---LPLSSGPPPCEAR 108
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
ECVNCGA +TPLWRRDGTGHYLCNACGLYH++NG NRPL++ +RL +
Sbjct: 109 ECVNCGATATPLWRRDGTGHYLCNACGLYHRLNGQNRPLIRPKKRLLVSK 158
>gi|303387041|gb|ADM15548.1| GATA-1, partial [Carassius auratus langsdorfii]
Length = 214
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/69 (63%), Positives = 50/69 (72%), Gaps = 4/69 (5%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNF 132
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL R
Sbjct: 127 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCA 186
Query: 133 NCYERSTLF 141
NC+ +T
Sbjct: 187 NCHTSTTTL 195
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 54/92 (58%), Gaps = 4/92 (4%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRDGTGHYLCNA G +R L I G
Sbjct: 127 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 182
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHK 110
+C NC +T LWRR+ +G +CNACGLY K
Sbjct: 183 TQCANCHTSTTTLWRRNASGEPVCNACGLYFK 214
>gi|7861539|dbj|BAA95683.1| transcription factor GATA-6 [Mus musculus]
Length = 470
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 67/114 (58%), Gaps = 12/114 (10%)
Query: 7 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMS 60
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNA + G +S
Sbjct: 362 APLPVPRGPSTDLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LS 418
Query: 61 RFLFASLPVEDIEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG 113
R L P + + G C NC +T LWRR+ G +CNACGLY K++G
Sbjct: 419 RPLIK--PQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHG 470
Score = 96.3 bits (238), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/71 (61%), Positives = 53/71 (74%), Gaps = 6/71 (8%)
Query: 65 ASLPV------EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
A LPV + +E +E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL
Sbjct: 362 APLPVPRGPSTDLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 421
Query: 119 VKQPRRLRFDR 129
+K +R+ R
Sbjct: 422 IKPQKRVPSSR 432
>gi|395548125|ref|XP_003775205.1| PREDICTED: erythroid transcription factor [Sarcophilus harrisii]
Length = 302
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 40/53 (75%), Positives = 45/53 (84%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 203 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 255
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 57/94 (60%), Gaps = 4/94 (4%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRDGTGHYLCNA G +R L I G
Sbjct: 203 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 258
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
+C NC +T LWRR+ +G +CNACGLY+K++
Sbjct: 259 TQCTNCQTTTTTLWRRNASGEPVCNACGLYYKLH 292
>gi|1132421|gb|AAA86091.1| zg2, partial [Danio rerio]
Length = 194
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/57 (77%), Positives = 45/57 (78%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNC 134
G ECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R C
Sbjct: 6 GAECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTC 62
Score = 96.3 bits (238), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 61/101 (60%), Gaps = 6/101 (5%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-TEG 78
G ECVNCGA STPLWRRDGTGHYLCNA G +R L P + G
Sbjct: 6 GAECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARRAG 60
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
C NC +T LWRR+G G +CNACGLY+K++ +NRPL
Sbjct: 61 TCCANCQTTTTTLWRRNGNGDPVCNACGLYYKLHNVNRPLT 101
>gi|355754933|gb|EHH58800.1| GATA-binding factor 6 [Macaca fascicularis]
Length = 410
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 51/107 (47%), Positives = 64/107 (59%), Gaps = 6/107 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E +E RECVNCG+I TPL R DGTGHYLCNA + G +SR L P + +
Sbjct: 196 LEDLSESRECVNCGSIQTPLGRGDGTGHYLCNACGLYSKMNG---ISRPLIK--PQKRVP 250
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 251 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 297
Score = 89.4 bits (220), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 68/125 (54%), Gaps = 22/125 (17%)
Query: 27 GAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV------EDIEYFTEGRE 80
GA TP W A F T + + SR A LPV + +E +E RE
Sbjct: 157 GAPLTPAWP----------AGPFETPVL-HSLQSRA-GAPLPVPRGPSADLLEDLSESRE 204
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNFNCYE 136
CVNCG+I TPL R DGTGHYLCNACGLY KMNG++RPL+K +R+ R NC+
Sbjct: 205 CVNCGSIQTPLGRGDGTGHYLCNACGLYSKMNGISRPLIKPQKRVPSSRRLGLSCANCHT 264
Query: 137 RSTLF 141
+T
Sbjct: 265 TTTTL 269
>gi|349804083|gb|AEQ17514.1| putative gata binding protein 6 [Hymenochirus curtipes]
Length = 164
Score = 96.3 bits (238), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 51/107 (47%), Positives = 64/107 (59%), Gaps = 6/107 (5%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
++ E RECVNCG++ TPLWRRDGTGHYLCNA + G SR L P + +
Sbjct: 3 LDELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGP---SRPLIK--PQKRVP 57
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 58 SSRRIGLACTNCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVPRPLA 104
Score = 90.1 bits (222), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 38/58 (65%), Positives = 46/58 (79%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
++ E RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG +RPL+K +R+ R
Sbjct: 3 LDELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGPSRPLIKPQKRVPSSR 60
>gi|47227325|emb|CAF96874.1| unnamed protein product [Tetraodon nigroviridis]
Length = 190
Score = 96.3 bits (238), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/69 (63%), Positives = 51/69 (73%), Gaps = 4/69 (5%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR--GNF-- 132
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++ +RL + G
Sbjct: 2 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTLCA 61
Query: 133 NCYERSTLF 141
NC+ +T
Sbjct: 62 NCHTSTTTL 70
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 61/103 (59%), Gaps = 4/103 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRDGTGHYLCNA G +R L I G
Sbjct: 2 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 57
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
C NC +T LWRR+ +G +CNACGLY K++ +NRPL +
Sbjct: 58 TLCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVNRPLTMK 100
>gi|196011732|ref|XP_002115729.1| hypothetical protein TRIADDRAFT_59720 [Trichoplax adhaerens]
gi|190581505|gb|EDV21581.1| hypothetical protein TRIADDRAFT_59720 [Trichoplax adhaerens]
Length = 468
Score = 96.3 bits (238), Expect = 4e-18, Method: Composition-based stats.
Identities = 45/69 (65%), Positives = 52/69 (75%), Gaps = 5/69 (7%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNFNC 134
R C NCG ISTPLWRRDG+G YLCNACGLYHK+NG+NRPLVK +RL R D NC
Sbjct: 290 RACANCGVISTPLWRRDGSGQYLCNACGLYHKINGVNRPLVKPNKRLSSTRRMDLICANC 349
Query: 135 -YERSTLFR 142
+R+TL+R
Sbjct: 350 NTDRTTLWR 358
Score = 89.0 bits (219), Expect = 6e-16, Method: Composition-based stats.
Identities = 46/102 (45%), Positives = 58/102 (56%), Gaps = 12/102 (11%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRE 80
R C NCG ISTPLWRRDG+G YLCNA G ++R L V+ + + R
Sbjct: 290 RACANCGVISTPLWRRDGSGQYLCNACGLYHKING---VNRPL-----VKPNKRLSSTRR 341
Query: 81 ----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC T LWRR+ G +CNACGLY K++ +NRP+
Sbjct: 342 MDLICANCNTDRTTLWRRNKYGQPVCNACGLYFKLHEINRPM 383
>gi|345491204|ref|XP_001607832.2| PREDICTED: hypothetical protein LOC100124020 [Nasonia vitripennis]
Length = 737
Score = 95.9 bits (237), Expect = 4e-18, Method: Composition-based stats.
Identities = 50/103 (48%), Positives = 61/103 (59%), Gaps = 8/103 (7%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR 79
GRECVNCGA +TPLWRRDGT YLCNA + G +SR + T GR
Sbjct: 508 GRECVNCGAHTTPLWRRDGTT-YLCNACGICSKTNG---ISRPPTQRAKPKTSVPPTGGR 563
Query: 80 E----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +T LWRR+ G +CNACGLY K++G+NRP+
Sbjct: 564 RLGVRCANCSTTTTTLWRRNNNGEPVCNACGLYFKLHGVNRPM 606
>gi|221127849|ref|XP_002159183.1| PREDICTED: uncharacterized protein LOC100208265 [Hydra
magnipapillata]
Length = 408
Score = 95.5 bits (236), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 63/106 (59%), Gaps = 14/106 (13%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLP-VEDIEYFTE 77
EGREC+NCGA STPLWRRD GHYLCNA G A+ P ++ ++
Sbjct: 225 EGRECMNCGATSTPLWRRDTRGHYLCNACGLYHKMNG---------ANRPLIKPKRRLSQ 275
Query: 78 GRE----CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
R C NC T LWRR+G+G +CNACGLY+K++ +NRPL
Sbjct: 276 ARRTGIVCSNCKTSQTTLWRRNGSGEPVCNACGLYYKLHKVNRPLT 321
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 42/53 (79%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
EGREC+NCGA STPLWRRD GHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 225 EGRECMNCGATSTPLWRRDTRGHYLCNACGLYHKMNGANRPLIKPKRRLSQAR 277
>gi|354485961|ref|XP_003505150.1| PREDICTED: erythroid transcription factor-like [Cricetulus griseus]
Length = 412
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 59/125 (47%), Positives = 66/125 (52%), Gaps = 24/125 (19%)
Query: 5 LFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF 64
L ASLPV Y GA P TG L +A S F G
Sbjct: 151 LPASLPVNSSGY----------GAPDFPGPFFSPTGSPLNSAAYSSPKFHG--------- 191
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
SLP+ E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +R
Sbjct: 192 -SLPLAPCE----ARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKR 246
Query: 125 LRFDR 129
L +
Sbjct: 247 LIVSK 251
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 199 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 254
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 255 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 295
>gi|328721860|ref|XP_001943307.2| PREDICTED: hypothetical protein LOC100166174 isoform 1
[Acyrthosiphon pisum]
gi|328721862|ref|XP_003247419.1| PREDICTED: hypothetical protein LOC100166174 isoform 2
[Acyrthosiphon pisum]
Length = 637
Score = 95.1 bits (235), Expect = 9e-18, Method: Composition-based stats.
Identities = 47/106 (44%), Positives = 56/106 (52%), Gaps = 16/106 (15%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT---- 76
+ECVNC A TPLWRRDGTGH+LCNA G PV + T
Sbjct: 415 KECVNCAANVTPLWRRDGTGHHLCNACGLYNRING--------VNRPPVRSAQKKTTQQT 466
Query: 77 ----EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 467 GNKRSGVACANCSTNTTTLWRRNNNGEPVCNACGLYFKLHNVNRPL 512
Score = 82.8 bits (203), Expect = 4e-14, Method: Composition-based stats.
Identities = 31/46 (67%), Positives = 40/46 (86%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
+ECVNC A TPLWRRDGTGH+LCNACGLY+++NG+NRP V+ ++
Sbjct: 415 KECVNCAANVTPLWRRDGTGHHLCNACGLYNRINGVNRPPVRSAQK 460
>gi|30851257|gb|AAH52653.1| GATA binding protein 1 [Mus musculus]
Length = 413
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 65/125 (52%), Gaps = 24/125 (19%)
Query: 5 LFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF 64
L ASLPV Y G P TG L +A S F G
Sbjct: 152 LPASLPVTGSAY----------GGADFPSPFFSPTGSPLSSAAYSSPKFHG--------- 192
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
SLP+ E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +R
Sbjct: 193 -SLPLAPCE----ARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKR 247
Query: 125 LRFDR 129
+ +
Sbjct: 248 MIVSK 252
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 60/101 (59%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRMIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYFKLHQVNRPLT 296
>gi|391347263|ref|XP_003747884.1| PREDICTED: uncharacterized protein LOC100905208 [Metaseiulus
occidentalis]
Length = 594
Score = 94.7 bits (234), Expect = 1e-17, Method: Composition-based stats.
Identities = 40/67 (59%), Positives = 48/67 (71%)
Query: 51 TSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHK 110
T F+GY + L + I+Y+ + +ECVNCGA+STPLWRRD GHYLCNACGLY K
Sbjct: 242 TGFSGYPAATNPLGMPMDPAYIDYYQDSKECVNCGAVSTPLWRRDMAGHYLCNACGLYTK 301
Query: 111 MNGMNRP 117
MNG NRP
Sbjct: 302 MNGSNRP 308
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 26/42 (61%), Positives = 31/42 (73%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTG 55
I+Y+ + +ECVNCGA+STPLWRRD GHYLCNA T G
Sbjct: 263 IDYYQDSKECVNCGAVSTPLWRRDMAGHYLCNACGLYTKMNG 304
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 29/40 (72%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NC +T LWRR+ G +CNACGLY+K++G+NRP
Sbjct: 375 GLVCSNCDTNNTTLWRRNNQGEPVCNACGLYYKLHGVNRP 414
>gi|335353905|dbj|BAK39709.1| GATA binding protein 2 [Tursiops truncatus]
Length = 83
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/51 (82%), Positives = 44/51 (86%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL R
Sbjct: 1 RECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAAR 51
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 50/88 (56%), Gaps = 6/88 (6%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE-GR 79
RECVNCGA +TPLWRRDGTGHYLCNA G +R L P + G
Sbjct: 1 RECVNCGATATPLWRRDGTGHYLCNACGLYHKMNG---QNRPLIK--PKRRLSAARRAGT 55
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGL 107
C NC +T LWRR+ G +CNACGL
Sbjct: 56 CCANCQTTTTTLWRRNANGDPVCNACGL 83
>gi|344250001|gb|EGW06105.1| Erythroid transcription factor [Cricetulus griseus]
Length = 330
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/125 (47%), Positives = 66/125 (52%), Gaps = 24/125 (19%)
Query: 5 LFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF 64
L ASLPV Y GA P TG L +A S F G
Sbjct: 69 LPASLPVNSSGY----------GAPDFPGPFFSPTGSPLNSAAYSSPKFHG--------- 109
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
SLP+ E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +R
Sbjct: 110 -SLPLAPCE----ARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKR 164
Query: 125 LRFDR 129
L +
Sbjct: 165 LIVSK 169
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 117 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 172
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 173 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 213
>gi|6679947|ref|NP_032115.1| erythroid transcription factor [Mus musculus]
gi|120957|sp|P17679.1|GATA1_MOUSE RecName: Full=Erythroid transcription factor; AltName: Full=Eryf1;
AltName: Full=GATA-binding factor 1; Short=GATA-1;
Short=GF-1; AltName: Full=NF-E1 DNA-binding protein
gi|51064|emb|CAA33769.1| DNA-binding protein, GF-1 [Mus musculus]
gi|74147266|dbj|BAE27527.1| unnamed protein product [Mus musculus]
gi|148701991|gb|EDL33938.1| GATA binding protein 1 [Mus musculus]
gi|226387|prf||1509337A DNA binding protein
Length = 413
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 65/125 (52%), Gaps = 24/125 (19%)
Query: 5 LFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLF 64
L ASLPV Y G P TG L +A S F G
Sbjct: 152 LPASLPVTGSAY----------GGADFPSPFFSPTGSPLSSAAYSSPKFHG--------- 192
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
SLP+ E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +R
Sbjct: 193 -SLPLAPCE----ARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKR 247
Query: 125 LRFDR 129
+ +
Sbjct: 248 MIVSK 252
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 60/101 (59%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRMIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYFKLHQVNRPLT 296
>gi|1289502|gb|AAC52841.1| transcription factor GATA-6, partial [Mus musculus]
Length = 205
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/98 (48%), Positives = 60/98 (61%), Gaps = 6/98 (6%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE-GREC 81
CVNCG+I TPLWRRDGTGHYLCNA ++ +SR L P + + G C
Sbjct: 1 CVNCGSIQTPLWRRDGTGHYLCNACGL---YSKMNALSRPLIK--PQKRVPSSRRLGLSC 55
Query: 82 VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 56 ANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 93
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 38/65 (58%), Positives = 46/65 (70%), Gaps = 4/65 (6%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNFNCYE 136
CVNCG+I TPLWRRDGTGHYLCNACGLY KMN ++RPL+K +R+ R NC+
Sbjct: 1 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNALSRPLIKPQKRVPSSRRLGLSCANCHT 60
Query: 137 RSTLF 141
+T
Sbjct: 61 TTTTL 65
>gi|410056444|ref|XP_001139973.3| PREDICTED: erythroid transcription factor [Pan troglodytes]
Length = 242
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 41/65 (63%), Positives = 48/65 (73%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNCYE 136
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL + F Y
Sbjct: 167 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLVRPQTCFTLYT 226
Query: 137 RSTLF 141
+ +
Sbjct: 227 GTPVL 231
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 24/28 (85%), Positives = 25/28 (89%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
E RECVNCGA +TPLWRRD TGHYLCNA
Sbjct: 167 EARECVNCGATATPLWRRDRTGHYLCNA 194
>gi|301753709|ref|XP_002912716.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Ailuropoda melanoleuca]
Length = 478
Score = 94.4 bits (233), Expect = 1e-17, Method: Composition-based stats.
Identities = 50/106 (47%), Positives = 62/106 (58%), Gaps = 7/106 (6%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIE 73
+E E RECVNCG+I TPLWR G GHYLCNA + G +SR L P + +
Sbjct: 265 LEDLPESRECVNCGSIQTPLWRXGG-GHYLCNACGLYSKMNG---LSRPLIK--PQKRVP 318
Query: 74 YFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 319 SSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPL 364
Score = 86.7 bits (213), Expect = 3e-15, Method: Composition-based stats.
Identities = 40/72 (55%), Positives = 49/72 (68%), Gaps = 5/72 (6%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RF 127
+E E RECVNCG+I TPLWR G GHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 265 LEDLPESRECVNCGSIQTPLWRXGG-GHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRL 323
Query: 128 DRGNFNCYERST 139
NC+ +T
Sbjct: 324 GLSCANCHTTTT 335
>gi|6978877|ref|NP_036896.1| erythroid transcription factor [Rattus norvegicus]
gi|392355488|ref|XP_003752057.1| PREDICTED: erythroid transcription factor-like [Rattus norvegicus]
gi|1169842|sp|P43429.1|GATA1_RAT RecName: Full=Erythroid transcription factor; AltName: Full=Eryf1;
AltName: Full=GATA-binding factor 1; Short=GATA-1;
Short=GF-1; AltName: Full=NF-E1 DNA-binding protein
gi|644896|dbj|BAA02735.1| GATA-1 [Rattus norvegicus]
gi|149028423|gb|EDL83808.1| GATA binding protein 1 [Rattus norvegicus]
Length = 413
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 56/91 (61%), Gaps = 14/91 (15%)
Query: 39 TGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTG 98
TG L +A S F G SLP+ E RECVNCGA +TPLWRRD TG
Sbjct: 176 TGSPLSSAAYSSPKFHG----------SLPLAPCE----ARECVNCGATATPLWRRDRTG 221
Query: 99 HYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
HYLCNACGLYHKMNG NRPL++ +R+ +
Sbjct: 222 HYLCNACGLYHKMNGQNRPLIRPKKRMIVSK 252
Score = 82.0 bits (201), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRMIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|291407452|ref|XP_002719945.1| PREDICTED: GATA binding protein 1 [Oryctolagus cuniculus]
Length = 413
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|213623646|gb|AAI70023.1| GATA binding factor-1 [Xenopus laevis]
gi|213625189|gb|AAI70025.1| GATA binding factor-1 [Xenopus laevis]
Length = 359
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 62/104 (59%), Gaps = 4/104 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TE RECVNCGA TPLWRRD +GHYLCNA G +R L I
Sbjct: 173 TEDRECVNCGATVTPLWRRDMSGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRA 228
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G +C NC +T LWRR+ +G +CNACGLY+K++ +NRPL +
Sbjct: 229 GTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRPLTMK 272
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 39/54 (72%), Positives = 44/54 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TE RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 173 TEDRECVNCGATVTPLWRRDMSGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 226
>gi|42566590|gb|AAS21059.1| GATA factor, partial [Nematostella vectensis]
Length = 208
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/69 (66%), Positives = 53/69 (76%), Gaps = 5/69 (7%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR----GNFNC 134
RECVNCGA STPLWRRDG+GHYLCNACGLYHKMNG +RPL+K RRL R NC
Sbjct: 1 RECVNCGATSTPLWRRDGSGHYLCNACGLYHKMNGSSRPLIKPKRRLSAARRAGTSCANC 60
Query: 135 Y-ERSTLFR 142
+ ++TL+R
Sbjct: 61 HTTQTTLWR 69
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 57/99 (57%), Gaps = 6/99 (6%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-TEGR 79
RECVNCGA STPLWRRDG+GHYLCNA G SR L P + G
Sbjct: 1 RECVNCGATSTPLWRRDGSGHYLCNACGLYHKMNGS---SRPLIK--PKRRLSAARRAGT 55
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC T LWRR+ G +CNACGLY ++ +NRPL
Sbjct: 56 SCANCHTTQTTLWRRNQNGDPVCNACGLYWMLHAVNRPL 94
>gi|148232497|ref|NP_001079109.1| GATA-binding factor 1-A [Xenopus laevis]
gi|120966|sp|P23767.1|GAT1A_XENLA RecName: Full=GATA-binding factor 1-A; AltName: Full=Transcription
factor xGATA-1A
gi|214167|gb|AAA49721.1| GATA binding factor-1 [Xenopus laevis]
Length = 359
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 62/104 (59%), Gaps = 4/104 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TE RECVNCGA TPLWRRD +GHYLCNA G +R L I
Sbjct: 173 TEDRECVNCGATVTPLWRRDMSGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRA 228
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G +C NC +T LWRR+ +G +CNACGLY+K++ +NRPL +
Sbjct: 229 GTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRPLTMK 272
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 39/54 (72%), Positives = 44/54 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TE RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 173 TEDRECVNCGATVTPLWRRDMSGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 226
>gi|431893570|gb|ELK03433.1| Erythroid transcription factor [Pteropus alecto]
Length = 413
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|348553638|ref|XP_003462633.1| PREDICTED: erythroid transcription factor-like [Cavia porcellus]
Length = 409
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/70 (62%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 60 SRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
S L SLP+ E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL+
Sbjct: 184 SPKLRGSLPLPPCE----ARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLI 239
Query: 120 KQPRRLRFDR 129
+ +RL +
Sbjct: 240 RPKKRLIVSK 249
Score = 82.0 bits (201), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/118 (46%), Positives = 67/118 (56%), Gaps = 8/118 (6%)
Query: 2 SRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSR 61
S L SLP+ E RECVNCGA +TPLWRRD TGHYLCNA G +R
Sbjct: 184 SPKLRGSLPLPPCE----ARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNR 236
Query: 62 FLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
L I G +C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 237 PLIRPK-KRLIVSKRAGTQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 293
>gi|301630601|ref|XP_002944405.1| PREDICTED: GATA-binding factor 1-B-like [Xenopus (Silurana)
tropicalis]
Length = 363
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 62/104 (59%), Gaps = 4/104 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TE RECVNCGA TPLWRRD +GHYLCNA G +R L I
Sbjct: 175 TEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRA 230
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G +C NC +T LWRR+ +G +CNACGLY+K++ +NRPL +
Sbjct: 231 GTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRPLTMK 274
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 39/54 (72%), Positives = 44/54 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TE RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 175 TEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 228
>gi|403297486|ref|XP_003939593.1| PREDICTED: erythroid transcription factor [Saimiri boliviensis
boliviensis]
Length = 413
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.9 bits (198), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|402910093|ref|XP_003917725.1| PREDICTED: erythroid transcription factor [Papio anubis]
Length = 413
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|432110143|gb|ELK33920.1| Erythroid transcription factor [Myotis davidii]
Length = 391
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 178 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 230
Score = 81.6 bits (200), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 178 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 233
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 234 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 274
>gi|395854434|ref|XP_003799696.1| PREDICTED: erythroid transcription factor [Otolemur garnettii]
Length = 413
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|213626857|gb|AAI70262.1| GATA binding factor-1b [Xenopus laevis]
Length = 364
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 62/104 (59%), Gaps = 4/104 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TE RECVNCGA TPLWRRD +GHYLCNA G +R L I
Sbjct: 175 TEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIISKRA 230
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G +C NC +T LWRR+ +G +CNACGLY+K++ +NRPL +
Sbjct: 231 GTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRPLTMK 274
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 63/114 (55%), Gaps = 15/114 (13%)
Query: 30 STPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-------------- 75
S PL RDG+ +L + TS I+S S + + Y
Sbjct: 116 SAPLAERDGSPKFLETLKTERTSPLTSDILS-LEPRSPSLLQVGYLGGGGQEFSHSLFQS 174
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TE RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 175 TEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRPLIRPKKRLIISK 228
>gi|213623822|gb|AAI70266.1| GATA binding factor-1b [Xenopus laevis]
Length = 364
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 62/104 (59%), Gaps = 4/104 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TE RECVNCGA TPLWRRD +GHYLCNA G +R L I
Sbjct: 175 TEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIISKRA 230
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G +C NC +T LWRR+ +G +CNACGLY+K++ +NRPL +
Sbjct: 231 GTQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVNRPLTMK 274
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 63/114 (55%), Gaps = 15/114 (13%)
Query: 30 STPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-------------- 75
S PL RDG+ +L + TS I+S S + + Y
Sbjct: 116 SAPLAERDGSPKFLETLKTERTSPLTSDILS-LEPRSPSLLQVGYLGGGGQEFSHSLFQS 174
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TE RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 175 TEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRPLIRPKKRLIISK 228
>gi|355704777|gb|EHH30702.1| Eryf1 [Macaca mulatta]
gi|355757338|gb|EHH60863.1| Eryf1 [Macaca fascicularis]
Length = 413
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|54696410|gb|AAV38577.1| GATA binding protein 1 (globin transcription factor 1) [synthetic
construct]
gi|61368845|gb|AAX43246.1| GATA binding protein 1 [synthetic construct]
Length = 414
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|4503925|ref|NP_002040.1| erythroid transcription factor [Homo sapiens]
gi|120956|sp|P15976.1|GATA1_HUMAN RecName: Full=Erythroid transcription factor; AltName: Full=Eryf1;
AltName: Full=GATA-binding factor 1; Short=GATA-1;
Short=GF-1; AltName: Full=NF-E1 DNA-binding protein
gi|31243|emb|CAA35120.1| unnamed protein product [Homo sapiens]
gi|183072|gb|AAA35885.1| erythroid DNA-binding protein [Homo sapiens]
gi|119571136|gb|EAW50751.1| GATA binding protein 1 (globin transcription factor 1), isoform
CRA_c [Homo sapiens]
gi|208966346|dbj|BAG73187.1| GATA binding protein 1 [synthetic construct]
gi|226698|prf||1603357A transcription factor
Length = 413
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|397471373|ref|XP_003807270.1| PREDICTED: erythroid transcription factor [Pan paniscus]
gi|426395812|ref|XP_004064154.1| PREDICTED: erythroid transcription factor [Gorilla gorilla gorilla]
Length = 413
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|335305975|ref|XP_003360352.1| PREDICTED: erythroid transcription factor [Sus scrofa]
Length = 412
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 199 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 251
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 199 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 254
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 255 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 295
>gi|444509358|gb|ELV09217.1| Erythroid transcription factor [Tupaia chinensis]
Length = 408
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/49 (79%), Positives = 43/49 (87%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL
Sbjct: 185 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRL 233
Score = 81.6 bits (200), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 49/107 (45%), Positives = 62/107 (57%), Gaps = 6/107 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGY---FIMSRFLFASLP---VEDI 72
E RECVNCGA +TPLWRRD TGHYLCNA G I + + + I
Sbjct: 185 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLGIISHHFIPQI 244
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G +C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 245 VSKRAGTQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 291
>gi|332255512|ref|XP_003276877.1| PREDICTED: LOW QUALITY PROTEIN: erythroid transcription factor
[Nomascus leucogenys]
Length = 413
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|301764787|ref|XP_002917807.1| PREDICTED: erythroid transcription factor-like [Ailuropoda
melanoleuca]
gi|281353655|gb|EFB29239.1| hypothetical protein PANDA_006176 [Ailuropoda melanoleuca]
Length = 413
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|148234737|ref|NP_001079244.1| GATA-binding factor 1-B [Xenopus laevis]
gi|120969|sp|P23768.1|GAT1B_XENLA RecName: Full=GATA-binding factor 1-B; AltName: Full=Transcription
factor xGATA-1B
gi|214169|gb|AAA49722.1| GATA binding factor-1 [Xenopus laevis]
Length = 364
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 63/114 (55%), Gaps = 15/114 (13%)
Query: 30 STPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF-------------- 75
S PL RDG+ +L + TS I+S S + + Y
Sbjct: 116 SAPLAERDGSPKFLETLKTERTSPLTSDILS-LEPRSPSLLQVGYLGGGGQEFSHSLFQS 174
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TE RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 175 TEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRPLIRPKKRLIISK 228
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 61/104 (58%), Gaps = 4/104 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
TE RECVNCGA TPLWRRD +GHYLCNA G +R L I
Sbjct: 175 TEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIISKRA 230
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL +
Sbjct: 231 GTQCSNCHTSTTTLWRRNAGGDPVCNACGLYYKLHNVNRPLTMK 274
>gi|410988529|ref|XP_004000536.1| PREDICTED: erythroid transcription factor [Felis catus]
Length = 413
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|74007379|ref|XP_538026.2| PREDICTED: erythroid transcription factor [Canis lupus familiaris]
Length = 412
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 199 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 251
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 199 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 254
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 255 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 295
>gi|329664512|ref|NP_001193169.1| erythroid transcription factor [Bos taurus]
gi|356582212|ref|NP_001239108.1| erythroid transcription factor [Ovis aries]
gi|296470754|tpg|DAA12869.1| TPA: GATA binding protein 1-like [Bos taurus]
gi|342671949|dbj|BAK57313.1| transcription factor GATA1 [Ovis aries]
gi|440912761|gb|ELR62302.1| Erythroid transcription factor [Bos grunniens mutus]
Length = 413
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|307186220|gb|EFN71904.1| Transcription factor GATA-4 [Camponotus floridanus]
Length = 713
Score = 92.8 bits (229), Expect = 4e-17, Method: Composition-based stats.
Identities = 44/102 (43%), Positives = 57/102 (55%), Gaps = 4/102 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE- 77
E +EC NC + TPLWRRDG GHYLCNA G + + P + +
Sbjct: 486 ETKECANCASQMTPLWRRDGAGHYLCNACGLYGKMNG--VNRPPMRCPKPKQTVPTGVRR 543
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 544 TGVQCANCRTNNTTLWRRNNNGEPVCNACGLYFKLHNVNRPL 585
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 32/47 (68%), Positives = 38/47 (80%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
E +EC NC + TPLWRRDG GHYLCNACGLY KMNG+NRP ++ P+
Sbjct: 486 ETKECANCASQMTPLWRRDGAGHYLCNACGLYGKMNGVNRPPMRCPK 532
>gi|297303806|ref|XP_001104486.2| PREDICTED: erythroid transcription factor-like [Macaca mulatta]
Length = 389
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 176 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 228
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 176 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 231
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 232 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 272
>gi|119571135|gb|EAW50750.1| GATA binding protein 1 (globin transcription factor 1), isoform
CRA_b [Homo sapiens]
Length = 183
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 117 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 169
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/28 (85%), Positives = 25/28 (89%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
E RECVNCGA +TPLWRRD TGHYLCNA
Sbjct: 117 EARECVNCGATATPLWRRDRTGHYLCNA 144
>gi|297709913|ref|XP_002831655.1| PREDICTED: erythroid transcription factor [Pongo abelii]
Length = 358
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 296
>gi|119571134|gb|EAW50749.1| GATA binding protein 1 (globin transcription factor 1), isoform
CRA_a [Homo sapiens]
Length = 334
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 56/94 (59%), Gaps = 4/94 (4%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
+C NC +T LWRR+ +G +CNACGLY+K++
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLH 289
>gi|449480617|ref|XP_004176564.1| PREDICTED: LOW QUALITY PROTEIN: GATA-binding factor 3-like
[Taeniopygia guttata]
Length = 445
Score = 92.8 bits (229), Expect = 4e-17, Method: Composition-based stats.
Identities = 49/106 (46%), Positives = 59/106 (55%), Gaps = 13/106 (12%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHY--LCNA---RLFSTSFTGYFIMSRFLFASLPVEDI 72
TEGRECVNCG STPLWRRDG H+ LCNA + R L A+
Sbjct: 259 TEGRECVNCGDTSTPLWRRDGXXHWAILCNAWSLHKMNGQNRPXIKPKRRLSAAR----- 313
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 314 ---RAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPL 356
Score = 78.6 bits (192), Expect = 8e-13, Method: Composition-based stats.
Identities = 38/56 (67%), Positives = 40/56 (71%), Gaps = 3/56 (5%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHY--LCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
TEGRECVNCG STPLWRRDG H+ LCNA L HKMNG NRP +K RRL R
Sbjct: 259 TEGRECVNCGDTSTPLWRRDGXXHWAILCNAWSL-HKMNGQNRPXIKPKRRLSAAR 313
>gi|14602571|gb|AAH09797.1| GATA1 protein [Homo sapiens]
Length = 335
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 252
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 56/94 (59%), Gaps = 4/94 (4%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 255
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
+C NC +T LWRR+ +G +CNACGLY+K++
Sbjct: 256 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLH 289
>gi|170043916|ref|XP_001849613.1| GATA transcription factor GATAc [Culex quinquefasciatus]
gi|167867188|gb|EDS30571.1| GATA transcription factor GATAc [Culex quinquefasciatus]
Length = 578
Score = 92.4 bits (228), Expect = 5e-17, Method: Composition-based stats.
Identities = 47/114 (41%), Positives = 60/114 (52%), Gaps = 22/114 (19%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA-RLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
TE RECVNCG+ TPLWRRD GH LCNA L++ G + P +
Sbjct: 446 TEHRECVNCGSSDTPLWRRDNVGHTLCNACALYNRQNPG---------TNRPPNRSQKAK 496
Query: 77 E------------GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G C NC +T LWRR+ G +CNACGLYHK++ ++RPL
Sbjct: 497 QAPKAPVPGNRRTGVTCANCQTTTTTLWRRNNRGEPVCNACGLYHKLHNVDRPL 550
>gi|194227895|ref|XP_001493895.2| PREDICTED: erythroid transcription factor-like [Equus caballus]
Length = 378
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 39/53 (73%), Positives = 44/53 (83%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +RL +
Sbjct: 165 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSK 217
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 165 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRPK-KRLIVSKRAG 220
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 221 TQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLT 261
>gi|74101554|gb|AAZ99709.1| GATA-6, partial [Bos taurus]
Length = 179
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 59/97 (60%), Gaps = 6/97 (6%)
Query: 24 VNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE-GRECV 82
VNCG+I TPLWRRDGTGHYLCNA + G +SR L P + + G C
Sbjct: 1 VNCGSIQTPLWRRDGTGHYLCNACGLYSKMNG---LSRPLIK--PQKRVPSSRRLGLSCA 55
Query: 83 NCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 56 NCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 92
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 35/48 (72%), Positives = 41/48 (85%)
Query: 82 VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
VNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R+ R
Sbjct: 1 VNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSR 48
>gi|157125526|ref|XP_001654370.1| transcription factor GATA-4 (GATA binding factor-4) [Aedes aegypti]
gi|108873605|gb|EAT37830.1| AAEL010224-PA [Aedes aegypti]
Length = 719
Score = 91.3 bits (225), Expect = 1e-16, Method: Composition-based stats.
Identities = 48/105 (45%), Positives = 61/105 (58%), Gaps = 4/105 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA-RLFSTSFTGY-FIMSRFLFASLPVEDIEYF 75
+E RECVNCG+ TPLWRRD GH LCNA L++ G +R A P +
Sbjct: 472 SEPRECVNCGSSDTPLWRRDNVGHTLCNACALYNRQNPGTNRPPNRSHKAKNPPKTPVPG 531
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLYHK++ ++RPL
Sbjct: 532 NRRTGVVCANCSTTTTTLWRRNNRGEPVCNACGLYHKLHNVDRPL 576
>gi|56900908|gb|AAW31747.1| GATA transcription factor GATAc [Aedes aegypti]
Length = 719
Score = 91.3 bits (225), Expect = 1e-16, Method: Composition-based stats.
Identities = 48/105 (45%), Positives = 61/105 (58%), Gaps = 4/105 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA-RLFSTSFTGY-FIMSRFLFASLPVEDIEYF 75
+E RECVNCG+ TPLWRRD GH LCNA L++ G +R A P +
Sbjct: 471 SEPRECVNCGSSDTPLWRRDNVGHTLCNACALYNRQNPGTNRPPNRSHKAKNPPKTPVPG 530
Query: 76 TE--GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLYHK++ ++RPL
Sbjct: 531 NRRTGVVCANCSTTTTTLWRRNNRGEPVCNACGLYHKLHNVDRPL 575
>gi|242010040|ref|XP_002425784.1| GATA binding factor-1B, putative [Pediculus humanus corporis]
gi|212509717|gb|EEB13046.1| GATA binding factor-1B, putative [Pediculus humanus corporis]
Length = 907
Score = 90.9 bits (224), Expect = 2e-16, Method: Composition-based stats.
Identities = 51/140 (36%), Positives = 67/140 (47%), Gaps = 25/140 (17%)
Query: 16 YFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYF 75
Y + +ECVNC TPLWRRD GHYLCNA G ++R L + +
Sbjct: 640 YINDVKECVNCATCRTPLWRRDDDGHYLCNACGLYNKVNG---VNRPLIKANSKKYNSVS 696
Query: 76 TE-----------------GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ G EC NC +T LWRR+ TG +CNACGLY K++G+ RPL
Sbjct: 697 SSSNSASPYSESPANNRRTGVECANCRTTNTTLWRRNNTGEPVCNACGLYFKLHGVPRPL 756
Query: 119 VK-----QPRRLRFDRGNFN 133
Q R+ + G+ N
Sbjct: 757 SMKKDGIQSRKRKPKTGSIN 776
>gi|347967441|ref|XP_001230593.3| AGAP002236-PA [Anopheles gambiae str. PEST]
gi|333466294|gb|EAU77738.3| AGAP002236-PA [Anopheles gambiae str. PEST]
Length = 747
Score = 90.9 bits (224), Expect = 2e-16, Method: Composition-based stats.
Identities = 46/107 (42%), Positives = 60/107 (56%), Gaps = 9/107 (8%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA-RLFSTSFTGY------FIMSRFLFASLPVED 71
E RECVNCG+ TPLWRRD GH LCNA L++ G ++ + P +
Sbjct: 492 EHRECVNCGSSDTPLWRRDIVGHTLCNACALYTRQNPGTNRPPNRSQKAKQTVKTPPAQG 551
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 552 NR--RSGVTCANCQTTTTTLWRRNNQGDPVCNACGLYYKLHSVNRPL 596
>gi|313245922|emb|CBY34903.1| unnamed protein product [Oikopleura dioica]
Length = 459
Score = 90.5 bits (223), Expect = 2e-16, Method: Composition-based stats.
Identities = 34/47 (72%), Positives = 41/47 (87%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
R+CVNC A+STPLWRRD G+YLCNACGLYHK+NG NRPL+K +R+
Sbjct: 278 RQCVNCAAVSTPLWRRDSNGNYLCNACGLYHKVNGCNRPLIKPKKRV 324
Score = 84.0 bits (206), Expect = 2e-14, Method: Composition-based stats.
Identities = 47/99 (47%), Positives = 60/99 (60%), Gaps = 6/99 (6%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI-EYFTEGR 79
R+CVNC A+STPLWRRD G+YLCNA G +R L P + + + G
Sbjct: 278 RQCVNCAAVSTPLWRRDSNGNYLCNACGLYHKVNG---CNRPLIK--PKKRVTQSKRTGA 332
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC T LWRR TG +CNACGLY K++G+NRP+
Sbjct: 333 KCTNCNTTQTTLWRRTTTGDAVCNACGLYQKLHGVNRPI 371
>gi|313232258|emb|CBY09367.1| unnamed protein product [Oikopleura dioica]
Length = 459
Score = 90.5 bits (223), Expect = 2e-16, Method: Composition-based stats.
Identities = 34/47 (72%), Positives = 41/47 (87%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
R+CVNC A+STPLWRRD G+YLCNACGLYHK+NG NRPL+K +R+
Sbjct: 278 RQCVNCAAVSTPLWRRDSNGNYLCNACGLYHKVNGCNRPLIKPKKRV 324
Score = 84.0 bits (206), Expect = 2e-14, Method: Composition-based stats.
Identities = 47/99 (47%), Positives = 60/99 (60%), Gaps = 6/99 (6%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDI-EYFTEGR 79
R+CVNC A+STPLWRRD G+YLCNA G +R L P + + + G
Sbjct: 278 RQCVNCAAVSTPLWRRDSNGNYLCNACGLYHKVNG---CNRPLIK--PKKRVTQSKRTGA 332
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC T LWRR TG +CNACGLY K++G+NRP+
Sbjct: 333 KCTNCNTTQTTLWRRTTTGDAVCNACGLYQKLHGVNRPI 371
>gi|442570364|pdb|3VD6|C Chain C, Both Zn Fingers Of Gata1 Bound To Palindromic Dna
Recognition Site, P21 Crystal Form
gi|442570368|pdb|3VEK|C Chain C, Both Zn Fingers Of Gata1 Bound To Palindromic Dna
Recognition Site, P1 Crystal Form
gi|442570371|pdb|3VEK|F Chain F, Both Zn Fingers Of Gata1 Bound To Palindromic Dna
Recognition Site, P1 Crystal Form
Length = 119
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 38/49 (77%), Positives = 43/49 (87%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++ +R+
Sbjct: 1 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRM 49
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 60/101 (59%), Gaps = 4/101 (3%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
E RECVNCGA +TPLWRRD TGHYLCNA G +R L I G
Sbjct: 1 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNG---QNRPLIRP-KKRMIVSKRAG 56
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+C NC +T LWRR+ +G +CNACGLY K++ +NRPL
Sbjct: 57 TQCTNCQTTTTTLWRRNASGDPVCNACGLYFKLHQVNRPLT 97
>gi|357609511|gb|EHJ66491.1| GATA transcription factor GATAc [Danaus plexippus]
Length = 633
Score = 90.5 bits (223), Expect = 2e-16, Method: Composition-based stats.
Identities = 44/102 (43%), Positives = 57/102 (55%), Gaps = 9/102 (8%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE- 77
E +ECVNC A +TPLWRRDGTGHYLCNA T G ++R + +
Sbjct: 448 EVKECVNCAAATTPLWRRDGTGHYLCNACGLYTRING---VNRPPLKGQKTKPQQALPTN 504
Query: 78 -----GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM 114
G C NC +T LWRR+ G +CNACGLY+K++ +
Sbjct: 505 GNRRVGVTCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNI 546
Score = 85.9 bits (211), Expect = 5e-15, Method: Composition-based stats.
Identities = 33/44 (75%), Positives = 39/44 (88%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
E +ECVNC A +TPLWRRDGTGHYLCNACGLY ++NG+NRP +K
Sbjct: 448 EVKECVNCAAATTPLWRRDGTGHYLCNACGLYTRINGVNRPPLK 491
>gi|283468189|emb|CAN84590.1| GATA-3 protein [Melanogrammus aeglefinus]
Length = 251
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 37/40 (92%), Positives = 38/40 (95%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 115
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG N
Sbjct: 212 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQN 251
Score = 65.5 bits (158), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/29 (93%), Positives = 28/29 (96%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 212 SEGRECVNCGATSTPLWRRDGTGHYLCNA 240
>gi|340374204|ref|XP_003385628.1| PREDICTED: hypothetical protein LOC100636783 [Amphimedon
queenslandica]
Length = 611
Score = 88.2 bits (217), Expect = 9e-16, Method: Composition-based stats.
Identities = 44/103 (42%), Positives = 60/103 (58%), Gaps = 14/103 (13%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA-----RLFSTSFTGYFIMSRFLFASLPVEDIEY 74
R+C NCG+ +TPLWRR+ GHYLCNA R+ T+ G+ + + A+L
Sbjct: 292 ARQCNNCGSSNTPLWRRNSEGHYLCNACGLYYRVNGTNRQGH--QKKKVRATLK------ 343
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
+ +C NCG T LWRR G +CN CGLY+K+NG+ RP
Sbjct: 344 -SMNNKCSNCGTTKTVLWRRLENGDPVCNPCGLYYKLNGVARP 385
>gi|321467983|gb|EFX78970.1| hypothetical protein DAPPUDRAFT_305027 [Daphnia pulex]
Length = 321
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/108 (42%), Positives = 59/108 (54%), Gaps = 8/108 (7%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEG 78
EGREC NC AI+TPLWRRDG HYLCNA G + ++ P G
Sbjct: 3 EGRECANCSAIATPLWRRDGNNHYLCNA-------CGLYKLTNGTNRP-PSSGSGNRRAG 54
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NC +T LWRR+ G +CNACGLY K++ + RPL + ++
Sbjct: 55 LTCSNCNTSTTTLWRRNANGEPVCNACGLYFKLHNVCRPLAMKKEGIQ 102
>gi|157169241|gb|ABV25955.1| GATA-binding transcription factor B2 [Capitella teleta]
Length = 282
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 56/102 (54%), Gaps = 10/102 (9%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE-- 77
GREC NCG+ PLWR +GTGH LCNA G +MS F +
Sbjct: 71 GRECANCGSTYAPLWRWNGTGHLLCNA-------CGVHVMSGFAKPVMKTSGGRRSVSRR 123
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+NRP+
Sbjct: 124 VGLSCANCHTSTTTLWRRNNEGEPVCNACGLYFKLHGVNRPM 165
>gi|60594093|pdb|1Y0J|A Chain A, Zinc Fingers As Protein Recognition Motifs: Structural
Basis For The Gata-1FRIEND OF GATA INTERACTION
Length = 46
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/45 (82%), Positives = 41/45 (91%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++
Sbjct: 2 SEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIR 46
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/29 (82%), Positives = 26/29 (89%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+E RECVNCGA +TPLWRRD TGHYLCNA
Sbjct: 2 SEARECVNCGATATPLWRRDRTGHYLCNA 30
>gi|443718381|gb|ELU09033.1| hypothetical protein CAPTEDRAFT_121668 [Capitella teleta]
Length = 131
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 56/102 (54%), Gaps = 10/102 (9%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE-- 77
GREC NCG+ PLWR +GTGH LCNA G +MS F +
Sbjct: 6 GRECANCGSTYAPLWRWNGTGHLLCNA-------CGVHVMSGFAKPVMKTSGGRRSVSRR 58
Query: 78 -GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY K++G+NRP+
Sbjct: 59 VGLSCANCHTSTTTLWRRNNEGEPVCNACGLYFKLHGVNRPM 100
>gi|313234089|emb|CBY19666.1| unnamed protein product [Oikopleura dioica]
Length = 553
Score = 87.0 bits (214), Expect = 2e-15, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 55/101 (54%), Gaps = 12/101 (11%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFAS----LPVEDIEYFT 76
REC NC A +TPLWRRD G+YLCNA G+ SR L P + I
Sbjct: 325 RECANCAATTTPLWRRDKCGNYLCNACGLYYKVNGH---SRPLIKPKKRVAPNKRI---- 377
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G CVNC T LWRR G +CNACGLY K++G+ RP
Sbjct: 378 -GTICVNCKTSQTTLWRRSLKGEPVCNACGLYEKLHGVPRP 417
Score = 80.9 bits (198), Expect = 2e-13, Method: Composition-based stats.
Identities = 37/69 (53%), Positives = 48/69 (69%), Gaps = 5/69 (7%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL----RFDRGNFNC 134
REC NC A +TPLWRRD G+YLCNACGLY+K+NG +RPL+K +R+ R NC
Sbjct: 325 RECANCAATTTPLWRRDKCGNYLCNACGLYYKVNGHSRPLIKPKKRVAPNKRIGTICVNC 384
Query: 135 -YERSTLFR 142
++TL+R
Sbjct: 385 KTSQTTLWR 393
>gi|166234101|emb|CAP72497.1| GATA binding protein 6 [Sus scrofa]
Length = 46
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/46 (80%), Positives = 42/46 (91%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K +R
Sbjct: 1 RECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKR 46
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/26 (92%), Positives = 25/26 (96%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNA 46
RECVNCG+I TPLWRRDGTGHYLCNA
Sbjct: 1 RECVNCGSIQTPLWRRDGTGHYLCNA 26
>gi|449295695|gb|EMC91716.1| hypothetical protein BAUCODRAFT_52987, partial [Baudoinia
compniacensis UAMH 10762]
Length = 105
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/111 (42%), Positives = 59/111 (53%), Gaps = 11/111 (9%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLP---VEDIEYFTEGR 79
C NCG TPLWRR TG +CNA G + +R + V+ +
Sbjct: 1 CSNCGTTKTPLWRRSPTGSIICNA-------CGLYYKARNQMRPVGLKRVQQAQITNVVV 53
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRLRFDR 129
C NCG TPLWRRD GH LCNACGLY K++G +RP+ +K+P R R
Sbjct: 54 ACQNCGTTITPLWRRDEAGHTLCNACGLYQKLHGAHRPVQMKKPEIKRRKR 104
>gi|159162363|pdb|1GNF|A Chain A, Solution Structure Of The N-Terminal Zinc Finger Of Murine
Gata-1, Nmr, 25 Structures
Length = 46
Score = 86.3 bits (212), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 37/44 (84%), Positives = 40/44 (90%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL+
Sbjct: 2 SEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLI 45
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/29 (82%), Positives = 26/29 (89%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+E RECVNCGA +TPLWRRD TGHYLCNA
Sbjct: 2 SEARECVNCGATATPLWRRDRTGHYLCNA 30
>gi|313219632|emb|CBY30553.1| unnamed protein product [Oikopleura dioica]
Length = 368
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 54/102 (52%), Gaps = 4/102 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+E REC NC A +TPLWRRD G+YLCNA G SR L + E
Sbjct: 188 SESRECANCKATTTPLWRRDDAGNYLCNACGLYYKVNG---KSRPLVKPK-RRSVPNKRE 243
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC T LWR+ +CNACGLY K++G+ RPL
Sbjct: 244 GTICDNCKTTETSLWRKSNEQKAVCNACGLYEKLHGVARPLT 285
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 34/49 (69%), Positives = 40/49 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
+E REC NC A +TPLWRRD G+YLCNACGLY+K+NG +RPLVK RR
Sbjct: 188 SESRECANCKATTTPLWRRDDAGNYLCNACGLYYKVNGKSRPLVKPKRR 236
>gi|313232905|emb|CBY09588.1| unnamed protein product [Oikopleura dioica]
Length = 368
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 54/102 (52%), Gaps = 4/102 (3%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
+E REC NC A +TPLWRRD G+YLCNA G SR L + E
Sbjct: 188 SESRECANCKATTTPLWRRDDAGNYLCNACGLYYKVNG---KSRPLVKPK-RRSVPNKRE 243
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC T LWR+ +CNACGLY K++G+ RPL
Sbjct: 244 GTICDNCKTTETSLWRKSNEQKAVCNACGLYEKLHGVARPLT 285
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 34/49 (69%), Positives = 40/49 (81%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
+E REC NC A +TPLWRRD G+YLCNACGLY+K+NG +RPLVK RR
Sbjct: 188 SESRECANCKATTTPLWRRDDAGNYLCNACGLYYKVNGKSRPLVKPKRR 236
>gi|57547700|gb|AAW52540.1| GATA transcription factor pannier-like protein [Schistocerca
americana]
Length = 88
Score = 85.5 bits (210), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 58/96 (60%), Gaps = 12/96 (12%)
Query: 28 AISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRE----CVN 83
AISTPLWRRDGTGHYLCNA G M+R L V+ + T R C N
Sbjct: 1 AISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----VKPSKRLTATRRLGLCCTN 52
Query: 84 CGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
CG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 53 CGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPLA 88
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/44 (86%), Positives = 39/44 (88%)
Query: 86 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL R
Sbjct: 1 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATR 44
>gi|23094291|emb|CAD45643.1| GATAa protein [Ciona intestinalis]
Length = 431
Score = 84.3 bits (207), Expect = 1e-14, Method: Composition-based stats.
Identities = 43/103 (41%), Positives = 57/103 (55%), Gaps = 11/103 (10%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
+ E RECVNCGAIS WRRD +GH+LC+ +G ++ + PV+
Sbjct: 91 YVEPRECVNCGAISATSWRRDASGHFLCST--CGACRSGSYMRA-------PVKSKGKLA 141
Query: 77 EGRE--CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
R C NC T LWRR G+ +CNACGLY K++G+ RP
Sbjct: 142 TCRRQVCSNCSTTVTTLWRRSPDGNPVCNACGLYQKLHGVPRP 184
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 29/44 (65%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ E RECVNCGAIS WRRD +GH+LC+ CG + M P+
Sbjct: 91 YVEPRECVNCGAISATSWRRDASGHFLCSTCGACRSGSYMRAPV 134
>gi|384499201|gb|EIE89692.1| hypothetical protein RO3G_14403 [Rhizopus delemar RA 99-880]
Length = 281
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 50/96 (52%), Gaps = 7/96 (7%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECV 82
C NCGA STPLWRR LCNA G ++ + D + G C
Sbjct: 124 CSNCGASSTPLWRRSANAQILCNA-------CGLYVSPNTNNLTEDSVDEDNRPSGTVCS 176
Query: 83 NCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
NCG TPLWRR+ G LCNACGLY+K++ RPL
Sbjct: 177 NCGTNKTPLWRRNAEGSPLCNACGLYYKLHNEKRPL 212
Score = 39.7 bits (91), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 19/34 (55%)
Query: 13 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
D + G C NCG TPLWRR+ G LCNA
Sbjct: 165 DEDNRPSGTVCSNCGTNKTPLWRRNAEGSPLCNA 198
>gi|193083007|ref|NP_001122335.1| GATAa protein [Ciona intestinalis]
gi|70569756|dbj|BAE06471.1| transcription factor protein [Ciona intestinalis]
Length = 641
Score = 83.6 bits (205), Expect = 2e-14, Method: Composition-based stats.
Identities = 43/103 (41%), Positives = 57/103 (55%), Gaps = 11/103 (10%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
+ E RECVNCGAIS WRRD +GH+LC+ +G ++ + PV+
Sbjct: 301 YGEPRECVNCGAISATSWRRDASGHFLCST--CGACRSGSYMRA-------PVKSKGKLA 351
Query: 77 EGRE--CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
R C NC T LWRR G+ +CNACGLY K++G+ RP
Sbjct: 352 TCRRQVCSNCSTTVTTLWRRSPDGNPVCNACGLYQKLHGVPRP 394
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 29/44 (65%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ E RECVNCGAIS WRRD +GH+LC+ CG + M P+
Sbjct: 301 YGEPRECVNCGAISATSWRRDASGHFLCSTCGACRSGSYMRAPV 344
>gi|388855917|emb|CCF50492.1| related to gata transcription factor [Ustilago hordei]
Length = 1129
Score = 83.2 bits (204), Expect = 4e-14, Method: Composition-based stats.
Identities = 51/137 (37%), Positives = 64/137 (46%), Gaps = 30/137 (21%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA-----RLFST--------------------SFTGY 56
+C NCGA STPLWRRD LCNA +L T S T
Sbjct: 612 QCTNCGATSTPLWRRDPNDLLLCNACGLYLKLHKTPRPKSLKNHHSHSHSGHVTPSATPG 671
Query: 57 FIMSRFLFASLPVEDIEYFTEGR-ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 115
+ A P D E E C NCG +TPLWR+D GH +CNACGLY K++ +
Sbjct: 672 GASAPGSRAGSPSRDGEPGAEDMMSCFNCGTYTTPLWRKDDAGHTVCNACGLYLKLHNEH 731
Query: 116 RPLVKQP----RRLRFD 128
RP+ + +R R+D
Sbjct: 732 RPVTMRADVIKKRSRYD 748
>gi|57116082|gb|AAW33557.1| GATA transcription factor pannier [Schistocerca americana]
Length = 75
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/44 (86%), Positives = 39/44 (88%)
Query: 86 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL R
Sbjct: 1 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATR 44
Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 41/75 (54%), Gaps = 12/75 (16%)
Query: 28 AISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRE----CVN 83
AISTPLWRRDGTGHYLCNA G M+R L V+ + T R C N
Sbjct: 1 AISTPLWRRDGTGHYLCNACGLYHKMNG---MNRPL-----VKPSKRLTATRRLGLCCTN 52
Query: 84 CGAISTPLWRRDGTG 98
CG +T LWRR+ G
Sbjct: 53 CGTRTTTLWRRNNDG 67
>gi|343425496|emb|CBQ69031.1| related to gata transcription factor [Sporisorium reilianum SRZ2]
Length = 1061
Score = 82.8 bits (203), Expect = 4e-14, Method: Composition-based stats.
Identities = 52/143 (36%), Positives = 66/143 (46%), Gaps = 42/143 (29%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA-----RLFST------------SFTGYFIMS---- 60
+C NCGA STPLWRRD LCNA +L T S G+ S
Sbjct: 611 QCTNCGATSTPLWRRDPNDLLLCNACGLYLKLHKTPRPKSLKNHHSHSHAGHMTPSATPG 670
Query: 61 -------RFLFASLP----VEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYH 109
R S P ED+ C NCG +TPLWR+D GH +CNACGLY
Sbjct: 671 GASAPGSRAGSPSRPGSPGAEDM------MSCFNCGTYTTPLWRKDDAGHTVCNACGLYL 724
Query: 110 KMNGMNRPLVKQP----RRLRFD 128
K++ +RP+ + +R R+D
Sbjct: 725 KLHNEHRPVTMRADVIKKRSRYD 747
>gi|242218586|ref|XP_002475082.1| hypothetical iron transporter biosynthesis regulating transcription
factor [Postia placenta Mad-698-R]
gi|220725761|gb|EED79735.1| hypothetical iron transporter biosynthesis regulating transcription
factor [Postia placenta Mad-698-R]
Length = 111
Score = 82.4 bits (202), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 52/95 (54%), Gaps = 2/95 (2%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECV 82
C NCGA TPLWRRDG G +CNA + + +S + C
Sbjct: 1 CANCGAEVTPLWRRDGVGKTVCNA--CGLYYKAHGSSRPGTISSGGRSRMRAAVGALSCA 58
Query: 83 NCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
NCG +TPLWRRD G+ +CNACGLY K++G +RP
Sbjct: 59 NCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 93
>gi|348517021|ref|XP_003446034.1| PREDICTED: transcription factor GATA-3-like [Oreochromis niloticus]
Length = 170
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 60/116 (51%), Gaps = 9/116 (7%)
Query: 4 FLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFL 63
+L L D++ + RECV+CG S PLWRRD G LCN ++
Sbjct: 17 YLTVCLGGRDVD---QQRECVSCGMQSAPLWRRDAAGRLLCNTCGLRQEANNTPLLRPKR 73
Query: 64 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
AS +G +CVNC T LWRR+ G +CNACGLY++++ +NRPL
Sbjct: 74 RAS------RIQRKGTQCVNCLTERTTLWRRNSAGEAVCNACGLYYRLHRVNRPLA 123
>gi|443687408|gb|ELT90399.1| hypothetical protein CAPTEDRAFT_175842 [Capitella teleta]
Length = 341
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/38 (89%), Positives = 36/38 (94%)
Query: 87 ISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLVK RR
Sbjct: 1 MSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVKPQRR 38
Score = 78.6 bits (192), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 56/102 (54%), Gaps = 15/102 (14%)
Query: 29 ISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFAS------------LPVEDIEYFT 76
+STPLWRRDGTGHYLCNA G ++R L PV ++
Sbjct: 1 MSTPLWRRDGTGHYLCNACGLYHKMNG---VNRPLVKPQRRSTGGGSSTTSPVPNLGSRR 57
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY+K++ + RP+
Sbjct: 58 LGLQCANCSTTTTTLWRRNNEGEPVCNACGLYYKLHQVARPI 99
Score = 36.2 bits (82), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
PV ++ G +C NC +T LWRR+ G +CNA
Sbjct: 49 PVPNLGSRRLGLQCANCSTTTTTLWRRNNEGEPVCNA 85
>gi|71019249|ref|XP_759855.1| hypothetical protein UM03708.1 [Ustilago maydis 521]
gi|46099653|gb|EAK84886.1| hypothetical protein UM03708.1 [Ustilago maydis 521]
Length = 1104
Score = 80.1 bits (196), Expect = 3e-13, Method: Composition-based stats.
Identities = 50/143 (34%), Positives = 63/143 (44%), Gaps = 42/143 (29%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA-----RLFST------------------------- 51
+C NCGA STPLWRRD LCNA +L T
Sbjct: 546 QCTNCGATSTPLWRRDPNDLLLCNACGLYLKLHKTPRPRSLKSHHSHSHSGHITPSATPG 605
Query: 52 --SFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYH 109
S G S S ED+ C NCG +TPLWR+D GH +CNACGLY
Sbjct: 606 GASAPGSRAGSPSRSGSPSGEDMT------SCFNCGTYTTPLWRKDDAGHTVCNACGLYL 659
Query: 110 KMNGMNRPLVKQP----RRLRFD 128
K++ +RP+ + +R R+D
Sbjct: 660 KLHNEHRPVTMRADVIKKRSRYD 682
>gi|342320721|gb|EGU12660.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 1471
Score = 79.0 bits (193), Expect = 7e-13, Method: Composition-based stats.
Identities = 43/106 (40%), Positives = 55/106 (51%), Gaps = 7/106 (6%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA-----RLFSTSFTGYFIMSRFLFASLPVEDIE 73
+G C +CG+I+TPLWRR LCNA +L S F S S +
Sbjct: 1152 DGPVCSHCGSITTPLWRRGPDDELLCNACGLYLKLHSKPRPKTFGKSNASKRSSNGAAAQ 1211
Query: 74 YFTEG--RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCGA STP+WR+D G CNAC LY+K++ +NRP
Sbjct: 1212 AAASGVPPSCSNCGATSTPMWRKDQEGRLCCNACSLYYKLHKVNRP 1257
>gi|452838991|gb|EME40931.1| hypothetical protein DOTSEDRAFT_106841, partial [Dothistroma
septosporum NZE10]
Length = 106
Score = 78.6 bits (192), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 53/99 (53%), Gaps = 12/99 (12%)
Query: 25 NCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR----- 79
NCG TPLWRR G +CNA G + +R + ++ T+
Sbjct: 2 NCGTTKTPLWRRSPAGAVICNA-------CGLYYKARNQMRPVGLKRGTPTTQAPTNVVI 54
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G +RP+
Sbjct: 55 ACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGAHRPV 93
Score = 40.8 bits (94), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 56 CQNCGTTITPLWRRDEAGHTICNA 79
>gi|380861645|gb|AFF18488.1| GATA456 [Schmidtea mediterranea]
Length = 529
Score = 78.6 bits (192), Expect = 9e-13, Method: Composition-based stats.
Identities = 47/117 (40%), Positives = 58/117 (49%), Gaps = 26/117 (22%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCN-ARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR 79
RECVNCGA +T LW RD +G+YLC+ FS + SR L LP + TE
Sbjct: 258 RECVNCGASNTQLWSRDNSGYYLCDECDRFSQN------NSRNL-EKLPTNQVSSSTENE 310
Query: 80 ------------------ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
EC NC T LWRR+ G +CNACGLY+K++ RPL
Sbjct: 311 FMKKSNANFQQYGKRSDLECSNCKITKTSLWRRNNEGEPVCNACGLYYKLHKSLRPL 367
>gi|323319541|gb|ADX36141.1| GATA456a [Schmidtea polychroa]
Length = 528
Score = 78.2 bits (191), Expect = 9e-13, Method: Composition-based stats.
Identities = 47/117 (40%), Positives = 58/117 (49%), Gaps = 26/117 (22%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCN-ARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR 79
RECVNCGA +T LW RD TG+YLC+ FS + SR + LP + TE
Sbjct: 258 RECVNCGASNTQLWSRDNTGYYLCDECDRFSQN------NSRNI-EKLPTNQVSSSTENE 310
Query: 80 ------------------ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
EC NC T LWRR+ G +CNACGLY+K++ RPL
Sbjct: 311 FMKKSNANFQQYGKRSDLECSNCKITKTSLWRRNNEGEPVCNACGLYYKLHKSLRPL 367
>gi|345110699|pdb|2L6Y|A Chain A, Haddock Model Of Gata1nf:lmo2lim2-Ldb1lid
gi|345110701|pdb|2L6Z|A Chain A, Haddock Model Of Gata1nf:lmo2lim2-Ldb1lid With Fog
Length = 39
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 34/39 (87%), Positives = 35/39 (89%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 115
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG N
Sbjct: 1 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQN 39
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/28 (85%), Positives = 25/28 (89%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
E RECVNCGA +TPLWRRD TGHYLCNA
Sbjct: 1 EARECVNCGATATPLWRRDRTGHYLCNA 28
>gi|384500433|gb|EIE90924.1| hypothetical protein RO3G_15635 [Rhizopus delemar RA 99-880]
Length = 379
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 58/125 (46%), Gaps = 14/125 (11%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA-----RLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
C NCGA STPLWRR LCNA +L + S E +
Sbjct: 187 CTNCGATSTPLWRRSAEDELLCNACGLYQKLHNAPRPKTLKPHNARKESKDDEGSQLV-- 244
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ----PRRLRFDRGNFN 133
C NC +TPLWRRD G LCNACGLY K++ RPL + +R R++ N N
Sbjct: 245 ---CSNCSTTTTPLWRRDDEGAPLCNACGLYLKLHHERRPLSMKTDIIKKRQRYESNNVN 301
Query: 134 CYERS 138
+S
Sbjct: 302 QPRKS 306
>gi|196049284|dbj|BAG68611.1| GATA-binding protein 2 [Cyprinus carpio]
Length = 244
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 32/34 (94%), Positives = 33/34 (97%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYH 109
+EGRECVNCGA STPLWRRDGTGHYLCNACGLYH
Sbjct: 211 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYH 244
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/29 (93%), Positives = 28/29 (96%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+EGRECVNCGA STPLWRRDGTGHYLCNA
Sbjct: 211 SEGRECVNCGATSTPLWRRDGTGHYLCNA 239
>gi|213409195|ref|XP_002175368.1| GATA zinc finger domain-containing protein [Schizosaccharomyces
japonicus yFS275]
gi|212003415|gb|EEB09075.1| GATA zinc finger domain-containing protein [Schizosaccharomyces
japonicus yFS275]
Length = 585
Score = 77.8 bits (190), Expect = 2e-12, Method: Composition-based stats.
Identities = 45/111 (40%), Positives = 57/111 (51%), Gaps = 11/111 (9%)
Query: 23 CVNCGAISTPLWRRD-GTGHYLCNA-RLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRE 80
C NC TPLWRR H+LCNA L++ Y IM L + P + + G
Sbjct: 393 CFNCKVTHTPLWRRTPDRKHFLCNACGLYAKQ---YGIMRPLLPRTKPAHNKD--NAGLV 447
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL----VKQPRRLRF 127
C NC T LWR+ G +CNACGLY +++G NRP+ K RR RF
Sbjct: 448 CTNCQTKKTSLWRKSPQGQTVCNACGLYARLHGQNRPVNLRKEKISRRRRF 498
>gi|393236397|gb|EJD43946.1| glucocorticoid receptor-like (DNA-binding domain), partial
[Auricularia delicata TFB-10046 SS5]
Length = 131
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/111 (42%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA-----RL-FSTSFTGYFIMSRFLFASLPVEDIEYF 75
EC NCGA TPLWRR CNA +L F + G SR A D
Sbjct: 18 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLNFRSGQDGVRAGSRS-GAGFADGDGVGP 76
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
TE C NC +TPLWR+D G +CNACGLY K++G +RPL + +R
Sbjct: 77 TEPVSCHNCHTTATPLWRKDEEGRTVCNACGLYSKLHGASRPLSMKSESIR 127
>gi|384498142|gb|EIE88633.1| hypothetical protein RO3G_13344 [Rhizopus delemar RA 99-880]
Length = 228
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 54/105 (51%), Gaps = 17/105 (16%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR-- 79
+C CGA TPLWRR G LCNA G M R A + D+ G
Sbjct: 20 KCSLCGATKTPLWRRGPHGEALCNA-------CGLKHMHRP--AKVKKNDVSSHQAGSFT 70
Query: 80 ------ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G+ +CNACGLY+K++ ++RP+
Sbjct: 71 VFNKLMMCANCRTTTTPLWRRDEAGNTICNACGLYYKLHQVHRPV 115
>gi|170097681|ref|XP_001880060.1| siderophore biosynthesis regulatory protein [Laccaria bicolor
S238N-H82]
gi|164645463|gb|EDR09711.1| siderophore biosynthesis regulatory protein [Laccaria bicolor
S238N-H82]
Length = 928
Score = 75.9 bits (185), Expect = 5e-12, Method: Composition-based stats.
Identities = 42/114 (36%), Positives = 54/114 (47%), Gaps = 11/114 (9%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA---------RLFSTSFTGYFIMSRFLFASLPVEDI 72
EC NCGA TPLWRR CNA R S R A P ++
Sbjct: 532 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNTHGEGRAQAA--PRQET 589
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+C NC +TPLWR+D G +CNACGLY+K++G RP+ + +R
Sbjct: 590 VDVMTAAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGSARPISMKSDVIR 643
>gi|384489810|gb|EIE81032.1| hypothetical protein RO3G_05737 [Rhizopus delemar RA 99-880]
Length = 393
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 45/96 (46%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECV 82
C NCGA TPLWRR LCNA + E E C
Sbjct: 234 CTNCGATKTPLWRRSIEDDLLCNACGLYQKLHNAPRPKSLKPHNSKKELKEVEGPKLVCS 293
Query: 83 NCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
NC I TPLWRRD G LCNACGLY+K++ RPL
Sbjct: 294 NCSTIKTPLWRRDDEGAPLCNACGLYYKLHHERRPL 329
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 26/42 (61%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP 122
C NCGA TPLWRR LCNACGLY K++ RP +P
Sbjct: 234 CTNCGATKTPLWRRSIEDDLLCNACGLYQKLHNAPRPKSLKP 275
>gi|323319539|gb|ADX36140.1| GATA123b [Schmidtea polychroa]
Length = 239
Score = 75.5 bits (184), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 32/44 (72%), Positives = 36/44 (81%)
Query: 86 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
A++TPLWRRDG G+YLCNACGLY KMNG NRPL+K RRL R
Sbjct: 1 AVNTPLWRRDGQGNYLCNACGLYQKMNGQNRPLIKPKRRLSASR 44
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 50/94 (53%), Gaps = 12/94 (12%)
Query: 28 AISTPLWRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIEYFTEGRECVN 83
A++TPLWRRDG G+YLCNA G R L AS I C N
Sbjct: 1 AVNTPLWRRDGQGNYLCNACGLYQKMNGQNRPLIKPKRRLSASRRTGTI--------CSN 52
Query: 84 CGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C +T LWRR+ G +CNACGLY K++ ++RP
Sbjct: 53 CNTSTTTLWRRNSNGEPVCNACGLYFKLHSVSRP 86
>gi|341575748|gb|AEK80449.1| gata4 [Bubalus bubalis]
Length = 215
Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/38 (81%), Positives = 33/38 (86%)
Query: 92 WRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
WRRDGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 1 WRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 38
Score = 73.2 bits (178), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 47/90 (52%), Gaps = 12/90 (13%)
Query: 34 WRRDGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIEYFTEGRECVNCGAIST 89
WRRDGTGHYLCNA G R L AS V G C NC +T
Sbjct: 1 WRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--------GLSCANCQTTTT 52
Query: 90 PLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 53 TLWRRNAEGEPVCNACGLYMKLHGVPRPLA 82
>gi|312384257|gb|EFR29024.1| hypothetical protein AND_02351 [Anopheles darlingi]
Length = 806
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 47/106 (44%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA-RLFSTSFTGY-FIMSRFLFASLPVEDIEYFT 76
E RECVNCG+ TPLWRRD GH LCNA L++ G +R A V+
Sbjct: 585 EHRECVNCGSSDTPLWRRDVVGHTLCNACALYTRQNPGTNRPPTRSHKAKQTVQKAPPAQ 644
Query: 77 EGREC----VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
R NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 645 GNRRSGVTCANCNTTTTTLWRRNNQGDPVCNACGLYFKLHNVNRPL 690
>gi|449543979|gb|EMD34953.1| hypothetical protein CERSUDRAFT_116480 [Ceriporiopsis subvermispora
B]
Length = 830
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 42/114 (36%), Positives = 56/114 (49%), Gaps = 13/114 (11%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA---------RLFSTSFTGYFIMSRFLFASLPVEDI 72
EC NCGA TPLWRR CNA R S + R S P ++
Sbjct: 432 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNHSGEGRT--QSAPRQES 489
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ +C NC +TPLWR+D G +CNACGLY+K++G RP+ + +R
Sbjct: 490 QEVIA--QCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGSARPISMKSDVIR 541
>gi|169854435|ref|XP_001833892.1| GATA-binding factor 6-B [Coprinopsis cinerea okayama7#130]
gi|116505027|gb|EAU87922.1| GATA-binding factor 6-B [Coprinopsis cinerea okayama7#130]
Length = 877
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 47/130 (36%), Positives = 59/130 (45%), Gaps = 29/130 (22%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRE- 80
EC NCGA TPLWRR CNA G + + P EGR+
Sbjct: 440 ECSNCGATHTPLWRRGLNDELNCNA-------CGLY--CKLHKRPRPKSMRNNHGEGRQQ 490
Query: 81 -------------CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP----R 123
C NC +TPLWR+D G +CNACGLY+K++G RP+ + +
Sbjct: 491 VQPRQETVDVMAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGSARPISMKSDVIRK 550
Query: 124 RLRFD--RGN 131
R R D RGN
Sbjct: 551 RSRHDARRGN 560
>gi|390595728|gb|EIN05132.1| hypothetical protein PUNSTDRAFT_122479 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 839
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 43/115 (37%), Positives = 57/115 (49%), Gaps = 14/115 (12%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA---------RLFSTSFTGYFIMSRFLFASLPVED- 71
EC NCGA TPLWRR CNA R + R A P +D
Sbjct: 414 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKTMRNSHGEGRTQVA--PRQDT 471
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
++ T +C NC +TPLWR+D G +CNACGLY+K++G RP+ + +R
Sbjct: 472 VDVMTA--QCYNCHTTTTPLWRKDDEGKTVCNACGLYYKLHGAARPISMKSDVIR 524
>gi|383863207|ref|XP_003707073.1| PREDICTED: uncharacterized protein LOC100875315 [Megachile
rotundata]
Length = 703
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 38/102 (37%), Positives = 54/102 (52%), Gaps = 2/102 (1%)
Query: 18 TEGRECVNCGAISTPLWRRD-GTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
T+ +EC C ++T +WRRD TGH C+ L++ R V
Sbjct: 438 TDPKECSGCANLTT-IWRRDDATGHCYCHTCLYNKMNGTNRPSMRLGKPKQAVAPTGVRR 496
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 497 TGVQCANCRTTNTTLWRRNNNGEPVCNACGLYYKLHNVNRPL 538
Score = 50.4 bits (119), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 32/46 (69%), Gaps = 3/46 (6%)
Query: 76 TEGRECVNCGAISTPLWRRD-GTGHYLCNACGLYHKMNGMNRPLVK 120
T+ +EC C ++T +WRRD TGH C+ C LY+KMNG NRP ++
Sbjct: 438 TDPKECSGCANLTT-IWRRDDATGHCYCHTC-LYNKMNGTNRPSMR 481
>gi|380017690|ref|XP_003692780.1| PREDICTED: uncharacterized protein LOC100866028 [Apis florea]
Length = 809
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 41/105 (39%), Positives = 58/105 (55%), Gaps = 13/105 (12%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASL-------PVEDIE 73
+EC NC AI T + RRD G+YLC + ++T+ M+ AS+ V
Sbjct: 543 KECSNC-AILTTVLRRDEAGNYLCQSCAYTTN-----KMNGINRASIKCGKPKQAVATAG 596
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 597 VRRTGVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRPL 641
Score = 46.2 bits (108), Expect = 0.004, Method: Composition-based stats.
Identities = 22/43 (51%), Positives = 29/43 (67%), Gaps = 2/43 (4%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLY-HKMNGMNRPLVK 120
+EC NC AI T + RRD G+YLC +C +KMNG+NR +K
Sbjct: 543 KECSNC-AILTTVLRRDEAGNYLCQSCAYTTNKMNGINRASIK 584
>gi|19075466|ref|NP_587966.1| transcription factor, zf-GATA type (predicted) [Schizosaccharomyces
pombe 972h-]
gi|74627051|sp|O94720.1|YCF8_SCHPO RecName: Full=GATA zinc finger domain-containing protein C1393.08
gi|4468727|emb|CAB38164.1| transcription factor, zf-GATA type (predicted) [Schizosaccharomyces
pombe]
Length = 557
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 39/99 (39%), Positives = 56/99 (56%), Gaps = 10/99 (10%)
Query: 23 CVNCGAISTPLWRRDGTG-HYLCNA-RLFSTSFTGYFIMS-RFLFASLPVEDIEYFTEGR 79
C NCG TPLWRR ++LCNA L++ + +S R +S +E++
Sbjct: 365 CFNCGVTETPLWRRTSDKLNFLCNACGLYNKQYGVMRPLSPRNKGSSKALENLV------ 418
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC + T LWR+D G +CNACGLY +++G NRP+
Sbjct: 419 -CANCSSTKTSLWRKDRHGQTVCNACGLYARLHGHNRPI 456
>gi|409042948|gb|EKM52431.1| hypothetical protein PHACADRAFT_260812 [Phanerochaete carnosa
HHB-10118-sp]
Length = 828
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 42/119 (35%), Positives = 54/119 (45%), Gaps = 23/119 (19%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR-- 79
EC NCGA TPLWRR CNA G + + P EGR
Sbjct: 410 ECSNCGATHTPLWRRGLNDELNCNA-------CGLY--CKLHKRPRPKSMRNNHGEGRSQ 460
Query: 80 ------------ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+C NC +TPLWR+D G +CNACGLY+K++G RP+ + +R
Sbjct: 461 TAPRQESQEVVAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGSARPISMKSDVIR 519
>gi|393219358|gb|EJD04845.1| hypothetical protein FOMMEDRAFT_27143 [Fomitiporia mediterranea
MF3/22]
Length = 930
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 44/117 (37%), Positives = 55/117 (47%), Gaps = 22/117 (18%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA---------RLFSTSFTGYFIMSRFLFASLPVED- 71
EC NCGA TPLWRR CNA + S R S P D
Sbjct: 417 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKKPRPKSMRNQHGEGRQ--QSAPRNDN 474
Query: 72 ---------IEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
I +FT+ +C NC +TPLWR+D G +CNACGLY K++G +RP+
Sbjct: 475 SDAMGEPVLINWFTKISAQCYNCHTTTTPLWRKDDEGKTVCNACGLYFKLHGSSRPI 531
Score = 40.0 bits (92), Expect = 0.30, Method: Composition-based stats.
Identities = 16/34 (47%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 14 IEYFTE-GRECVNCGAISTPLWRRDGTGHYLCNA 46
I +FT+ +C NC +TPLWR+D G +CNA
Sbjct: 484 INWFTKISAQCYNCHTTTTPLWRKDDEGKTVCNA 517
>gi|392563238|gb|EIW56417.1| hypothetical protein TRAVEDRAFT_73009 [Trametes versicolor
FP-101664 SS1]
Length = 820
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 42/119 (35%), Positives = 54/119 (45%), Gaps = 23/119 (19%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR-- 79
EC NCGA TPLWRR CNA G + + P EGR
Sbjct: 420 ECSNCGATHTPLWRRGLNDELNCNA-------CGLY--CKLHKRPRPKSMRNNHGEGRAQ 470
Query: 80 ------------ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+C NC +TPLWR+D G +CNACGLY+K++G RP+ + +R
Sbjct: 471 AAPRQESQEVVAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGSARPISMKSDVIR 529
>gi|388580872|gb|EIM21184.1| iron transporter biosynthesis regulating transcription factor,
partial [Wallemia sebi CBS 633.66]
Length = 113
Score = 72.8 bits (177), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 47/97 (48%), Gaps = 1/97 (1%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFT-EGREC 81
C NCG +TPLWRR LCNA + ++ DI + +C
Sbjct: 1 CSNCGTSNTPLWRRGLNDQTLCNACGLYEKNRNTPRPTTLQSTTINQSDINKTSGSSLQC 60
Query: 82 VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
NCG +TPLWRR G CNACGLY K++ RP+
Sbjct: 61 ANCGTTTTPLWRRTDDGKPQCNACGLYQKLHNAPRPV 97
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 23/41 (56%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
C NCG +TPLWRR LCNACGLY K RP Q
Sbjct: 1 CSNCGTSNTPLWRRGLNDQTLCNACGLYEKNRNTPRPTTLQ 41
Score = 37.0 bits (84), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NCG +TPLWRR G CNA
Sbjct: 59 QCANCGTTTTPLWRRTDDGKPQCNA 83
>gi|302677612|ref|XP_003028489.1| hypothetical protein SCHCODRAFT_86018 [Schizophyllum commune H4-8]
gi|300102177|gb|EFI93586.1| hypothetical protein SCHCODRAFT_86018 [Schizophyllum commune H4-8]
Length = 469
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 41/106 (38%), Positives = 54/106 (50%), Gaps = 13/106 (12%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA---------RLFSTSFTGYFIMSRFLFASLPVEDI 72
EC NCGA TPLWRR CNA R S SR + P +++
Sbjct: 58 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNSNDGSRN--NNQPRQEV 115
Query: 73 EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC +TPLWR+D G +CNACGLY+K++G +RP+
Sbjct: 116 ADVMA--QCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGTSRPI 159
>gi|336363936|gb|EGN92304.1| hypothetical protein SERLA73DRAFT_65990 [Serpula lacrymans var.
lacrymans S7.3]
Length = 772
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 42/120 (35%), Positives = 54/120 (45%), Gaps = 24/120 (20%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR-- 79
EC NCGA TPLWRR CNA G + + P EGR
Sbjct: 373 ECSNCGATHTPLWRRGLNDELNCNA-------CGLY--CKLHKRPRPKSMRSNHGEGRTQ 423
Query: 80 -------------ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+C NC +TPLWR+D G +CNACGLY+K++G RP+ + +R
Sbjct: 424 AAPRQETVDVMVAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGSARPISMKSDVIR 483
>gi|336379100|gb|EGO20256.1| hypothetical protein SERLADRAFT_452950 [Serpula lacrymans var.
lacrymans S7.9]
Length = 784
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 42/119 (35%), Positives = 54/119 (45%), Gaps = 23/119 (19%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR-- 79
EC NCGA TPLWRR CNA G + + P EGR
Sbjct: 410 ECSNCGATHTPLWRRGLNDELNCNA-------CGLY--CKLHKRPRPKSMRSNHGEGRTQ 460
Query: 80 ------------ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+C NC +TPLWR+D G +CNACGLY+K++G RP+ + +R
Sbjct: 461 AAPRQETVDVMAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGSARPISMKSDVIR 519
>gi|124111290|gb|ABM92043.1| GATA3 [Pan troglodytes]
Length = 35
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 31/35 (88%), Positives = 32/35 (91%)
Query: 91 LWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRL 125
LWRRDGTGHYLCNACGLYHKMNG NRPL+K RRL
Sbjct: 1 LWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRL 35
Score = 36.2 bits (82), Expect = 4.8, Method: Composition-based stats.
Identities = 14/14 (100%), Positives = 14/14 (100%)
Query: 33 LWRRDGTGHYLCNA 46
LWRRDGTGHYLCNA
Sbjct: 1 LWRRDGTGHYLCNA 14
>gi|328786511|ref|XP_001121244.2| PREDICTED: hypothetical protein LOC725389 [Apis mellifera]
Length = 808
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 58/102 (56%), Gaps = 7/102 (6%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTS-FTGYFIMSRFLFASLPVEDIEYF---T 76
+EC NC AI T + RRD TG+Y+C +++ + G I + P + +
Sbjct: 543 KECPNC-AILTNVLRRDETGNYVCQNCIYAANKING--INRSSIKCGKPKQAVATAGVRR 599
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 600 TGVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRPL 641
Score = 42.7 bits (99), Expect = 0.049, Method: Composition-based stats.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 4/44 (9%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLY--HKMNGMNRPLVK 120
+EC NC AI T + RRD TG+Y+C C +Y +K+NG+NR +K
Sbjct: 543 KECPNC-AILTNVLRRDETGNYVCQNC-IYAANKINGINRSSIK 584
>gi|384499461|gb|EIE89952.1| hypothetical protein RO3G_14663 [Rhizopus delemar RA 99-880]
Length = 374
Score = 71.6 bits (174), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 70/170 (41%), Gaps = 41/170 (24%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA-----RLFSTS----FTGYF 57
AS+ DI T C NCG +TPLWRR G +CNA + +TS +
Sbjct: 72 ASVATSDIGSTT-ATSCSNCGTTTTPLWRRSPLGETICNACGLYYKARNTSRPVWLKRNY 130
Query: 58 IMSRFLFASL-PVEDIEYFTEGRE--------------------------CVNCGAISTP 90
+ R L P + + C NC +TP
Sbjct: 131 LKQRQQQKHLAPRQQPPLLAPATQKPIDPPPLPLVLPTAPAKVEHTTDFVCANCSTETTP 190
Query: 91 LWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ----PRRLRFDRGNFNCYE 136
LWRRD +G +CNACGLY+K++ ++RP+ + RR R N ++
Sbjct: 191 LWRRDESGQPICNACGLYYKLHHVHRPVTMKRSTIKRRKRVTAANLTTHK 240
>gi|340719407|ref|XP_003398145.1| PREDICTED: hypothetical protein LOC100650330 [Bombus terrestris]
Length = 813
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 36/98 (36%), Positives = 52/98 (53%), Gaps = 2/98 (2%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRE 80
+EC CG T W+RD TG + C++ ++ + M R V G +
Sbjct: 539 KECFGCGN-PTSSWKRDETGRFYCHSCIYKMNGINRSSM-RCGKPKQTVATAGVRRTGVQ 596
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 597 CANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRPL 634
>gi|384484811|gb|EIE76991.1| hypothetical protein RO3G_01695 [Rhizopus delemar RA 99-880]
Length = 326
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 61/135 (45%), Gaps = 23/135 (17%)
Query: 22 ECVNCGAISTPLWRRDGTGHY-LCNA-----------RLFSTSFTGYFIMSRFLFASLPV 69
EC NC TPLWRR Y LCNA R + + + SLP+
Sbjct: 150 ECFNCKVTKTPLWRRTPDRKYSLCNACGLYYKQYNHHRPLHVRNKTHTVRAHPYDRSLPL 209
Query: 70 E----DIEYFTE-----GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
++ F E +EC NC TPLWR++ G LCNACGLY K+ +RP+
Sbjct: 210 SITKPELASFAEQSVEISQECANCHQTQTPLWRKNERGEPLCNACGLYAKLRQRDRPV-- 267
Query: 121 QPRRLRFDRGNFNCY 135
+ R+ R +C+
Sbjct: 268 EMRKTTIQRRRRDCW 282
Score = 38.9 bits (89), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 22/41 (53%), Gaps = 3/41 (7%)
Query: 6 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
AS + +E +EC NC TPLWR++ G LCNA
Sbjct: 216 LASFAEQSVEI---SQECANCHQTQTPLWRKNERGEPLCNA 253
>gi|384489627|gb|EIE80849.1| hypothetical protein RO3G_05554 [Rhizopus delemar RA 99-880]
Length = 221
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 57/129 (44%), Gaps = 27/129 (20%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA-----RLFSTSFTGYF---IMSRFLFASLPV 69
T C NCG +TPLWRR G +CNA + +T + + + L P
Sbjct: 84 TTVTSCSNCGTTTTPLWRRSPLGETICNACGLYYKARNTCRPVWLKRNLSKKNLVRQQPP 143
Query: 70 EDIEYFTEGRE-------------------CVNCGAISTPLWRRDGTGHYLCNACGLYHK 110
TE + C NC +TPLWRRD +G +CNACGLY K
Sbjct: 144 LLAPALTEKKSHQVIVIQPPNRTEENNNLVCANCSTKTTPLWRRDESGQPICNACGLYFK 203
Query: 111 MNGMNRPLV 119
++ ++RP+
Sbjct: 204 LHNVHRPMT 212
>gi|164662389|ref|XP_001732316.1| hypothetical protein MGL_0091 [Malassezia globosa CBS 7966]
gi|159106219|gb|EDP45102.1| hypothetical protein MGL_0091 [Malassezia globosa CBS 7966]
Length = 435
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 47/145 (32%), Positives = 63/145 (43%), Gaps = 25/145 (17%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV 69
P + E C NCG +TPLWRR+ LCNA G ++ L +
Sbjct: 159 PSSNTTSSEEAPICSNCGTDNTPLWRRNHNMLLLCNA-------CGLYLKIHKTHRPLLL 211
Query: 70 EDIEYFTEGRE-------------CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNR 116
+ R C NCG TPLWR+ +G LCNACGLY K++ NR
Sbjct: 212 RKRQQLNSTRASQSQDPCSGPSSGCTNCGTKVTPLWRKGISGAVLCNACGLYLKLHQSNR 271
Query: 117 PLVKQP----RRLRFD-RGNFNCYE 136
P+ + +R R+D RG + E
Sbjct: 272 PVRYRADVIRKRSRYDNRGRISQSE 296
>gi|389743019|gb|EIM84204.1| hypothetical protein STEHIDRAFT_123062 [Stereum hirsutum FP-91666
SS1]
Length = 939
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 43/113 (38%), Positives = 54/113 (47%), Gaps = 12/113 (10%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA-----RLFSTSFTGYFIMS---RFLFASLPVEDIE 73
EC NCGA TPLWRR CNA +L + R A P E E
Sbjct: 440 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKTMRNNHGERSQAAPRP-ESNE 498
Query: 74 YFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
+ C NC +TPLWR+D G +CNACGLY K++G RP+ + +R
Sbjct: 499 VMAK---CYNCNTTATPLWRKDDEGKTVCNACGLYFKLHGSPRPISMKSDVIR 548
>gi|384488088|gb|EIE80268.1| hypothetical protein RO3G_04973 [Rhizopus delemar RA 99-880]
Length = 336
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 48/122 (39%), Gaps = 26/122 (21%)
Query: 22 ECVNCGAISTPLWRRDGT-GHYLCNA-------------------------RLFSTSFTG 55
EC NC TPLWRR H LCNA +
Sbjct: 151 ECFNCKVTKTPLWRRTPDRKHSLCNACGLYYKQYNHHRPLHVRNKTHTVRAHPYDRGLPV 210
Query: 56 YFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 115
+ + S P + +EC NC TPLWR++ G LCNACGLY K++ +
Sbjct: 211 TIVKPELSYPSSPPSTEQTVEMNQECANCHQTQTPLWRKNERGEPLCNACGLYAKLHNRD 270
Query: 116 RP 117
RP
Sbjct: 271 RP 272
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 80 ECVNCGAISTPLWRRDGT-GHYLCNACGLYHKMNGMNRPL 118
EC NC TPLWRR H LCNACGLY+K +RPL
Sbjct: 151 ECFNCKVTKTPLWRRTPDRKHSLCNACGLYYKQYNHHRPL 190
Score = 39.3 bits (90), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 21/41 (51%)
Query: 6 FASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+ S P + +EC NC TPLWR++ G LCNA
Sbjct: 219 YPSSPPSTEQTVEMNQECANCHQTQTPLWRKNERGEPLCNA 259
>gi|338728062|ref|XP_003365610.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Equus caballus]
Length = 333
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 50/110 (45%), Gaps = 14/110 (12%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPV 69
P E +E + R+CVNC + T LWR D +C SR L
Sbjct: 124 PDELLEDLPKSRQCVNCASSQTRLWREDRPRSIICAEAWRRLQAKRKDPSSRRL------ 177
Query: 70 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 178 --------GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 219
>gi|392587614|gb|EIW76948.1| hypothetical protein CONPUDRAFT_110709 [Coniophora puteana
RWD-64-598 SS2]
Length = 807
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 44/136 (32%), Positives = 57/136 (41%), Gaps = 34/136 (25%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA-----------------------------RLFSTS 52
ECVNCGA TPLWRR CNA R +
Sbjct: 399 ECVNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKTMRSNHGEGRTQAAPRQETVD 458
Query: 53 FTGYFIMSRFLFASLPVEDIEYFTEG--RECVNCGAISTPLWRRDGTGHYLCNACGLYHK 110
G + S + PV+ T +C NC +TPLWR+D G +CNACGLY K
Sbjct: 459 VVGGYKASPSM---TPVQTRVASTPAGTAQCYNCHTTATPLWRKDDEGKTVCNACGLYFK 515
Query: 111 MNGMNRPLVKQPRRLR 126
++G RP+ + +R
Sbjct: 516 LHGSARPISMKSDVIR 531
>gi|345570715|gb|EGX53536.1| hypothetical protein AOL_s00006g402 [Arthrobotrys oligospora ATCC
24927]
Length = 984
Score = 69.7 bits (169), Expect = 3e-10, Method: Composition-based stats.
Identities = 25/37 (67%), Positives = 31/37 (83%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD +GH +CNACGLYHK++G++RP
Sbjct: 333 CQNCGTTITPLWRRDESGHTICNACGLYHKLHGVHRP 369
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP-LVKQP 122
++G+ C NCG TPLWRR G +CNACGLY K RP +K+P
Sbjct: 147 SKGQVCSNCGTTRTPLWRRAPNGLTICNACGLYLKARNAARPSTLKRP 194
Score = 45.1 bits (105), Expect = 0.009, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 333 CQNCGTTITPLWRRDESGHTICNA 356
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
++G+ C NCG TPLWRR G +CNA
Sbjct: 147 SKGQVCSNCGTTRTPLWRRAPNGLTICNA 175
>gi|296424766|ref|XP_002841917.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295638169|emb|CAZ86108.1| unnamed protein product [Tuber melanosporum]
Length = 658
Score = 69.7 bits (169), Expect = 3e-10, Method: Composition-based stats.
Identities = 25/37 (67%), Positives = 31/37 (83%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD +GH +CNACGLYHK++G++RP
Sbjct: 353 CQNCGTTITPLWRRDESGHTICNACGLYHKLHGVHRP 389
Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NC TPLWRR G +CNACGLY K +RP +K+P
Sbjct: 201 GQVCSNCRTTRTPLWRRAPDGQTICNACGLYLKARNQSRPTNLKRP 246
Score = 45.1 bits (105), Expect = 0.009, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 353 CQNCGTTITPLWRRDESGHTICNA 376
Score = 37.4 bits (85), Expect = 2.1, Method: Composition-based stats.
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NC TPLWRR G +CNA
Sbjct: 201 GQVCSNCRTTRTPLWRRAPDGQTICNA 227
>gi|393234217|gb|EJD41782.1| glucocorticoid receptor-like (DNA-binding domain), partial
[Auricularia delicata TFB-10046 SS5]
Length = 157
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 49/125 (39%), Gaps = 35/125 (28%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA-----------------RLFSTSFTGY--------- 56
C NC TPLWRR G +CNA R S +
Sbjct: 25 CSNCRTTDTPLWRRGADGKSICNACGLYQKSRRMARPTNLQRTPPPSASAQSPQQQNGNG 84
Query: 57 ----FIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
M AS P C NCG +TPLWRRD G+ +CNACGLY K++
Sbjct: 85 NGTSLAMPSHSGASTPASPPSL-----SCHNCGTSTTPLWRRDDAGNNICNACGLYLKLH 139
Query: 113 GMNRP 117
G RP
Sbjct: 140 GTQRP 144
>gi|409079370|gb|EKM79731.1| hypothetical protein AGABI1DRAFT_56893 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 386
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 62/126 (49%), Gaps = 22/126 (17%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTG-HYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
++ G++C +C A STPLWRR+ T LCNA G ++ R P E I
Sbjct: 256 VDASAGGKKCSHCNATSTPLWRREPTTLKPLCNA-------CGLYLQQRHRH--RPRELI 306
Query: 73 EYFT------------EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+ G EC +C T +WRR TG LCNACG+Y ++ G +RPL
Sbjct: 307 DADQEDEESEEEDQNYNGPECSHCHTHRTSVWRRSKTGAQLCNACGVYARLRGKDRPLTL 366
Query: 121 QPRRLR 126
+ ++++
Sbjct: 367 KRKKIK 372
>gi|322694341|gb|EFY86173.1| GATA transcription factor [Metarhizium acridum CQMa 102]
Length = 523
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 29/50 (58%), Positives = 39/50 (78%), Gaps = 4/50 (8%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP---RRLR 126
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ +K+P RR R
Sbjct: 230 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKPTIKRRKR 279
Score = 45.1 bits (105), Expect = 0.011, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 230 CQNCGTTITPLWRRDESGHTICNA 253
>gi|170086289|ref|XP_001874368.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651920|gb|EDR16160.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 169
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 56/113 (49%), Gaps = 23/113 (20%)
Query: 20 GRECVNCGAISTPLWRRDGTG-HYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE- 77
++C +C A STPLWRRD T LCNA G ++ R F P E I+ T+
Sbjct: 49 AKKCSHCQATSTPLWRRDPTTFKTLCNA-------CGLYLQQRNKF--RPQELIDADTDD 99
Query: 78 ------------GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G EC +C T +WRR TG LCNACG+Y ++ G RPL
Sbjct: 100 GDTTDSSDGNYIGPECSHCRTHHTSVWRRSKTGAQLCNACGVYARLRGKPRPL 152
>gi|345320418|ref|XP_001516151.2| PREDICTED: erythroid transcription factor-like [Ornithorhynchus
anatinus]
Length = 223
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 28/31 (90%), Positives = 29/31 (93%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGL 107
E RECVNCGA +TPLWRRDGTGHYLCNACGL
Sbjct: 193 EARECVNCGATATPLWRRDGTGHYLCNACGL 223
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/28 (89%), Positives = 26/28 (92%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
E RECVNCGA +TPLWRRDGTGHYLCNA
Sbjct: 193 EARECVNCGATATPLWRRDGTGHYLCNA 220
>gi|302422454|ref|XP_003009057.1| siderophore regulation protein [Verticillium albo-atrum VaMs.102]
gi|261352203|gb|EEY14631.1| siderophore regulation protein [Verticillium albo-atrum VaMs.102]
Length = 581
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 33/38 (86%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 286 CQNCGTTTTPLWRRDESGHTICNACGLYYKLHGVHRPV 323
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 26/50 (52%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKQPRRL 125
G+ C NCG TPLWRR G +CNACGLY K +RP L K+P +L
Sbjct: 120 GQVCSNCGTTQTPLWRRSPQGTTICNACGLYQKARNASRPTSLKKKPPQL 169
Score = 46.6 bits (109), Expect = 0.004, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 19/24 (79%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD +GH +CNA
Sbjct: 286 CQNCGTTTTPLWRRDESGHTICNA 309
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 120 GQVCSNCGTTQTPLWRRSPQGTTICNA 146
>gi|426192692|gb|EKV42628.1| hypothetical protein AGABI2DRAFT_122836 [Agaricus bisporus var.
bisporus H97]
Length = 386
Score = 68.9 bits (167), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 62/126 (49%), Gaps = 22/126 (17%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTG-HYLCNARLFSTSFTGYFIMSRFLFASLPVEDI 72
++ G++C +C A STPLWRR+ T LCNA G ++ R P E I
Sbjct: 256 VDASAGGKKCSHCNATSTPLWRREPTTLKPLCNA-------CGLYLQQRHRH--RPRELI 306
Query: 73 EYFT------------EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 120
+ G EC +C T +WRR TG LCNACG+Y ++ G +RPL
Sbjct: 307 DADQEDEESEEEDQNYNGPECSHCHTHRTSVWRRSKTGAQLCNACGVYARLRGKDRPLTL 366
Query: 121 QPRRLR 126
+ ++++
Sbjct: 367 KRKKIK 372
>gi|346970218|gb|EGY13670.1| siderophore regulation protein [Verticillium dahliae VdLs.17]
Length = 581
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 33/38 (86%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 286 CQNCGTTTTPLWRRDESGHTICNACGLYYKLHGVHRPV 323
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 26/50 (52%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKQPRRL 125
G+ C NCG TPLWRR G +CNACGLY K +RP L K+P +L
Sbjct: 120 GQVCSNCGTTQTPLWRRSPQGTTICNACGLYQKARNASRPTSLKKKPPQL 169
Score = 46.2 bits (108), Expect = 0.004, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 19/24 (79%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD +GH +CNA
Sbjct: 286 CQNCGTTTTPLWRRDESGHTICNA 309
Score = 40.4 bits (93), Expect = 0.23, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 120 GQVCSNCGTTQTPLWRRSPQGTTICNA 146
>gi|238607357|ref|XP_002396957.1| hypothetical protein MPER_02700 [Moniliophthora perniciosa FA553]
gi|215470472|gb|EEB97887.1| hypothetical protein MPER_02700 [Moniliophthora perniciosa FA553]
Length = 198
Score = 68.6 bits (166), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 53/120 (44%), Gaps = 23/120 (19%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA---------RLFSTSFTGYFIMSRFLFASL----- 67
EC NCGA TPLWRR CNA R + R A
Sbjct: 12 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKTMRNSHGEGRSQTAPRQETVD 71
Query: 68 --------PVED-IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
P+ D + F +C NC +TPLWR+D G +CNACGLY+K++G RP+
Sbjct: 72 VVGKYSFRPIHDGVSNFFPLAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGSARPI 131
Score = 37.0 bits (84), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 1/38 (2%)
Query: 10 PVED-IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
P+ D + F +C NC +TPLWR+D G +CNA
Sbjct: 80 PIHDGVSNFFPLAQCYNCHTTATPLWRKDDEGKTVCNA 117
>gi|396465058|ref|XP_003837137.1| hypothetical protein LEMA_P033710.1 [Leptosphaeria maculans JN3]
gi|312213695|emb|CBX93697.1| hypothetical protein LEMA_P033710.1 [Leptosphaeria maculans JN3]
Length = 511
Score = 68.6 bits (166), Expect = 9e-10, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 34/46 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + ++
Sbjct: 271 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPIAMKKQEIK 316
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G+ C NC TPLWRR TG LCNACGLY K +RP+
Sbjct: 109 GQICSNCRTTQTPLWRRSPTGETLCNACGLYMKARNQSRPV 149
Score = 44.7 bits (104), Expect = 0.011, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 271 CQNCGTTITPLWRRDDAGHIICNA 294
Score = 40.8 bits (94), Expect = 0.18, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NC TPLWRR TG LCNA
Sbjct: 109 GQICSNCRTTQTPLWRRSPTGETLCNA 135
>gi|322712581|gb|EFZ04154.1| GATA transcription factor [Metarhizium anisopliae ARSEF 23]
Length = 554
Score = 68.6 bits (166), Expect = 9e-10, Method: Composition-based stats.
Identities = 29/50 (58%), Positives = 39/50 (78%), Gaps = 4/50 (8%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP---RRLR 126
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ +K+P RR R
Sbjct: 261 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKPTIKRRKR 310
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP +K+P
Sbjct: 100 GQVCSNCGTTRTPLWRRSPQGATICNACGLYQKARNTARPTSLKKP 145
Score = 44.7 bits (104), Expect = 0.014, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 261 CQNCGTTITPLWRRDESGHTICNA 284
Score = 40.0 bits (92), Expect = 0.32, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 100 GQVCSNCGTTRTPLWRRSPQGATICNA 126
>gi|302677711|ref|XP_003028538.1| hypothetical protein SCHCODRAFT_11907 [Schizophyllum commune H4-8]
gi|300102227|gb|EFI93635.1| hypothetical protein SCHCODRAFT_11907 [Schizophyllum commune H4-8]
Length = 279
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 60/127 (47%), Gaps = 15/127 (11%)
Query: 8 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTG-HYLCNARLFSTSFTGYFIMSRFLF-A 65
SLP + ++C +CG TPLWRRD + LCNA G F R +
Sbjct: 146 SLPNASQKPQQTEKQCSHCGVKQTPLWRRDPSNFQLLCNA-------CGLFYKQRHMHRP 198
Query: 66 SLPVEDIEYFTEGRE------CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
+ +E + G + C +CG T +WRR G +CNACG+Y ++ G RPL
Sbjct: 199 KVLIEADQEDDTGEDDPNAPTCSHCGTHRTSVWRRGKDGTQVCNACGVYSRLRGKERPLA 258
Query: 120 KQPRRLR 126
+ ++R
Sbjct: 259 LKKNKIR 265
>gi|402220446|gb|EJU00517.1| glucocorticoid receptor-like protein, partial [Dacryopinax sp.
DJM-731 SS1]
Length = 138
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 55/105 (52%), Gaps = 7/105 (6%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYL-CNA-RLFSTSFTGYFIMSRFLFASLPVEDIEYFT 76
E +C NCGA TPLWRR G G L CNA L+ ++ A+ ++ +
Sbjct: 22 EEAQCANCGAKRTPLWRR-GLGDELNCNACGLYVKQHKKPRPLNLQRDANAAYKNGAGGS 80
Query: 77 ----EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
E C NC +TPLWR+D G LCNACGLY K++G RP
Sbjct: 81 GQPQEPTRCSNCDTTNTPLWRKDADGSTLCNACGLYMKLHGAPRP 125
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYL-CNACGLYHKMNGMNRPLVKQ 121
E +C NCGA TPLWRR G G L CNACGLY K + RPL Q
Sbjct: 22 EEAQCANCGAKRTPLWRR-GLGDELNCNACGLYVKQHKKPRPLNLQ 66
>gi|346323174|gb|EGX92772.1| GATA factor SREP [Cordyceps militaris CM01]
Length = 533
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 32/38 (84%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRDG GH +CNACGLY++++G++RP+
Sbjct: 242 CQNCGTTITPLWRRDGNGHVICNACGLYYRLHGVHRPV 279
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRL 125
C NCG TPLWRR G +CNACGLY + RP +K+P ++
Sbjct: 87 CSNCGTTETPLWRRSPQGATICNACGLYLRARNSARPTNLKKPPKV 132
Score = 47.0 bits (110), Expect = 0.003, Method: Composition-based stats.
Identities = 17/24 (70%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRDG GH +CNA
Sbjct: 242 CQNCGTTITPLWRRDGNGHVICNA 265
Score = 38.1 bits (87), Expect = 1.1, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRR G +CNA
Sbjct: 87 CSNCGTTETPLWRRSPQGATICNA 110
>gi|330921388|ref|XP_003299406.1| hypothetical protein PTT_10383 [Pyrenophora teres f. teres 0-1]
gi|311326936|gb|EFQ92491.1| hypothetical protein PTT_10383 [Pyrenophora teres f. teres 0-1]
Length = 465
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 34/46 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + ++
Sbjct: 232 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPVAMKKQEIK 277
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 66 SLPVEDIEYFTE-----GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
SL +++I Y G+ C NC TPLWRR +G +CNACGLY K +RP+
Sbjct: 62 SLQIQNINYEASNAPVTGQVCSNCKTTQTPLWRRSPSGETVCNACGLYMKARNQSRPV 119
Score = 44.7 bits (104), Expect = 0.012, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 232 CQNCGTTITPLWRRDDAGHIICNA 255
Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats.
Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 5/44 (11%)
Query: 8 SLPVEDIEYFTE-----GRECVNCGAISTPLWRRDGTGHYLCNA 46
SL +++I Y G+ C NC TPLWRR +G +CNA
Sbjct: 62 SLQIQNINYEASNAPVTGQVCSNCKTTQTPLWRRSPSGETVCNA 105
>gi|189206315|ref|XP_001939492.1| GATA factor SREP [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187975585|gb|EDU42211.1| GATA factor SREP [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 443
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 34/46 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + ++
Sbjct: 210 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPVAMKKQEIK 255
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 66 SLPVEDIEY-----FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
SL +++I Y G+ C NC TPLWRR +G +CNACGLY K +RP+
Sbjct: 59 SLQIQNINYDASNATVTGQVCSNCKTTQTPLWRRSPSGETVCNACGLYMKARNQSRPV 116
Score = 44.7 bits (104), Expect = 0.013, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 210 CQNCGTTITPLWRRDDAGHIICNA 233
Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats.
Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 5/44 (11%)
Query: 8 SLPVEDIEY-----FTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
SL +++I Y G+ C NC TPLWRR +G +CNA
Sbjct: 59 SLQIQNINYDASNATVTGQVCSNCKTTQTPLWRRSPSGETVCNA 102
>gi|417079326|gb|AFX60070.1| GATA binding protein 1, partial [Mesocricetus auratus]
Length = 212
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/32 (87%), Positives = 29/32 (90%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLY 108
E RECVNCGA +TPLWRRD TGHYLCNACGLY
Sbjct: 181 EARECVNCGATATPLWRRDRTGHYLCNACGLY 212
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 24/28 (85%), Positives = 25/28 (89%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
E RECVNCGA +TPLWRRD TGHYLCNA
Sbjct: 181 EARECVNCGATATPLWRRDRTGHYLCNA 208
>gi|453081543|gb|EMF09592.1| hypothetical protein SEPMUDRAFT_72436 [Mycosphaerella populorum
SO2202]
Length = 489
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 25/38 (65%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLYHK++G +RP+
Sbjct: 289 CQNCGTTITPLWRRDENGHTICNACGLYHKLHGAHRPV 326
Score = 55.5 bits (132), Expect = 8e-06, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 24/38 (63%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRR G +CNACGLY+K RP+
Sbjct: 132 CSNCGTTKTPLWRRSPAGAVICNACGLYYKARNQMRPV 169
Score = 43.9 bits (102), Expect = 0.023, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 289 CQNCGTTITPLWRRDENGHTICNA 312
Score = 38.9 bits (89), Expect = 0.73, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRR G +CNA
Sbjct: 132 CSNCGTTKTPLWRRSPAGAVICNA 155
>gi|451856759|gb|EMD70050.1| hypothetical protein COCSADRAFT_216437 [Cochliobolus sativus
ND90Pr]
Length = 507
Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 34/46 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + ++
Sbjct: 274 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPVAMKKQEIK 319
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 33/59 (55%), Gaps = 5/59 (8%)
Query: 65 ASLPVEDIEYFT-----EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
ASL + ++ Y T G+ C NC TPLWRR G +CNACGLY K +RP+
Sbjct: 98 ASLQIPNMNYETPNASITGQICSNCKTTQTPLWRRSPAGETVCNACGLYMKARNQSRPV 156
Score = 44.7 bits (104), Expect = 0.014, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 274 CQNCGTTITPLWRRDDAGHIICNA 297
Score = 40.4 bits (93), Expect = 0.26, Method: Composition-based stats.
Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 5/45 (11%)
Query: 7 ASLPVEDIEYFT-----EGRECVNCGAISTPLWRRDGTGHYLCNA 46
ASL + ++ Y T G+ C NC TPLWRR G +CNA
Sbjct: 98 ASLQIPNMNYETPNASITGQICSNCKTTQTPLWRRSPAGETVCNA 142
>gi|451993853|gb|EMD86325.1| hypothetical protein COCHEDRAFT_109473 [Cochliobolus heterostrophus
C5]
Length = 520
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 34/46 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + ++
Sbjct: 287 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPVAMKKQEIK 332
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 33/59 (55%), Gaps = 5/59 (8%)
Query: 65 ASLPVEDIEYFT-----EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
ASL + ++ Y T G+ C NC TPLWRR G +CNACGLY K +RP+
Sbjct: 110 ASLQIPNMNYETPNASITGQVCSNCKTTQTPLWRRSPAGETVCNACGLYMKARNQSRPV 168
Score = 44.3 bits (103), Expect = 0.015, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 287 CQNCGTTITPLWRRDDAGHIICNA 310
Score = 40.4 bits (93), Expect = 0.23, Method: Composition-based stats.
Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 5/45 (11%)
Query: 7 ASLPVEDIEYFT-----EGRECVNCGAISTPLWRRDGTGHYLCNA 46
ASL + ++ Y T G+ C NC TPLWRR G +CNA
Sbjct: 110 ASLQIPNMNYETPNASITGQVCSNCKTTQTPLWRRSPAGETVCNA 154
>gi|408399802|gb|EKJ78894.1| hypothetical protein FPSE_00936 [Fusarium pseudograminearum CS3096]
Length = 557
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 35/46 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + ++
Sbjct: 260 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIK 305
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
++G+ C NCG TPLWRR G +CNACGLY K +RP +K+P
Sbjct: 99 SQGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKRP 146
Score = 44.7 bits (104), Expect = 0.015, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 260 CQNCGTTITPLWRRDESGHTICNA 283
Score = 41.2 bits (95), Expect = 0.13, Method: Composition-based stats.
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
++G+ C NCG TPLWRR G +CNA
Sbjct: 99 SQGQVCSNCGTTRTPLWRRSPQGATICNA 127
>gi|302693230|ref|XP_003036294.1| hypothetical protein SCHCODRAFT_66326 [Schizophyllum commune H4-8]
gi|300109990|gb|EFJ01392.1| hypothetical protein SCHCODRAFT_66326 [Schizophyllum commune H4-8]
Length = 428
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 40/108 (37%), Positives = 53/108 (49%), Gaps = 12/108 (11%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA--------RLFSTSFTGYFIMSRFLFASLPVE--- 70
EC NCGA TPLWRR CNA R+ T ++ +S+ E
Sbjct: 250 ECANCGATHTPLWRRGLNDELNCNACGLYYKQHRVPRPKTTARNGANKPAPSSVKQEAEP 309
Query: 71 DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ + +C NC TPLWR+D G +CNACGLY K++G RP+
Sbjct: 310 PVGIICDA-QCHNCQTTVTPLWRKDPDGRTVCNACGLYFKLHGSPRPI 356
>gi|452979166|gb|EME78929.1| hypothetical protein MYCFIDRAFT_216796 [Pseudocercospora fijiensis
CIRAD86]
Length = 434
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 25/38 (65%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLYHK++G +RP+
Sbjct: 264 CQNCGTTITPLWRRDEQGHTICNACGLYHKLHGSHRPV 301
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 24/38 (63%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRR G +CNACGLY+K RP+
Sbjct: 118 CSNCGTTKTPLWRRSPAGAVICNACGLYYKARNQMRPV 155
Score = 43.5 bits (101), Expect = 0.026, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 264 CQNCGTTITPLWRRDEQGHTICNA 287
Score = 38.9 bits (89), Expect = 0.70, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRR G +CNA
Sbjct: 118 CSNCGTTKTPLWRRSPAGAVICNA 141
>gi|440639377|gb|ELR09296.1| hypothetical protein GMDG_03864 [Geomyces destructans 20631-21]
Length = 555
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 32/38 (84%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 270 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPV 307
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G+ C NCG STPLWRR G +CNACGLY K +RP
Sbjct: 111 GQICSNCGTTSTPLWRRSPQGATICNACGLYQKARNASRP 150
Score = 45.1 bits (105), Expect = 0.011, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 270 CQNCGTTITPLWRRDESGHTICNA 293
Score = 42.0 bits (97), Expect = 0.084, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG STPLWRR G +CNA
Sbjct: 111 GQICSNCGTTSTPLWRRSPQGATICNA 137
>gi|390516511|emb|CCE73649.1| GATA-type transcription factor, iron regulation [Fusarium
fujikuroi]
Length = 555
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 35/46 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + ++
Sbjct: 257 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIK 302
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
++G+ C NCG TPLWRR G +CNACGLY K +RP +K+P
Sbjct: 97 SQGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKRP 144
Score = 44.3 bits (103), Expect = 0.015, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 257 CQNCGTTITPLWRRDESGHTICNA 280
Score = 41.2 bits (95), Expect = 0.14, Method: Composition-based stats.
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
++G+ C NCG TPLWRR G +CNA
Sbjct: 97 SQGQVCSNCGTTRTPLWRRSPQGATICNA 125
>gi|46136099|ref|XP_389741.1| hypothetical protein FG09565.1 [Gibberella zeae PH-1]
Length = 557
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 35/46 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + ++
Sbjct: 260 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIK 305
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
++G+ C NCG TPLWRR G +CNACGLY K +RP +K+P
Sbjct: 99 SQGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKRP 146
Score = 44.3 bits (103), Expect = 0.015, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 260 CQNCGTTITPLWRRDESGHTICNA 283
Score = 41.2 bits (95), Expect = 0.14, Method: Composition-based stats.
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
++G+ C NCG TPLWRR G +CNA
Sbjct: 99 SQGQVCSNCGTTRTPLWRRSPQGATICNA 127
>gi|443924595|gb|ELU43592.1| Opi1 domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 1201
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 30/37 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G+ +CNACGLYHK++G +RP
Sbjct: 125 CTNCGTSTTPLWRRDDAGNNICNACGLYHKLHGTHRP 161
Score = 43.5 bits (101), Expect = 0.032, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 125 CTNCGTSTTPLWRRDDAGNNICNA 148
>gi|367045274|ref|XP_003653017.1| hypothetical protein THITE_2114956 [Thielavia terrestris NRRL 8126]
gi|347000279|gb|AEO66681.1| hypothetical protein THITE_2114956 [Thielavia terrestris NRRL 8126]
Length = 564
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 264 CQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVHRPV 301
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP+ +K+P
Sbjct: 104 GQICSNCGTTQTPLWRRSPQGETICNACGLYLKARNAARPINLKRP 149
Score = 44.7 bits (104), Expect = 0.012, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 264 CQNCGTTVTPLWRRDEAGHTICNA 287
Score = 41.2 bits (95), Expect = 0.16, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 104 GQICSNCGTTQTPLWRRSPQGETICNA 130
>gi|342881901|gb|EGU82684.1| hypothetical protein FOXB_06796 [Fusarium oxysporum Fo5176]
Length = 555
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 35/46 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + ++
Sbjct: 257 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIK 302
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
++G+ C NCG TPLWRR G +CNACGLY K +RP +K+P
Sbjct: 97 SQGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKRP 144
Score = 44.3 bits (103), Expect = 0.016, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 257 CQNCGTTITPLWRRDESGHTICNA 280
Score = 41.2 bits (95), Expect = 0.15, Method: Composition-based stats.
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
++G+ C NCG TPLWRR G +CNA
Sbjct: 97 SQGQVCSNCGTTRTPLWRRSPQGATICNA 125
>gi|336470856|gb|EGO59017.1| hypothetical protein NEUTE1DRAFT_59972 [Neurospora tetrasperma FGSC
2508]
gi|350291924|gb|EGZ73119.1| siderophore regulation protein [Neurospora tetrasperma FGSC 2509]
Length = 587
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 282 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPV 319
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP +++P
Sbjct: 115 GQVCSNCGTTHTPLWRRSPQGAIICNACGLYLKARNAARPANIRRP 160
Score = 44.7 bits (104), Expect = 0.014, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 282 CQNCGTTITPLWRRDEAGHTICNA 305
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 115 GQVCSNCGTTHTPLWRRSPQGAIICNA 141
>gi|336263677|ref|XP_003346618.1| hypothetical protein SMAC_04791 [Sordaria macrospora k-hell]
Length = 542
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 236 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPV 273
Score = 44.7 bits (104), Expect = 0.014, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 236 CQNCGTTITPLWRRDEAGHTICNA 259
>gi|85105503|ref|XP_961978.1| hypothetical protein NCU07728 [Neurospora crassa OR74A]
gi|3552028|gb|AAC64946.1| siderophore regulation protein [Neurospora crassa]
gi|28923567|gb|EAA32742.1| predicted protein [Neurospora crassa OR74A]
Length = 587
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 282 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPV 319
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP +++P
Sbjct: 115 GQVCSNCGTTHTPLWRRSPQGAIICNACGLYLKARNAARPANIRRP 160
Score = 44.7 bits (104), Expect = 0.014, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 282 CQNCGTTITPLWRRDEAGHTICNA 305
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 115 GQVCSNCGTTHTPLWRRSPQGAIICNA 141
>gi|171690816|ref|XP_001910333.1| hypothetical protein [Podospora anserina S mat+]
gi|170945356|emb|CAP71468.1| unnamed protein product [Podospora anserina S mat+]
Length = 536
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 43/68 (63%), Gaps = 6/68 (8%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKQPRRLRFDRGNFNC 134
C NCG TPLWRRD GH +CNACGLY+K++G++RP ++K+ +R+ G
Sbjct: 257 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKSVIKRRKRVIPAAGGSPE 316
Query: 135 YERSTLFR 142
E +TL R
Sbjct: 317 MESATLDR 324
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K +RP+ +K+P
Sbjct: 98 GQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNTHRPVNLKRP 143
Score = 44.3 bits (103), Expect = 0.016, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 257 CQNCGTTITPLWRRDEAGHTICNA 280
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 98 GQVCSNCGTTQTPLWRRSPQGATICNA 124
>gi|367022150|ref|XP_003660360.1| hypothetical protein MYCTH_2298578 [Myceliophthora thermophila ATCC
42464]
gi|347007627|gb|AEO55115.1| hypothetical protein MYCTH_2298578 [Myceliophthora thermophila ATCC
42464]
Length = 553
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 256 CQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVHRPV 293
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP+ +K+P
Sbjct: 95 GQVCSNCGTTFTPLWRRSPQGEIICNACGLYLKTRNAPRPINLKRP 140
Score = 44.7 bits (104), Expect = 0.013, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 256 CQNCGTTVTPLWRRDEAGHTICNA 279
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 95 GQVCSNCGTTFTPLWRRSPQGEIICNA 121
>gi|302894399|ref|XP_003046080.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256727007|gb|EEU40367.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 559
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 35/46 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + ++
Sbjct: 258 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIK 303
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
+ G+ C NCG TPLWRR G +CNACGLY K RP +K+P
Sbjct: 96 SHGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARPTSLKRP 143
Score = 44.3 bits (103), Expect = 0.017, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 258 CQNCGTTITPLWRRDESGHTICNA 281
Score = 40.4 bits (93), Expect = 0.27, Method: Composition-based stats.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+ G+ C NCG TPLWRR G +CNA
Sbjct: 96 SHGQVCSNCGTTRTPLWRRSPQGATICNA 124
>gi|402079663|gb|EJT74928.1| GATA factor SREP [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 581
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 32/38 (84%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 267 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPV 304
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP +K+P
Sbjct: 109 GQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARPTNLKRP 154
Score = 44.7 bits (104), Expect = 0.012, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 267 CQNCGTTITPLWRRDESGHTICNA 290
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 109 GQVCSNCGTTQTPLWRRSPQGATICNA 135
>gi|380090512|emb|CCC11809.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 554
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 248 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPV 285
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP +++P
Sbjct: 82 GQVCSNCGTTHTPLWRRSPLGAIICNACGLYLKARNAARPANIRRP 127
Score = 44.3 bits (103), Expect = 0.015, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 248 CQNCGTTITPLWRRDEAGHTICNA 271
Score = 40.4 bits (93), Expect = 0.26, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 82 GQVCSNCGTTHTPLWRRSPLGAIICNA 108
>gi|154292043|ref|XP_001546599.1| hypothetical protein BC1G_14396 [Botryotinia fuckeliana B05.10]
gi|347838109|emb|CCD52681.1| bir1, transcription factor Zn, GATA [Botryotinia fuckeliana]
Length = 604
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 32/38 (84%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 278 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPV 315
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 34/61 (55%), Gaps = 3/61 (4%)
Query: 65 ASLPVED--IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQ 121
AS+P +E + C NCG TPLWRR G +CNACGLY K +RP +K+
Sbjct: 101 ASVPASSPGVEQIQSNQICSNCGTTRTPLWRRSPQGATICNACGLYQKARNASRPTNLKR 160
Query: 122 P 122
P
Sbjct: 161 P 161
Score = 44.7 bits (104), Expect = 0.013, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 278 CQNCGTTITPLWRRDESGHTICNA 301
Score = 40.8 bits (94), Expect = 0.19, Method: Composition-based stats.
Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 2/42 (4%)
Query: 7 ASLPVED--IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
AS+P +E + C NCG TPLWRR G +CNA
Sbjct: 101 ASVPASSPGVEQIQSNQICSNCGTTRTPLWRRSPQGATICNA 142
>gi|384490311|gb|EIE81533.1| hypothetical protein RO3G_06238 [Rhizopus delemar RA 99-880]
Length = 462
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 37/90 (41%), Positives = 46/90 (51%), Gaps = 9/90 (10%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA-----RLFSTSFTGYFIMSRFLFASLPVEDIEYFTE 77
C NC +TPLWRR LCNA +L +T + + + +D E FT+
Sbjct: 377 CSNCSTTTTPLWRRSAEDELLCNACGLYLKLHNTPRPKHL---KPQSSRKDAKDEESFTQ 433
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGL 107
C NC +TPLWRRD G LCNACGL
Sbjct: 434 PI-CSNCATSTTPLWRRDVDGSPLCNACGL 462
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 28/48 (58%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
C NC +TPLWRR LCNACGLY K++ RP +P+ R D
Sbjct: 377 CSNCSTTTTPLWRRSAEDELLCNACGLYLKLHNTPRPKHLKPQSSRKD 424
>gi|380489847|emb|CCF36425.1| GATA zinc finger [Colletotrichum higginsianum]
Length = 591
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 32/38 (84%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 289 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPV 326
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRL 125
G+ C NCG TPLWRR G +CNACGLY K +RP +K+P L
Sbjct: 119 GQVCSNCGTTRTPLWRRSPQGTTICNACGLYQKARNASRPTNLKKPPHL 167
Score = 44.7 bits (104), Expect = 0.013, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 289 CQNCGTTITPLWRRDESGHTICNA 312
Score = 40.0 bits (92), Expect = 0.31, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 119 GQVCSNCGTTRTPLWRRSPQGTTICNA 145
>gi|156049633|ref|XP_001590783.1| hypothetical protein SS1G_08523 [Sclerotinia sclerotiorum 1980]
gi|154692922|gb|EDN92660.1| hypothetical protein SS1G_08523 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 749
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/54 (53%), Positives = 40/54 (74%), Gaps = 1/54 (1%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
ASLP++ + T C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 260 ASLPLQ-TQNTTVVVACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPV 312
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 26/52 (50%), Positives = 31/52 (59%), Gaps = 1/52 (1%)
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
IE G+ C NCG TPLWRR G +CNACGLY K +RP +K+P
Sbjct: 107 IEQIQTGQICSNCGTTRTPLWRRSPQGATICNACGLYQKARNASRPTNLKRP 158
Score = 45.1 bits (105), Expect = 0.010, Method: Composition-based stats.
Identities = 21/40 (52%), Positives = 26/40 (65%), Gaps = 1/40 (2%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
ASLP++ + T C NCG TPLWRRD +GH +CNA
Sbjct: 260 ASLPLQ-TQNTTVVVACQNCGTTITPLWRRDESGHTICNA 298
Score = 41.6 bits (96), Expect = 0.12, Method: Composition-based stats.
Identities = 17/33 (51%), Positives = 19/33 (57%)
Query: 14 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
IE G+ C NCG TPLWRR G +CNA
Sbjct: 107 IEQIQTGQICSNCGTTRTPLWRRSPQGATICNA 139
>gi|440467530|gb|ELQ36746.1| GATA factor SREP [Magnaporthe oryzae Y34]
gi|440488667|gb|ELQ68382.1| GATA factor SREP [Magnaporthe oryzae P131]
Length = 539
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 267 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPV 304
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP +K+P
Sbjct: 111 GQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARPTNLKRP 156
Score = 44.3 bits (103), Expect = 0.016, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 267 CQNCGTTITPLWRRDEAGHTICNA 290
Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 111 GQVCSNCGTTQTPLWRRSPQGATICNA 137
>gi|340960405|gb|EGS21586.1| putative transcription factor [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 447
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/52 (51%), Positives = 37/52 (71%), Gaps = 6/52 (11%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKQPRRLR 126
C NCG TPLWRRD GH +CNACGLY+K++G++RP ++K+ +R R
Sbjct: 219 CQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKSIIKRRKRNR 270
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP+ +K+P
Sbjct: 64 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARPINLKRP 109
Score = 44.3 bits (103), Expect = 0.017, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 219 CQNCGTTVTPLWRRDEAGHTICNA 242
Score = 40.0 bits (92), Expect = 0.31, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 64 GQVCSNCGTTRTPLWRRSPQGATICNA 90
>gi|389634415|ref|XP_003714860.1| GATA factor SREP [Magnaporthe oryzae 70-15]
gi|351647193|gb|EHA55053.1| GATA factor SREP [Magnaporthe oryzae 70-15]
Length = 539
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 267 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPV 304
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP +K+P
Sbjct: 111 GQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARPTNLKRP 156
Score = 44.3 bits (103), Expect = 0.016, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 267 CQNCGTTITPLWRRDEAGHTICNA 290
Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 111 GQVCSNCGTTQTPLWRRSPQGATICNA 137
>gi|19113848|ref|NP_592936.1| iron-sensing transcription factor Fep1 [Schizosaccharomyces pombe
972h-]
gi|1346083|sp|Q10134.1|FEP1_SCHPO RecName: Full=Iron-sensing transcription factor 1; AltName:
Full=Transcription factor gaf2; Short=Gaf-2
gi|1167862|emb|CAA93113.1| iron-sensing transcription factor Fep1 [Schizosaccharomyces pombe]
gi|20302469|emb|CAD30004.1| iron-sensing transcription factor [Schizosaccharomyces pombe]
gi|21064951|gb|AAM29187.1| transcription factor Fep1 [Schizosaccharomyces pombe]
Length = 564
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 43/59 (72%), Gaps = 7/59 (11%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKQPRRLRFDRGNFN 133
C NC +TPLWRRD +G+ +CNACGLY+K++G++RP ++K+ +RL F+ GN N
Sbjct: 172 CQNCATTNTPLWRRDESGNPICNACGLYYKIHGVHRPVTMKKAIIKRRKRLVFN-GNAN 229
Score = 48.9 bits (115), Expect = 8e-04, Method: Composition-based stats.
Identities = 20/41 (48%), Positives = 23/41 (56%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G+ C NC +T LWRR LCNACGLY K RP+
Sbjct: 9 GQSCSNCHKTTTSLWRRGPDNSLLCNACGLYQKHRKHARPV 49
Score = 40.4 bits (93), Expect = 0.27, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD +G+ +CNA
Sbjct: 172 CQNCATTNTPLWRRDESGNPICNA 195
>gi|340517662|gb|EGR47905.1| Zn-finger gata type domain-containing protein [Trichoderma reesei
QM6a]
Length = 554
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 32/38 (84%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 253 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPV 290
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRL 125
C NCG TPLWRR G +CNACGLY K RP +K+P L
Sbjct: 95 CSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARPTSLKKPPNL 140
Score = 44.3 bits (103), Expect = 0.015, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 253 CQNCGTTITPLWRRDESGHTICNA 276
Score = 38.1 bits (87), Expect = 1.1, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRR G +CNA
Sbjct: 95 CSNCGTTRTPLWRRSPQGATICNA 118
>gi|310792185|gb|EFQ27712.1| GATA zinc finger [Glomerella graminicola M1.001]
Length = 592
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 32/38 (84%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 289 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPV 326
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRL 125
G+ C NCG TPLWRR G +CNACGLY K +RP +K+P L
Sbjct: 119 GQVCSNCGTTRTPLWRRSPQGSTICNACGLYQKARNASRPTNLKKPPHL 167
Score = 44.3 bits (103), Expect = 0.015, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 289 CQNCGTTITPLWRRDESGHTICNA 312
Score = 40.0 bits (92), Expect = 0.28, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 119 GQVCSNCGTTRTPLWRRSPQGSTICNA 145
>gi|47198749|emb|CAF88195.1| unnamed protein product [Tetraodon nigroviridis]
Length = 151
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/43 (72%), Positives = 35/43 (81%), Gaps = 1/43 (2%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNARLF 49
ASL + D + TEGRECVNCG++STPLWRRDGTGHYLCN R
Sbjct: 93 ASLDLMD-DGSTEGRECVNCGSVSTPLWRRDGTGHYLCNLRPL 134
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/39 (76%), Positives = 34/39 (87%), Gaps = 1/39 (2%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCN 103
ASL + D + TEGRECVNCG++STPLWRRDGTGHYLCN
Sbjct: 93 ASLDLMD-DGSTEGRECVNCGSVSTPLWRRDGTGHYLCN 130
>gi|108744055|gb|ABG02399.1| GATA binding protein 4 [Bubalus bubalis]
Length = 172
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/35 (80%), Positives = 30/35 (85%)
Query: 95 DGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDR 129
DGTGHYLCNACGLYHKMNG+NRPL+K RRL R
Sbjct: 1 DGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASR 35
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 44/87 (50%), Gaps = 12/87 (13%)
Query: 37 DGTGHYLCNARLFSTSFTGY----FIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLW 92
DGTGHYLCNA G R L AS V G C NC +T LW
Sbjct: 1 DGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRV--------GLSCANCQTTTTTLW 52
Query: 93 RRDGTGHYLCNACGLYHKMNGMNRPLV 119
RR+ G +CNACGLY K++G+ RPL
Sbjct: 53 RRNAEGEPVCNACGLYMKLHGVPRPLA 79
>gi|320582061|gb|EFW96279.1| hypothetical protein HPODL_1936 [Ogataea parapolymorpha DL-1]
Length = 459
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 28/65 (43%), Positives = 41/65 (63%), Gaps = 3/65 (4%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP---RRLRFDRGNFNCYE 136
+C NCG TPLWRRD G+ LCNACGL+ K++G RPL + ++ R +F+ +
Sbjct: 119 KCFNCGTTKTPLWRRDSQGNTLCNACGLFQKLHGTMRPLSLKTDIIKKRNSKRQSFSQIQ 178
Query: 137 RSTLF 141
+ T+F
Sbjct: 179 QGTIF 183
Score = 44.7 bits (104), Expect = 0.012, Method: Composition-based stats.
Identities = 16/25 (64%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NCG TPLWRRD G+ LCNA
Sbjct: 119 KCFNCGTTKTPLWRRDSQGNTLCNA 143
>gi|378726228|gb|EHY52687.1| GATA-binding protein [Exophiala dermatitidis NIH/UT8656]
Length = 487
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 25/38 (65%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLYHK++G +RP+
Sbjct: 270 CKNCGTTVTPLWRRDEHGHPICNACGLYHKLHGSHRPV 307
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G+ C NCG TPLWRR TG +CNACGLY K RP
Sbjct: 97 GQVCSNCGTTKTPLWRRSPTGTTICNACGLYQKTRNAPRP 136
Score = 42.4 bits (98), Expect = 0.057, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 270 CKNCGTTVTPLWRRDEHGHPICNA 293
Score = 42.0 bits (97), Expect = 0.089, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR TG +CNA
Sbjct: 97 GQVCSNCGTTKTPLWRRSPTGTTICNA 123
>gi|328773371|gb|EGF83408.1| hypothetical protein BATDEDRAFT_34165 [Batrachochytrium
dendrobatidis JAM81]
Length = 947
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 31/74 (41%), Positives = 42/74 (56%), Gaps = 4/74 (5%)
Query: 45 NARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 104
+ RL + + FI + P DI + CVNCG STPLWRRD +CNA
Sbjct: 306 SVRLSTATNGATFINASSASTIPPARDISHTI----CVNCGTTSTPLWRRDEMSRSICNA 361
Query: 105 CGLYHKMNGMNRPL 118
CGLY++++G +RPL
Sbjct: 362 CGLYYRLHGKHRPL 375
Score = 43.9 bits (102), Expect = 0.023, Method: Composition-based stats.
Identities = 19/37 (51%), Positives = 21/37 (56%), Gaps = 4/37 (10%)
Query: 10 PVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
P DI + CVNCG STPLWRRD +CNA
Sbjct: 329 PARDISHTI----CVNCGTTSTPLWRRDEMSRSICNA 361
>gi|358058831|dbj|GAA95229.1| hypothetical protein E5Q_01885 [Mixia osmundae IAM 14324]
Length = 416
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 32/61 (52%), Positives = 41/61 (67%), Gaps = 4/61 (6%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP-LVKQP---RRLRFDRGNFNCY 135
+C NCG +TPLWRRDG G+ CNACGLY+K++G +RP L+K+P RR R N
Sbjct: 198 QCGNCGTRTTPLWRRDGDGNVACNACGLYYKLHGHHRPGLLKKPNIKRRKRIPTARANHT 257
Query: 136 E 136
E
Sbjct: 258 E 258
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 20/30 (66%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHK 110
C CG TPLWRRD G +CNACGL HK
Sbjct: 72 CSQCGKTETPLWRRDPQGRTICNACGLAHK 101
Score = 42.4 bits (98), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 19/25 (76%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NCG +TPLWRRDG G+ CNA
Sbjct: 198 QCGNCGTRTTPLWRRDGDGNVACNA 222
Score = 38.1 bits (87), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C CG TPLWRRD G +CNA
Sbjct: 72 CSQCGKTETPLWRRDPQGRTICNA 95
>gi|358394095|gb|EHK43496.1| hypothetical protein TRIATDRAFT_174504, partial [Trichoderma
atroviride IMI 206040]
Length = 567
Score = 65.9 bits (159), Expect = 6e-09, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 269 CQNCGTTITPLWRRDEGGHTICNACGLYYKLHGVHRPV 306
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRL 125
G+ C NC TPLWRR G +CNACGLY K +RP +K+P L
Sbjct: 109 GQICSNCNTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKKPPNL 157
Score = 43.5 bits (101), Expect = 0.030, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 269 CQNCGTTITPLWRRDEGGHTICNA 292
Score = 37.4 bits (85), Expect = 2.1, Method: Composition-based stats.
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NC TPLWRR G +CNA
Sbjct: 109 GQICSNCNTTRTPLWRRSPQGATICNA 135
>gi|213402573|ref|XP_002172059.1| iron-sensing transcription factor 1 [Schizosaccharomyces japonicus
yFS275]
gi|212000106|gb|EEB05766.1| iron-sensing transcription factor 1 [Schizosaccharomyces japonicus
yFS275]
Length = 519
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 34/41 (82%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRRD +G+ +CNACGLY+K++G++RP+
Sbjct: 177 GTYCQNCGTTTTPLWRRDESGNPICNACGLYYKIHGVHRPV 217
Score = 43.5 bits (101), Expect = 0.026, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 20/27 (74%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRRD +G+ +CNA
Sbjct: 177 GTYCQNCGTTTTPLWRRDESGNPICNA 203
Score = 42.7 bits (99), Expect = 0.045, Method: Composition-based stats.
Identities = 19/42 (45%), Positives = 22/42 (52%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
E + C NC + T WRR LCNACGLY K RP+
Sbjct: 8 EPQVCANCKSQITAQWRRGPDNSILCNACGLYFKTRDAPRPV 49
>gi|407920283|gb|EKG13498.1| Zinc finger GATA-type protein [Macrophomina phaseolina MS6]
Length = 588
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 34/46 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NC TPLWRRD GH +CNACGLY+K++G++RP+ + + ++
Sbjct: 299 CANCRTTVTPLWRRDEAGHTICNACGLYYKLHGVHRPVQMKKQEIK 344
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G+ C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 132 GQTCSNCGTTRTPLWRRSPTGEPICNACGLYLKARNQSRP 171
Score = 43.5 bits (101), Expect = 0.030, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR TG +CNA
Sbjct: 132 GQTCSNCGTTRTPLWRRSPTGEPICNA 158
Score = 42.4 bits (98), Expect = 0.062, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 299 CANCRTTVTPLWRRDEAGHTICNA 322
>gi|429862999|gb|ELA37584.1| siderophore regulation protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 592
Score = 65.1 bits (157), Expect = 8e-09, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 34/46 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NC TPLWRRD +GH +CNACGLY+K++G++RP+ + ++
Sbjct: 289 CQNCSTTVTPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKSTIK 334
Score = 42.7 bits (99), Expect = 0.053, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD +GH +CNA
Sbjct: 289 CQNCSTTVTPLWRRDESGHTICNA 312
>gi|406602734|emb|CCH45692.1| Zinc finger protein [Wickerhamomyces ciferrii]
Length = 487
Score = 65.1 bits (157), Expect = 8e-09, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD G +CNACGLY+K++G++RP+
Sbjct: 180 CTNCGTTVTPLWRRDDNGDTICNACGLYYKLHGLHRPI 217
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 22/43 (51%), Positives = 29/43 (67%), Gaps = 1/43 (2%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
C NC TPLWRR G +CNACGLY++ N +RP+ +K+P
Sbjct: 31 CSNCSTTKTPLWRRAPDGSLICNACGLYYRANNSHRPINLKRP 73
Score = 42.0 bits (97), Expect = 0.080, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G +CNA
Sbjct: 180 CTNCGTTVTPLWRRDDNGDTICNA 203
>gi|398392283|ref|XP_003849601.1| hypothetical protein MYCGRDRAFT_110870 [Zymoseptoria tritici
IPO323]
gi|339469478|gb|EGP84577.1| hypothetical protein MYCGRDRAFT_110870 [Zymoseptoria tritici
IPO323]
Length = 352
Score = 65.1 bits (157), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 37/57 (64%), Gaps = 6/57 (10%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL------VKQPRRLRFDRGN 131
C NCG TPLWRRD GH +CNACGLYHK++G RP+ +K+ +R+ GN
Sbjct: 267 CQNCGTTITPLWRRDEAGHTICNACGLYHKLHGAARPVQMKKAEIKRRKRVVPAAGN 323
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 26/41 (63%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G+ C NCG TPLWRR G +CNACGLY+K RP+
Sbjct: 116 GQMCSNCGTTKTPLWRRSPAGAVICNACGLYYKARNQMRPV 156
Score = 42.4 bits (98), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 267 CQNCGTTITPLWRRDEAGHTICNA 290
Score = 39.7 bits (91), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 116 GQMCSNCGTTKTPLWRRSPAGAVICNA 142
>gi|358382685|gb|EHK20356.1| hypothetical protein TRIVIDRAFT_192933 [Trichoderma virens Gv29-8]
Length = 567
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 269 CQNCATTITPLWRRDESGHTICNACGLYYKLHGVHRPV 306
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRL 125
G+ C NC TPLWRR G +CNACGLY K RP +K+P L
Sbjct: 108 GQVCSNCNTTRTPLWRRSPQGATICNACGLYLKARNAARPTSLKKPPNL 156
Score = 42.7 bits (99), Expect = 0.051, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD +GH +CNA
Sbjct: 269 CQNCATTITPLWRRDESGHTICNA 292
Score = 37.7 bits (86), Expect = 1.5, Method: Composition-based stats.
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NC TPLWRR G +CNA
Sbjct: 108 GQVCSNCNTTRTPLWRRSPQGATICNA 134
>gi|400602130|gb|EJP69755.1| siderophore regulation protein [Beauveria bassiana ARSEF 2860]
Length = 532
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 32/38 (84%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRDG G+ +CNACGLY++++G++RP+
Sbjct: 240 CSNCGTTITPLWRRDGEGNMICNACGLYYRLHGVHRPV 277
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 23/51 (45%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRL 125
+ G+ C NCG TPLWRR G +CNACGLY + RP +K+P ++
Sbjct: 82 SHGQICSNCGTTETPLWRRSPQGATICNACGLYLRARNSARPTNLKKPPKV 132
Score = 43.9 bits (102), Expect = 0.021, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRDG G+ +CNA
Sbjct: 240 CSNCGTTITPLWRRDGEGNMICNA 263
Score = 40.0 bits (92), Expect = 0.32, Method: Composition-based stats.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+ G+ C NCG TPLWRR G +CNA
Sbjct: 82 SHGQICSNCGTTETPLWRRSPQGATICNA 110
>gi|154286086|ref|XP_001543838.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150407479|gb|EDN03020.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 630
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 6/51 (11%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG------MNRPLVKQPRRL 125
C NC TPLWRRD GH +CNACGLYHK++G M +P++K+ +R+
Sbjct: 291 CQNCQTTVTPLWRRDENGHPICNACGLYHKLHGAYRPPTMKKPIIKRRKRV 341
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 164
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNA 151
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 291 CQNCQTTVTPLWRRDENGHPICNA 314
>gi|169621604|ref|XP_001804212.1| hypothetical protein SNOG_14013 [Phaeosphaeria nodorum SN15]
gi|111057518|gb|EAT78638.1| hypothetical protein SNOG_14013 [Phaeosphaeria nodorum SN15]
Length = 511
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 33/46 (71%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
C NCG TPLWRRD GH +CNACGLY+K++ +RP+ + + ++
Sbjct: 273 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHNKHRPVAMKKQEIK 318
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 26/41 (63%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G+ C NC TPLWRR TG +CNACGLY K +RP+
Sbjct: 110 GQVCSNCKTTQTPLWRRSPTGETICNACGLYFKARNQHRPV 150
Score = 44.7 bits (104), Expect = 0.014, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 273 CQNCGTTITPLWRRDDAGHIICNA 296
Score = 40.4 bits (93), Expect = 0.27, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NC TPLWRR TG +CNA
Sbjct: 110 GQVCSNCKTTQTPLWRRSPTGETICNA 136
>gi|254564713|ref|XP_002489467.1| Transcriptional activator of genes regulated by nitrogen catabolite
repression (NCR) [Komagataella pastoris GS115]
gi|238029263|emb|CAY67186.1| Transcriptional activator of genes regulated by nitrogen catabolite
repression (NCR) [Komagataella pastoris GS115]
gi|328349896|emb|CCA36296.1| Histone-lysine N-methyltransferase MLL3 [Komagataella pastoris CBS
7435]
Length = 566
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/54 (50%), Positives = 35/54 (64%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+S P D + +C NC TPLWRRDG+G+ LCNACGL+ K++G RPL
Sbjct: 205 SSSPKADNDESITRTQCSNCKTEKTPLWRRDGSGNTLCNACGLFQKLHGTVRPL 258
Score = 45.1 bits (105), Expect = 0.010, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 24/40 (60%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+S P D + +C NC TPLWRRDG+G+ LCNA
Sbjct: 205 SSSPKADNDESITRTQCSNCKTEKTPLWRRDGSGNTLCNA 244
>gi|344300755|gb|EGW31076.1| hypothetical protein SPAPADRAFT_156732 [Spathaspora passalidarum
NRRL Y-27907]
Length = 667
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 30/39 (76%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
EC NC + TPLWR+D G+ LCNACGL++K++G RPL
Sbjct: 233 ECYNCKTLKTPLWRKDAEGNTLCNACGLFYKLHGTTRPL 271
Score = 43.1 bits (100), Expect = 0.036, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
EC NC + TPLWR+D G+ LCNA
Sbjct: 233 ECYNCKTLKTPLWRKDAEGNTLCNA 257
>gi|448097680|ref|XP_004198732.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
gi|359380154|emb|CCE82395.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
Length = 539
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/47 (55%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
EG++C NCG TPLWRR G +CNACGLY + N NRP+ +K+P
Sbjct: 21 EGQQCSNCGTTKTPLWRRAPDGTMICNACGLYLRSNSTNRPVNLKRP 67
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG+ TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 138 CYNCGSTITPLWRRDDAGNTICNACGLYYRLHGSHRPI 175
Score = 43.9 bits (102), Expect = 0.019, Method: Composition-based stats.
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
EG++C NCG TPLWRR G +CNA
Sbjct: 21 EGQQCSNCGTTKTPLWRRAPDGTMICNA 48
Score = 42.7 bits (99), Expect = 0.044, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG+ TPLWRRD G+ +CNA
Sbjct: 138 CYNCGSTITPLWRRDDAGNTICNA 161
>gi|448101527|ref|XP_004199582.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
gi|359381004|emb|CCE81463.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
Length = 546
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/47 (55%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
EG++C NCG TPLWRR G +CNACGLY + N NRP+ +K+P
Sbjct: 25 EGQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNSTNRPVNLKRP 71
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG+ TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 142 CFNCGSTITPLWRRDDAGNTICNACGLYYRLHGSHRPI 179
Score = 43.9 bits (102), Expect = 0.020, Method: Composition-based stats.
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
EG++C NCG TPLWRR G +CNA
Sbjct: 25 EGQQCSNCGTTKTPLWRRAPDGTLICNA 52
Score = 43.1 bits (100), Expect = 0.040, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG+ TPLWRRD G+ +CNA
Sbjct: 142 CFNCGSTITPLWRRDDAGNTICNA 165
>gi|328858702|gb|EGG07814.1| hypothetical protein MELLADRAFT_55799 [Melampsora larici-populina
98AG31]
Length = 186
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 48/119 (40%), Gaps = 26/119 (21%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA----------------RLFSTSFTGYFIMSRF 62
E C NC TPLWRR LCNA +F G F R
Sbjct: 12 ESPTCSNCRGTQTPLWRRGPDDELLCNACGVFYKVHKKHRPHNLAKVKNTFRGKFGSRRH 71
Query: 63 LFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQ 121
+ P++ C NC A +TP+WR+ G LCNAC LY K + RP+ Q
Sbjct: 72 PRDTQPIK----------CTNCNATATPMWRKAPDGALLCNACALYFKCHKRPRPMPPQ 120
Score = 37.0 bits (84), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 14/29 (48%), Positives = 19/29 (65%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
T+ +C NC A +TP+WR+ G LCNA
Sbjct: 75 TQPIKCTNCNATATPMWRKAPDGALLCNA 103
>gi|392565133|gb|EIW58310.1| hypothetical protein TRAVEDRAFT_167792 [Trametes versicolor
FP-101664 SS1]
Length = 715
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/65 (43%), Positives = 43/65 (66%), Gaps = 6/65 (9%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKQPRRLRFDRGNFNC 134
C NCG +TPLWRRD G+ +CNACGLY K++G +RP ++K+ +R+ G+ +
Sbjct: 295 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRPNSMKKTVIKRRKRVPAAPGSPSQ 354
Query: 135 YERST 139
++R T
Sbjct: 355 HDRMT 359
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++TP WRR G +CNACGLY++ + RPL
Sbjct: 86 CTNCGTVTTPQWRRGDDGKSICNACGLYYRTKHVPRPL 123
Score = 43.1 bits (100), Expect = 0.040, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 295 CANCGTSTTPLWRRDDVGNNICNA 318
Score = 38.9 bits (89), Expect = 0.69, Method: Composition-based stats.
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++TP WRR G +CNA
Sbjct: 86 CTNCGTVTTPQWRRGDDGKSICNA 109
>gi|410083315|ref|XP_003959235.1| hypothetical protein KAFR_0J00320 [Kazachstania africana CBS 2517]
gi|372465826|emb|CCF60100.1| hypothetical protein KAFR_0J00320 [Kazachstania africana CBS 2517]
Length = 657
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 19/78 (24%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV-------------------K 120
+C NC + TPLWRRD G+ LCNACGL+ K++G RPL +
Sbjct: 258 QCFNCKTLKTPLWRRDPNGNALCNACGLFQKLHGTMRPLSLKTDVIKKRNSKKRAKKLQE 317
Query: 121 QPRRLRFDRGNFNCYERS 138
QPR+ F+ ++ Y ++
Sbjct: 318 QPRQRHFEHSEYDRYTQT 335
Score = 42.7 bits (99), Expect = 0.049, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC + TPLWRRD G+ LCNA
Sbjct: 258 QCFNCKTLKTPLWRRDPNGNALCNA 282
>gi|384495933|gb|EIE86424.1| global nitrogen regulator protein [Rhizopus delemar RA 99-880]
Length = 617
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 24/39 (61%), Positives = 31/39 (79%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NCG +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 466 KCTNCGTTTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 504
Score = 45.4 bits (106), Expect = 0.007, Method: Composition-based stats.
Identities = 16/25 (64%), Positives = 19/25 (76%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NCG +TPLWRR+ GH LCNA
Sbjct: 466 KCTNCGTTTTPLWRRNPEGHPLCNA 490
>gi|443895398|dbj|GAC72744.1| GATA-4/5/6 transcription factors [Pseudozyma antarctica T-34]
Length = 1025
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G+ +CNACGLYHK++G +RP+
Sbjct: 475 CTNCQTTTTPLWRRDEDGNNICNACGLYHKLHGTHRPI 512
Score = 56.6 bits (135), Expect = 4e-06, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 25/39 (64%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNR 116
G C NCG STPLWRR G +CNACGLY K + NR
Sbjct: 326 GLRCSNCGVTSTPLWRRAPDGSTICNACGLYMKSHSTNR 364
Score = 41.2 bits (95), Expect = 0.14, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR G +CNA
Sbjct: 326 GLRCSNCGVTSTPLWRRAPDGSTICNA 352
Score = 40.0 bits (92), Expect = 0.33, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G+ +CNA
Sbjct: 475 CTNCQTTTTPLWRRDEDGNNICNA 498
>gi|321263187|ref|XP_003196312.1| zinc finger protein regulator of siderophore biosynthesis
[Cryptococcus gattii WM276]
gi|317462787|gb|ADV24525.1| Zinc finger protein regulator of siderophore biosynthesis, putative
[Cryptococcus gattii WM276]
Length = 1015
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/41 (60%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPV 345
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRRD G CNA
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNA 331
>gi|150864214|ref|XP_001382948.2| regulates glutamine-repressible gene products [Scheffersomyces
stipitis CBS 6054]
gi|149385470|gb|ABN64919.2| regulates glutamine-repressible gene products [Scheffersomyces
stipitis CBS 6054]
Length = 782
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 24/39 (61%), Positives = 31/39 (79%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
EC NC ++TPLWR+D TG+ LCNACGL+ K++G RPL
Sbjct: 277 ECYNCHTLNTPLWRKDPTGNTLCNACGLFLKLHGTTRPL 315
Score = 44.7 bits (104), Expect = 0.013, Method: Composition-based stats.
Identities = 16/25 (64%), Positives = 20/25 (80%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
EC NC ++TPLWR+D TG+ LCNA
Sbjct: 277 ECYNCHTLNTPLWRKDPTGNTLCNA 301
>gi|108802600|gb|ABG21303.1| iron regulator 1 [Cryptococcus neoformans var. neoformans]
Length = 963
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/41 (60%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPV 345
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRRD G CNA
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNA 331
>gi|409040728|gb|EKM50215.1| hypothetical protein PHACADRAFT_264823 [Phanerochaete carnosa
HHB-10118-sp]
Length = 661
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 26/37 (70%), Positives = 27/37 (72%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCGA STPLWRRDG G +CNACGLY K M RP
Sbjct: 92 CANCGAHSTPLWRRDGEGKAVCNACGLYWKHKNMPRP 128
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 23/37 (62%), Positives = 29/37 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G+ +CNACGLY K++G +RP
Sbjct: 294 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 330
Score = 47.4 bits (111), Expect = 0.002, Method: Composition-based stats.
Identities = 18/24 (75%), Positives = 19/24 (79%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCGA STPLWRRDG G +CNA
Sbjct: 92 CANCGAHSTPLWRRDGEGKAVCNA 115
Score = 43.1 bits (100), Expect = 0.037, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 294 CANCGTSTTPLWRRDDVGNNICNA 317
>gi|108802602|gb|ABG21304.1| iron regulator 1 [Cryptococcus neoformans var. grubii]
Length = 952
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/41 (60%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPV 345
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRRD G CNA
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNA 331
>gi|405122586|gb|AFR97352.1| iron regulator 1 [Cryptococcus neoformans var. grubii H99]
Length = 1019
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/41 (60%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPV 345
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRRD G CNA
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNA 331
>gi|343426925|emb|CBQ70453.1| Siderophore biosynthesis regulatory protein URBS1 [Sporisorium
reilianum SRZ2]
Length = 1080
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 33/49 (67%)
Query: 70 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
ED + C NC +TPLWRRD G+ +CNACGLYHK++G +RP+
Sbjct: 466 EDDKSAVGALRCTNCQTTTTPLWRRDEDGNNICNACGLYHKLHGTHRPI 514
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 24/45 (53%), Positives = 28/45 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP 122
G C NCG STPLWRR G +CNACGLY K + M+R +P
Sbjct: 329 GMRCSNCGVTSTPLWRRAPDGSTICNACGLYIKSHSMHRATPSRP 373
Score = 41.6 bits (96), Expect = 0.12, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 24/49 (48%), Gaps = 7/49 (14%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLP 68
G C NCG STPLWRR G +CNA G +I S + + P
Sbjct: 329 GMRCSNCGVTSTPLWRRAPDGSTICNA-------CGLYIKSHSMHRATP 370
Score = 39.7 bits (91), Expect = 0.36, Method: Composition-based stats.
Identities = 16/35 (45%), Positives = 20/35 (57%)
Query: 12 EDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
ED + C NC +TPLWRRD G+ +CNA
Sbjct: 466 EDDKSAVGALRCTNCQTTTTPLWRRDEDGNNICNA 500
>gi|58260324|ref|XP_567572.1| gata factor srep [Cryptococcus neoformans var. neoformans JEC21]
gi|134116222|ref|XP_773282.1| hypothetical protein CNBJ0600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255904|gb|EAL18635.1| hypothetical protein CNBJ0600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229622|gb|AAW46055.1| gata factor srep, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 1060
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/41 (60%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPV 345
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRRD G CNA
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNA 331
>gi|71005062|ref|XP_757197.1| siderophore biosynthesis regulatory protein URBS1 [Ustilago maydis
521]
gi|119370539|sp|P40349.2|URB1_USTMA RecName: Full=Siderophore biosynthesis regulatory protein URBS1
gi|46096559|gb|EAK81792.1| URB1_USTMA Siderophore biosynthesis regulatory protein URBS1
[Ustilago maydis 521]
Length = 1084
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G+ +CNACGLYHK++G +RP+
Sbjct: 482 CTNCQTTTTPLWRRDEDGNNICNACGLYHKLHGTHRPI 519
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 25/39 (64%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNR 116
G C NCG STPLWRR G +CNACGLY K + +R
Sbjct: 335 GMRCSNCGVTSTPLWRRAPDGSTICNACGLYIKSHSTHR 373
Score = 41.2 bits (95), Expect = 0.13, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR G +CNA
Sbjct: 335 GMRCSNCGVTSTPLWRRAPDGSTICNA 361
Score = 39.7 bits (91), Expect = 0.37, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G+ +CNA
Sbjct: 482 CTNCQTTTTPLWRRDEDGNNICNA 505
>gi|703468|gb|AAB38022.1| homologous to GATA-binding transcription factor
[Schizosaccharomyces pombe]
Length = 564
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 42/59 (71%), Gaps = 7/59 (11%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKQPRRLRFDRGNFN 133
C NC +TP WRRD +G+ +CNACGLY+K++G++RP ++K+ +RL F+ GN N
Sbjct: 172 CQNCATTNTPKWRRDESGNPICNACGLYYKIHGVHRPVTMKKAIIKRRKRLVFN-GNAN 229
Score = 48.9 bits (115), Expect = 7e-04, Method: Composition-based stats.
Identities = 20/41 (48%), Positives = 23/41 (56%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G+ C NC +T LWRR LCNACGLY K RP+
Sbjct: 9 GQSCSNCHKTTTSLWRRGPDNSLLCNACGLYQKHRKHARPV 49
Score = 37.7 bits (86), Expect = 1.6, Method: Composition-based stats.
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TP WRRD +G+ +CNA
Sbjct: 172 CQNCATTNTPKWRRDESGNPICNA 195
>gi|384491758|gb|EIE82954.1| hypothetical protein RO3G_07659 [Rhizopus delemar RA 99-880]
Length = 383
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 55/134 (41%), Gaps = 44/134 (32%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE---- 77
EC NCG +TPLWRR G +CNA G ++ +R ++ +
Sbjct: 8 ECANCGTTTTPLWRRGPNGETICNA-------CGLYLKARNTLRPPTLKKASKKEKNDCG 60
Query: 78 ---------------GRECVNCGA------------------ISTPLWRRDGTGHYLCNA 104
C C A +TPLWRRD G+ +CNA
Sbjct: 61 GGTCPGGGQCNGTGGSTSCAGCPAFNQHQVNRQALICANCRTTTTPLWRRDEAGNTICNA 120
Query: 105 CGLYHKMNGMNRPL 118
CGLY+K++ ++RP+
Sbjct: 121 CGLYYKLHHVHRPV 134
>gi|13620173|emb|CAC36427.1| GATA transcription factor [Botryotinia fuckeliana]
Length = 388
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 32/39 (82%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 119
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 277 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVT 315
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 34/61 (55%), Gaps = 4/61 (6%)
Query: 65 ASLPVED--IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQ 121
AS+P +E + C NCG TPLWRR G +CNACGLY K +RP +K+
Sbjct: 101 ASVPASSPGVEQIQSNQIC-NCGTTRTPLWRRSPQGATICNACGLYQKARNASRPTNLKR 159
Query: 122 P 122
P
Sbjct: 160 P 160
Score = 42.7 bits (99), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +GH +CNA
Sbjct: 277 CQNCGTTITPLWRRDESGHTICNA 300
Score = 36.2 bits (82), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 7 ASLPVED--IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
AS+P +E + C NCG TPLWRR G +CNA
Sbjct: 101 ASVPASSPGVEQIQSNQIC-NCGTTRTPLWRRSPQGATICNA 141
>gi|388851597|emb|CCF54787.1| probable siderophore biosynthesis regulatory protein URBS1
[Ustilago hordei]
Length = 1055
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G+ +CNACGLYHK++G +RP+
Sbjct: 486 CTNCQTTTTPLWRRDEDGNNICNACGLYHKLHGTHRPI 523
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 28/48 (58%)
Query: 75 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP 122
+ G C NCG STPLWRR G +CNACGLY K + +R +P
Sbjct: 334 YHAGMRCSNCGVTSTPLWRRAPDGSTICNACGLYMKPHSTHRTPSSRP 381
Score = 41.6 bits (96), Expect = 0.12, Method: Composition-based stats.
Identities = 16/30 (53%), Positives = 18/30 (60%)
Query: 17 FTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+ G C NCG STPLWRR G +CNA
Sbjct: 334 YHAGMRCSNCGVTSTPLWRRAPDGSTICNA 363
Score = 39.7 bits (91), Expect = 0.38, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G+ +CNA
Sbjct: 486 CTNCQTTTTPLWRRDEDGNNICNA 509
>gi|393222104|gb|EJD07588.1| hypothetical protein FOMMEDRAFT_100924 [Fomitiporia mediterranea
MF3/22]
Length = 676
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 23/37 (62%), Positives = 30/37 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G+ +CNACGLY+K++G +RP
Sbjct: 290 CANCGTSTTPLWRRDDAGNNICNACGLYYKLHGTHRP 326
Score = 43.9 bits (102), Expect = 0.025, Method: Composition-based stats.
Identities = 16/31 (51%), Positives = 19/31 (61%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHK 110
+C +C A T WRRD G +CNACGL K
Sbjct: 96 QCASCNATKTSQWRRDADGRQICNACGLSQK 126
Score = 42.7 bits (99), Expect = 0.047, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 290 CANCGTSTTPLWRRDDAGNNICNA 313
>gi|170596|gb|AAB05617.1| URBS1 [Ustilago maydis]
Length = 950
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G+ +CNACGLYHK++G +RP+
Sbjct: 482 CTNCQTTTTPLWRRDEDGNNICNACGLYHKLHGTHRPI 519
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 25/39 (64%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNR 116
G C NCG STPLWRR G +CNACGLY K + +R
Sbjct: 335 GMRCSNCGVTSTPLWRRAPDGSTICNACGLYIKSHSTHR 373
Score = 41.2 bits (95), Expect = 0.14, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR G +CNA
Sbjct: 335 GMRCSNCGVTSTPLWRRAPDGSTICNA 361
Score = 39.7 bits (91), Expect = 0.39, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G+ +CNA
Sbjct: 482 CTNCQTTTTPLWRRDEDGNNICNA 505
>gi|294656337|ref|XP_458597.2| DEHA2D02992p [Debaryomyces hansenii CBS767]
gi|199431394|emb|CAG86732.2| DEHA2D02992p [Debaryomyces hansenii CBS767]
Length = 574
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 38/57 (66%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNCYER 137
C NCG+ TPLWRRD G+ +CNACGLY++++G +RP+ + ++ + N +R
Sbjct: 199 CFNCGSTITPLWRRDDVGNTICNACGLYYRLHGSHRPIKMKRNTIKRRKRNMTLIKR 255
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
+G++C NCG TPLWRR G +CNACGLY + N +RP+ +K+P
Sbjct: 70 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNNSHRPVNLKRP 116
Score = 42.4 bits (98), Expect = 0.061, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG+ TPLWRRD G+ +CNA
Sbjct: 199 CFNCGSTITPLWRRDDVGNTICNA 222
Score = 42.0 bits (97), Expect = 0.081, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G++C NCG TPLWRR G +CNA
Sbjct: 70 DGQQCSNCGTTKTPLWRRAPDGTLICNA 97
>gi|294659953|ref|XP_462408.2| DEHA2G19888p [Debaryomyces hansenii CBS767]
gi|199434358|emb|CAG90917.2| DEHA2G19888p [Debaryomyces hansenii CBS767]
Length = 702
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/43 (58%), Positives = 31/43 (72%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
T+ EC NC A+ TPLWR+D G LCNACGL+ K++G RPL
Sbjct: 214 TKQTECFNCHALKTPLWRKDPQGKTLCNACGLFLKLHGTTRPL 256
Score = 44.3 bits (103), Expect = 0.016, Method: Composition-based stats.
Identities = 17/29 (58%), Positives = 20/29 (68%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
T+ EC NC A+ TPLWR+D G LCNA
Sbjct: 214 TKQTECFNCHALKTPLWRKDPQGKTLCNA 242
>gi|395331932|gb|EJF64312.1| hypothetical protein DICSQDRAFT_101190 [Dichomitus squalens
LYAD-421 SS1]
Length = 534
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 23/37 (62%), Positives = 29/37 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G+ +CNACGLY K++G +RP
Sbjct: 119 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 155
Score = 43.5 bits (101), Expect = 0.029, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 119 CANCGTSTTPLWRRDDVGNNICNA 142
>gi|403415920|emb|CCM02620.1| predicted protein [Fibroporia radiculosa]
Length = 746
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 23/37 (62%), Positives = 29/37 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G+ +CNACGLY K++G +RP
Sbjct: 289 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 325
Score = 43.5 bits (101), Expect = 0.030, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 289 CANCGTSTTPLWRRDDVGNNICNA 312
>gi|121713620|ref|XP_001274421.1| siderophore transcription factor SreA [Aspergillus clavatus NRRL 1]
gi|119402574|gb|EAW12995.1| siderophore transcription factor SreA [Aspergillus clavatus NRRL 1]
Length = 561
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 6/51 (11%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKQPRRL 125
C NCG TPLWRRD GH +CNACGLY+K++G RP ++K+ +R+
Sbjct: 261 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTTMKKTIIKRRKRV 311
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 27/53 (50%), Positives = 31/53 (58%), Gaps = 4/53 (7%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRG 130
G C NCG STPLWRR TG +CNACGLY K + RP +R R + G
Sbjct: 110 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP----TKRNRLEPG 158
Score = 43.9 bits (102), Expect = 0.019, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 110 GHSCSNCGTKSTPLWRRSPTGAMICNA 136
Score = 43.9 bits (102), Expect = 0.023, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 261 CQNCGTTVTPLWRRDENGHPICNA 284
>gi|299745564|ref|XP_001831801.2| hypothetical protein CC1G_05900 [Coprinopsis cinerea okayama7#130]
gi|298406644|gb|EAU89984.2| hypothetical protein CC1G_05900 [Coprinopsis cinerea okayama7#130]
Length = 500
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 23/37 (62%), Positives = 29/37 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G+ +CNACGLY K++G +RP
Sbjct: 119 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 155
Score = 43.5 bits (101), Expect = 0.031, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 119 CANCGTSTTPLWRRDDVGNNICNA 142
>gi|156844572|ref|XP_001645348.1| hypothetical protein Kpol_1058p27 [Vanderwaltozyma polyspora DSM
70294]
gi|156116009|gb|EDO17490.1| hypothetical protein Kpol_1058p27 [Vanderwaltozyma polyspora DSM
70294]
Length = 484
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/54 (51%), Positives = 38/54 (70%), Gaps = 1/54 (1%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
ASL + I+Y + C NC +TPLWRRD G+ LCNACGL++K++G+ RPL
Sbjct: 385 ASLSTDKIQY-NDDMICDNCKTTNTPLWRRDPIGNPLCNACGLFYKLHGVMRPL 437
Score = 42.7 bits (99), Expect = 0.047, Method: Composition-based stats.
Identities = 20/40 (50%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
ASL + I+Y + C NC +TPLWRRD G+ LCNA
Sbjct: 385 ASLSTDKIQY-NDDMICDNCKTTNTPLWRRDPIGNPLCNA 423
>gi|385304401|gb|EIF48420.1| gln3p [Dekkera bruxellensis AWRI1499]
Length = 732
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 29/39 (74%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NCG TPLWR+D G+ LCNACGL+ K++G RPL
Sbjct: 360 QCSNCGTTKTPLWRKDSQGNTLCNACGLFQKLHGTMRPL 398
Score = 43.5 bits (101), Expect = 0.026, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NCG TPLWR+D G+ LCNA
Sbjct: 360 QCSNCGTTKTPLWRKDSQGNTLCNA 384
>gi|119479079|ref|XP_001259568.1| siderophore transcription factor SreA [Neosartorya fischeri NRRL
181]
gi|119407722|gb|EAW17671.1| siderophore transcription factor SreA [Neosartorya fischeri NRRL
181]
Length = 546
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 253 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 289
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 27/54 (50%), Positives = 32/54 (59%), Gaps = 4/54 (7%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN 131
G C NCG STPLWRR TG +CNACGLY K + RP +R R + G+
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP----TKRNRMEPGS 149
Score = 43.9 bits (102), Expect = 0.020, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGAMICNA 126
Score = 43.9 bits (102), Expect = 0.025, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 253 CQNCGTTVTPLWRRDENGHPICNA 276
>gi|70997557|ref|XP_753523.1| siderophore transcription factor SreA [Aspergillus fumigatus Af293]
gi|66851159|gb|EAL91485.1| siderophore transcription factor SreA [Aspergillus fumigatus Af293]
gi|159126746|gb|EDP51862.1| siderophore transcription factor SreA [Aspergillus fumigatus A1163]
Length = 546
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 253 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 289
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 27/54 (50%), Positives = 32/54 (59%), Gaps = 4/54 (7%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGN 131
G C NCG STPLWRR TG +CNACGLY K + RP +R R + G+
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP----TKRNRMEPGS 149
Score = 43.9 bits (102), Expect = 0.021, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGAMICNA 126
Score = 43.5 bits (101), Expect = 0.026, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 253 CQNCGTTVTPLWRRDENGHPICNA 276
>gi|50552360|ref|XP_503590.1| YALI0E05555p [Yarrowia lipolytica]
gi|49649459|emb|CAG79171.1| YALI0E05555p [Yarrowia lipolytica CLIB122]
Length = 805
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD +G +CNACGLY++++G++RP+
Sbjct: 397 CQNCGTTITPLWRRDDSGSTICNACGLYYRLHGVHRPV 434
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKQP 122
C NCG TPLWRR G +CNACGLY K ++RP L K P
Sbjct: 228 CSNCGTTRTPLWRRAPNGATICNACGLYLKARNISRPTHLEKPP 271
Score = 42.0 bits (97), Expect = 0.084, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD +G +CNA
Sbjct: 397 CQNCGTTITPLWRRDDSGSTICNA 420
Score = 38.5 bits (88), Expect = 0.97, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRR G +CNA
Sbjct: 228 CSNCGTTRTPLWRRAPNGATICNA 251
>gi|406699318|gb|EKD02524.1| iron regulator 1 [Trichosporon asahii var. asahii CBS 8904]
Length = 977
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/41 (60%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 347 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPV 387
Score = 40.4 bits (93), Expect = 0.27, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRRD G CNA
Sbjct: 347 GMSCRNCGTSTTPLWRRDEEGRPQCNA 373
>gi|169775323|ref|XP_001822129.1| GATA factor SREP [Aspergillus oryzae RIB40]
gi|83769992|dbj|BAE60127.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 566
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 27/45 (60%), Positives = 31/45 (68%), Gaps = 3/45 (6%)
Query: 76 TEGR---ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
TEG C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 258 TEGNMLVSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 302
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 106 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 145
Score = 43.9 bits (102), Expect = 0.024, Method: Composition-based stats.
Identities = 19/32 (59%), Positives = 20/32 (62%), Gaps = 3/32 (9%)
Query: 18 TEGR---ECVNCGAISTPLWRRDGTGHYLCNA 46
TEG C NCG TPLWRRD GH +CNA
Sbjct: 258 TEGNMLVSCQNCGTTVTPLWRRDENGHPICNA 289
Score = 43.5 bits (101), Expect = 0.029, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 106 GHSCSNCGTKSTPLWRRSPTGAMICNA 132
>gi|391873006|gb|EIT82081.1| GATA transcription factor [Aspergillus oryzae 3.042]
Length = 566
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 27/45 (60%), Positives = 31/45 (68%), Gaps = 3/45 (6%)
Query: 76 TEGR---ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
TEG C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 258 TEGNMLVSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 302
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 106 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 145
Score = 43.9 bits (102), Expect = 0.024, Method: Composition-based stats.
Identities = 19/32 (59%), Positives = 20/32 (62%), Gaps = 3/32 (9%)
Query: 18 TEGR---ECVNCGAISTPLWRRDGTGHYLCNA 46
TEG C NCG TPLWRRD GH +CNA
Sbjct: 258 TEGNMLVSCQNCGTTVTPLWRRDENGHPICNA 289
Score = 43.5 bits (101), Expect = 0.029, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 106 GHSCSNCGTKSTPLWRRSPTGAMICNA 132
>gi|401888062|gb|EJT52030.1| iron regulator 1 [Trichosporon asahii var. asahii CBS 2479]
Length = 977
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/41 (60%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 347 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPV 387
Score = 40.0 bits (92), Expect = 0.28, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRRD G CNA
Sbjct: 347 GMSCRNCGTSTTPLWRRDEEGRPQCNA 373
>gi|241949727|ref|XP_002417586.1| negative regulator of iron uptake genes, putative [Candida
dubliniensis CD36]
gi|223640924|emb|CAX45241.1| negative regulator of iron uptake genes, putative [Candida
dubliniensis CD36]
Length = 523
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/48 (52%), Positives = 35/48 (72%), Gaps = 1/48 (2%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
T+G++C NCG TPLWRR G +CNACGLY++ N +RP+ +K+P
Sbjct: 51 TDGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRP 98
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 25/50 (50%), Positives = 36/50 (72%), Gaps = 1/50 (2%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRLRFDR 129
C NCG TPLWRRD G+ +CNACGL+++++G +RP+ +K+P R R
Sbjct: 179 CFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMKRPTIKRRKR 228
Score = 43.5 bits (101), Expect = 0.026, Method: Composition-based stats.
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
T+G++C NCG TPLWRR G +CNA
Sbjct: 51 TDGQQCSNCGTTKTPLWRRAPDGTLICNA 79
Score = 41.6 bits (96), Expect = 0.100, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 179 CFNCGTTITPLWRRDDAGNTICNA 202
>gi|449548468|gb|EMD39435.1| URBS1-like transcription factor [Ceriporiopsis subvermispora B]
Length = 728
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 23/37 (62%), Positives = 29/37 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G+ +CNACGLY K++G +RP
Sbjct: 299 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 335
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 25/37 (67%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG STPLWRRD G +CNACGLY K M RP
Sbjct: 98 CANCGTSSTPLWRRDADGKSVCNACGLYLKNRNMARP 134
Score = 43.5 bits (101), Expect = 0.025, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG STPLWRRD G +CNA
Sbjct: 98 CANCGTSSTPLWRRDADGKSVCNA 121
Score = 43.1 bits (100), Expect = 0.037, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 299 CANCGTSTTPLWRRDDVGNNICNA 322
>gi|238496045|ref|XP_002379258.1| siderophore transcription factor SreA [Aspergillus flavus NRRL3357]
gi|220694138|gb|EED50482.1| siderophore transcription factor SreA [Aspergillus flavus NRRL3357]
Length = 559
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 27/45 (60%), Positives = 31/45 (68%), Gaps = 3/45 (6%)
Query: 76 TEGR---ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
TEG C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 251 TEGNMLVSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 295
Score = 58.5 bits (140), Expect = 1e-06, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 99 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 138
Score = 43.5 bits (101), Expect = 0.025, Method: Composition-based stats.
Identities = 19/32 (59%), Positives = 20/32 (62%), Gaps = 3/32 (9%)
Query: 18 TEGR---ECVNCGAISTPLWRRDGTGHYLCNA 46
TEG C NCG TPLWRRD GH +CNA
Sbjct: 251 TEGNMLVSCQNCGTTVTPLWRRDENGHPICNA 282
Score = 43.5 bits (101), Expect = 0.030, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 99 GHSCSNCGTKSTPLWRRSPTGAMICNA 125
>gi|366998729|ref|XP_003684101.1| hypothetical protein TPHA_0A05930 [Tetrapisispora phaffii CBS 4417]
gi|357522396|emb|CCE61667.1| hypothetical protein TPHA_0A05930 [Tetrapisispora phaffii CBS 4417]
Length = 812
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 330 QCANCKTFKTPLWRRDAQGNILCNACGLFQKLHGTMRPL 368
Score = 43.1 bits (100), Expect = 0.035, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRRD G+ LCNA
Sbjct: 330 QCANCKTFKTPLWRRDAQGNILCNA 354
>gi|410074657|ref|XP_003954911.1| hypothetical protein KAFR_0A03410 [Kazachstania africana CBS 2517]
gi|372461493|emb|CCF55776.1| hypothetical protein KAFR_0A03410 [Kazachstania africana CBS 2517]
Length = 336
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 43/70 (61%), Gaps = 3/70 (4%)
Query: 52 SFTGYFIMSRFLFASLPVEDIEYFTEGRE---CVNCGAISTPLWRRDGTGHYLCNACGLY 108
SFT + I+ + AS + TE ++ C NC +TPLWRRD G+ LCNACGL+
Sbjct: 86 SFTDHSIVETDIIASSTQQQNVRLTESKDAHHCSNCLTTTTPLWRRDLDGNILCNACGLF 145
Query: 109 HKMNGMNRPL 118
K++G +RP+
Sbjct: 146 LKLHGNSRPI 155
Score = 39.7 bits (91), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 21/33 (63%), Gaps = 3/33 (9%)
Query: 17 FTEGRE---CVNCGAISTPLWRRDGTGHYLCNA 46
TE ++ C NC +TPLWRRD G+ LCNA
Sbjct: 109 LTESKDAHHCSNCLTTTTPLWRRDLDGNILCNA 141
>gi|242762467|ref|XP_002340383.1| siderophore transcription factor SreA [Talaromyces stipitatus ATCC
10500]
gi|218723579|gb|EED22996.1| siderophore transcription factor SreA [Talaromyces stipitatus ATCC
10500]
Length = 493
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 256 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 292
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/47 (53%), Positives = 29/47 (61%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
G C NCG STPLWRR TG +CNACGLY K ++RP + R
Sbjct: 103 GHSCSNCGTKSTPLWRRSPTGATICNACGLYLKARNVHRPTKRNRNR 149
Score = 43.5 bits (101), Expect = 0.030, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 103 GHSCSNCGTKSTPLWRRSPTGATICNA 129
Score = 43.5 bits (101), Expect = 0.030, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 256 CQNCGTTVTPLWRRDENGHPICNA 279
>gi|145228733|ref|XP_001388675.1| GATA factor SREP [Aspergillus niger CBS 513.88]
gi|134054767|emb|CAK43607.1| unnamed protein product [Aspergillus niger]
gi|350637901|gb|EHA26257.1| hypothetical protein ASPNIDRAFT_52040 [Aspergillus niger ATCC 1015]
Length = 569
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 267 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 303
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 108 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 147
Score = 43.5 bits (101), Expect = 0.025, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 108 GHSCSNCGTKSTPLWRRSPTGAMICNA 134
Score = 43.5 bits (101), Expect = 0.030, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 267 CQNCGTTVTPLWRRDENGHPICNA 290
>gi|255953715|ref|XP_002567610.1| GATA transcription factor sreP-Penicillium chrysogenum [Penicillium
chrysogenum Wisconsin 54-1255]
gi|2494691|sp|Q92259.1|SREP_PENCH RecName: Full=GATA factor SREP
gi|1517916|gb|AAC49628.1| SreP [Penicillium chrysogenum]
gi|211589321|emb|CAP95461.1| GATA transcription factor sreP-Penicillium chrysogenum [Penicillium
chrysogenum Wisconsin 54-1255]
Length = 532
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 238 CQNCGTTVTPLWRRDEQGHPICNACGLYYKLHGCYRP 274
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 91 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 130
Score = 43.9 bits (102), Expect = 0.023, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 91 GHSCSNCGTKSTPLWRRSPTGAMICNA 117
Score = 43.5 bits (101), Expect = 0.032, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 238 CQNCGTTVTPLWRRDEQGHPICNA 261
>gi|303322645|ref|XP_003071314.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
gi|240111016|gb|EER29169.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
Length = 429
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/38 (65%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRDG G LCNACGL+ K++G RPL
Sbjct: 149 CQNCGTSKTPLWRRDGMGAVLCNACGLFLKLHGRPRPL 186
Score = 44.7 bits (104), Expect = 0.011, Method: Composition-based stats.
Identities = 17/24 (70%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRDG G LCNA
Sbjct: 149 CQNCGTSKTPLWRRDGMGAVLCNA 172
>gi|67515789|ref|XP_657780.1| hypothetical protein AN0176.2 [Aspergillus nidulans FGSC A4]
gi|4585213|gb|AAD25328.1|AF095898_1 siderophore biosynthesis repressor SREA [Emericella nidulans]
gi|40746893|gb|EAA66049.1| hypothetical protein AN0176.2 [Aspergillus nidulans FGSC A4]
gi|259489617|tpe|CBF90036.1| TPA: Putative uncharacterized proteinSiderophore biosynthesis
repressor SREA ; [Source:UniProtKB/TrEMBL;Acc:Q9Y754]
[Aspergillus nidulans FGSC A4]
Length = 549
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 251 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGSYRP 287
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 103 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 142
Score = 43.5 bits (101), Expect = 0.025, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 103 GHSCSNCGTKSTPLWRRSPTGAMICNA 129
Score = 43.5 bits (101), Expect = 0.030, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 251 CQNCGTTVTPLWRRDENGHPICNA 274
>gi|260944592|ref|XP_002616594.1| hypothetical protein CLUG_03835 [Clavispora lusitaniae ATCC 42720]
gi|238850243|gb|EEQ39707.1| hypothetical protein CLUG_03835 [Clavispora lusitaniae ATCC 42720]
Length = 473
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 36/53 (67%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFN 133
C NC + TPLWRRD G+ +CNACGLY++++G +RP+ + ++ + N N
Sbjct: 241 CFNCSSTITPLWRRDDAGNTICNACGLYYRLHGSHRPIKMKRNTIKRRKRNLN 293
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 21/40 (52%), Positives = 27/40 (67%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
++C NCG TPLWRR G +CNACGLY + N +RP+
Sbjct: 127 QQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNNHHRPV 166
Score = 40.4 bits (93), Expect = 0.25, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC + TPLWRRD G+ +CNA
Sbjct: 241 CFNCSSTITPLWRRDDAGNTICNA 264
Score = 39.3 bits (90), Expect = 0.47, Method: Composition-based stats.
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNA 46
++C NCG TPLWRR G +CNA
Sbjct: 127 QQCSNCGTTKTPLWRRAPDGTLICNA 152
>gi|115401494|ref|XP_001216335.1| GATA factor SREP [Aspergillus terreus NIH2624]
gi|114190276|gb|EAU31976.1| GATA factor SREP [Aspergillus terreus NIH2624]
Length = 536
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 247 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 283
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 27/40 (67%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG STPLWRR TG +CNACGLY K ++RP
Sbjct: 93 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVSRP 132
Score = 43.5 bits (101), Expect = 0.026, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 93 GHSCSNCGTKSTPLWRRSPTGAMICNA 119
Score = 43.5 bits (101), Expect = 0.031, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 247 CQNCGTTVTPLWRRDENGHPICNA 270
>gi|225558357|gb|EEH06641.1| siderophore transcription factor SreA [Ajellomyces capsulatus
G186AR]
Length = 630
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 27/37 (72%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC TPLWRRD GH +CNACGLYHK++G RP
Sbjct: 291 CQNCQTTVTPLWRRDENGHPICNACGLYHKLHGAYRP 327
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 164
Score = 41.6 bits (96), Expect = 0.098, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNA 151
Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 291 CQNCQTTVTPLWRRDENGHPICNA 314
>gi|164685140|gb|ABY66603.1| siderophore uptake regulator [Ajellomyces capsulatus]
Length = 630
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 27/37 (72%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC TPLWRRD GH +CNACGLYHK++G RP
Sbjct: 291 CQNCQTTVTPLWRRDENGHPICNACGLYHKLHGAYRP 327
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 164
Score = 41.6 bits (96), Expect = 0.098, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNA 151
Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 291 CQNCQTTVTPLWRRDENGHPICNA 314
>gi|358372181|dbj|GAA88786.1| siderophore transcription factor SreA [Aspergillus kawachii IFO
4308]
Length = 568
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 266 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 302
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 107 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 146
Score = 43.5 bits (101), Expect = 0.027, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 107 GHSCSNCGTKSTPLWRRSPTGAMICNA 133
Score = 43.5 bits (101), Expect = 0.032, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 266 CQNCGTTVTPLWRRDENGHPICNA 289
>gi|240274786|gb|EER38301.1| siderophore biosynthesis repressor SREA [Ajellomyces capsulatus
H143]
Length = 630
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 27/37 (72%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC TPLWRRD GH +CNACGLYHK++G RP
Sbjct: 291 CQNCQTTVTPLWRRDENGHPICNACGLYHKLHGAYRP 327
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 164
Score = 41.6 bits (96), Expect = 0.10, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNA 151
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 291 CQNCQTTVTPLWRRDENGHPICNA 314
>gi|342319729|gb|EGU11676.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 3850
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 30/38 (78%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
+C+NCG +TPLWRRD G LCNACGL+ ++G++RP
Sbjct: 3733 KCLNCGTTNTPLWRRDSEGKPLCNACGLFRNLHGVDRP 3770
Score = 44.7 bits (104), Expect = 0.012, Method: Composition-based stats.
Identities = 16/25 (64%), Positives = 19/25 (76%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C+NCG +TPLWRRD G LCNA
Sbjct: 3733 KCLNCGTTNTPLWRRDSEGKPLCNA 3757
>gi|425771811|gb|EKV10244.1| GATA factor SREP [Penicillium digitatum Pd1]
gi|425777158|gb|EKV15342.1| GATA factor SREP [Penicillium digitatum PHI26]
Length = 528
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 231 CQNCGTTVTPLWRRDEQGHPICNACGLYYKLHGCYRP 267
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 84 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 123
Score = 43.5 bits (101), Expect = 0.025, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 84 GHSCSNCGTKSTPLWRRSPTGAMICNA 110
Score = 43.1 bits (100), Expect = 0.036, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 231 CQNCGTTVTPLWRRDEQGHPICNA 254
>gi|255723273|ref|XP_002546570.1| predicted protein [Candida tropicalis MYA-3404]
gi|240130701|gb|EER30264.1| predicted protein [Candida tropicalis MYA-3404]
Length = 674
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD +G+ LCNACGL+ K++G +RPL
Sbjct: 215 CYNCKTTATPLWRRDASGNTLCNACGLFLKLHGTSRPL 252
Score = 43.1 bits (100), Expect = 0.039, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD +G+ LCNA
Sbjct: 215 CYNCKTTATPLWRRDASGNTLCNA 238
>gi|212529674|ref|XP_002144994.1| siderophore transcription factor SreA [Talaromyces marneffei ATCC
18224]
gi|210074392|gb|EEA28479.1| siderophore transcription factor SreA [Talaromyces marneffei ATCC
18224]
Length = 485
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 250 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 286
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 25/47 (53%), Positives = 29/47 (61%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
G C NCG STPLWRR TG +CNACGLY K ++RP + R
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGATICNACGLYLKARNVHRPTKRNRNR 146
Score = 43.1 bits (100), Expect = 0.034, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG STPLWRR TG +CNA
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGATICNA 126
Score = 43.1 bits (100), Expect = 0.034, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 250 CQNCGTTVTPLWRRDENGHPICNA 273
>gi|406602548|emb|CCH45864.1| Nitrogen catabolic enzyme regulatory protein [Wickerhamomyces
ciferrii]
Length = 519
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 24/39 (61%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
EC NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 199 ECFNCHTQKTPLWRRDSNGNTLCNACGLFQKLHGTMRPL 237
Score = 42.7 bits (99), Expect = 0.045, Method: Composition-based stats.
Identities = 16/25 (64%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
EC NC TPLWRRD G+ LCNA
Sbjct: 199 ECFNCHTQKTPLWRRDSNGNTLCNA 223
>gi|344234670|gb|EGV66538.1| hypothetical protein CANTEDRAFT_91703 [Candida tenuis ATCC 10573]
Length = 499
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 26/49 (53%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKQP 122
++G+ C NCG +TPLWRR G +CNACGLY + N NRP L K P
Sbjct: 87 SDGQSCSNCGTKTTPLWRRSTNGTLICNACGLYLRSNNTNRPVNLKKTP 135
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD G+ +CNACGLY+K++G +RP+
Sbjct: 198 CFNCNTTITPLWRRDDGGNTICNACGLYYKLHGSHRPV 235
Score = 43.5 bits (101), Expect = 0.029, Method: Composition-based stats.
Identities = 15/29 (51%), Positives = 20/29 (68%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
++G+ C NCG +TPLWRR G +CNA
Sbjct: 87 SDGQSCSNCGTKTTPLWRRSTNGTLICNA 115
Score = 39.3 bits (90), Expect = 0.50, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD G+ +CNA
Sbjct: 198 CFNCNTTITPLWRRDDGGNTICNA 221
>gi|390600081|gb|EIN09476.1| hypothetical protein PUNSTDRAFT_143032 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 697
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 23/37 (62%), Positives = 29/37 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G+ +CNACGLY K++G +RP
Sbjct: 285 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 321
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 22/37 (59%), Positives = 25/37 (67%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G +CNACGLY K + RP
Sbjct: 96 CTNCGTTTTPLWRRDAEGKSICNACGLYLKSRRVPRP 132
Score = 43.5 bits (101), Expect = 0.031, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G +CNA
Sbjct: 96 CTNCGTTTTPLWRRDAEGKSICNA 119
Score = 42.7 bits (99), Expect = 0.047, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 285 CANCGTSTTPLWRRDDVGNNICNA 308
>gi|325094138|gb|EGC47448.1| siderophore transcription factor SreA [Ajellomyces capsulatus H88]
Length = 541
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 27/37 (72%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC TPLWRRD GH +CNACGLYHK++G RP
Sbjct: 202 CQNCQTTVTPLWRRDENGHPICNACGLYHKLHGAYRP 238
Score = 52.0 bits (123), Expect = 7e-05, Method: Composition-based stats.
Identities = 21/35 (60%), Positives = 23/35 (65%)
Query: 83 NCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 41 NCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 75
Score = 40.4 bits (93), Expect = 0.25, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 202 CQNCQTTVTPLWRRDENGHPICNA 225
Score = 36.6 bits (83), Expect = 3.6, Method: Composition-based stats.
Identities = 14/22 (63%), Positives = 15/22 (68%)
Query: 25 NCGAISTPLWRRDGTGHYLCNA 46
NCG TPLWRR TG +CNA
Sbjct: 41 NCGTKRTPLWRRSPTGATICNA 62
>gi|327353473|gb|EGE82330.1| siderophore transcription factor SreA [Ajellomyces dermatitidis
ATCC 18188]
Length = 634
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 27/37 (72%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC TPLWRRD GH +CNACGLYHK++G RP
Sbjct: 290 CQNCQTTVTPLWRRDENGHPICNACGLYHKLHGSYRP 326
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 126 GHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 165
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 126 GHSCSNCGTKRTPLWRRSPTGATICNA 152
Score = 40.4 bits (93), Expect = 0.23, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 290 CQNCQTTVTPLWRRDENGHPICNA 313
>gi|239611963|gb|EEQ88950.1| siderophore transcription factor SreA [Ajellomyces dermatitidis
ER-3]
Length = 634
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 27/37 (72%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC TPLWRRD GH +CNACGLYHK++G RP
Sbjct: 290 CQNCQTTVTPLWRRDENGHPICNACGLYHKLHGSYRP 326
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 126 GHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 165
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 126 GHSCSNCGTKRTPLWRRSPTGATICNA 152
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 290 CQNCQTTVTPLWRRDENGHPICNA 313
>gi|261201874|ref|XP_002628151.1| siderophore transcription factor SreA [Ajellomyces dermatitidis
SLH14081]
gi|239590248|gb|EEQ72829.1| siderophore transcription factor SreA [Ajellomyces dermatitidis
SLH14081]
Length = 633
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 27/37 (72%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC TPLWRRD GH +CNACGLYHK++G RP
Sbjct: 289 CQNCQTTVTPLWRRDENGHPICNACGLYHKLHGSYRP 325
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNACGLYLKARNTDRP 164
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 125 GHSCSNCGTKRTPLWRRSPTGATICNA 151
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 289 CQNCQTTVTPLWRRDENGHPICNA 312
>gi|164657933|ref|XP_001730092.1| hypothetical protein MGL_2474 [Malassezia globosa CBS 7966]
gi|159103987|gb|EDP42878.1| hypothetical protein MGL_2474 [Malassezia globosa CBS 7966]
Length = 865
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 37/59 (62%), Gaps = 3/59 (5%)
Query: 63 LFASLPVEDI---EYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
L A+L E++ T C NC TPLWRRD G+ LCNACGL+H+++G+ RPL
Sbjct: 754 LHATLSAENLGSASSPTSTTRCFNCHTSMTPLWRRDPDGNVLCNACGLFHRLHGVMRPL 812
Score = 40.0 bits (92), Expect = 0.31, Method: Composition-based stats.
Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 3/45 (6%)
Query: 5 LFASLPVEDI---EYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
L A+L E++ T C NC TPLWRRD G+ LCNA
Sbjct: 754 LHATLSAENLGSASSPTSTTRCFNCHTSMTPLWRRDPDGNVLCNA 798
>gi|358056188|dbj|GAA97928.1| hypothetical protein E5Q_04608 [Mixia osmundae IAM 14324]
Length = 1047
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C+NC +TPLWRRD G LCN+CGL+ K++G+NRPL
Sbjct: 946 CMNCKTSNTPLWRRDELGQPLCNSCGLFKKLHGVNRPL 983
Score = 39.3 bits (90), Expect = 0.59, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C+NC +TPLWRRD G LCN+
Sbjct: 946 CMNCKTSNTPLWRRDELGQPLCNS 969
>gi|302652995|ref|XP_003018335.1| siderophore transcription factor SreA [Trichophyton verrucosum HKI
0517]
gi|291181965|gb|EFE37690.1| siderophore transcription factor SreA [Trichophyton verrucosum HKI
0517]
Length = 545
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 31/40 (77%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ C NCG TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 251 KACQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPV 290
Score = 42.0 bits (97), Expect = 0.080, Method: Composition-based stats.
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNA 46
+ C NCG TPLWRRD G+ +CNA
Sbjct: 251 KACQNCGTTLTPLWRRDDQGNTICNA 276
>gi|448121576|ref|XP_004204242.1| Piso0_000073 [Millerozyma farinosa CBS 7064]
gi|358349781|emb|CCE73060.1| Piso0_000073 [Millerozyma farinosa CBS 7064]
Length = 621
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 2/56 (3%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR--RLRFDRGNFN 133
EC NC I TPLWR+D G+ LCNACGL+ K++G RPL + + R R +FN
Sbjct: 213 ECFNCHTIKTPLWRKDPQGNTLCNACGLFLKLHGTMRPLSLKTDVIKKRSSRKSFN 268
Score = 43.1 bits (100), Expect = 0.034, Method: Composition-based stats.
Identities = 16/25 (64%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
EC NC I TPLWR+D G+ LCNA
Sbjct: 213 ECFNCHTIKTPLWRKDPQGNTLCNA 237
>gi|448123947|ref|XP_004204795.1| Piso0_000073 [Millerozyma farinosa CBS 7064]
gi|358249428|emb|CCE72494.1| Piso0_000073 [Millerozyma farinosa CBS 7064]
Length = 620
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 2/56 (3%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR--RLRFDRGNFN 133
EC NC I TPLWR+D G+ LCNACGL+ K++G RPL + + R R +FN
Sbjct: 213 ECFNCHTIKTPLWRKDPQGNTLCNACGLFLKLHGTMRPLSLKTDVIKKRSSRKSFN 268
Score = 43.1 bits (100), Expect = 0.034, Method: Composition-based stats.
Identities = 16/25 (64%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
EC NC I TPLWR+D G+ LCNA
Sbjct: 213 ECFNCHTIKTPLWRKDPQGNTLCNA 237
>gi|390602192|gb|EIN11585.1| hypothetical protein PUNSTDRAFT_83261 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1041
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD +G LCNACGL++K++G+ RPL
Sbjct: 919 CTNCQTTNTPLWRRDPSGQPLCNACGLFYKLHGVVRPL 956
Score = 41.6 bits (96), Expect = 0.10, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD +G LCNA
Sbjct: 919 CTNCQTTNTPLWRRDPSGQPLCNA 942
>gi|164658295|ref|XP_001730273.1| hypothetical protein MGL_2655 [Malassezia globosa CBS 7966]
gi|159104168|gb|EDP43059.1| hypothetical protein MGL_2655 [Malassezia globosa CBS 7966]
Length = 380
Score = 62.4 bits (150), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 24/42 (57%), Positives = 31/42 (73%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
E C NC +TPLWRRD G+ +CNACGLYHK++G +RP+
Sbjct: 117 EALRCTNCQTTTTPLWRRDEDGNNICNACGLYHKLHGTHRPI 158
Score = 38.9 bits (89), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 18/28 (64%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
E C NC +TPLWRRD G+ +CNA
Sbjct: 117 EALRCTNCQTTTTPLWRRDEDGNNICNA 144
>gi|302496671|ref|XP_003010336.1| siderophore transcription factor SreA [Arthroderma benhamiae CBS
112371]
gi|291173879|gb|EFE29696.1| siderophore transcription factor SreA [Arthroderma benhamiae CBS
112371]
Length = 542
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 31/40 (77%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ C NCG TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 248 KACQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPV 287
Score = 42.0 bits (97), Expect = 0.084, Method: Composition-based stats.
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNA 46
+ C NCG TPLWRRD G+ +CNA
Sbjct: 248 KACQNCGTTLTPLWRRDDQGNTICNA 273
>gi|295670209|ref|XP_002795652.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284737|gb|EEH40303.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 650
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 6/51 (11%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKQPRRL 125
C NC TPLWRRD GH +CNACGLYHK++G RP ++K+ +R+
Sbjct: 354 CQNCQTTVTPLWRRDEHGHPICNACGLYHKLHGSYRPPTMKKSIIKRRKRV 404
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 187 GHSCSNCGTKQTPLWRRSPTGATICNACGLYLKARNADRP 226
Score = 42.4 bits (98), Expect = 0.071, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 187 GHSCSNCGTKQTPLWRRSPTGATICNA 213
Score = 40.0 bits (92), Expect = 0.31, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 354 CQNCQTTVTPLWRRDEHGHPICNA 377
>gi|150864605|ref|XP_001383491.2| GATA type transcriptional activator of nitrogen-regulated genes
[Scheffersomyces stipitis CBS 6054]
gi|149385861|gb|ABN65462.2| GATA type transcriptional activator of nitrogen-regulated genes
[Scheffersomyces stipitis CBS 6054]
Length = 486
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 192 CFNCGTTITPLWRRDDAGNTICNACGLYYRLHGSHRPI 229
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
+G++C NCG TPLWRR G +CNACGLY + N +RP+ +K+P
Sbjct: 66 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNNTHRPVNLKRP 112
Score = 42.0 bits (97), Expect = 0.082, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G++C NCG TPLWRR G +CNA
Sbjct: 66 DGQQCSNCGTTKTPLWRRAPDGTLICNA 93
Score = 42.0 bits (97), Expect = 0.089, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 192 CFNCGTTITPLWRRDDAGNTICNA 215
>gi|444323739|ref|XP_004182510.1| hypothetical protein TBLA_0I03360 [Tetrapisispora blattae CBS 6284]
gi|387515557|emb|CCH62991.1| hypothetical protein TBLA_0I03360 [Tetrapisispora blattae CBS 6284]
Length = 1061
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD G+ LCNACGL+ K++G++RPL
Sbjct: 423 CGNCKTTKTPLWRRDNDGNTLCNACGLFQKLHGISRPL 460
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD G+ LCNA
Sbjct: 423 CGNCKTTKTPLWRRDNDGNTLCNA 446
>gi|238878590|gb|EEQ42228.1| hypothetical protein CAWG_00430 [Candida albicans WO-1]
Length = 517
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 26/50 (52%), Positives = 37/50 (74%), Gaps = 4/50 (8%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP---RRLR 126
C NCG TPLWRRD G+ +CNACGL+++++G +RP+ +K+P RR R
Sbjct: 184 CFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMKRPTIKRRKR 233
Score = 62.0 bits (149), Expect = 9e-08, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
+G++C NCG TPLWRR G +CNACGLY++ N +RP+ +K+P
Sbjct: 56 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRP 102
Score = 42.0 bits (97), Expect = 0.082, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G++C NCG TPLWRR G +CNA
Sbjct: 56 DGQQCSNCGTTKTPLWRRAPDGTLICNA 83
Score = 42.0 bits (97), Expect = 0.089, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 184 CFNCGTTITPLWRRDDAGNTICNA 207
>gi|50302249|ref|XP_451058.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640189|emb|CAH02646.1| KLLA0A01342p [Kluyveromyces lactis]
Length = 781
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 29/39 (74%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC + TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 348 QCYNCKTLKTPLWRRDPDGNTLCNACGLFQKLHGTMRPL 386
Score = 42.4 bits (98), Expect = 0.069, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC + TPLWRRD G+ LCNA
Sbjct: 348 QCYNCKTLKTPLWRRDPDGNTLCNA 372
>gi|345486907|ref|XP_003425585.1| PREDICTED: hypothetical protein LOC100678172 [Nasonia vitripennis]
Length = 647
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 466 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPV 503
Score = 37.7 bits (86), Expect = 1.7, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR+ G +CNA
Sbjct: 466 CTNCGTMTTTIWRRNMKGEMVCNA 489
>gi|332023110|gb|EGI63371.1| GATA-binding factor A [Acromyrmex echinatior]
Length = 595
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 425 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPV 462
Score = 37.7 bits (86), Expect = 1.7, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR+ G +CNA
Sbjct: 425 CTNCGTMTTTIWRRNMKGEMVCNA 448
>gi|322789591|gb|EFZ14817.1| hypothetical protein SINV_01115 [Solenopsis invicta]
Length = 547
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 379 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPV 416
Score = 37.7 bits (86), Expect = 1.7, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR+ G +CNA
Sbjct: 379 CTNCGTMTTTIWRRNMKGEMVCNA 402
>gi|307172112|gb|EFN63680.1| GATA-binding factor A [Camponotus floridanus]
Length = 595
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 424 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPV 461
Score = 37.7 bits (86), Expect = 1.7, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR+ G +CNA
Sbjct: 424 CTNCGTMTTTIWRRNMKGEMVCNA 447
>gi|366991961|ref|XP_003675746.1| hypothetical protein NCAS_0C03920 [Naumovozyma castellii CBS 4309]
gi|342301611|emb|CCC69382.1| hypothetical protein NCAS_0C03920 [Naumovozyma castellii CBS 4309]
Length = 459
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 29/65 (44%), Positives = 39/65 (60%), Gaps = 7/65 (10%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV-------KQPRRLRFDRGNFN 133
C NCG +TPLWR+D G+ LCNACGL+ K++G+ RPL K+ R + N N
Sbjct: 346 CSNCGTGTTPLWRKDANGNSLCNACGLFLKLHGVMRPLSLKTDVIKKRQRNKKTSTANNN 405
Query: 134 CYERS 138
+RS
Sbjct: 406 GKKRS 410
Score = 42.4 bits (98), Expect = 0.059, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWR+D G+ LCNA
Sbjct: 346 CSNCGTGTTPLWRKDANGNSLCNA 369
>gi|226293998|gb|EEH49418.1| GATA factor SREP [Paracoccidioides brasiliensis Pb18]
Length = 583
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 6/51 (11%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKQPRRL 125
C NC TPLWRRD GH +CNACGLYHK++G RP ++K+ +R+
Sbjct: 290 CQNCQTTVTPLWRRDEHGHPICNACGLYHKLHGSYRPPTMKKSIIKRRKRV 340
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 123 GHSCSNCGTKQTPLWRRSPTGATICNACGLYLKARNADRP 162
Score = 42.0 bits (97), Expect = 0.081, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 123 GHSCSNCGTKQTPLWRRSPTGATICNA 149
Score = 40.0 bits (92), Expect = 0.34, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 290 CQNCQTTVTPLWRRDEHGHPICNA 313
>gi|225684362|gb|EEH22646.1| GATA factor SREP [Paracoccidioides brasiliensis Pb03]
Length = 448
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 27/37 (72%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC TPLWRRD GH +CNACGLYHK++G RP
Sbjct: 290 CQNCQTTVTPLWRRDEHGHPICNACGLYHKLHGSYRP 326
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G C NCG TPLWRR TG +CNACGLY K +RP
Sbjct: 123 GHSCSNCGTKQTPLWRRSPTGATICNACGLYLKARNADRP 162
Score = 42.0 bits (97), Expect = 0.081, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG TPLWRR TG +CNA
Sbjct: 123 GHSCSNCGTKQTPLWRRSPTGATICNA 149
Score = 40.0 bits (92), Expect = 0.34, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 290 CQNCQTTVTPLWRRDEHGHPICNA 313
>gi|328785124|ref|XP_001120276.2| PREDICTED: hypothetical protein LOC724420 [Apis mellifera]
Length = 588
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPV 454
Score = 37.7 bits (86), Expect = 1.8, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR+ G +CNA
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNA 440
>gi|383865617|ref|XP_003708269.1| PREDICTED: uncharacterized protein LOC100877935 [Megachile
rotundata]
Length = 586
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 415 CTNCGTMTTTIWRRNMKGEIVCNACGLYYKLHGVNRPV 452
Score = 37.4 bits (85), Expect = 1.9, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR+ G +CNA
Sbjct: 415 CTNCGTMTTTIWRRNMKGEIVCNA 438
>gi|405974156|gb|EKC38824.1| Transcription factor GATA-4 [Crassostrea gigas]
Length = 443
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 23/41 (56%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 268 GLSCANCSTTTTTLWRRNNEGEPVCNACGLYYKLHGVNRPL 308
Score = 36.6 bits (83), Expect = 3.9, Method: Composition-based stats.
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NC +T LWRR+ G +CNA
Sbjct: 268 GLSCANCSTTTTTLWRRNNEGEPVCNA 294
>gi|380022377|ref|XP_003695026.1| PREDICTED: uncharacterized protein LOC100864759 [Apis florea]
Length = 588
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPV 454
Score = 37.4 bits (85), Expect = 1.8, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR+ G +CNA
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNA 440
>gi|340960837|gb|EGS22018.1| putative GATA zinc finger protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 651
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 341 CQNCGTSTTPLWRRDETGAVLCNACGLFLKLHGRPRPI 378
Score = 44.7 bits (104), Expect = 0.013, Method: Composition-based stats.
Identities = 17/24 (70%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD TG LCNA
Sbjct: 341 CQNCGTSTTPLWRRDETGAVLCNA 364
>gi|255728267|ref|XP_002549059.1| predicted protein [Candida tropicalis MYA-3404]
gi|240133375|gb|EER32931.1| predicted protein [Candida tropicalis MYA-3404]
Length = 516
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 25/50 (50%), Positives = 36/50 (72%), Gaps = 1/50 (2%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRLRFDR 129
C NCG TPLWRRD G+ +CNACGL+++++G +RP+ +K+P R R
Sbjct: 214 CFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMKRPTIKRRKR 263
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
+G++C NCG TPLWRR G +CNACGLY++ N +RP+ +K+P
Sbjct: 70 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRANNTHRPVNLKRP 116
Score = 42.0 bits (97), Expect = 0.091, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G++C NCG TPLWRR G +CNA
Sbjct: 70 DGQQCSNCGTTKTPLWRRAPDGTLICNA 97
Score = 41.6 bits (96), Expect = 0.096, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 214 CFNCGTTITPLWRRDDAGNTICNA 237
>gi|350398203|ref|XP_003485118.1| PREDICTED: hypothetical protein LOC100747914 [Bombus impatiens]
Length = 588
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPV 454
Score = 37.4 bits (85), Expect = 1.9, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR+ G +CNA
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNA 440
>gi|21745321|gb|AAM77345.1|AF520973_1 transcription factor SFU1 [Candida albicans]
Length = 518
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 25/50 (50%), Positives = 36/50 (72%), Gaps = 1/50 (2%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRLRFDR 129
C NCG TPLWRRD G+ +CNACGL+++++G +RP+ +K+P R R
Sbjct: 185 CFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMKRPTIKRRKR 234
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
+G++C NCG TPLWRR G +CNACGLY++ N +RP+ +K+P
Sbjct: 57 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRP 103
Score = 42.0 bits (97), Expect = 0.093, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G++C NCG TPLWRR G +CNA
Sbjct: 57 DGQQCSNCGTTKTPLWRRAPDGTLICNA 84
Score = 41.6 bits (96), Expect = 0.099, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 185 CFNCGTTITPLWRRDDAGNTICNA 208
>gi|68464951|ref|XP_723553.1| negative regulator of iron uptake genes [Candida albicans SC5314]
gi|68465330|ref|XP_723364.1| negative regulator of iron uptake genes [Candida albicans SC5314]
gi|46445393|gb|EAL04662.1| negative regulator of iron uptake genes [Candida albicans SC5314]
gi|46445589|gb|EAL04857.1| negative regulator of iron uptake genes [Candida albicans SC5314]
Length = 517
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 25/50 (50%), Positives = 36/50 (72%), Gaps = 1/50 (2%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRLRFDR 129
C NCG TPLWRRD G+ +CNACGL+++++G +RP+ +K+P R R
Sbjct: 184 CFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMKRPTIKRRKR 233
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
+G++C NCG TPLWRR G +CNACGLY++ N +RP+ +K+P
Sbjct: 56 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRP 102
Score = 42.0 bits (97), Expect = 0.093, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G++C NCG TPLWRR G +CNA
Sbjct: 56 DGQQCSNCGTTKTPLWRRAPDGTLICNA 83
Score = 41.6 bits (96), Expect = 0.099, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 184 CFNCGTTITPLWRRDDAGNTICNA 207
>gi|340724648|ref|XP_003400693.1| PREDICTED: hypothetical protein LOC100646437 [Bombus terrestris]
Length = 588
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPV 454
Score = 37.4 bits (85), Expect = 1.9, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR+ G +CNA
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNA 440
>gi|195452168|ref|XP_002073242.1| GK13254 [Drosophila willistoni]
gi|194169327|gb|EDW84228.1| GK13254 [Drosophila willistoni]
Length = 746
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY+K++ MNRPL
Sbjct: 522 GVTCANCSTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRPL 562
Score = 36.6 bits (83), Expect = 3.5, Method: Composition-based stats.
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NC ST LWRR+ G+ +CNA
Sbjct: 522 GVTCANCSTNSTTLWRRNNEGNPVCNA 548
>gi|403215533|emb|CCK70032.1| hypothetical protein KNAG_0D02830 [Kazachstania naganishii CBS
8797]
Length = 755
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 335 QCFNCKTFKTPLWRRDTDGNTLCNACGLFQKLHGTMRPL 373
Score = 42.0 bits (97), Expect = 0.086, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRRD G+ LCNA
Sbjct: 335 QCFNCKTFKTPLWRRDTDGNTLCNA 359
>gi|260948204|ref|XP_002618399.1| hypothetical protein CLUG_01858 [Clavispora lusitaniae ATCC 42720]
gi|238848271|gb|EEQ37735.1| hypothetical protein CLUG_01858 [Clavispora lusitaniae ATCC 42720]
Length = 563
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 29/39 (74%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
EC NC + TPLWR+D G+ LCNACGL+ K++G RPL
Sbjct: 180 ECFNCHTLKTPLWRKDRVGNTLCNACGLFLKLHGTTRPL 218
Score = 42.4 bits (98), Expect = 0.062, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
EC NC + TPLWR+D G+ LCNA
Sbjct: 180 ECFNCHTLKTPLWRKDRVGNTLCNA 204
>gi|302684915|ref|XP_003032138.1| hypothetical protein SCHCODRAFT_56035 [Schizophyllum commune H4-8]
gi|300105831|gb|EFI97235.1| hypothetical protein SCHCODRAFT_56035 [Schizophyllum commune H4-8]
Length = 190
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 50/101 (49%), Gaps = 8/101 (7%)
Query: 23 CVNCGAISTPLWRRDG-TGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE---- 77
C +C +TPLWRRD LCNA S G + A +D + E
Sbjct: 78 CWHCRTRTTPLWRRDTRIPGLLCNACGLYLSQRGKLRPRELIDAD---DDSDVVREANYS 134
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G EC +C +T +WRR+ G +CNACG+Y ++ G RPL
Sbjct: 135 GPECTHCHTRTTSVWRRNKVGAQVCNACGVYERLKGKERPL 175
>gi|255717635|ref|XP_002555098.1| KLTH0G01320p [Lachancea thermotolerans]
gi|238936482|emb|CAR24661.1| KLTH0G01320p [Lachancea thermotolerans CBS 6340]
Length = 744
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRRD G+ +CNACGL+ K++G RPL
Sbjct: 256 QCFNCKTFKTPLWRRDAQGNTMCNACGLFQKLHGTMRPL 294
Score = 42.0 bits (97), Expect = 0.086, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRRD G+ +CNA
Sbjct: 256 QCFNCKTFKTPLWRRDAQGNTMCNA 280
>gi|195050653|ref|XP_001992938.1| GH13551 [Drosophila grimshawi]
gi|193899997|gb|EDV98863.1| GH13551 [Drosophila grimshawi]
Length = 911
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 29/37 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY+K++G+NRP
Sbjct: 747 CSNCGTLTTTIWRRSARGEMVCNACGLYYKLHGVNRP 783
Score = 37.4 bits (85), Expect = 1.9, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR G +CNA
Sbjct: 747 CSNCGTLTTTIWRRSARGEMVCNA 770
>gi|327299572|ref|XP_003234479.1| siderophore transcription factor SreA [Trichophyton rubrum CBS
118892]
gi|326463373|gb|EGD88826.1| siderophore transcription factor SreA [Trichophyton rubrum CBS
118892]
Length = 572
Score = 62.0 bits (149), Expect = 9e-08, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 31/40 (77%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ C NCG TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 278 KACQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPV 317
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 27/40 (67%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G +CVNCG TPLWRR +G +CNACGLY K +RP
Sbjct: 121 GHQCVNCGTKRTPLWRRSLSGSTICNACGLYLKARNTDRP 160
Score = 43.1 bits (100), Expect = 0.037, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G +CVNCG TPLWRR +G +CNA
Sbjct: 121 GHQCVNCGTKRTPLWRRSLSGSTICNA 147
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNA 46
+ C NCG TPLWRRD G+ +CNA
Sbjct: 278 KACQNCGTTLTPLWRRDDQGNTICNA 303
>gi|296815622|ref|XP_002848148.1| GATA factor SREP [Arthroderma otae CBS 113480]
gi|238841173|gb|EEQ30835.1| GATA factor SREP [Arthroderma otae CBS 113480]
Length = 592
Score = 62.0 bits (149), Expect = 9e-08, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 31/40 (77%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ C NCG TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 281 KACQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPV 320
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 27/40 (67%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G +CVNCG TPLWRR +G +CNACGLY K +RP
Sbjct: 125 GHQCVNCGTKRTPLWRRSPSGSTICNACGLYLKARNTDRP 164
Score = 43.9 bits (102), Expect = 0.024, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G +CVNCG TPLWRR +G +CNA
Sbjct: 125 GHQCVNCGTKRTPLWRRSPSGSTICNA 151
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNA 46
+ C NCG TPLWRRD G+ +CNA
Sbjct: 281 KACQNCGTTLTPLWRRDDQGNTICNA 306
>gi|403214757|emb|CCK69257.1| hypothetical protein KNAG_0C01440 [Kazachstania naganishii CBS
8797]
Length = 460
Score = 62.0 bits (149), Expect = 9e-08, Method: Composition-based stats.
Identities = 23/41 (56%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ++TPLWRRD G+ LCNACGL+ K++G RP+
Sbjct: 176 GSICKNCATVNTPLWRRDNEGNTLCNACGLFLKLHGTPRPI 216
Score = 43.9 bits (102), Expect = 0.021, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NC ++TPLWRRD G+ LCNA
Sbjct: 176 GSICKNCATVNTPLWRRDNEGNTLCNA 202
>gi|344302057|gb|EGW32362.1| hypothetical protein SPAPADRAFT_55830 [Spathaspora passalidarum
NRRL Y-27907]
Length = 413
Score = 62.0 bits (149), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 65/167 (38%), Gaps = 66/167 (39%)
Query: 18 TEGRECVNCGAISTPLWRR--DGT------GHYL-------------------------- 43
++G++C NCG TPLWRR DG+ G YL
Sbjct: 37 SDGQQCSNCGTTKTPLWRRAPDGSLICNACGLYLRSNNTHRPVNLKRPPNIIPIHKEEEG 96
Query: 44 -------CNARLFSTSFTG------YFIMSRFLFASLPVEDIEYFTEGRE---------- 80
CN S + G ++++ P + G E
Sbjct: 97 SCKGDGRCNGTGGSAACKGCPAYNNRVVVAKKTLDKSPKSETTTTNNGGENKRKSPDTAN 156
Query: 81 ---------CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD +G+ +CNACGL+++++G +RP+
Sbjct: 157 AGEDSLAIACFNCGTTITPLWRRDDSGNTICNACGLFYRLHGSHRPI 203
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 35/51 (68%), Gaps = 1/51 (1%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRL 125
++G++C NCG TPLWRR G +CNACGLY + N +RP+ +K+P +
Sbjct: 37 SDGQQCSNCGTTKTPLWRRAPDGSLICNACGLYLRSNNTHRPVNLKRPPNI 87
>gi|326474110|gb|EGD98119.1| siderophore transcription factor SreA [Trichophyton tonsurans CBS
112818]
Length = 571
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 31/40 (77%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ C NCG TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 277 KACQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPV 316
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 27/56 (48%), Positives = 32/56 (57%), Gaps = 4/56 (7%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFN 133
G +CVNCG TPLWRR +G +CNACGLY K +RP R+R N N
Sbjct: 120 GHQCVNCGTKRTPLWRRSPSGSTICNACGLYLKARNTDRP----TNRIRDSGANAN 171
Score = 43.9 bits (102), Expect = 0.025, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G +CVNCG TPLWRR +G +CNA
Sbjct: 120 GHQCVNCGTKRTPLWRRSPSGSTICNA 146
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNA 46
+ C NCG TPLWRRD G+ +CNA
Sbjct: 277 KACQNCGTTLTPLWRRDDQGNTICNA 302
>gi|303312759|ref|XP_003066391.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
gi|240106053|gb|EER24246.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
gi|320032224|gb|EFW14179.1| GATA factor SREP [Coccidioides posadasii str. Silveira]
Length = 584
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY +++G RP+
Sbjct: 268 CQNCGTTVTPLWRRDDQGHPICNACGLYFRLHGCARPV 305
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G CVNCG TPLWRR G +CNACGLY K +RP
Sbjct: 106 GHSCVNCGTKRTPLWRRAPNGSTICNACGLYLKARNADRP 145
Score = 43.5 bits (101), Expect = 0.027, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 268 CQNCGTTVTPLWRRDDQGHPICNA 291
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G CVNCG TPLWRR G +CNA
Sbjct: 106 GHSCVNCGTKRTPLWRRAPNGSTICNA 132
>gi|119192532|ref|XP_001246872.1| hypothetical protein CIMG_00643 [Coccidioides immitis RS]
gi|392863886|gb|EAS35338.2| SreP protein [Coccidioides immitis RS]
Length = 584
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD GH +CNACGLY +++G RP+
Sbjct: 268 CQNCGTTVTPLWRRDDQGHPICNACGLYFRLHGCARPV 305
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G CVNCG TPLWRR G +CNACGLY K +RP
Sbjct: 106 GHSCVNCGTKRTPLWRRAPNGSTICNACGLYLKARNADRP 145
Score = 43.5 bits (101), Expect = 0.027, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD GH +CNA
Sbjct: 268 CQNCGTTVTPLWRRDDQGHPICNA 291
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G CVNCG TPLWRR G +CNA
Sbjct: 106 GHSCVNCGTKRTPLWRRAPNGSTICNA 132
>gi|326478307|gb|EGE02317.1| GATA factor SREP [Trichophyton equinum CBS 127.97]
Length = 573
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 31/40 (77%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ C NCG TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 279 KACQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPV 318
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 32/56 (57%), Gaps = 4/56 (7%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFN 133
G +CVNCG TPLWRR +G +CNACGLY K NRP R+R N N
Sbjct: 122 GHQCVNCGTKRTPLWRRSPSGSTICNACGLYLKARNTNRP----TNRIRDSGANAN 173
Score = 43.5 bits (101), Expect = 0.026, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G +CVNCG TPLWRR +G +CNA
Sbjct: 122 GHQCVNCGTKRTPLWRRSPSGSTICNA 148
Score = 41.6 bits (96), Expect = 0.12, Method: Composition-based stats.
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNA 46
+ C NCG TPLWRRD G+ +CNA
Sbjct: 279 KACQNCGTTLTPLWRRDDQGNTICNA 304
>gi|241954936|ref|XP_002420189.1| nitrogen regulatory GATA-factor, putative; transcriptional
activator with GATA-1-type Zn finger DNA-binding motif,
putative [Candida dubliniensis CD36]
gi|223643530|emb|CAX42412.1| nitrogen regulatory GATA-factor, putative [Candida dubliniensis
CD36]
Length = 753
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 509 GVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 549
Score = 42.7 bits (99), Expect = 0.050, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRR+ G LCNA
Sbjct: 509 GVSCTNCGTKTTPLWRRNPQGQPLCNA 535
>gi|213406309|ref|XP_002173926.1| transcription factor gaf1 [Schizosaccharomyces japonicus yFS275]
gi|212001973|gb|EEB07633.1| transcription factor gaf1 [Schizosaccharomyces japonicus yFS275]
Length = 867
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR GH LCNACGL+ K+NG+ RPL
Sbjct: 658 CTNCQTRTTPLWRRSPDGHPLCNACGLFMKINGVVRPL 695
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR GH LCNA
Sbjct: 658 CTNCQTRTTPLWRRSPDGHPLCNA 681
>gi|315052696|ref|XP_003175722.1| GATA factor SREP [Arthroderma gypseum CBS 118893]
gi|311341037|gb|EFR00240.1| GATA factor SREP [Arthroderma gypseum CBS 118893]
Length = 577
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 31/40 (77%)
Query: 79 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+ C NCG TPLWRRD G+ +CNACGLY++++G +RP+
Sbjct: 282 KACQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPV 321
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 27/40 (67%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G +CVNCG TPLWRR +G +CNACGLY K +RP
Sbjct: 123 GHQCVNCGTKRTPLWRRSPSGSTICNACGLYLKARNTDRP 162
Score = 43.5 bits (101), Expect = 0.026, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G +CVNCG TPLWRR +G +CNA
Sbjct: 123 GHQCVNCGTKRTPLWRRSPSGSTICNA 149
Score = 41.6 bits (96), Expect = 0.12, Method: Composition-based stats.
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 21 RECVNCGAISTPLWRRDGTGHYLCNA 46
+ C NCG TPLWRRD G+ +CNA
Sbjct: 282 KACQNCGTTLTPLWRRDDQGNTICNA 307
>gi|366994914|ref|XP_003677221.1| hypothetical protein NCAS_0F03840 [Naumovozyma castellii CBS 4309]
gi|342303089|emb|CCC70868.1| hypothetical protein NCAS_0F03840 [Naumovozyma castellii CBS 4309]
Length = 831
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 306 QCFNCKTFKTPLWRRDPQGNTLCNACGLFQKLHGTMRPL 344
Score = 41.6 bits (96), Expect = 0.10, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRRD G+ LCNA
Sbjct: 306 QCFNCKTFKTPLWRRDPQGNTLCNA 330
>gi|31559064|gb|AAP50501.1| nitrogen regulatory GATA-factor [Candida albicans]
Length = 755
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 502 GVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 542
Score = 42.7 bits (99), Expect = 0.052, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRR+ G LCNA
Sbjct: 502 GVSCTNCGTKTTPLWRRNPQGQPLCNA 528
>gi|401889257|gb|EJT53195.1| transcriptional activator [Trichosporon asahii var. asahii CBS 2479]
Length = 1273
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1148 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1185
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1148 CTNCQTTNTPLWRRDPEGQPLCNA 1171
>gi|336371932|gb|EGO00272.1| hypothetical protein SERLA73DRAFT_107315 [Serpula lacrymans var.
lacrymans S7.3]
Length = 2013
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 743 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 780
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 743 CTNCQTTNTPLWRRDPEGQPLCNA 766
>gi|392593705|gb|EIW83030.1| hypothetical protein CONPUDRAFT_81125 [Coniophora puteana RWD-64-598
SS2]
Length = 1103
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 973 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1010
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 973 CTNCQTTNTPLWRRDPEGQPLCNA 996
>gi|392577630|gb|EIW70759.1| hypothetical protein TREMEDRAFT_73557 [Tremella mesenterica DSM 1558]
Length = 1334
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1175 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1212
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1175 CTNCQTTNTPLWRRDPEGQPLCNA 1198
>gi|336384681|gb|EGO25829.1| hypothetical protein SERLADRAFT_448749 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1944
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 713 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 750
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 713 CTNCQTTNTPLWRRDPEGQPLCNA 736
>gi|307197634|gb|EFN78813.1| GATA-binding factor A [Harpegnathos saltator]
Length = 594
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+
Sbjct: 422 CSNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGINRPV 459
>gi|406698929|gb|EKD02150.1| transcriptional activator [Trichosporon asahii var. asahii CBS 8904]
Length = 1273
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1148 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1185
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1148 CTNCQTTNTPLWRRDPEGQPLCNA 1171
>gi|149239598|ref|XP_001525675.1| hypothetical protein LELG_03603 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146451168|gb|EDK45424.1| hypothetical protein LELG_03603 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 515
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/58 (43%), Positives = 38/58 (65%), Gaps = 2/58 (3%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFDRGNFNCYERS 138
C NCG TPLWRRD G+ +CNACGL+++++G +RP+ + +R R N ++S
Sbjct: 229 CFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPI--KMKRATIKRRKRNVPDKS 284
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
+G++C NCG TPLWRR G +CNACGLY + N +RP+ +K+P
Sbjct: 66 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNNSHRPVNLKRP 112
Score = 42.4 bits (98), Expect = 0.071, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G++C NCG TPLWRR G +CNA
Sbjct: 66 DGQQCSNCGTTKTPLWRRAPDGTLICNA 93
Score = 42.0 bits (97), Expect = 0.075, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 229 CFNCGTTITPLWRRDDAGNTICNA 252
>gi|58258269|ref|XP_566547.1| transcriptional activator [Cryptococcus neoformans var. neoformans
JEC21]
gi|57222684|gb|AAW40728.1| transcriptional activator, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1290
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPL 1223
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNA 1209
>gi|405117554|gb|AFR92329.1| transcriptional activator [Cryptococcus neoformans var. grubii H99]
Length = 1277
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1173 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPL 1210
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1173 CTNCQTTNTPLWRRDPDGQPLCNA 1196
>gi|393245125|gb|EJD52636.1| hypothetical protein AURDEDRAFT_158369 [Auricularia delicata
TFB-10046 SS5]
Length = 1111
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1033 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPL 1070
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1033 CTNCQTTNTPLWRRDPDGQPLCNA 1056
>gi|321251473|ref|XP_003192077.1| transcriptional activator [Cryptococcus gattii WM276]
gi|317458545|gb|ADV20290.1| Transcriptional activator, putative [Cryptococcus gattii WM276]
Length = 1290
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1183 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPL 1220
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1183 CTNCQTTNTPLWRRDPDGQPLCNA 1206
>gi|134106329|ref|XP_778175.1| hypothetical protein CNBA1750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260878|gb|EAL23528.1| hypothetical protein CNBA1750 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1290
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPL 1223
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNA 1209
>gi|294658476|ref|XP_002770789.1| DEHA2F10384p [Debaryomyces hansenii CBS767]
gi|202953159|emb|CAR66314.1| DEHA2F10384p [Debaryomyces hansenii CBS767]
Length = 475
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 34/49 (69%), Gaps = 1/49 (2%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRR 124
T+ +C NC +TPLWR+ G LCNACGL++K++G+ RPL QP R
Sbjct: 222 TKFLQCTNCNTKTTPLWRKSNKGDLLCNACGLFYKLHGVLRPL-SQPDR 269
Score = 38.5 bits (88), Expect = 0.87, Method: Composition-based stats.
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
T+ +C NC +TPLWR+ G LCNA
Sbjct: 222 TKFLQCTNCNTKTTPLWRKSNKGDLLCNA 250
>gi|363749771|ref|XP_003645103.1| hypothetical protein Ecym_2569 [Eremothecium cymbalariae
DBVPG#7215]
gi|356888736|gb|AET38286.1| Hypothetical protein Ecym_2569 [Eremothecium cymbalariae
DBVPG#7215]
Length = 847
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 339 QCFNCKTYKTPLWRRDPQGNTLCNACGLFQKLHGTMRPL 377
Score = 41.2 bits (95), Expect = 0.13, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRRD G+ LCNA
Sbjct: 339 QCFNCKTYKTPLWRRDPQGNTLCNA 363
>gi|238881297|gb|EEQ44935.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 582
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 329 GVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 369
Score = 42.4 bits (98), Expect = 0.056, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRR+ G LCNA
Sbjct: 329 GVSCTNCGTKTTPLWRRNPQGQPLCNA 355
>gi|389740007|gb|EIM81199.1| hypothetical protein STEHIDRAFT_142426 [Stereum hirsutum FP-91666
SS1]
Length = 1696
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1553 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1590
Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1553 CTNCQTTNTPLWRRDPEGQPLCNA 1576
>gi|12641859|dbj|BAB21550.1| GATA factor e [Drosophila melanogaster]
Length = 704
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY+K++ MNRPL
Sbjct: 494 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRPL 534
>gi|24647326|ref|NP_650516.2| GATAe, isoform A [Drosophila melanogaster]
gi|23171431|gb|AAF55262.2| GATAe, isoform A [Drosophila melanogaster]
gi|28603696|gb|AAO47880.1| LD08432p [Drosophila melanogaster]
gi|37951320|dbj|BAD00020.1| transcription factor dGATAe [Drosophila melanogaster]
gi|220942528|gb|ACL83807.1| GATAe-PA [synthetic construct]
gi|220952748|gb|ACL88917.1| GATAe-PA [synthetic construct]
Length = 746
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY+K++ MNRPL
Sbjct: 536 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRPL 576
>gi|320542891|ref|NP_001189231.1| GATAe, isoform B [Drosophila melanogaster]
gi|320542893|ref|NP_001189232.1| GATAe, isoform C [Drosophila melanogaster]
gi|318068787|gb|ADV37322.1| GATAe, isoform B [Drosophila melanogaster]
gi|318068788|gb|ADV37323.1| GATAe, isoform C [Drosophila melanogaster]
Length = 731
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY+K++ MNRPL
Sbjct: 521 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRPL 561
>gi|402217264|gb|EJT97345.1| hypothetical protein DACRYDRAFT_102444 [Dacryopinax sp. DJM-731 SS1]
Length = 1326
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G+ LCNACGL+ K++G+ RPL
Sbjct: 1199 CTNCQTTNTPLWRRDAEGNPLCNACGLFFKLHGVVRPL 1236
Score = 41.6 bits (96), Expect = 0.097, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G+ LCNA
Sbjct: 1199 CTNCQTTNTPLWRRDAEGNPLCNA 1222
>gi|384497544|gb|EIE88035.1| hypothetical protein RO3G_12746 [Rhizopus delemar RA 99-880]
Length = 556
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 37/96 (38%), Positives = 44/96 (45%), Gaps = 21/96 (21%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLP-----------VED 71
C NC +TPLWRR LCNA G ++ + A P +D
Sbjct: 471 CSNCNTTTTPLWRRSVDDELLCNA-------CGLYL--KLHNAPRPKHLKPQSSRKDTKD 521
Query: 72 IEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGL 107
E + C NCG +TPLWRRD G LCNACGL
Sbjct: 522 EENMIQ-PVCSNCGTSTTPLWRRDVDGSPLCNACGL 556
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 28/48 (58%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLRFD 128
C NC +TPLWRR LCNACGLY K++ RP +P+ R D
Sbjct: 471 CSNCNTTTTPLWRRSVDDELLCNACGLYLKLHNAPRPKHLKPQSSRKD 518
>gi|403419010|emb|CCM05710.1| predicted protein [Fibroporia radiculosa]
Length = 1271
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1137 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1174
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1137 CTNCQTTNTPLWRRDPEGQPLCNA 1160
>gi|68478842|ref|XP_716583.1| hypothetical protein CaO19.1275 [Candida albicans SC5314]
gi|68478951|ref|XP_716530.1| hypothetical protein CaO19.8862 [Candida albicans SC5314]
gi|46438200|gb|EAK97535.1| hypothetical protein CaO19.8862 [Candida albicans SC5314]
gi|46438254|gb|EAK97588.1| hypothetical protein CaO19.1275 [Candida albicans SC5314]
Length = 688
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 435 GVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 475
Score = 42.4 bits (98), Expect = 0.058, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NCG +TPLWRR+ G LCNA
Sbjct: 435 GVSCTNCGTKTTPLWRRNPQGQPLCNA 461
>gi|374109015|gb|AEY97921.1| FAFR237Wp [Ashbya gossypii FDAG1]
Length = 807
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 329 QCFNCKTFKTPLWRRDLQGNTLCNACGLFQKLHGTMRPL 367
Score = 41.2 bits (95), Expect = 0.14, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRRD G+ LCNA
Sbjct: 329 QCFNCKTFKTPLWRRDLQGNTLCNA 353
>gi|312379466|gb|EFR25730.1| hypothetical protein AND_08684 [Anopheles darlingi]
Length = 566
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 185 CTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPL 222
Score = 36.6 bits (83), Expect = 3.3, Method: Composition-based stats.
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +T LWRR+ G +CNA
Sbjct: 185 CTNCGTRTTTLWRRNNDGEPVCNA 208
>gi|302308753|ref|NP_985784.2| AFR237Wp [Ashbya gossypii ATCC 10895]
gi|299790778|gb|AAS53608.2| AFR237Wp [Ashbya gossypii ATCC 10895]
Length = 807
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 329 QCFNCKTFKTPLWRRDLQGNTLCNACGLFQKLHGTMRPL 367
Score = 41.2 bits (95), Expect = 0.14, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRRD G+ LCNA
Sbjct: 329 QCFNCKTFKTPLWRRDLQGNTLCNA 353
>gi|242220478|ref|XP_002476005.1| hypothetical transcription factor [Postia placenta Mad-698-R]
gi|220724794|gb|EED78814.1| hypothetical transcription factor [Postia placenta Mad-698-R]
Length = 1062
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 931 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 968
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 931 CTNCQTTNTPLWRRDPEGQPLCNA 954
>gi|347840710|emb|CCD55282.1| similar to transcription factor Zn, GATA [Botryotinia fuckeliana]
Length = 451
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 51 TSFTGYFIMSRFLFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHK 110
+ F Y ++ + A++ D E T+ C NC +TPLWRRD G LCNACGL+ K
Sbjct: 144 SDFANYHTINTTIMATMANNDRE--TQQPTCQNCQTSTTPLWRRDEIGSVLCNACGLFLK 201
Query: 111 MNGMNRPL 118
++G RP+
Sbjct: 202 LHGRPRPI 209
Score = 40.0 bits (92), Expect = 0.34, Method: Composition-based stats.
Identities = 19/42 (45%), Positives = 24/42 (57%), Gaps = 2/42 (4%)
Query: 5 LFASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+ A++ D E T+ C NC +TPLWRRD G LCNA
Sbjct: 156 IMATMANNDRE--TQQPTCQNCQTSTTPLWRRDEIGSVLCNA 195
>gi|392589458|gb|EIW78788.1| hypothetical protein CONPUDRAFT_106827 [Coniophora puteana
RWD-64-598 SS2]
Length = 763
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 22/37 (59%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC +TPLWRRD G+ +CNACGLY K++G +RP
Sbjct: 321 CANCSTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 357
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKQP 122
C NCG TPLWRRD G +CNACGLY K + RP L + P
Sbjct: 146 CNNCGTAETPLWRRDSEGKTICNACGLYLKSRKVARPPSLARTP 189
Score = 41.6 bits (96), Expect = 0.097, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G +CNA
Sbjct: 146 CNNCGTAETPLWRRDSEGKTICNA 169
Score = 40.8 bits (94), Expect = 0.18, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G+ +CNA
Sbjct: 321 CANCSTSTTPLWRRDDVGNNICNA 344
>gi|299751251|ref|XP_002911612.1| transcription factor ScGATA-6 [Coprinopsis cinerea okayama7#130]
gi|298409288|gb|EFI28118.1| transcription factor ScGATA-6 [Coprinopsis cinerea okayama7#130]
Length = 1080
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 29/39 (74%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC +TPLWRRD G LCNACGL+ K++G+ RPL
Sbjct: 951 QCTNCQTTNTPLWRRDPEGQPLCNACGLFFKLHGVVRPL 989
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC +TPLWRRD G LCNA
Sbjct: 951 QCTNCQTTNTPLWRRDPEGQPLCNA 975
>gi|384489884|gb|EIE81106.1| hypothetical protein RO3G_05811 [Rhizopus delemar RA 99-880]
Length = 234
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 31/38 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC A +TPLWRRD G+ +CNACGLY+K++ ++RPL
Sbjct: 19 CYNCDATTTPLWRRDDDGNTICNACGLYYKLHHVHRPL 56
Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC A +TPLWRRD G+ +CNA
Sbjct: 19 CYNCDATTTPLWRRDDDGNTICNA 42
>gi|320032947|gb|EFW14897.1| GATA transcription factor [Coccidioides posadasii str. Silveira]
Length = 396
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/38 (65%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRDG G LCNACGL+ K++G RPL
Sbjct: 115 CQNCGTSKTPLWRRDGMGAVLCNACGLFLKLHGRPRPL 152
Score = 43.9 bits (102), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRDG G LCNA
Sbjct: 115 CQNCGTSKTPLWRRDGMGAVLCNA 138
>gi|68467645|ref|XP_721981.1| hypothetical protein CaO19.11393 [Candida albicans SC5314]
gi|68467966|ref|XP_721822.1| hypothetical protein CaO19.3912 [Candida albicans SC5314]
gi|46443763|gb|EAL03042.1| hypothetical protein CaO19.3912 [Candida albicans SC5314]
gi|46443927|gb|EAL03205.1| hypothetical protein CaO19.11393 [Candida albicans SC5314]
Length = 682
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 35/54 (64%), Gaps = 6/54 (11%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+SLP + + +C NC +TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 205 SSLPSKKVT------KCYNCNTTATPLWRRDAEGNTLCNACGLFLKLHGTCRPL 252
Score = 42.7 bits (99), Expect = 0.051, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 6/40 (15%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+SLP + + +C NC +TPLWRRD G+ LCNA
Sbjct: 205 SSLPSKKVT------KCYNCNTTATPLWRRDAEGNTLCNA 238
>gi|444319927|ref|XP_004180620.1| hypothetical protein TBLA_0E00400 [Tetrapisispora blattae CBS 6284]
gi|387513663|emb|CCH61101.1| hypothetical protein TBLA_0E00400 [Tetrapisispora blattae CBS 6284]
Length = 969
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 29/39 (74%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC ++TPLWRRD G LCNACGL+ K++G RP+
Sbjct: 266 QCENCKTVTTPLWRRDPRGKTLCNACGLFQKLHGSMRPI 304
Score = 42.4 bits (98), Expect = 0.071, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC ++TPLWRRD G LCNA
Sbjct: 266 QCENCKTVTTPLWRRDPRGKTLCNA 290
>gi|238882809|gb|EEQ46447.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 682
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 35/54 (64%), Gaps = 6/54 (11%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+SLP + + +C NC +TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 205 SSLPSKKVT------KCYNCNTTATPLWRRDAEGNTLCNACGLFLKLHGTCRPL 252
Score = 42.7 bits (99), Expect = 0.052, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 6/40 (15%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+SLP + + +C NC +TPLWRRD G+ LCNA
Sbjct: 205 SSLPSKKVT------KCYNCNTTATPLWRRDAEGNTLCNA 238
>gi|365984106|ref|XP_003668886.1| hypothetical protein NDAI_0B06110 [Naumovozyma dairenensis CBS 421]
gi|343767653|emb|CCD23643.1| hypothetical protein NDAI_0B06110 [Naumovozyma dairenensis CBS 421]
Length = 1040
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 27/38 (71%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 377 CFNCKTTKTPLWRRDPQGNTLCNACGLFQKLHGTMRPL 414
Score = 40.8 bits (94), Expect = 0.18, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD G+ LCNA
Sbjct: 377 CFNCKTTKTPLWRRDPQGNTLCNA 400
>gi|426197491|gb|EKV47418.1| hypothetical protein AGABI2DRAFT_221510 [Agaricus bisporus var.
bisporus H97]
Length = 894
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 763 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 800
Score = 40.4 bits (93), Expect = 0.27, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 763 CTNCQTTNTPLWRRDPEGQPLCNA 786
>gi|241955939|ref|XP_002420690.1| transcriptional activator of nitrogen-regulated genes, putative;
transcriptional activator, nitrogen catabolite
repression, putative [Candida dubliniensis CD36]
gi|223644032|emb|CAX41773.1| transcriptional activator of nitrogen-regulated genes, putative
[Candida dubliniensis CD36]
Length = 670
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 205 CFNCNTTATPLWRRDADGNTLCNACGLFLKLHGTCRPL 242
Score = 42.4 bits (98), Expect = 0.059, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G+ LCNA
Sbjct: 205 CFNCNTTATPLWRRDADGNTLCNA 228
>gi|392868504|gb|EAS34301.2| protein GZF3 [Coccidioides immitis RS]
Length = 425
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/38 (65%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRDG G LCNACGL+ K++G RPL
Sbjct: 144 CQNCGTSKTPLWRRDGMGAVLCNACGLFLKLHGRPRPL 181
Score = 43.9 bits (102), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRDG G LCNA
Sbjct: 144 CQNCGTSKTPLWRRDGMGAVLCNA 167
>gi|126002036|ref|XP_001352245.1| GA18611 [Drosophila pseudoobscura pseudoobscura]
gi|54640243|gb|EAL29293.1| GA18611 [Drosophila pseudoobscura pseudoobscura]
Length = 803
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 656 CSNCGTLTTTIWRRSARGEMVCNACGLYFKLHGINRP 692
Score = 37.4 bits (85), Expect = 1.9, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG ++T +WRR G +CNA
Sbjct: 656 CSNCGTLTTTIWRRSARGEMVCNA 679
>gi|258565607|ref|XP_002583548.1| GATA type zinc finger protein Asd4 [Uncinocarpus reesii 1704]
gi|237907249|gb|EEP81650.1| GATA type zinc finger protein Asd4 [Uncinocarpus reesii 1704]
Length = 298
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/38 (65%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRDG G LCNACGL+ K++G RPL
Sbjct: 12 CQNCGTSKTPLWRRDGMGAVLCNACGLFLKLHGRPRPL 49
Score = 43.9 bits (102), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRDG G LCNA
Sbjct: 12 CQNCGTSKTPLWRRDGMGAVLCNA 35
>gi|195036112|ref|XP_001989515.1| GH18758 [Drosophila grimshawi]
gi|193893711|gb|EDV92577.1| GH18758 [Drosophila grimshawi]
Length = 749
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL
Sbjct: 522 GVTCANCSTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPL 562
Score = 36.6 bits (83), Expect = 3.5, Method: Composition-based stats.
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NC ST LWRR+ G+ +CNA
Sbjct: 522 GVTCANCSTNSTTLWRRNNEGNPVCNA 548
>gi|119189993|ref|XP_001245603.1| hypothetical protein CIMG_05044 [Coccidioides immitis RS]
Length = 303
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/38 (65%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRDG G LCNACGL+ K++G RPL
Sbjct: 22 CQNCGTSKTPLWRRDGMGAVLCNACGLFLKLHGRPRPL 59
Score = 43.9 bits (102), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRDG G LCNA
Sbjct: 22 CQNCGTSKTPLWRRDGMGAVLCNA 45
>gi|8096353|dbj|BAA96108.1| transcription factor ScGATA-6 [Schizophyllum commune]
Length = 1075
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 957 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 994
Score = 40.0 bits (92), Expect = 0.29, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 957 CTNCQTTNTPLWRRDPEGQPLCNA 980
>gi|254583165|ref|XP_002499314.1| ZYRO0E08910p [Zygosaccharomyces rouxii]
gi|238942888|emb|CAR31059.1| ZYRO0E08910p [Zygosaccharomyces rouxii]
Length = 853
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRRD G+ +CNACGL+ K++G RPL
Sbjct: 324 QCYNCKTNKTPLWRRDAQGNTMCNACGLFQKLHGTMRPL 362
Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRRD G+ +CNA
Sbjct: 324 QCYNCKTNKTPLWRRDAQGNTMCNA 348
>gi|157125534|ref|XP_001654374.1| transcription factor GATA-4 (GATA binding factor-4) [Aedes aegypti]
gi|108873609|gb|EAT37834.1| AAEL010222-PB [Aedes aegypti]
Length = 1003
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 31/41 (75%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC ++T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 642 GLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRPL 682
Score = 37.4 bits (85), Expect = 2.2, Method: Composition-based stats.
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G +C NC ++T LWRR+ G +CNA
Sbjct: 642 GLQCSNCNTMNTSLWRRNQVGEPVCNA 668
>gi|41059166|gb|AAR99325.1| GATA transcription factor [Aedes aegypti]
Length = 1003
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 31/41 (75%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G +C NC ++T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 642 GLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRPL 682
Score = 37.4 bits (85), Expect = 2.2, Method: Composition-based stats.
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G +C NC ++T LWRR+ G +CNA
Sbjct: 642 GLQCSNCNTMNTSLWRRNQVGEPVCNA 668
>gi|302680356|ref|XP_003029860.1| transcription factor scgata-6 [Schizophyllum commune H4-8]
gi|300103550|gb|EFI94957.1| transcription factor scgata-6 [Schizophyllum commune H4-8]
Length = 1075
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 957 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 994
Score = 40.0 bits (92), Expect = 0.30, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 957 CTNCQTTNTPLWRRDPEGQPLCNA 980
>gi|156839542|ref|XP_001643461.1| hypothetical protein Kpol_483p22 [Vanderwaltozyma polyspora DSM
70294]
gi|156114072|gb|EDO15603.1| hypothetical protein Kpol_483p22 [Vanderwaltozyma polyspora DSM
70294]
Length = 777
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 27/38 (71%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 320 CANCKTSKTPLWRRDPQGNVLCNACGLFQKLHGTMRPL 357
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD G+ LCNA
Sbjct: 320 CANCKTSKTPLWRRDPQGNVLCNA 343
>gi|353242644|emb|CCA74270.1| hypothetical protein PIIN_08223 [Piriformospora indica DSM 11827]
Length = 1358
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 1177 CTNCSTTTTPLWRRNPEGQPLCNACGLFFKLHGVTRPL 1214
Score = 39.7 bits (91), Expect = 0.38, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G LCNA
Sbjct: 1177 CTNCSTTTTPLWRRNPEGQPLCNA 1200
>gi|146414375|ref|XP_001483158.1| hypothetical protein PGUG_05113 [Meyerozyma guilliermondii ATCC
6260]
Length = 429
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 33/46 (71%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G++C NCG TPLWRR G +CNACGLY++ N +RP+ +K+P
Sbjct: 8 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNSHRPVNLKRP 53
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/43 (58%), Positives = 31/43 (72%), Gaps = 1/43 (2%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
C NCG TPLWRRD G+ +CNACGLY+K++ +RP VK R
Sbjct: 138 CYNCGTTITPLWRRDDAGNTICNACGLYYKLHKSHRP-VKMKR 179
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 138 CYNCGTTITPLWRRDDAGNTICNA 161
Score = 41.2 bits (95), Expect = 0.14, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G++C NCG TPLWRR G +CNA
Sbjct: 8 GQQCSNCGTTKTPLWRRAPDGTLICNA 34
>gi|190348546|gb|EDK41015.2| hypothetical protein PGUG_05113 [Meyerozyma guilliermondii ATCC
6260]
Length = 429
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 33/46 (71%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
G++C NCG TPLWRR G +CNACGLY++ N +RP+ +K+P
Sbjct: 8 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNSHRPVNLKRP 53
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/43 (58%), Positives = 31/43 (72%), Gaps = 1/43 (2%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPR 123
C NCG TPLWRRD G+ +CNACGLY+K++ +RP VK R
Sbjct: 138 CYNCGTTITPLWRRDDAGNTICNACGLYYKLHKSHRP-VKMKR 179
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 138 CYNCGTTITPLWRRDDAGNTICNA 161
Score = 41.2 bits (95), Expect = 0.15, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G++C NCG TPLWRR G +CNA
Sbjct: 8 GQQCSNCGTTKTPLWRRAPDGTLICNA 34
>gi|354548341|emb|CCE45077.1| hypothetical protein CPAR2_700810 [Candida parapsilosis]
Length = 642
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 37/58 (63%), Gaps = 6/58 (10%)
Query: 66 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
SLP + +G++C NCG TPLWRR G +CNACGLY + N +RP+ +K+P
Sbjct: 54 SLPTPE-----DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNHTHRPVNLKRP 106
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 20/38 (52%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD G+ +CNACGL+++++G +RP+
Sbjct: 234 CFNCDTTITPLWRRDDAGNTICNACGLFYRLHGSHRPI 271
Score = 42.7 bits (99), Expect = 0.045, Method: Composition-based stats.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 5/39 (12%)
Query: 8 SLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
SLP + +G++C NCG TPLWRR G +CNA
Sbjct: 54 SLPTPE-----DGQQCSNCGTTKTPLWRRAPDGTLICNA 87
Score = 39.7 bits (91), Expect = 0.44, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD G+ +CNA
Sbjct: 234 CFNCDTTITPLWRRDDAGNTICNA 257
>gi|255729218|ref|XP_002549534.1| hypothetical protein CTRG_03831 [Candida tropicalis MYA-3404]
gi|240132603|gb|EER32160.1| hypothetical protein CTRG_03831 [Candida tropicalis MYA-3404]
Length = 589
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 474 CTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 511
Score = 41.6 bits (96), Expect = 0.12, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRR+ G LCNA
Sbjct: 474 CTNCGTKTTPLWRRNPQGQPLCNA 497
>gi|409075785|gb|EKM76161.1| hypothetical protein AGABI1DRAFT_131482 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1114
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 34/51 (66%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
++ +C NC +TPLWR+D G +CNACGLY+K++G RP+ + +R
Sbjct: 489 SQTAQCYNCNTTATPLWRKDDEGKTVCNACGLYYKLHGSPRPISMKSDVIR 539
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 24/38 (63%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
EC NCGA TPLWRR CNACGLY K++ RP
Sbjct: 393 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRP 430
Score = 39.3 bits (90), Expect = 0.50, Method: Composition-based stats.
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
++ +C NC +TPLWR+D G +CNA
Sbjct: 489 SQTAQCYNCNTTATPLWRKDDEGKTVCNA 517
Score = 37.4 bits (85), Expect = 2.0, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 15/25 (60%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
EC NCGA TPLWRR CNA
Sbjct: 393 ECSNCGATHTPLWRRGLNDELNCNA 417
>gi|426193772|gb|EKV43705.1| hypothetical protein AGABI2DRAFT_121841 [Agaricus bisporus var.
bisporus H97]
Length = 1116
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 34/51 (66%)
Query: 76 TEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLR 126
++ +C NC +TPLWR+D G +CNACGLY+K++G RP+ + +R
Sbjct: 489 SQTAQCYNCNTTATPLWRKDDEGKTVCNACGLYYKLHGSPRPISMKSDVIR 539
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 24/38 (63%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
EC NCGA TPLWRR CNACGLY K++ RP
Sbjct: 393 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRP 430
Score = 39.3 bits (90), Expect = 0.50, Method: Composition-based stats.
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 18 TEGRECVNCGAISTPLWRRDGTGHYLCNA 46
++ +C NC +TPLWR+D G +CNA
Sbjct: 489 SQTAQCYNCNTTATPLWRKDDEGKTVCNA 517
Score = 37.4 bits (85), Expect = 2.0, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 15/25 (60%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
EC NCGA TPLWRR CNA
Sbjct: 393 ECSNCGATHTPLWRRGLNDELNCNA 417
>gi|358054029|dbj|GAA99828.1| hypothetical protein E5Q_06531 [Mixia osmundae IAM 14324]
Length = 903
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/37 (62%), Positives = 26/37 (70%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C+NC +TPLWRRD G+ LCNAC LY KM G RP
Sbjct: 229 CMNCKTQTTPLWRRDEAGNVLCNACSLYQKMKGAPRP 265
Score = 41.6 bits (96), Expect = 0.099, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C+NC +TPLWRRD G+ LCNA
Sbjct: 229 CMNCKTQTTPLWRRDEAGNVLCNA 252
>gi|154275034|ref|XP_001538368.1| hypothetical protein HCAG_05973 [Ajellomyces capsulatus NAm1]
gi|150414808|gb|EDN10170.1| hypothetical protein HCAG_05973 [Ajellomyces capsulatus NAm1]
Length = 510
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD +G LCNACGL+ K++G RP+
Sbjct: 313 CQNCGTSTTPLWRRDESGSVLCNACGLFLKLHGRPRPI 350
Score = 43.1 bits (100), Expect = 0.039, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD +G LCNA
Sbjct: 313 CQNCGTSTTPLWRRDESGSVLCNA 336
>gi|409080576|gb|EKM80936.1| hypothetical protein AGABI1DRAFT_119477 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 680
Score = 60.5 bits (145), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 549 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 586
Score = 40.0 bits (92), Expect = 0.36, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 549 CTNCQTTNTPLWRRDPEGQPLCNA 572
>gi|170587074|ref|XP_001898304.1| GATA zinc finger family protein [Brugia malayi]
gi|158594699|gb|EDP33283.1| GATA zinc finger family protein [Brugia malayi]
Length = 567
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 22/42 (52%), Positives = 30/42 (71%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+G C NC +T LWRRD GH +CNACGLY+K++ + RP+
Sbjct: 356 QGLVCANCRGTNTTLWRRDADGHPVCNACGLYYKLHQVQRPI 397
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 18/28 (64%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G C NC +T LWRRD GH +CNA
Sbjct: 356 QGLVCANCRGTNTTLWRRDADGHPVCNA 383
>gi|195570482|ref|XP_002103236.1| GD19066 [Drosophila simulans]
gi|194199163|gb|EDX12739.1| GD19066 [Drosophila simulans]
Length = 736
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL
Sbjct: 526 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPL 566
>gi|195501114|ref|XP_002097664.1| GE24358 [Drosophila yakuba]
gi|194183765|gb|EDW97376.1| GE24358 [Drosophila yakuba]
Length = 738
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL
Sbjct: 532 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPL 572
>gi|194901184|ref|XP_001980132.1| GG16970 [Drosophila erecta]
gi|190651835|gb|EDV49090.1| GG16970 [Drosophila erecta]
Length = 749
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL
Sbjct: 543 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPL 583
>gi|194767721|ref|XP_001965963.1| GF11880 [Drosophila ananassae]
gi|190619806|gb|EDV35330.1| GF11880 [Drosophila ananassae]
Length = 710
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL
Sbjct: 506 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPL 546
>gi|410730747|ref|XP_003980194.1| hypothetical protein NDAI_0G05350 [Naumovozyma dairenensis CBS 421]
gi|401780371|emb|CCK73518.1| hypothetical protein NDAI_0G05350 [Naumovozyma dairenensis CBS 421]
Length = 437
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD +G LCNACGL+ K++G NRP+
Sbjct: 117 CKNCLTSTTPLWRRDESGAMLCNACGLFLKLHGRNRPI 154
Score = 39.3 bits (90), Expect = 0.47, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD +G LCNA
Sbjct: 117 CKNCLTSTTPLWRRDESGAMLCNA 140
>gi|260947878|ref|XP_002618236.1| hypothetical protein CLUG_01695 [Clavispora lusitaniae ATCC 42720]
gi|238848108|gb|EEQ37572.1| hypothetical protein CLUG_01695 [Clavispora lusitaniae ATCC 42720]
Length = 480
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC ++TPLWR+ G+ LCNACGL++K++G+ RPL
Sbjct: 247 CTNCHTVTTPLWRKTNEGNVLCNACGLFYKLHGILRPL 284
Score = 39.3 bits (90), Expect = 0.51, Method: Composition-based stats.
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC ++TPLWR+ G+ LCNA
Sbjct: 247 CTNCHTVTTPLWRKTNEGNVLCNA 270
>gi|449546862|gb|EMD37831.1| hypothetical protein CERSUDRAFT_114478 [Ceriporiopsis subvermispora
B]
Length = 1137
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL+ K++G+ RPL
Sbjct: 1004 CTNCQTTNTPLWRRDPEGQPLCNACGLFFKLHGVVRPL 1041
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 1004 CTNCQTTNTPLWRRDPEGQPLCNA 1027
>gi|409044810|gb|EKM54291.1| hypothetical protein PHACADRAFT_258054 [Phanerochaete carnosa
HHB-10118-sp]
Length = 872
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL+ K++G+ RPL
Sbjct: 736 CTNCQTTNTPLWRRDPEGQPLCNACGLFFKLHGVVRPL 773
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 736 CTNCQTTNTPLWRRDPEGQPLCNA 759
>gi|406606254|emb|CCH42361.1| DNA-directed RNA polymerase II subunit [Wickerhamomyces ciferrii]
Length = 481
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 95 CKNCKTSTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 132
Score = 42.4 bits (98), Expect = 0.056, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 95 CKNCKTSTTPLWRRDETGQVLCNA 118
>gi|395332895|gb|EJF65273.1| hypothetical protein DICSQDRAFT_132832 [Dichomitus squalens LYAD-421
SS1]
Length = 1227
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1100 CTNCQTTITPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1137
Score = 39.3 bits (90), Expect = 0.53, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 15/24 (62%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD G LCNA
Sbjct: 1100 CTNCQTTITPLWRRDPEGQPLCNA 1123
>gi|448535115|ref|XP_003870912.1| Sfu1 transcriptional regulator of iron-responsive genes [Candida
orthopsilosis Co 90-125]
gi|380355268|emb|CCG24785.1| Sfu1 transcriptional regulator of iron-responsive genes [Candida
orthopsilosis]
Length = 604
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 27/59 (45%), Positives = 38/59 (64%), Gaps = 6/59 (10%)
Query: 65 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQP 122
+SLP + +G++C NCG TPLWRR G +CNACGLY + N +RP+ +K+P
Sbjct: 55 SSLPTPE-----DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNHTHRPVNLKRP 108
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 20/38 (52%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD G+ +CNACGL+++++G +RP+
Sbjct: 228 CYNCDTTITPLWRRDDAGNTICNACGLFYRLHGSHRPI 265
Score = 42.7 bits (99), Expect = 0.050, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 5/40 (12%)
Query: 7 ASLPVEDIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
+SLP + +G++C NCG TPLWRR G +CNA
Sbjct: 55 SSLPTPE-----DGQQCSNCGTTKTPLWRRAPDGTLICNA 89
Score = 38.9 bits (89), Expect = 0.68, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD G+ +CNA
Sbjct: 228 CYNCDTTITPLWRRDDAGNTICNA 251
>gi|258573903|ref|XP_002541133.1| predicted protein [Uncinocarpus reesii 1704]
gi|237901399|gb|EEP75800.1| predicted protein [Uncinocarpus reesii 1704]
Length = 573
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD GH +CNACGLY +++G RP+
Sbjct: 262 CQNCSTTVTPLWRRDDQGHPICNACGLYFRLHGCARPV 299
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 25/40 (62%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
G CVNCG TPLWRR G +CNACGLY K +RP
Sbjct: 99 GHSCVNCGTKRTPLWRRAPNGSTICNACGLYLKARNADRP 138
Score = 42.0 bits (97), Expect = 0.076, Method: Composition-based stats.
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G CVNCG TPLWRR G +CNA
Sbjct: 99 GHSCVNCGTKRTPLWRRAPNGSTICNA 125
Score = 41.6 bits (96), Expect = 0.10, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD GH +CNA
Sbjct: 262 CQNCSTTVTPLWRRDDQGHPICNA 285
>gi|367013804|ref|XP_003681402.1| hypothetical protein TDEL_0D06070 [Torulaspora delbrueckii]
gi|359749062|emb|CCE92191.1| hypothetical protein TDEL_0D06070 [Torulaspora delbrueckii]
Length = 811
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 27/38 (71%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC TPLWRRD G+ LCNACGL+ K++G RPL
Sbjct: 327 CFNCKTQKTPLWRRDSHGNTLCNACGLFQKLHGTMRPL 364
Score = 40.0 bits (92), Expect = 0.31, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD G+ LCNA
Sbjct: 327 CFNCKTQKTPLWRRDSHGNTLCNA 350
>gi|241958258|ref|XP_002421848.1| GATA zinc finger protein, putative; protein gzf3 homologue,
putative [Candida dubliniensis CD36]
gi|223645193|emb|CAX39792.1| GATA zinc finger protein, putative [Candida dubliniensis CD36]
Length = 662
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 109 GPVCGNCQTQTTPLWRRDETGQVLCNACGLFLKLHGRARPI 149
Score = 42.4 bits (98), Expect = 0.066, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NC +TPLWRRD TG LCNA
Sbjct: 109 GPVCGNCQTQTTPLWRRDETGQVLCNA 135
>gi|134056270|emb|CAK96398.1| unnamed protein product [Aspergillus niger]
Length = 541
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD G LCNACGL+ K++G RP+
Sbjct: 242 CQNCGTSTTPLWRRDELGSVLCNACGLFLKLHGRPRPI 279
Score = 42.4 bits (98), Expect = 0.058, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G LCNA
Sbjct: 242 CQNCGTSTTPLWRRDELGSVLCNA 265
>gi|353238152|emb|CCA70107.1| related to SRE siderophore regulation protein (GATA factor)
[Piriformospora indica DSM 11827]
Length = 414
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/37 (62%), Positives = 30/37 (81%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G +CNACGLY+K++G++RP
Sbjct: 237 CANCGTSATPLWRRDDMGSTICNACGLYYKLHGVHRP 273
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 33/63 (52%), Gaps = 4/63 (6%)
Query: 60 SRFLFASLPVE----DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 115
SR + A L + D E C NCG +TPLWRR+ G +CNACGLY K
Sbjct: 15 SRSVLAPLSMSRERSDKAAGAEPTVCSNCGTNTTPLWRRNSGGDTICNACGLYEKARRTA 74
Query: 116 RPL 118
RP+
Sbjct: 75 RPV 77
Score = 40.0 bits (92), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G +CNA
Sbjct: 237 CANCGTSATPLWRRDDMGSTICNA 260
Score = 38.9 bits (89), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 25/49 (51%), Gaps = 4/49 (8%)
Query: 2 SRFLFASLPVE----DIEYFTEGRECVNCGAISTPLWRRDGTGHYLCNA 46
SR + A L + D E C NCG +TPLWRR+ G +CNA
Sbjct: 15 SRSVLAPLSMSRERSDKAAGAEPTVCSNCGTNTTPLWRRNSGGDTICNA 63
>gi|443726960|gb|ELU13928.1| hypothetical protein CAPTEDRAFT_116469, partial [Capitella teleta]
Length = 75
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 43/71 (60%), Gaps = 7/71 (9%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQP--RRLRFDRGNFNCY 135
GREC NCG+ PLWR +GTGH LCNACG+ H M+G +P++K RR R +C
Sbjct: 6 GRECANCGSTYAPLWRWNGTGHLLCNACGV-HVMSGFAKPVMKTSGGRRSVSRRVGLSCA 64
Query: 136 ----ERSTLFR 142
+TL+R
Sbjct: 65 NCHTSTTTLWR 75
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 36/77 (46%), Gaps = 10/77 (12%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNARLFSTSFTGYFIMSRFLFASLPVEDIEYFTE-- 77
GREC NCG+ PLWR +GTGH LCNA G +MS F +
Sbjct: 6 GRECANCGSTYAPLWRWNGTGHLLCNA-------CGVHVMSGFAKPVMKTSGGRRSVSRR 58
Query: 78 -GRECVNCGAISTPLWR 93
G C NC +T LWR
Sbjct: 59 VGLSCANCHTSTTTLWR 75
>gi|320580218|gb|EFW94441.1| hypothetical protein HPODL_3941 [Ogataea parapolymorpha DL-1]
Length = 527
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 75 CKNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 112
Score = 42.4 bits (98), Expect = 0.063, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 75 CKNCKTQTTPLWRRDETGQVLCNA 98
>gi|195328567|ref|XP_002030986.1| GM24276 [Drosophila sechellia]
gi|194119929|gb|EDW41972.1| GM24276 [Drosophila sechellia]
Length = 733
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL
Sbjct: 527 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPL 567
>gi|410730331|ref|XP_003671345.2| hypothetical protein NDAI_0G03250 [Naumovozyma dairenensis CBS 421]
gi|401780163|emb|CCD26102.2| hypothetical protein NDAI_0G03250 [Naumovozyma dairenensis CBS 421]
Length = 497
Score = 59.7 bits (143), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWR+D G+ LCNACGL+ K++G+ RPL
Sbjct: 367 CNNCGTGTTPLWRKDPNGNSLCNACGLFLKLHGVMRPL 404
Score = 40.8 bits (94), Expect = 0.19, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWR+D G+ LCNA
Sbjct: 367 CNNCGTGTTPLWRKDPNGNSLCNA 390
>gi|238879747|gb|EEQ43385.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 701
Score = 59.7 bits (143), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 129 GPVCGNCQTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 169
Score = 42.4 bits (98), Expect = 0.066, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NC +TPLWRRD TG LCNA
Sbjct: 129 GPVCGNCQTQTTPLWRRDETGQVLCNA 155
>gi|68480451|ref|XP_715824.1| hypothetical protein CaO19.10361 [Candida albicans SC5314]
gi|46437465|gb|EAK96811.1| hypothetical protein CaO19.10361 [Candida albicans SC5314]
Length = 715
Score = 59.7 bits (143), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 134 GPVCGNCQTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 174
Score = 42.4 bits (98), Expect = 0.066, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NC +TPLWRRD TG LCNA
Sbjct: 134 GPVCGNCQTQTTPLWRRDETGQVLCNA 160
>gi|68480557|ref|XP_715773.1| hypothetical protein CaO19.2842 [Candida albicans SC5314]
gi|46437412|gb|EAK96759.1| hypothetical protein CaO19.2842 [Candida albicans SC5314]
Length = 710
Score = 59.7 bits (143), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 131 GPVCGNCQTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 171
Score = 42.4 bits (98), Expect = 0.066, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NC +TPLWRRD TG LCNA
Sbjct: 131 GPVCGNCQTQTTPLWRRDETGQVLCNA 157
>gi|89363524|emb|CAJ78410.1| GATA zinc finger protein 3 [Debaryomyces hansenii]
Length = 543
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 45 CRNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 82
Score = 42.4 bits (98), Expect = 0.067, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 45 CRNCKTQTTPLWRRDETGQVLCNA 68
>gi|448507977|ref|XP_003865868.1| Gzf3 transcription factor [Candida orthopsilosis Co 90-125]
gi|380350206|emb|CCG20426.1| Gzf3 transcription factor [Candida orthopsilosis Co 90-125]
Length = 830
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 224 CRNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 261
Score = 42.4 bits (98), Expect = 0.067, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 224 CRNCKTQTTPLWRRDETGQVLCNA 247
>gi|354544744|emb|CCE41469.1| hypothetical protein CPAR2_800210 [Candida parapsilosis]
Length = 880
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 222 CRNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 259
Score = 42.4 bits (98), Expect = 0.067, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 222 CRNCKTQTTPLWRRDETGQVLCNA 245
>gi|260949937|ref|XP_002619265.1| hypothetical protein CLUG_00424 [Clavispora lusitaniae ATCC 42720]
gi|238846837|gb|EEQ36301.1| hypothetical protein CLUG_00424 [Clavispora lusitaniae ATCC 42720]
Length = 527
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 42 CRNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 79
Score = 42.4 bits (98), Expect = 0.067, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 42 CRNCKTQTTPLWRRDETGQVLCNA 65
>gi|254566337|ref|XP_002490279.1| hypothetical protein [Komagataella pastoris GS115]
gi|238030075|emb|CAY67998.1| hypothetical protein PAS_chr1-4_0658 [Komagataella pastoris GS115]
gi|328350672|emb|CCA37072.1| Abnormal spindle-like microcephaly-associated protein homolog
[Komagataella pastoris CBS 7435]
Length = 465
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 46 CRNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 83
Score = 42.4 bits (98), Expect = 0.067, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 46 CRNCKTQTTPLWRRDETGQVLCNA 69
>gi|294657093|ref|XP_459410.2| DEHA2E01826p [Debaryomyces hansenii CBS767]
gi|199432441|emb|CAG87621.2| DEHA2E01826p [Debaryomyces hansenii CBS767]
Length = 636
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL+ K++G+ RPL
Sbjct: 457 CTNCNTTATPLWRRDPKGKPLCNACGLFLKLHGVVRPL 494
Score = 40.8 bits (94), Expect = 0.17, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 457 CTNCNTTATPLWRRDPKGKPLCNA 480
>gi|294655727|ref|XP_457916.2| DEHA2C05280p [Debaryomyces hansenii CBS767]
gi|199430559|emb|CAG85967.2| DEHA2C05280p [Debaryomyces hansenii CBS767]
Length = 543
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 45 CRNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 82
Score = 42.4 bits (98), Expect = 0.067, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 45 CRNCKTQTTPLWRRDETGQVLCNA 68
>gi|190348693|gb|EDK41198.2| hypothetical protein PGUG_05296 [Meyerozyma guilliermondii ATCC
6260]
Length = 512
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 43 CRNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 80
Score = 42.4 bits (98), Expect = 0.067, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 43 CRNCKTQTTPLWRRDETGQVLCNA 66
>gi|146412610|ref|XP_001482276.1| hypothetical protein PGUG_05296 [Meyerozyma guilliermondii ATCC
6260]
Length = 512
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 43 CRNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 80
Score = 42.4 bits (98), Expect = 0.067, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 43 CRNCKTQTTPLWRRDETGQVLCNA 66
>gi|320590477|gb|EFX02920.1| siderophore transcription factor [Grosmannia clavigera kw1407]
Length = 597
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 22/37 (59%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NC TPLWRRD G +CNACGLY+K++G++RP
Sbjct: 283 CQNCSTTITPLWRRDENGRTICNACGLYYKLHGVHRP 319
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP-LVKQP 122
G+ C NCG TPLWRR G +CNACGLY K RP +K+P
Sbjct: 108 GQVCSNCGITRTPLWRRSPQGAIICNACGLYQKARNTARPSYLKRP 153
Score = 39.3 bits (90), Expect = 0.53, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NCG TPLWRR G +CNA
Sbjct: 108 GQVCSNCGITRTPLWRRSPQGAIICNA 134
Score = 38.1 bits (87), Expect = 1.3, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC TPLWRRD G +CNA
Sbjct: 283 CQNCSTTITPLWRRDENGRTICNA 306
>gi|193683287|ref|XP_001943242.1| PREDICTED: hypothetical protein LOC100168211 [Acyrthosiphon pisum]
Length = 888
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 22/42 (52%), Positives = 30/42 (71%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+G +C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 719 QGLQCTNCQTATTSLWRRNQVGEPVCNACGLYFKLHGVKRPL 760
Score = 37.4 bits (85), Expect = 1.9, Method: Composition-based stats.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G +C NC +T LWRR+ G +CNA
Sbjct: 719 QGLQCTNCQTATTSLWRRNQVGEPVCNA 746
>gi|402083985|gb|EJT79003.1| GATA type zinc finger protein Asd4 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 448
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 193 CQNCSTSTTPLWRRDETGAVLCNACGLFLKLHGRPRPI 230
Score = 42.4 bits (98), Expect = 0.070, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 193 CQNCSTSTTPLWRRDETGAVLCNA 216
>gi|344229029|gb|EGV60915.1| hypothetical protein CANTEDRAFT_137382 [Candida tenuis ATCC 10573]
Length = 519
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 28/39 (71%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
EC NC + TPLWR+D G LCNACGL+ K++G RPL
Sbjct: 205 ECHNCHTVKTPLWRKDPDGKTLCNACGLFLKLHGTMRPL 243
Score = 40.8 bits (94), Expect = 0.16, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
EC NC + TPLWR+D G LCNA
Sbjct: 205 ECHNCHTVKTPLWRKDPDGKTLCNA 229
>gi|358393838|gb|EHK43239.1| hypothetical protein TRIATDRAFT_320532 [Trichoderma atroviride IMI
206040]
Length = 547
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD G LCNACGL+ K++G RP+
Sbjct: 81 CQNCGTSTTPLWRRDEFGSVLCNACGLFLKLHGRPRPI 118
Score = 42.4 bits (98), Expect = 0.072, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G LCNA
Sbjct: 81 CQNCGTSTTPLWRRDEFGSVLCNA 104
>gi|347827511|emb|CCD43208.1| similar to transcription factor Zn, GATA [Botryotinia fuckeliana]
Length = 989
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 703 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 740
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 703 CTNCFTQTTPLWRRNPEGHPLCNA 726
>gi|340521468|gb|EGR51702.1| zinc finger protein [Trichoderma reesei QM6a]
Length = 429
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD G LCNACGL+ K++G RP+
Sbjct: 11 CQNCGTSTTPLWRRDEFGSVLCNACGLFLKLHGRPRPI 48
Score = 42.4 bits (98), Expect = 0.072, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G LCNA
Sbjct: 11 CQNCGTSTTPLWRRDEFGSVLCNA 34
>gi|396463571|ref|XP_003836396.1| hypothetical protein LEMA_P039320.1 [Leptosphaeria maculans JN3]
gi|312212949|emb|CBX93031.1| hypothetical protein LEMA_P039320.1 [Leptosphaeria maculans JN3]
Length = 925
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 700 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 737
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 700 CTNCFTQTTPLWRRNPEGHPLCNA 723
>gi|154302551|ref|XP_001551685.1| hypothetical protein BC1G_09852 [Botryotinia fuckeliana B05.10]
Length = 989
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 703 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 740
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 703 CTNCFTQTTPLWRRNPEGHPLCNA 726
>gi|367053717|ref|XP_003657237.1| hypothetical protein THITE_2171447 [Thielavia terrestris NRRL 8126]
gi|347004502|gb|AEO70901.1| hypothetical protein THITE_2171447 [Thielavia terrestris NRRL 8126]
Length = 306
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/38 (63%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 13 CQNCGTSTTPLWRRDETGAVLCNACGLFLKLHGRPRPI 50
Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD TG LCNA
Sbjct: 13 CQNCGTSTTPLWRRDETGAVLCNA 36
>gi|149246049|ref|XP_001527494.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146447448|gb|EDK41836.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 816
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 242 CRNCQTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 279
Score = 42.0 bits (97), Expect = 0.073, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD TG LCNA
Sbjct: 242 CRNCQTQTTPLWRRDETGQVLCNA 265
>gi|37547258|gb|AAP30890.1| AreA protein [Phaeosphaeria nodorum]
Length = 851
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 631 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 668
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 631 CTNCFTQTTPLWRRNPEGHPLCNA 654
>gi|453088528|gb|EMF16568.1| hypothetical protein SEPMUDRAFT_34698 [Mycosphaerella populorum
SO2202]
Length = 957
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 714 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 751
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 714 CTNCFTQTTPLWRRNPEGHPLCNA 737
>gi|452987950|gb|EME87705.1| GLN3p transcription factor [Pseudocercospora fijiensis CIRAD86]
Length = 876
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 630 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 667
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 630 CTNCFTQTTPLWRRNPEGHPLCNA 653
>gi|451996461|gb|EMD88928.1| hypothetical protein COCHEDRAFT_1110693 [Cochliobolus
heterostrophus C5]
Length = 824
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 606 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 643
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 606 CTNCFTQTTPLWRRNPEGHPLCNA 629
>gi|451850876|gb|EMD64177.1| hypothetical protein COCSADRAFT_89695 [Cochliobolus sativus ND90Pr]
Length = 843
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 625 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 662
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 625 CTNCFTQTTPLWRRNPEGHPLCNA 648
>gi|449298668|gb|EMC94683.1| hypothetical protein BAUCODRAFT_35909 [Baudoinia compniacensis UAMH
10762]
Length = 936
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 677 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 714
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 677 CTNCFTQTTPLWRRNPEGHPLCNA 700
>gi|440634869|gb|ELR04788.1| hypothetical protein GMDG_07015 [Geomyces destructans 20631-21]
Length = 996
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 698 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 735
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 698 CTNCFTQTTPLWRRNPEGHPLCNA 721
>gi|407924861|gb|EKG17886.1| Zinc finger GATA-type protein [Macrophomina phaseolina MS6]
Length = 923
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 690 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 727
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 690 CTNCFTQTTPLWRRNPEGHPLCNA 713
>gi|398397269|ref|XP_003852092.1| hypothetical protein MYCGRDRAFT_109392 [Zymoseptoria tritici
IPO323]
gi|339471973|gb|EGP87068.1| hypothetical protein MYCGRDRAFT_109392 [Zymoseptoria tritici
IPO323]
Length = 932
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 691 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 728
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 691 CTNCFTQTTPLWRRNPEGHPLCNA 714
>gi|189200320|ref|XP_001936497.1| nitrogen regulatory protein areA [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187983596|gb|EDU49084.1| nitrogen regulatory protein areA [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 912
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 659 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 696
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 659 CTNCFTQTTPLWRRNPEGHPLCNA 682
>gi|169608630|ref|XP_001797734.1| hypothetical protein SNOG_07400 [Phaeosphaeria nodorum SN15]
gi|111063746|gb|EAT84866.1| hypothetical protein SNOG_07400 [Phaeosphaeria nodorum SN15]
Length = 854
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 634 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 671
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 634 CTNCFTQTTPLWRRNPEGHPLCNA 657
>gi|330928054|ref|XP_003302108.1| hypothetical protein PTT_13807 [Pyrenophora teres f. teres 0-1]
gi|311322720|gb|EFQ89799.1| hypothetical protein PTT_13807 [Pyrenophora teres f. teres 0-1]
Length = 889
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 671 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 708
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 671 CTNCFTQTTPLWRRNPEGHPLCNA 694
>gi|296413330|ref|XP_002836367.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630184|emb|CAZ80558.1| unnamed protein product [Tuber melanosporum]
Length = 930
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 699 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 736
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 699 CTNCFTQTTPLWRRNPEGHPLCNA 722
>gi|50556296|ref|XP_505556.1| YALI0F17886p [Yarrowia lipolytica]
gi|49651426|emb|CAG78365.1| YALI0F17886p [Yarrowia lipolytica CLIB122]
Length = 697
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/42 (54%), Positives = 29/42 (69%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
E +C NC +TPLWRR G LCNACGL+ K++G+ RPL
Sbjct: 532 EALQCSNCNTTTTPLWRRSPEGESLCNACGLFLKLHGVVRPL 573
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 17/28 (60%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
E +C NC +TPLWRR G LCNA
Sbjct: 532 EALQCSNCNTTTTPLWRRSPEGESLCNA 559
>gi|156055378|ref|XP_001593613.1| hypothetical protein SS1G_05040 [Sclerotinia sclerotiorum 1980]
gi|154702825|gb|EDO02564.1| hypothetical protein SS1G_05040 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 901
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 611 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 648
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 611 CTNCFTQTTPLWRRNPEGHPLCNA 634
>gi|217337439|gb|ACK43087.1| major nitrogen regulator element [Tuber borchii]
Length = 931
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 700 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 737
Score = 40.8 bits (94), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 700 CTNCFTQTTPLWRRNPEGHPLCNA 723
>gi|354547100|emb|CCE43833.1| hypothetical protein CPAR2_500590 [Candida parapsilosis]
Length = 769
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/41 (56%), Positives = 30/41 (73%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G+ C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 596 GQTCSNCQTKTTPLWRRNPEGQPLCNACGLFLKLHGVTRPL 636
Score = 39.7 bits (91), Expect = 0.42, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G+ C NC +TPLWRR+ G LCNA
Sbjct: 596 GQTCSNCQTKTTPLWRRNPEGQPLCNA 622
>gi|59802873|gb|AAX07650.1| GATA type zinc finger protein-like protein [Magnaporthe grisea]
Length = 512
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD G LCNACGL+ K++G RP+
Sbjct: 187 CQNCGTSTTPLWRRDEHGAVLCNACGLFLKLHGRPRPI 224
Score = 42.0 bits (97), Expect = 0.079, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G LCNA
Sbjct: 187 CQNCGTSTTPLWRRDEHGAVLCNA 210
>gi|363748562|ref|XP_003644499.1| hypothetical protein Ecym_1456 [Eremothecium cymbalariae
DBVPG#7215]
gi|356888131|gb|AET37682.1| hypothetical protein Ecym_1456 [Eremothecium cymbalariae
DBVPG#7215]
Length = 657
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 30/39 (76%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC +TPLWRRD G+ LCNACGL+ K++G+ RPL
Sbjct: 444 KCTNCHTKTTPLWRRDPQGNPLCNACGLFLKLHGVVRPL 482
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC +TPLWRRD G+ LCNA
Sbjct: 444 KCTNCHTKTTPLWRRDPQGNPLCNA 468
>gi|255710611|ref|XP_002551589.1| KLTH0A03036p [Lachancea thermotolerans]
gi|238932966|emb|CAR21147.1| KLTH0A03036p [Lachancea thermotolerans CBS 6340]
Length = 720
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 30/39 (76%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC +TPLWRRD G+ LCNACGL+ K++G+ RPL
Sbjct: 460 KCTNCHTKTTPLWRRDPQGNPLCNACGLFLKLHGVVRPL 498
Score = 40.4 bits (93), Expect = 0.22, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC +TPLWRRD G+ LCNA
Sbjct: 460 KCTNCHTKTTPLWRRDPQGNPLCNA 484
>gi|389628496|ref|XP_003711901.1| GATA type zinc finger protein Asd4 [Magnaporthe oryzae 70-15]
gi|351644233|gb|EHA52094.1| GATA type zinc finger protein Asd4 [Magnaporthe oryzae 70-15]
gi|440470940|gb|ELQ39979.1| GATA type zinc finger protein Asd4 [Magnaporthe oryzae Y34]
gi|440488291|gb|ELQ68022.1| GATA type zinc finger protein Asd4 [Magnaporthe oryzae P131]
Length = 512
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRRD G LCNACGL+ K++G RP+
Sbjct: 187 CQNCGTSTTPLWRRDEHGAVLCNACGLFLKLHGRPRPI 224
Score = 42.0 bits (97), Expect = 0.079, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G LCNA
Sbjct: 187 CQNCGTSTTPLWRRDEHGAVLCNA 210
>gi|402589823|gb|EJW83754.1| GATA zinc finger family protein [Wuchereria bancrofti]
Length = 553
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 22/42 (52%), Positives = 30/42 (71%)
Query: 77 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+G C NC +T LWRRD GH +CNACGLY+K++ + RP+
Sbjct: 346 QGLICANCRGTNTTLWRRDADGHPVCNACGLYYKLHQVQRPI 387
Score = 40.0 bits (92), Expect = 0.30, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 18/28 (64%)
Query: 19 EGRECVNCGAISTPLWRRDGTGHYLCNA 46
+G C NC +T LWRRD GH +CNA
Sbjct: 346 QGLICANCRGTNTTLWRRDADGHPVCNA 373
>gi|195111292|ref|XP_002000213.1| GI22648 [Drosophila mojavensis]
gi|193916807|gb|EDW15674.1| GI22648 [Drosophila mojavensis]
Length = 750
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL
Sbjct: 524 GVVCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPL 564
>gi|255722205|ref|XP_002546037.1| hypothetical protein CTRG_00818 [Candida tropicalis MYA-3404]
gi|240136526|gb|EER36079.1| hypothetical protein CTRG_00818 [Candida tropicalis MYA-3404]
Length = 667
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 78 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
G C NC +TPLWRRD TG LCNACGL+ K++G RP+
Sbjct: 120 GPICGNCQTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPI 160
Score = 42.0 bits (97), Expect = 0.082, Method: Composition-based stats.
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 20 GRECVNCGAISTPLWRRDGTGHYLCNA 46
G C NC +TPLWRRD TG LCNA
Sbjct: 120 GPICGNCQTQTTPLWRRDETGQVLCNA 146
>gi|389738594|gb|EIM79791.1| glucocorticoid receptor-like protein, partial [Stereum hirsutum
FP-91666 SS1]
Length = 163
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 48/94 (51%), Gaps = 7/94 (7%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNAR-LFSTSFTGYFIMSRFLFASLPVEDIEYFTEGR-- 79
C+NCG TPLWR+ CNA L+ S ++ R + ++ + +F
Sbjct: 1 CLNCGVTHTPLWRKGLNNELNCNAGGLYCQSA---YLSPRLHHVTNRLQFLHFFISPAVT 57
Query: 80 -ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 112
+C NC +TP WR+D G +CN CGLY K++
Sbjct: 58 AKCYNCNTTTTPPWRKDDEGKTVCNVCGLYFKLH 91
>gi|327309432|ref|XP_003239407.1| hypothetical protein TERG_08692 [Trichophyton rubrum CBS 118892]
gi|326459663|gb|EGD85116.1| hypothetical protein TERG_08692 [Trichophyton rubrum CBS 118892]
Length = 704
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G +LCNACGL+ K++G+ RPL
Sbjct: 476 CSNCFTQTTPLWRRNAEGQFLCNACGLFLKLHGVVRPL 513
Score = 40.4 bits (93), Expect = 0.23, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G +LCNA
Sbjct: 476 CSNCFTQTTPLWRRNAEGQFLCNA 499
>gi|315053355|ref|XP_003176051.1| nitrogen regulatory protein areA [Arthroderma gypseum CBS 118893]
gi|311337897|gb|EFQ97099.1| nitrogen regulatory protein areA [Arthroderma gypseum CBS 118893]
Length = 801
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G +LCNACGL+ K++G+ RPL
Sbjct: 586 CSNCFTQTTPLWRRNAEGQFLCNACGLFLKLHGVVRPL 623
Score = 40.4 bits (93), Expect = 0.23, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G +LCNA
Sbjct: 586 CSNCFTQTTPLWRRNAEGQFLCNA 609
>gi|426197006|gb|EKV46934.1| hypothetical protein AGABI2DRAFT_186255 [Agaricus bisporus var.
bisporus H97]
Length = 439
Score = 59.3 bits (142), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 23/37 (62%), Positives = 29/37 (78%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +TPLWRRD G+ +CNACGLY K++G +RP
Sbjct: 87 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRP 123
Score = 41.2 bits (95), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRRD G+ +CNA
Sbjct: 87 CANCGTSTTPLWRRDDVGNNICNA 110
>gi|448123846|ref|XP_004204769.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
gi|358249402|emb|CCE72468.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
Length = 697
Score = 59.3 bits (142), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 530 CTNCGTRTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 567
Score = 40.8 bits (94), Expect = 0.19, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRR+ G LCNA
Sbjct: 530 CTNCGTRTTPLWRRNPQGQPLCNA 553
>gi|195146794|ref|XP_002014369.1| GL19159 [Drosophila persimilis]
gi|194106322|gb|EDW28365.1| GL19159 [Drosophila persimilis]
Length = 803
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 656 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGINRP 692
>gi|194859800|ref|XP_001969453.1| GG10108 [Drosophila erecta]
gi|190661320|gb|EDV58512.1| GG10108 [Drosophila erecta]
Length = 838
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 690 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGINRP 726
>gi|190346789|gb|EDK38960.2| hypothetical protein PGUG_03058 [Meyerozyma guilliermondii ATCC
6260]
Length = 579
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 29/39 (74%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC +TPLWR+D G+ LCNACGL+ K++G RPL
Sbjct: 189 QCFNCNTSNTPLWRKDPDGNTLCNACGLFLKLHGSTRPL 227
Score = 40.0 bits (92), Expect = 0.29, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC +TPLWR+D G+ LCNA
Sbjct: 189 QCFNCNTSNTPLWRKDPDGNTLCNA 213
>gi|146418727|ref|XP_001485329.1| hypothetical protein PGUG_03058 [Meyerozyma guilliermondii ATCC
6260]
Length = 579
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 29/39 (74%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC +TPLWR+D G+ LCNACGL+ K++G RPL
Sbjct: 189 QCFNCNTSNTPLWRKDPDGNTLCNACGLFLKLHGSTRPL 227
Score = 40.0 bits (92), Expect = 0.29, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC +TPLWR+D G+ LCNA
Sbjct: 189 QCFNCNTSNTPLWRKDPDGNTLCNA 213
>gi|12232017|gb|AAG49351.1|AF320976_1 GATA factor AREB gamma [Emericella nidulans]
Length = 436
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 27/38 (71%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG TPLWRRD G LCNACGL+ K++G RP+
Sbjct: 141 CQNCGTSKTPLWRRDELGSVLCNACGLFLKLHGRPRPI 178
Score = 42.0 bits (97), Expect = 0.087, Method: Composition-based stats.
Identities = 16/24 (66%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G LCNA
Sbjct: 141 CQNCGTSKTPLWRRDELGSVLCNA 164
>gi|345571186|gb|EGX54001.1| hypothetical protein AOL_s00004g660 [Arthrobotrys oligospora ATCC
24927]
Length = 820
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 586 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 623
Score = 40.4 bits (93), Expect = 0.23, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ GH LCNA
Sbjct: 586 CTNCFTQTTPLWRRNPEGHPLCNA 609
>gi|302661634|ref|XP_003022483.1| GATA transcriptional activator AreA [Trichophyton verrucosum HKI
0517]
gi|291186429|gb|EFE41865.1| GATA transcriptional activator AreA [Trichophyton verrucosum HKI
0517]
Length = 905
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G +LCNACGL+ K++G+ RPL
Sbjct: 677 CSNCFTQTTPLWRRNAEGQFLCNACGLFLKLHGVVRPL 714
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G +LCNA
Sbjct: 677 CSNCFTQTTPLWRRNAEGQFLCNA 700
>gi|302499979|ref|XP_003011984.1| GATA transcriptional activator AreA [Arthroderma benhamiae CBS
112371]
gi|291175539|gb|EFE31344.1| GATA transcriptional activator AreA [Arthroderma benhamiae CBS
112371]
Length = 811
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G +LCNACGL+ K++G+ RPL
Sbjct: 583 CSNCFTQTTPLWRRNAEGQFLCNACGLFLKLHGVVRPL 620
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G +LCNA
Sbjct: 583 CSNCFTQTTPLWRRNAEGQFLCNA 606
>gi|296814256|ref|XP_002847465.1| nitrogen regulatory protein areA [Arthroderma otae CBS 113480]
gi|238840490|gb|EEQ30152.1| nitrogen regulatory protein areA [Arthroderma otae CBS 113480]
Length = 774
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G +LCNACGL+ K++G+ RPL
Sbjct: 545 CSNCFTQTTPLWRRNAEGQFLCNACGLFLKLHGVVRPL 582
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G +LCNA
Sbjct: 545 CSNCFTQTTPLWRRNAEGQFLCNA 568
>gi|219687608|dbj|BAH09312.1| GATA transcription factor [Trichophyton interdigitale]
Length = 796
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G +LCNACGL+ K++G+ RPL
Sbjct: 567 CSNCFTQTTPLWRRNAEGQFLCNACGLFLKLHGVVRPL 604
Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G +LCNA
Sbjct: 567 CSNCFTQTTPLWRRNAEGQFLCNA 590
>gi|403171937|ref|XP_003331115.2| GATA-binding protein [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375169639|gb|EFP86696.2| GATA-binding protein [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 632
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 25/48 (52%), Positives = 31/48 (64%), Gaps = 5/48 (10%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-----VKQPR 123
C NCG +TPLWRR+ +G LCNAC L+ KM G RP+ V +PR
Sbjct: 86 CANCGTNTTPLWRRNQSGATLCNACALFQKMKGRPRPISLKTNVIKPR 133
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRR+ +G LCNA
Sbjct: 86 CANCGTNTTPLWRRNQSGATLCNA 109
>gi|50312349|ref|XP_456208.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49645344|emb|CAG98916.1| KLLA0F25300p [Kluyveromyces lactis]
Length = 692
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 30/39 (76%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC +TPLWRRD G+ LCNACGL+ K++G+ RPL
Sbjct: 470 QCSNCHTKTTPLWRRDPQGNPLCNACGLFLKLHGVVRPL 508
Score = 40.4 bits (93), Expect = 0.25, Method: Composition-based stats.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC +TPLWRRD G+ LCNA
Sbjct: 470 QCSNCHTKTTPLWRRDPQGNPLCNA 494
>gi|449015268|dbj|BAM78699.1| ranscriptional activator [Saccharomyces pastorianus]
Length = 729
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 304 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 342
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 304 QCFNCKTFKTPLWRRSPEGNTLCNA 328
>gi|171600|gb|AAA34645.1| GLN3 protein [Saccharomyces cerevisiae]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|398364401|ref|NP_010958.3| Gln3p [Saccharomyces cerevisiae S288c]
gi|729591|sp|P18494.2|GLN3_YEAST RecName: Full=Nitrogen regulatory protein GLN3
gi|603273|gb|AAB64575.1| Gln3p: Nitrogen regulatory protein [Saccharomyces cerevisiae]
gi|285811666|tpg|DAA07694.1| TPA: Gln3p [Saccharomyces cerevisiae S288c]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|401626089|gb|EJS44054.1| gln3p [Saccharomyces arboricola H-6]
Length = 747
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 328 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 366
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 328 QCFNCKTFKTPLWRRSPEGNTLCNA 352
>gi|392299731|gb|EIW10823.1| Gln3p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|365766065|gb|EHN07566.1| Gln3p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|365761038|gb|EHN02714.1| Gln3p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 731
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|349577698|dbj|GAA22866.1| K7_Gln3p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|323355248|gb|EGA87073.1| Gln3p [Saccharomyces cerevisiae VL3]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|323348973|gb|EGA83209.1| Gln3p [Saccharomyces cerevisiae Lalvin QA23]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|323337940|gb|EGA79179.1| Gln3p [Saccharomyces cerevisiae Vin13]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|323333924|gb|EGA75313.1| Gln3p [Saccharomyces cerevisiae AWRI796]
Length = 713
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|323305228|gb|EGA58975.1| Gln3p [Saccharomyces cerevisiae FostersB]
Length = 679
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|259145949|emb|CAY79209.1| Gln3p [Saccharomyces cerevisiae EC1118]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|256271152|gb|EEU06245.1| Gln3p [Saccharomyces cerevisiae JAY291]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|190405602|gb|EDV08869.1| transcriptional activator of nitrogen-regulated genes
[Saccharomyces cerevisiae RM11-1a]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|151944750|gb|EDN63009.1| transcriptional activator of nitrogen-regulated genes
[Saccharomyces cerevisiae YJM789]
Length = 730
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC TPLWRR G+ LCNACGL+ K++G RPL
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNACGLFQKLHGTMRPL 343
Score = 39.3 bits (90), Expect = 0.56, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC TPLWRR G+ LCNA
Sbjct: 305 QCFNCKTFKTPLWRRSPEGNTLCNA 329
>gi|384491561|gb|EIE82757.1| hypothetical protein RO3G_07462 [Rhizopus delemar RA 99-880]
Length = 243
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 39/58 (67%), Gaps = 6/58 (10%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKQPRRLRFDRGNF 132
C NC A +TPLWRRD G+ +CNACGLY+K++ ++RP ++K+ +R + NF
Sbjct: 69 CSNCQATNTPLWRRDEAGNTICNACGLYYKLHHVHRPRTMMNSVIKRRKRCPSKKINF 126
Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC A +TPLWRRD G+ +CNA
Sbjct: 69 CSNCQATNTPLWRRDEAGNTICNA 92
>gi|24583358|ref|NP_609383.2| GATAd, isoform A [Drosophila melanogaster]
gi|442627216|ref|NP_001260326.1| GATAd, isoform B [Drosophila melanogaster]
gi|442627218|ref|NP_001260327.1| GATAd, isoform C [Drosophila melanogaster]
gi|22946136|gb|AAF52916.2| GATAd, isoform A [Drosophila melanogaster]
gi|379699074|gb|AFD10759.1| FI19405p1 [Drosophila melanogaster]
gi|440213644|gb|AGB92861.1| GATAd, isoform B [Drosophila melanogaster]
gi|440213645|gb|AGB92862.1| GATAd, isoform C [Drosophila melanogaster]
Length = 842
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 692 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 728
>gi|448121454|ref|XP_004204211.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
gi|358349750|emb|CCE73029.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
Length = 697
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 530 CTNCGTRTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 567
Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +TPLWRR+ G LCNA
Sbjct: 530 CTNCGTRTTPLWRRNPQGQPLCNA 553
>gi|195578139|ref|XP_002078923.1| GD23678 [Drosophila simulans]
gi|194190932|gb|EDX04508.1| GD23678 [Drosophila simulans]
Length = 844
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 696 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 732
>gi|195473609|ref|XP_002089085.1| GE18923 [Drosophila yakuba]
gi|194175186|gb|EDW88797.1| GE18923 [Drosophila yakuba]
Length = 838
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 690 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 726
>gi|195397608|ref|XP_002057420.1| GJ18119 [Drosophila virilis]
gi|194141074|gb|EDW57493.1| GJ18119 [Drosophila virilis]
Length = 860
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 698 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 734
>gi|195339757|ref|XP_002036483.1| GM18170 [Drosophila sechellia]
gi|194130363|gb|EDW52406.1| GM18170 [Drosophila sechellia]
Length = 844
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 696 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 732
>gi|195119257|ref|XP_002004148.1| GI18290 [Drosophila mojavensis]
gi|193914723|gb|EDW13590.1| GI18290 [Drosophila mojavensis]
Length = 469
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 408 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 444
>gi|194761760|ref|XP_001963094.1| GF15767 [Drosophila ananassae]
gi|190616791|gb|EDV32315.1| GF15767 [Drosophila ananassae]
Length = 810
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 667 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 703
>gi|326483661|gb|EGE07671.1| zinc-finger protein [Trichophyton equinum CBS 127.97]
Length = 709
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G +LCNACGL+ K++G+ RPL
Sbjct: 480 CSNCFTQTTPLWRRNAEGQFLCNACGLFLKLHGVVRPL 517
Score = 40.4 bits (93), Expect = 0.26, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G +LCNA
Sbjct: 480 CSNCFTQTTPLWRRNAEGQFLCNA 503
>gi|326471709|gb|EGD95718.1| zinc-finger protein [Trichophyton tonsurans CBS 112818]
Length = 745
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G +LCNACGL+ K++G+ RPL
Sbjct: 524 CSNCFTQTTPLWRRNAEGQFLCNACGLFLKLHGVVRPL 561
Score = 40.4 bits (93), Expect = 0.26, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G +LCNA
Sbjct: 524 CSNCFTQTTPLWRRNAEGQFLCNA 547
>gi|170092985|ref|XP_001877714.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164647573|gb|EDR11817.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1103
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRRD G LCNACGL+ K++G+ RPL
Sbjct: 979 CTNCHTTNTPLWRRDPEGQPLCNACGLFFKLHGVVRPL 1016
Score = 40.0 bits (92), Expect = 0.34, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRRD G LCNA
Sbjct: 979 CTNCHTTNTPLWRRDPEGQPLCNA 1002
>gi|113374055|dbj|BAF03588.1| zinc-finger protein [Arthroderma otae]
Length = 761
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC +TPLWRR+ G +LCNACGL+ K++G+ RPL
Sbjct: 532 CSNCFTQTTPLWRRNAEGQFLCNACGLFLKLHGVVRPL 569
Score = 40.4 bits (93), Expect = 0.26, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NC +TPLWRR+ G +LCNA
Sbjct: 532 CSNCFTQTTPLWRRNAEGQFLCNA 555
>gi|357625879|gb|EHJ76168.1| hypothetical protein KGM_07907 [Danaus plexippus]
Length = 514
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 117
C NCG +T +WRRD G +CNACGLY+K++G+ RP
Sbjct: 355 CSNCGTHTTTIWRRDARGEMVCNACGLYYKLHGVPRP 391
Score = 38.1 bits (87), Expect = 1.3, Method: Composition-based stats.
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG +T +WRRD G +CNA
Sbjct: 355 CSNCGTHTTTIWRRDARGEMVCNA 378
>gi|195389406|ref|XP_002053368.1| GJ23377 [Drosophila virilis]
gi|194151454|gb|EDW66888.1| GJ23377 [Drosophila virilis]
Length = 771
Score = 59.3 bits (142), Expect = 6e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
C NC ST LWRR+ G+ +CNACGLY K++ MNRPL
Sbjct: 558 CANCHTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPL 595
>gi|254566821|ref|XP_002490521.1| Transcriptional activator of genes regulated by nitrogen catabolite
repression (NCR) [Komagataella pastoris GS115]
gi|238030317|emb|CAY68240.1| Transcriptional activator of genes regulated by nitrogen catabolite
repression (NCR) [Komagataella pastoris GS115]
gi|249692033|emb|CAM59350.2| GATA-type transcription factor Fep1 [Komagataella pastoris]
gi|328350911|emb|CCA37311.1| Zinc finger protein 287 [Komagataella pastoris CBS 7435]
Length = 337
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 4/78 (5%)
Query: 45 NARLFSTSFTGYFIMSRFLFASLPVEDIEYFTEGRE----CVNCGAISTPLWRRDGTGHY 100
N R+ S++F S D E + + C NCG TPLWRRD G+
Sbjct: 124 NNRVISSTFKHKQSNSTVKSPDTSSNDEEESSGLQSVAIACTNCGTTVTPLWRRDNIGNT 183
Query: 101 LCNACGLYHKMNGMNRPL 118
+CNACGLY++++G +RP+
Sbjct: 184 ICNACGLYYRLHGSHRPV 201
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 5/59 (8%)
Query: 81 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKQPRRL----RFDRGNFNC 134
C NCG TPLWRR G +CNACGLY++ +RP+ +K+P ++ R D N C
Sbjct: 44 CSNCGTAKTPLWRRAADGSLVCNACGLYYRAKNSHRPVNLKRPPKVVTVSRSDLENGTC 102
Score = 40.8 bits (94), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRRD G+ +CNA
Sbjct: 164 CTNCGTTVTPLWRRDNIGNTICNA 187
Score = 37.0 bits (84), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 23 CVNCGAISTPLWRRDGTGHYLCNA 46
C NCG TPLWRR G +CNA
Sbjct: 44 CSNCGTAKTPLWRRAADGSLVCNA 67
>gi|238595515|ref|XP_002393788.1| hypothetical protein MPER_06424 [Moniliophthora perniciosa FA553]
gi|215461799|gb|EEB94718.1| hypothetical protein MPER_06424 [Moniliophthora perniciosa FA553]
Length = 147
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 30/39 (76%)
Query: 80 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 118
+C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 54 QCTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 92
Score = 40.0 bits (92), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 22 ECVNCGAISTPLWRRDGTGHYLCNA 46
+C NC +TPLWRRD G LCNA
Sbjct: 54 QCTNCQTTNTPLWRRDPEGQPLCNA 78
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.329 0.143 0.480
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,458,449,431
Number of Sequences: 23463169
Number of extensions: 98704086
Number of successful extensions: 217669
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1570
Number of HSP's successfully gapped in prelim test: 822
Number of HSP's that attempted gapping in prelim test: 212066
Number of HSP's gapped (non-prelim): 5446
length of query: 142
length of database: 8,064,228,071
effective HSP length: 106
effective length of query: 36
effective length of database: 9,872,099,453
effective search space: 355395580308
effective search space used: 355395580308
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 71 (32.0 bits)