BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy15323
(139 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|442619350|ref|NP_001262620.1| pannier, isoform C [Drosophila melanogaster]
gi|440217479|gb|AGB96000.1| pannier, isoform C [Drosophila melanogaster]
Length = 537
Score = 249 bits (636), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 116/134 (86%), Positives = 118/134 (88%), Gaps = 10/134 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 159 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 218
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKDGIQ
Sbjct: 219 MGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKDGIQ 268
Query: 121 TRKRKPKKQGGGGG 134
TRKRKPKK G G
Sbjct: 269 TRKRKPKKTGSGSA 282
>gi|194767723|ref|XP_001965964.1| GF11891 [Drosophila ananassae]
gi|190619807|gb|EDV35331.1| GF11891 [Drosophila ananassae]
Length = 551
Score = 248 bits (632), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 118/139 (84%), Positives = 119/139 (85%), Gaps = 10/139 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 171 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 230
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGL CTNCGT TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKDGIQ
Sbjct: 231 LGLRCTNCGTHTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKDGIQ 280
Query: 121 TRKRKPKKQGGGGGGGSGG 139
TRKRKPKK G G GSG
Sbjct: 281 TRKRKPKKSGSGTAAGSGA 299
>gi|195452164|ref|XP_002073240.1| GK13255 [Drosophila willistoni]
gi|194169325|gb|EDW84226.1| GK13255 [Drosophila willistoni]
Length = 550
Score = 246 bits (628), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 116/131 (88%), Positives = 117/131 (89%), Gaps = 10/131 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 155 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 214
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKDGIQ
Sbjct: 215 LGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKDGIQ 264
Query: 121 TRKRKPKKQGG 131
TRKRKPKK G
Sbjct: 265 TRKRKPKKTGS 275
>gi|198454786|ref|XP_001359720.2| GA17817, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132953|gb|EAL28872.2| GA17817, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 246 bits (627), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 115/128 (89%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 150 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 209
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKDGIQ
Sbjct: 210 LGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKDGIQ 259
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 260 TRKRKPKK 267
>gi|195157146|ref|XP_002019457.1| GL12215 [Drosophila persimilis]
gi|194116048|gb|EDW38091.1| GL12215 [Drosophila persimilis]
Length = 532
Score = 246 bits (627), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 115/128 (89%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 150 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 209
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKDGIQ
Sbjct: 210 LGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKDGIQ 259
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 260 TRKRKPKK 267
>gi|119112909|ref|XP_307949.3| AGAP002235-PA [Anopheles gambiae str. PEST]
gi|15420800|gb|AAK97462.1|AF395080_1 zinc finger transcription factor pannier [Anopheles gambiae]
gi|116132794|gb|EAA03705.3| AGAP002235-PA [Anopheles gambiae str. PEST]
Length = 537
Score = 246 bits (627), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 118/137 (86%), Positives = 120/137 (87%), Gaps = 13/137 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 110 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSQTA 169
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRRLGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKD
Sbjct: 170 TRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKD 219
Query: 118 GIQTRKRKPKKQGGGGG 134
GIQTRKRKPKK GG GG
Sbjct: 220 GIQTRKRKPKKTGGSGG 236
>gi|195328561|ref|XP_002030983.1| GM24279 [Drosophila sechellia]
gi|194119926|gb|EDW41969.1| GM24279 [Drosophila sechellia]
Length = 494
Score = 245 bits (625), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 114/128 (89%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 159 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 218
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKDGIQ
Sbjct: 219 MGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKDGIQ 268
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 269 TRKRKPKK 276
>gi|19554935|gb|AAB29876.2| achaete and scute repressor/zinc finger protein [Drosophila
melanogaster]
Length = 540
Score = 244 bits (623), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 116/137 (84%), Positives = 118/137 (86%), Gaps = 13/137 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 159 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 218
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRR+GLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKD
Sbjct: 219 TRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKD 268
Query: 118 GIQTRKRKPKKQGGGGG 134
GIQTRKRKPKK G G
Sbjct: 269 GIQTRKRKPKKTGSGSA 285
>gi|17136408|ref|NP_476685.1| pannier, isoform A [Drosophila melanogaster]
gi|1709699|sp|P52168.1|PNR_DROME RecName: Full=GATA-binding factor A; AltName: Full=Protein pannier;
AltName: Full=Transcription factor GATA-A; AltName:
Full=dGATA-A
gi|545347|gb|AAB29874.1| C4 zinc finger DNA-binding protein/GATA transcription factor
[Drosophila melanogaster]
gi|23171432|gb|AAN13693.1| pannier, isoform A [Drosophila melanogaster]
gi|159884201|gb|ABX00779.1| RE33290p [Drosophila melanogaster]
Length = 540
Score = 244 bits (623), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 116/137 (84%), Positives = 118/137 (86%), Gaps = 13/137 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 159 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 218
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRR+GLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKD
Sbjct: 219 TRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKD 268
Query: 118 GIQTRKRKPKKQGGGGG 134
GIQTRKRKPKK G G
Sbjct: 269 GIQTRKRKPKKTGSGSA 285
>gi|194901188|ref|XP_001980134.1| GG16973 [Drosophila erecta]
gi|190651837|gb|EDV49092.1| GG16973 [Drosophila erecta]
Length = 535
Score = 244 bits (623), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 114/128 (89%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 161 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 220
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKDGIQ
Sbjct: 221 MGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKDGIQ 270
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 271 TRKRKPKK 278
>gi|157125522|ref|XP_001654368.1| GATA transcription factor (GATAd) [Aedes aegypti]
gi|108873603|gb|EAT37828.1| AAEL010221-PA [Aedes aegypti]
Length = 483
Score = 244 bits (622), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 117/137 (85%), Positives = 119/137 (86%), Gaps = 12/137 (8%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL--TAT 58
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TAT
Sbjct: 110 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSTAT 169
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
RRLGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKDG
Sbjct: 170 RRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKDG 219
Query: 119 IQTRKRKPKKQGGGGGG 135
IQTRKRKPKK G GG
Sbjct: 220 IQTRKRKPKKTGTGGAA 236
>gi|56900910|gb|AAW31748.1| GATA transcription factor GATAd [Aedes aegypti]
Length = 483
Score = 244 bits (622), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 117/137 (85%), Positives = 119/137 (86%), Gaps = 12/137 (8%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL--TAT 58
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TAT
Sbjct: 110 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSTAT 169
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
RRLGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKDG
Sbjct: 170 RRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKDG 219
Query: 119 IQTRKRKPKKQGGGGGG 135
IQTRKRKPKK G GG
Sbjct: 220 IQTRKRKPKKTGTGGAA 236
>gi|195501108|ref|XP_002097661.1| GE24361 [Drosophila yakuba]
gi|194183762|gb|EDW97373.1| GE24361 [Drosophila yakuba]
Length = 540
Score = 244 bits (622), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 114/128 (89%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 165 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 224
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKDGIQ
Sbjct: 225 MGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKDGIQ 274
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 275 TRKRKPKK 282
>gi|24647330|ref|NP_732102.1| pannier, isoform B [Drosophila melanogaster]
gi|442619352|ref|NP_001262621.1| pannier, isoform D [Drosophila melanogaster]
gi|23171433|gb|AAF55264.3| pannier, isoform B [Drosophila melanogaster]
gi|218506027|gb|ACK77655.1| RE17974p [Drosophila melanogaster]
gi|440217480|gb|AGB96001.1| pannier, isoform D [Drosophila melanogaster]
Length = 488
Score = 243 bits (621), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 116/137 (84%), Positives = 118/137 (86%), Gaps = 13/137 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 107 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 166
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRR+GLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKD
Sbjct: 167 TRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKD 216
Query: 118 GIQTRKRKPKKQGGGGG 134
GIQTRKRKPKK G G
Sbjct: 217 GIQTRKRKPKKTGSGSA 233
>gi|20152127|gb|AAM11423.1| SD02611p [Drosophila melanogaster]
Length = 486
Score = 243 bits (621), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 116/137 (84%), Positives = 118/137 (86%), Gaps = 13/137 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 107 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 166
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRR+GLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKD
Sbjct: 167 TRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKD 216
Query: 118 GIQTRKRKPKKQGGGGG 134
GIQTRKRKPKK G G
Sbjct: 217 GIQTRKRKPKKTGSGSA 233
>gi|195389402|ref|XP_002053366.1| GJ23378 [Drosophila virilis]
gi|194151452|gb|EDW66886.1| GJ23378 [Drosophila virilis]
Length = 520
Score = 243 bits (621), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 114/128 (89%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 152 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 211
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 212 LGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 261
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 262 TRKRKPKK 269
>gi|91094885|ref|XP_973051.1| PREDICTED: similar to AGAP002235-PA [Tribolium castaneum]
Length = 395
Score = 243 bits (620), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 115/132 (87%), Positives = 118/132 (89%), Gaps = 10/132 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M+FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 115 MEFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 174
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 175 LGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 224
Query: 121 TRKRKPKKQGGG 132
TRKRKPKK GG
Sbjct: 225 TRKRKPKKPVGG 236
>gi|195036110|ref|XP_001989514.1| GH18759 [Drosophila grimshawi]
gi|193893710|gb|EDV92576.1| GH18759 [Drosophila grimshawi]
Length = 528
Score = 243 bits (620), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 114/128 (89%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 151 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 210
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 211 LGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 260
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 261 TRKRKPKK 268
>gi|270008083|gb|EFA04531.1| pannier [Tribolium castaneum]
Length = 407
Score = 243 bits (620), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 115/132 (87%), Positives = 118/132 (89%), Gaps = 10/132 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M+FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 115 MEFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 174
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 175 LGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 224
Query: 121 TRKRKPKKQGGG 132
TRKRKPKK GG
Sbjct: 225 TRKRKPKKPVGG 236
>gi|18032000|gb|AAL32068.1| pannier-like protein [Calliphora vicina]
Length = 511
Score = 242 bits (618), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 115/139 (82%), Positives = 119/139 (85%), Gaps = 13/139 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
+DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 113 IDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 172
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRRLGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKD
Sbjct: 173 TRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKD 222
Query: 118 GIQTRKRKPKKQGGGGGGG 136
GIQTRKRKPKK GG G
Sbjct: 223 GIQTRKRKPKKSGGSTENG 241
>gi|170043914|ref|XP_001849612.1| GATA transcription factor GATAd [Culex quinquefasciatus]
gi|167867187|gb|EDS30570.1| GATA transcription factor GATAd [Culex quinquefasciatus]
Length = 520
Score = 241 bits (616), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 114/128 (89%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 107 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 166
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 167 LGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 216
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 217 TRKRKPKK 224
>gi|322782943|gb|EFZ10661.1| hypothetical protein SINV_07216 [Solenopsis invicta]
Length = 407
Score = 241 bits (615), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 113/128 (88%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 153 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 212
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 213 LGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 262
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 263 TRKRKPKK 270
>gi|332016555|gb|EGI57436.1| GATA-binding factor A [Acromyrmex echinatior]
Length = 407
Score = 241 bits (615), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 113/128 (88%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 155 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 214
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 215 LGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 264
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 265 TRKRKPKK 272
>gi|340720269|ref|XP_003398563.1| PREDICTED: hypothetical protein LOC100651088 [Bombus terrestris]
Length = 440
Score = 241 bits (615), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 113/128 (88%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 174 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 233
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 234 LGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 283
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 284 TRKRKPKK 291
>gi|307170386|gb|EFN62698.1| GATA-binding factor A [Camponotus floridanus]
Length = 638
Score = 240 bits (613), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 116/139 (83%), Positives = 118/139 (84%), Gaps = 10/139 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 150 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 209
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 210 LGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 259
Query: 121 TRKRKPKKQGGGGGGGSGG 139
TRKRKPKK S G
Sbjct: 260 TRKRKPKKTPEPNARSSAG 278
>gi|383863209|ref|XP_003707074.1| PREDICTED: GATA-binding factor A-like [Megachile rotundata]
Length = 369
Score = 240 bits (613), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 113/128 (88%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 107 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 166
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 167 LGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 216
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 217 TRKRKPKK 224
>gi|390179286|ref|XP_003736855.1| GA17817, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859788|gb|EIM52928.1| GA17817, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 496
Score = 240 bits (613), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 115/131 (87%), Positives = 116/131 (88%), Gaps = 13/131 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 111 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 170
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRRLGLCCTNCGTR TTLWRRNNDGEPVCNACGLYYKLH VN RPLAMRKD
Sbjct: 171 TRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLAMRKD 220
Query: 118 GIQTRKRKPKK 128
GIQTRKRKPKK
Sbjct: 221 GIQTRKRKPKK 231
>gi|195111290|ref|XP_002000212.1| GI22649 [Drosophila mojavensis]
gi|193916806|gb|EDW15673.1| GI22649 [Drosophila mojavensis]
Length = 517
Score = 239 bits (611), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 114/131 (87%), Positives = 116/131 (88%), Gaps = 13/131 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 152 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 211
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRRLGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKD
Sbjct: 212 TRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKD 261
Query: 118 GIQTRKRKPKK 128
GIQTRKRKPKK
Sbjct: 262 GIQTRKRKPKK 272
>gi|225544200|gb|ACN91518.1| pnr [Bactrocera cucurbitae]
Length = 517
Score = 239 bits (609), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 113/134 (84%), Positives = 117/134 (87%), Gaps = 13/134 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
+DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 101 IDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 160
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRRLGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKD
Sbjct: 161 TRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKD 210
Query: 118 GIQTRKRKPKKQGG 131
GIQTRKRKPKK G
Sbjct: 211 GIQTRKRKPKKSNG 224
>gi|328786704|ref|XP_001121210.2| PREDICTED: transcription factor GATA-5 [Apis mellifera]
Length = 444
Score = 239 bits (609), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 113/128 (88%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 178 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 237
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 238 LGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 287
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 288 TRKRKPKK 295
>gi|225544236|gb|ACN91538.1| pnr [Rhagoletis juglandis]
Length = 556
Score = 238 bits (608), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 112/132 (84%), Positives = 116/132 (87%), Gaps = 13/132 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
+DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 101 IDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSTTA 160
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRRLGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKD
Sbjct: 161 TRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKD 210
Query: 118 GIQTRKRKPKKQ 129
GIQTRKRKPKK
Sbjct: 211 GIQTRKRKPKKS 222
>gi|7381420|gb|AAF61481.1|AF184154_1 zinc finger DNA binding protein pannier [Ceratitis capitata]
Length = 324
Score = 238 bits (606), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 112/129 (86%), Positives = 116/129 (89%), Gaps = 10/129 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 101 IDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 160
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 161 LGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 210
Query: 121 TRKRKPKKQ 129
TRKRKPKK
Sbjct: 211 TRKRKPKKS 219
>gi|225544205|gb|ACN91521.1| pnr [Bactrocera dorsalis]
gi|225544207|gb|ACN91522.1| pnr [Bactrocera dorsalis]
Length = 540
Score = 236 bits (602), Expect = 2e-60, Method: Composition-based stats.
Identities = 114/142 (80%), Positives = 118/142 (83%), Gaps = 13/142 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
+DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 101 IDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 160
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRRLGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKD
Sbjct: 161 TRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKD 210
Query: 118 GIQTRKRKPKKQGGGGGGGSGG 139
GIQTRKRKPKK G G
Sbjct: 211 GIQTRKRKPKKSNSSTDGAKDG 232
>gi|156538733|ref|XP_001607848.1| PREDICTED: GATA-binding factor 5-A-like [Nasonia vitripennis]
Length = 442
Score = 236 bits (601), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 110/128 (85%), Positives = 115/128 (89%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQF EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHK+NGMNRPL+KP+KRLTATRR
Sbjct: 165 MDFQFSEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKINGMNRPLIKPTKRLTATRR 224
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKDGIQ
Sbjct: 225 LGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKDGIQ 274
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 275 TRKRKPKK 282
>gi|307198085|gb|EFN79138.1| GATA-binding factor A [Harpegnathos saltator]
Length = 375
Score = 236 bits (601), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 115/144 (79%), Positives = 118/144 (81%), Gaps = 15/144 (10%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL----- 55
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL
Sbjct: 161 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLMSEFQ 220
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
TATRRLGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMR
Sbjct: 221 TATRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMR 270
Query: 116 KDGIQTRKRKPKKQGGGGGGGSGG 139
KDGIQTRKRKPKK S G
Sbjct: 271 KDGIQTRKRKPKKTPEPNARSSTG 294
>gi|225544221|gb|ACN91531.1| pnr [Ceratitis capitata]
Length = 539
Score = 235 bits (600), Expect = 4e-60, Method: Composition-based stats.
Identities = 112/131 (85%), Positives = 116/131 (88%), Gaps = 13/131 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
+DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 101 IDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 160
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRRLGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VN RPLAMRKD
Sbjct: 161 TRRLGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPLAMRKD 210
Query: 118 GIQTRKRKPKK 128
GIQTRKRKPKK
Sbjct: 211 GIQTRKRKPKK 221
>gi|350423466|ref|XP_003493491.1| PREDICTED: GATA-binding factor 5-A-like, partial [Bombus impatiens]
Length = 317
Score = 235 bits (599), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 111/128 (86%), Positives = 114/128 (89%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 45 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 104
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGT TTLWRRNN+GEPVCNACGLY+KLH VN RP AMRKDGIQ
Sbjct: 105 LGLCCTNCGTLTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPRAMRKDGIQ 154
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 155 TRKRKPKK 162
>gi|242005616|ref|XP_002423660.1| transcription factor GATA-4, putative [Pediculus humanus corporis]
gi|212506820|gb|EEB10922.1| transcription factor GATA-4, putative [Pediculus humanus corporis]
Length = 231
Score = 231 bits (590), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 110/128 (85%), Positives = 116/128 (90%), Gaps = 10/128 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR+TAT+R
Sbjct: 84 LDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRMTATKR 143
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
LGLCCTNCGTR TTLWRRNN+GEPVCNACGLY+KLH +N RPLAMRKDGIQ
Sbjct: 144 LGLCCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGIN----------RPLAMRKDGIQ 193
Query: 121 TRKRKPKK 128
TRKRKPKK
Sbjct: 194 TRKRKPKK 201
>gi|119220894|gb|ABL61528.1| GATA transcription factor pannier [Drosophila quadrilineata]
Length = 344
Score = 231 bits (590), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 113/131 (86%), Positives = 116/131 (88%), Gaps = 13/131 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TA 57
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRL TA
Sbjct: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLVSATA 60
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
TRR+GLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPLAMRKD
Sbjct: 61 TRRMGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPLAMRKD 110
Query: 118 GIQTRKRKPKK 128
GIQTRKRKPKK
Sbjct: 111 GIQTRKRKPKK 121
>gi|328721865|ref|XP_001943353.2| PREDICTED: endothelial transcription factor GATA-2-like
[Acyrthosiphon pisum]
Length = 386
Score = 228 bits (582), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 111/134 (82%), Positives = 113/134 (84%), Gaps = 15/134 (11%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTA--- 57
MD QFG+GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKP+KRL A
Sbjct: 157 MDLQFGDGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPAKRLVAFET 216
Query: 58 --TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
RRLGL CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLH VN RPL MR
Sbjct: 217 ASNRRLGLSCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHGVN----------RPLTMR 266
Query: 116 KDGIQTRKRKPKKQ 129
KDGIQTRKRKPKKQ
Sbjct: 267 KDGIQTRKRKPKKQ 280
>gi|162416003|gb|ABX89306.1| pannier protein, partial [Megaselia abdita]
Length = 206
Score = 218 bits (556), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 103/119 (86%), Positives = 106/119 (89%), Gaps = 10/119 (8%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KPSKRLTATRR
Sbjct: 98 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPSKRLTATRR 157
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
LGL CTNCGTR TTLWRRNN+GEPVCNACGLY+KLH VNRPLAM RKDGI
Sbjct: 158 LGLSCTNCGTRTTTLWRRNNEGEPVCNACGLYFKLHGVNRPLAM----------RKDGI 206
>gi|238863844|gb|ACR66216.1| transcription factor GATA456a, partial [Branchiostoma floridae]
Length = 424
Score = 210 bits (535), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 99/131 (75%), Positives = 108/131 (82%), Gaps = 10/131 (7%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL C
Sbjct: 289 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLSASRRVGLQC 348
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRNN+GEPVCNACGLY+KLHNVN RPLAM+KDGIQTRKRK
Sbjct: 349 ANCRTTQTTLWRRNNEGEPVCNACGLYFKLHNVN----------RPLAMKKDGIQTRKRK 398
Query: 126 PKKQGGGGGGG 136
PK G GGG
Sbjct: 399 PKTLNKGKGGG 409
>gi|284005110|ref|NP_001164701.1| gata4 transcription factor [Saccoglossus kowalevskii]
gi|283462246|gb|ADB22417.1| gata4 transcription factor [Saccoglossus kowalevskii]
Length = 516
Score = 208 bits (529), Expect = 6e-52, Method: Composition-based stats.
Identities = 94/130 (72%), Positives = 105/130 (80%), Gaps = 10/130 (7%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KP +RL+A+RR+GL C
Sbjct: 295 AEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPQRRLSASRRVGLQC 354
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRNN+GEPVCNACGLY+KLH+VN RPLA++KD IQTRKRK
Sbjct: 355 ANCHTTQTTLWRRNNEGEPVCNACGLYFKLHSVN----------RPLALKKDTIQTRKRK 404
Query: 126 PKKQGGGGGG 135
PK G
Sbjct: 405 PKNTNSDKNG 414
>gi|357619365|gb|EHJ71973.1| hypothetical protein KGM_02427 [Danaus plexippus]
Length = 355
Score = 206 bits (525), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 98/128 (76%), Positives = 106/128 (82%), Gaps = 10/128 (7%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
+ EGRECVNCGA +TPLWRRD TGHYLCNACGLYHK+NG+NRPLVKPSKRL+A RR G
Sbjct: 76 YEEGRECVNCGANNTPLWRRDSTGHYLCNACGLYHKINGVNRPLVKPSKRLSAARRHGQS 135
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNCG+R TTLWRRNN+GEPVCNACGLYYKLH +N RPLAMRKDGIQTRKR
Sbjct: 136 CTNCGSRNTTLWRRNNEGEPVCNACGLYYKLHGIN----------RPLAMRKDGIQTRKR 185
Query: 125 KPKKQGGG 132
KPKK G
Sbjct: 186 KPKKSANG 193
>gi|332016551|gb|EGI57432.1| Transcription factor GATA-4 [Acromyrmex echinatior]
Length = 833
Score = 206 bits (524), Expect = 3e-51, Method: Composition-based stats.
Identities = 94/123 (76%), Positives = 101/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL+ATRR+GL
Sbjct: 564 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSATRRVGLS 623
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T MT+LWRRN GEPVCNACGLY+KLH VN RP M+KD IQTRKR
Sbjct: 624 CSNCETTMTSLWRRNAQGEPVCNACGLYFKLHGVN----------RPSTMKKDSIQTRKR 673
Query: 125 KPK 127
KPK
Sbjct: 674 KPK 676
>gi|328715574|ref|XP_003245664.1| PREDICTED: GATA-binding factor A-like, partial [Acyrthosiphon
pisum]
Length = 232
Score = 206 bits (523), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 102/134 (76%), Positives = 109/134 (81%), Gaps = 15/134 (11%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL----- 55
MD QFGEGRECVNCGAISTPLWRRDG GHYLCNACGLYHKMNGMNRP +P+KRL
Sbjct: 3 MDLQFGEGRECVNCGAISTPLWRRDGCGHYLCNACGLYHKMNGMNRPSGRPAKRLGAFET 62
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
++RR G+ CTNCGTRMTTLWRR+NDG+PVCNACGLYYKLH N RPL MR
Sbjct: 63 VSSRRYGVSCTNCGTRMTTLWRRDNDGKPVCNACGLYYKLHGFN----------RPLRMR 112
Query: 116 KDGIQTRKRKPKKQ 129
KDGIQTRKRKPKKQ
Sbjct: 113 KDGIQTRKRKPKKQ 126
>gi|347967443|ref|XP_003436067.1| AGAP002238-PB [Anopheles gambiae str. PEST]
gi|333466293|gb|EGK96194.1| AGAP002238-PB [Anopheles gambiae str. PEST]
Length = 1157
Score = 204 bits (520), Expect = 6e-51, Method: Composition-based stats.
Identities = 95/132 (71%), Positives = 103/132 (78%), Gaps = 10/132 (7%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL++ RR+GL
Sbjct: 758 FTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSSARRVGLQ 817
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T T+LWRRN GEPVCNACGLYYKLHNVN RPLAM+KD IQ+RKR
Sbjct: 818 CSNCNTTNTSLWRRNQVGEPVCNACGLYYKLHNVN----------RPLAMKKDNIQSRKR 867
Query: 125 KPKKQGGGGGGG 136
KPK G G
Sbjct: 868 KPKGSKNSDGNG 879
>gi|56849575|gb|AAW31646.1| GATAa2 transcription factor [Aedes aegypti]
Length = 1034
Score = 204 bits (520), Expect = 8e-51, Method: Composition-based stats.
Identities = 94/134 (70%), Positives = 102/134 (76%), Gaps = 14/134 (10%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL----TATRR 60
F EGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL ++ RR
Sbjct: 612 FTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLVKEPSSARR 671
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C+NC T T+LWRRN GEPVCNACGLYYKLHNVN RPLAM+KD IQ
Sbjct: 672 VGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVN----------RPLAMKKDNIQ 721
Query: 121 TRKRKPKKQGGGGG 134
+RKRKPK G
Sbjct: 722 SRKRKPKGSKNADG 735
>gi|157125536|ref|XP_001654375.1| transcription factor GATA-4 (GATA binding factor-4) [Aedes aegypti]
gi|108873610|gb|EAT37835.1| AAEL010222-PA [Aedes aegypti]
Length = 1034
Score = 204 bits (519), Expect = 1e-50, Method: Composition-based stats.
Identities = 93/127 (73%), Positives = 101/127 (79%), Gaps = 14/127 (11%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL----TATRR 60
F EGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL ++ RR
Sbjct: 612 FTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLVKEPSSARR 671
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C+NC T T+LWRRN GEPVCNACGLYYKLHNVN RPLAM+KD IQ
Sbjct: 672 VGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVN----------RPLAMKKDNIQ 721
Query: 121 TRKRKPK 127
+RKRKPK
Sbjct: 722 SRKRKPK 728
>gi|7649393|emb|CAB88873.1| GATA transcription factor [Aedes aegypti]
Length = 868
Score = 204 bits (519), Expect = 1e-50, Method: Composition-based stats.
Identities = 93/127 (73%), Positives = 101/127 (79%), Gaps = 14/127 (11%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL----TATRR 60
F EGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL ++ RR
Sbjct: 613 FTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLVKEPSSARR 672
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C+NC T T+LWRRN GEPVCNACGLYYKLHNVN RPLAM+KD IQ
Sbjct: 673 VGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVN----------RPLAMKKDNIQ 722
Query: 121 TRKRKPK 127
+RKRKPK
Sbjct: 723 SRKRKPK 729
>gi|117276623|gb|ABK32792.1| GATA transcription factor 456 [Platynereis dumerilii]
Length = 364
Score = 204 bits (519), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 93/121 (76%), Positives = 104/121 (85%), Gaps = 10/121 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 254 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGLNRPLIKPQRRLSASRRVGLS 313
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRNN+GEPVCNACGLYYKLH VN RPLAM+K+GIQT+K+
Sbjct: 314 CANCHTTTTTLWRRNNEGEPVCNACGLYYKLHGVN----------RPLAMKKEGIQTKKK 363
Query: 125 K 125
K
Sbjct: 364 K 364
>gi|147906903|ref|NP_001081962.1| GATA-binding factor 5-A [Xenopus laevis]
gi|1169806|sp|P43695.1|GAT5A_XENLA RecName: Full=GATA-binding factor 5-A; AltName: Full=Transcription
factor xGATA-5A
gi|311050|gb|AAA63686.1| transcription factor xGATA-5a [Xenopus laevis]
gi|54648481|gb|AAH84985.1| GATA-5a protein [Xenopus laevis]
Length = 390
Score = 204 bits (518), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 96/127 (75%), Positives = 105/127 (82%), Gaps = 10/127 (7%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KP KRL+++RR GLCC
Sbjct: 178 GEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPQKRLSSSRRAGLCC 237
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 238 TNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 287
Query: 126 PKKQGGG 132
PK G G
Sbjct: 288 PKNIGKG 294
>gi|307186218|gb|EFN71902.1| Transcription factor BCFI [Camponotus floridanus]
Length = 830
Score = 204 bits (518), Expect = 1e-50, Method: Composition-based stats.
Identities = 93/123 (75%), Positives = 100/123 (81%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL+A+RR GL
Sbjct: 561 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASRRAGLS 620
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T MT+LWRRN GEPVCNACGLY+KLH VN RP M+KD IQTRKR
Sbjct: 621 CSNCETTMTSLWRRNASGEPVCNACGLYFKLHGVN----------RPSTMKKDSIQTRKR 670
Query: 125 KPK 127
KPK
Sbjct: 671 KPK 673
>gi|307198088|gb|EFN79141.1| Transcription factor GATA-4 [Harpegnathos saltator]
Length = 785
Score = 204 bits (518), Expect = 1e-50, Method: Composition-based stats.
Identities = 94/123 (76%), Positives = 101/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
+ EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL+ATRR+GL
Sbjct: 561 YTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSATRRMGLA 620
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T MT+LWRRN GEPVCNACGLYYKLH VN RP M+KD IQTRKR
Sbjct: 621 CSNCETTMTSLWRRNTMGEPVCNACGLYYKLHGVN----------RPSTMKKDSIQTRKR 670
Query: 125 KPK 127
KPK
Sbjct: 671 KPK 673
>gi|340720273|ref|XP_003398565.1| PREDICTED: hypothetical protein LOC100651336 [Bombus terrestris]
Length = 985
Score = 203 bits (517), Expect = 1e-50, Method: Composition-based stats.
Identities = 93/123 (75%), Positives = 101/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL+A+RR+G
Sbjct: 690 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASRRVGTS 749
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T MT+LWRRN GEPVCNACGLY+KLH VN RP AM+KD IQTRKR
Sbjct: 750 CSNCQTTMTSLWRRNTLGEPVCNACGLYFKLHGVN----------RPHAMKKDSIQTRKR 799
Query: 125 KPK 127
KPK
Sbjct: 800 KPK 802
>gi|33416800|gb|AAH55963.1| Gata-5b protein [Xenopus laevis]
Length = 388
Score = 202 bits (514), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 95/127 (74%), Positives = 105/127 (82%), Gaps = 10/127 (7%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL+++RR GLCC
Sbjct: 179 GEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSRRAGLCC 238
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 239 TNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 288
Query: 126 PKKQGGG 132
PK G G
Sbjct: 289 PKNIGKG 295
>gi|147905907|ref|NP_001085355.1| GATA binding protein 4 [Xenopus laevis]
gi|49256044|gb|AAH71122.1| MGC81427 protein [Xenopus laevis]
Length = 394
Score = 202 bits (514), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 93/124 (75%), Positives = 103/124 (83%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 179 DFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGL 238
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CC NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 239 CCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 288
Query: 124 RKPK 127
RKPK
Sbjct: 289 RKPK 292
>gi|148230811|ref|NP_001079831.1| GATA-binding factor 5-B [Xenopus laevis]
gi|1169807|sp|P43696.1|GAT5B_XENLA RecName: Full=GATA-binding factor 5-B; AltName: Full=Transcription
factor xGATA-5B
gi|311052|gb|AAA63687.1| transcription factor xGATA-5b [Xenopus laevis]
Length = 388
Score = 202 bits (514), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 95/127 (74%), Positives = 105/127 (82%), Gaps = 10/127 (7%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL+++RR GLCC
Sbjct: 179 GEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSRRAGLCC 238
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 239 TNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 288
Query: 126 PKKQGGG 132
PK G G
Sbjct: 289 PKNIGKG 295
>gi|347967445|ref|XP_307946.4| AGAP002238-PA [Anopheles gambiae str. PEST]
gi|333466292|gb|EAA03724.4| AGAP002238-PA [Anopheles gambiae str. PEST]
Length = 542
Score = 202 bits (514), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 95/132 (71%), Positives = 103/132 (78%), Gaps = 10/132 (7%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL++ RR+GL
Sbjct: 143 FTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSSARRVGLQ 202
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T T+LWRRN GEPVCNACGLYYKLHNVN RPLAM+KD IQ+RKR
Sbjct: 203 CSNCNTTNTSLWRRNQVGEPVCNACGLYYKLHNVN----------RPLAMKKDNIQSRKR 252
Query: 125 KPKKQGGGGGGG 136
KPK G G
Sbjct: 253 KPKGSKNSDGNG 264
>gi|291045150|dbj|BAI82362.1| GATA transcription factor [Haemaphysalis longicornis]
Length = 319
Score = 202 bits (514), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 93/135 (68%), Positives = 109/135 (80%), Gaps = 10/135 (7%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
FGEGRECVNCG+ISTPLWRRDGTGHYLCNACGLY+KMNG++RP++K +RL+A+RR+GL
Sbjct: 12 FGEGRECVNCGSISTPLWRRDGTGHYLCNACGLYNKMNGVHRPVIKTPRRLSASRRVGLT 71
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T T+LWRRNN GEPVCNACGLY++LH VN RPLAM+KD IQTRKR
Sbjct: 72 CSNCQTGTTSLWRRNNVGEPVCNACGLYFRLHGVN----------RPLAMKKDSIQTRKR 121
Query: 125 KPKKQGGGGGGGSGG 139
KPK GG G + G
Sbjct: 122 KPKTAASGGSGATNG 136
>gi|327288072|ref|XP_003228752.1| PREDICTED: transcription factor GATA-4-like [Anolis carolinensis]
Length = 411
Score = 201 bits (512), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 102/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 182 YSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRIGLM 241
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQTRKR
Sbjct: 242 CANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKR 291
Query: 125 KPK 127
KPK
Sbjct: 292 KPK 294
>gi|300797798|ref|NP_001179806.1| transcription factor GATA-4 [Bos taurus]
gi|296484958|tpg|DAA27073.1| TPA: GATA binding protein 4 [Bos taurus]
Length = 442
Score = 201 bits (512), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|395842329|ref|XP_003793970.1| PREDICTED: transcription factor GATA-4 [Otolemur garnettii]
Length = 442
Score = 201 bits (511), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 206 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 265
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 266 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 315
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 316 TRKRKPK 322
>gi|291385798|ref|XP_002709484.1| PREDICTED: GATA-4 zinc-finger transcription factor [Oryctolagus
cuniculus]
Length = 443
Score = 201 bits (511), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|297682265|ref|XP_002818845.1| PREDICTED: transcription factor GATA-4 isoform 1 [Pongo abelii]
Length = 443
Score = 201 bits (511), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 208 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 267
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 268 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 317
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 318 TRKRKPK 324
>gi|58332620|ref|NP_001011384.1| GATA binding protein 5 [Xenopus (Silurana) tropicalis]
gi|56789762|gb|AAH88567.1| GATA binding protein 5 [Xenopus (Silurana) tropicalis]
Length = 389
Score = 201 bits (511), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 104/127 (81%), Gaps = 10/127 (7%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL+++RR GLCC
Sbjct: 179 GEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSRRAGLCC 238
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 239 TNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 288
Query: 126 PKKQGGG 132
PK G
Sbjct: 289 PKNVSKG 295
>gi|508484|gb|AAA58496.1| putative [Homo sapiens]
Length = 442
Score = 201 bits (511), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|114326345|ref|NP_001041577.1| transcription factor GATA-4 [Canis lupus familiaris]
gi|122142221|sp|Q0Q0E4.1|GATA4_CANFA RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|110554951|gb|ABG75570.1| GATA binding protein 4 [Canis lupus familiaris]
Length = 442
Score = 201 bits (511), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|219518827|gb|AAI43480.1| GATA4 protein [Homo sapiens]
Length = 443
Score = 201 bits (511), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 208 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 267
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 268 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 317
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 318 TRKRKPK 324
>gi|395739374|ref|XP_003777251.1| PREDICTED: transcription factor GATA-4 [Pongo abelii]
Length = 442
Score = 201 bits (511), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|403307497|ref|XP_003944229.1| PREDICTED: transcription factor GATA-4 [Saimiri boliviensis
boliviensis]
Length = 442
Score = 201 bits (511), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|189237326|ref|XP_973133.2| PREDICTED: similar to GATA transcription factor [Tribolium
castaneum]
Length = 368
Score = 201 bits (511), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 94/123 (76%), Positives = 101/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL+A+RR+GL
Sbjct: 116 FTEGRECVNCGAIDTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASRRVGLT 175
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T+LWRRN GEPVCNACGLY+KLH VN RPLAM+KD IQTRKR
Sbjct: 176 CTNCHTSTTSLWRRNTVGEPVCNACGLYFKLHGVN----------RPLAMKKDSIQTRKR 225
Query: 125 KPK 127
KPK
Sbjct: 226 KPK 228
>gi|33188461|ref|NP_002043.2| transcription factor GATA-4 [Homo sapiens]
gi|215274105|sp|P43694.2|GATA4_HUMAN RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|75517731|gb|AAI01581.1| GATA binding protein 4 [Homo sapiens]
gi|85397947|gb|AAI05109.1| GATA binding protein 4 [Homo sapiens]
gi|219518812|gb|AAI43435.1| GATA binding protein 4 [Homo sapiens]
Length = 442
Score = 201 bits (511), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|387541612|gb|AFJ71433.1| transcription factor GATA-4 [Macaca mulatta]
Length = 442
Score = 201 bits (511), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|426358838|ref|XP_004046698.1| PREDICTED: transcription factor GATA-4 [Gorilla gorilla gorilla]
Length = 382
Score = 201 bits (511), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 147 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 206
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 207 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 256
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 257 TRKRKPK 263
>gi|332862667|ref|XP_528070.3| PREDICTED: transcription factor GATA-4 [Pan troglodytes]
Length = 408
Score = 201 bits (511), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 173 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 232
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 233 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 282
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 283 TRKRKPK 289
>gi|297298911|ref|XP_001087008.2| PREDICTED: transcription factor GATA-4 isoform 1 [Macaca mulatta]
Length = 443
Score = 201 bits (511), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 208 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 267
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 268 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 317
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 318 TRKRKPK 324
>gi|402877493|ref|XP_003902460.1| PREDICTED: transcription factor GATA-4 [Papio anubis]
Length = 443
Score = 201 bits (511), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 208 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 267
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 268 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 317
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 318 TRKRKPK 324
>gi|332244648|ref|XP_003271486.1| PREDICTED: transcription factor GATA-4 [Nomascus leucogenys]
Length = 442
Score = 201 bits (511), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|119586024|gb|EAW65620.1| GATA binding protein 4, isoform CRA_a [Homo sapiens]
Length = 416
Score = 201 bits (511), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 181 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 240
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 241 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 290
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 291 TRKRKPK 297
>gi|189442167|gb|AAI67280.1| GATA binding protein 4 [Xenopus (Silurana) tropicalis]
Length = 394
Score = 201 bits (510), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 102/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 180 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLS 239
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKR
Sbjct: 240 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKR 289
Query: 125 KPK 127
KPK
Sbjct: 290 KPK 292
>gi|119586025|gb|EAW65621.1| GATA binding protein 4, isoform CRA_b [Homo sapiens]
Length = 417
Score = 201 bits (510), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 182 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 241
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 242 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 291
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 292 TRKRKPK 298
>gi|17402599|dbj|BAB78731.1| transcription factor GATA-4 [Mus musculus]
Length = 442
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|25282465|ref|NP_653331.1| transcription factor GATA-4 [Rattus norvegicus]
gi|1169847|sp|P46152.1|GATA4_RAT RecName: Full=Transcription factor GATA-4; AltName:
Full=DNA-binding protein GATA-GT2; AltName:
Full=GATA-binding factor 4
gi|437670|gb|AAA16159.1| DNA binding protein [Rattus norvegicus]
gi|149030263|gb|EDL85319.1| GATA binding protein 4 [Rattus norvegicus]
Length = 440
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 206 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 265
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 266 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 315
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 316 TRKRKPK 322
>gi|397467354|ref|XP_003805387.1| PREDICTED: transcription factor GATA-4 [Pan paniscus]
Length = 386
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 151 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 210
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 211 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 260
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 261 TRKRKPK 267
>gi|354481949|ref|XP_003503163.1| PREDICTED: transcription factor GATA-5 [Cricetulus griseus]
gi|344254951|gb|EGW11055.1| Transcription factor GATA-5 [Cricetulus griseus]
Length = 406
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL++TRR GLCC
Sbjct: 192 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSTRRSGLCC 251
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 252 SNCHTATTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 301
Query: 126 PK 127
PK
Sbjct: 302 PK 303
>gi|170043918|ref|XP_001849614.1| GATAa2 transcription factor [Culex quinquefasciatus]
gi|167867189|gb|EDS30572.1| GATAa2 transcription factor [Culex quinquefasciatus]
Length = 496
Score = 201 bits (510), Expect = 1e-49, Method: Composition-based stats.
Identities = 94/130 (72%), Positives = 102/130 (78%), Gaps = 10/130 (7%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAI TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL++ RR+GL
Sbjct: 151 FTEGRECVNCGAIQTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSSARRVGLQ 210
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T T+LWRRN GEPVCNACGLYYKLHNVN RPLAM+KD IQ+RKR
Sbjct: 211 CSNCNTTNTSLWRRNQVGEPVCNACGLYYKLHNVN----------RPLAMKKDNIQSRKR 260
Query: 125 KPKKQGGGGG 134
KPK G
Sbjct: 261 KPKGSKNADG 270
>gi|49256546|gb|AAH71107.1| Gata4 protein [Xenopus laevis]
Length = 392
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 102/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 178 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLS 237
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKR
Sbjct: 238 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKR 287
Query: 125 KPK 127
KPK
Sbjct: 288 KPK 290
>gi|46909569|ref|NP_032118.2| transcription factor GATA-4 [Mus musculus]
gi|5882286|gb|AAD55266.1|AF179424_1 transcription factor GATA-4 [Mus musculus]
gi|148704120|gb|EDL36067.1| GATA binding protein 4 [Mus musculus]
gi|223460984|gb|AAI37825.1| GATA binding protein 4 [Mus musculus]
Length = 441
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 206 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 265
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 266 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 315
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 316 TRKRKPK 322
>gi|148235016|ref|NP_001084098.1| transcription factor GATA-4 [Xenopus laevis]
gi|2494683|sp|Q91677.1|GATA4_XENLA RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4; Short=xGATA-4
gi|1209878|gb|AAB05647.1| transcription factor xGATA-4 [Xenopus laevis]
Length = 392
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 102/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 178 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLS 237
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKR
Sbjct: 238 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKR 287
Query: 125 KPK 127
KPK
Sbjct: 288 KPK 290
>gi|281347691|gb|EFB23275.1| hypothetical protein PANDA_012893 [Ailuropoda melanoleuca]
Length = 347
Score = 200 bits (509), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 112 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 171
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 172 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 221
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 222 TRKRKPK 228
>gi|301776575|ref|XP_002923708.1| PREDICTED: transcription factor GATA-4-like [Ailuropoda
melanoleuca]
Length = 353
Score = 200 bits (509), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 118 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 177
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 178 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 227
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 228 TRKRKPK 234
>gi|62858609|ref|NP_001016949.1| GATA binding protein 4 [Xenopus (Silurana) tropicalis]
gi|89266736|emb|CAJ83964.1| GATA binding protein 4 [Xenopus (Silurana) tropicalis]
Length = 394
Score = 200 bits (509), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 102/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 180 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLS 239
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKR
Sbjct: 240 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKR 289
Query: 125 KPK 127
KPK
Sbjct: 290 KPK 292
>gi|181337117|ref|NP_032119.2| transcription factor GATA-5 [Mus musculus]
gi|341940731|sp|P97489.2|GATA5_MOUSE RecName: Full=Transcription factor GATA-5; AltName:
Full=GATA-binding factor 5
gi|74204896|dbj|BAE20945.1| unnamed protein product [Mus musculus]
gi|74209194|dbj|BAE24979.1| unnamed protein product [Mus musculus]
gi|148675383|gb|EDL07330.1| GATA binding protein 5 [Mus musculus]
Length = 404
Score = 200 bits (509), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 92/122 (75%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 191 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSGLCC 250
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 251 SNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 300
Query: 126 PK 127
PK
Sbjct: 301 PK 302
>gi|37781291|gb|AAP35028.1| GATA transcription factor alpha [Patiria miniata]
Length = 644
Score = 200 bits (509), Expect = 1e-49, Method: Composition-based stats.
Identities = 94/124 (75%), Positives = 103/124 (83%), Gaps = 11/124 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRLGL 63
FG GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM+RPL+KP +RL + +RR G+
Sbjct: 354 FGIGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMSRPLIKPQRRLQSGSRREGI 413
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLYYKLH+VN RPLAM+KDGIQTRK
Sbjct: 414 VCANCHTTTTTLWRRNKEGEPVCNACGLYYKLHSVN----------RPLAMKKDGIQTRK 463
Query: 124 RKPK 127
RKPK
Sbjct: 464 RKPK 467
>gi|3183530|sp|Q08369.3|GATA4_MOUSE RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|1815679|gb|AAB42015.1| GATA-4 [Mus musculus]
Length = 440
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 206 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 265
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 266 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 315
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 316 TRKRKPK 322
>gi|440913584|gb|ELR63022.1| Transcription factor GATA-4, partial [Bos grunniens mutus]
Length = 268
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 33 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 92
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 93 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 142
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 143 TRKRKPK 149
>gi|444525948|gb|ELV14225.1| Transcription factor GATA-4 [Tupaia chinensis]
Length = 253
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 18 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 77
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 78 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 127
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 128 TRKRKPK 134
>gi|66730290|ref|NP_001019487.1| transcription factor GATA-5 [Rattus norvegicus]
gi|55250430|gb|AAH85855.1| GATA binding protein 5 [Rattus norvegicus]
gi|149034029|gb|EDL88812.1| similar to Transcription factor GATA-5 (GATA binding factor-5)
[Rattus norvegicus]
Length = 404
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 92/122 (75%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 191 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSGLCC 250
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 251 SNCHTATTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 300
Query: 126 PK 127
PK
Sbjct: 301 PK 302
>gi|383863205|ref|XP_003707072.1| PREDICTED: uncharacterized protein LOC100875205 [Megachile
rotundata]
Length = 941
Score = 200 bits (508), Expect = 2e-49, Method: Composition-based stats.
Identities = 92/123 (74%), Positives = 100/123 (81%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL+A+RR+G
Sbjct: 625 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASRRVGTS 684
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T MT+LWRRN GEPVCNACGLY+KLH VN RP M+KD IQTRKR
Sbjct: 685 CSNCQTTMTSLWRRNTLGEPVCNACGLYFKLHGVN----------RPQTMKKDSIQTRKR 734
Query: 125 KPK 127
KPK
Sbjct: 735 KPK 737
>gi|410956538|ref|XP_003984899.1| PREDICTED: transcription factor GATA-4 [Felis catus]
Length = 441
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 206 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 265
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 266 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 315
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 316 TRKRKPK 322
>gi|390473525|ref|XP_003734617.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-4
[Callithrix jacchus]
Length = 379
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 144 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 203
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 204 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 253
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 254 TRKRKPK 260
>gi|224048927|ref|XP_002186842.1| PREDICTED: transcription factor GATA-4 [Taeniopygia guttata]
Length = 411
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 101/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL KP +RL+A+RR+GL
Sbjct: 181 YSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLFKPQRRLSASRRVGLS 240
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQTRKR
Sbjct: 241 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKR 290
Query: 125 KPK 127
KPK
Sbjct: 291 KPK 293
>gi|126303987|ref|XP_001381638.1| PREDICTED: transcription factor GATA-4 [Monodelphis domestica]
Length = 451
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 218 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 277
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 278 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 327
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 328 TRKRKPK 334
>gi|326916751|ref|XP_003204668.1| PREDICTED: transcription factor GATA-4-like [Meleagris gallopavo]
Length = 410
Score = 199 bits (507), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 101/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL KP +RL+A+RR+GL
Sbjct: 181 YSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLFKPQRRLSASRRVGLS 240
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQTRKR
Sbjct: 241 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKR 290
Query: 125 KPK 127
KPK
Sbjct: 291 KPK 293
>gi|426222407|ref|XP_004005383.1| PREDICTED: transcription factor GATA-4 [Ovis aries]
Length = 353
Score = 199 bits (507), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 102/127 (80%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A+RR
Sbjct: 118 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSASRR 177
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 178 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 227
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 228 TRKRKPK 234
>gi|285157832|gb|ADC35039.1| GATA456a [Themiste lageniformis]
Length = 730
Score = 199 bits (507), Expect = 2e-49, Method: Composition-based stats.
Identities = 89/123 (72%), Positives = 100/123 (81%), Gaps = 16/123 (13%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL------TATR 59
EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KP++R+ + +R
Sbjct: 385 AEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPTRRMSNNVLQSGSR 444
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R+GL C NCGT TTLWRRNN+GEPVCNACGLYYKLH VN RP++M+KDGI
Sbjct: 445 RMGLQCANCGTSTTTLWRRNNEGEPVCNACGLYYKLHQVN----------RPMSMKKDGI 494
Query: 120 QTR 122
QTR
Sbjct: 495 QTR 497
>gi|15593990|emb|CAC69835.1| gata4 protein [Bos taurus]
Length = 266
Score = 199 bits (507), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 31 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 90
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 91 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 140
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 141 TRKRKPK 147
>gi|50745252|ref|XP_420041.1| PREDICTED: transcription factor GATA-4 [Gallus gallus]
Length = 410
Score = 199 bits (507), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 101/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL KP +RL+A+RR+GL
Sbjct: 181 YSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLFKPQRRLSASRRVGLS 240
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQTRKR
Sbjct: 241 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKR 290
Query: 125 KPK 127
KPK
Sbjct: 291 KPK 293
>gi|47523656|ref|NP_999458.1| transcription factor GATA-4 [Sus scrofa]
gi|22651761|gb|AAM63546.1| transcription factor GATA-4 [Sus scrofa]
Length = 442
Score = 199 bits (507), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 93/127 (73%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG++RPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGISRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|328786707|ref|XP_001121273.2| PREDICTED: hypothetical protein LOC725422 [Apis mellifera]
Length = 942
Score = 199 bits (507), Expect = 2e-49, Method: Composition-based stats.
Identities = 92/123 (74%), Positives = 100/123 (81%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK +RL+A+RR+G
Sbjct: 612 FTEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLSASRRVGTS 671
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T MT+LWRRN GEPVCNACGLY+KLH VN RP M+KD IQTRKR
Sbjct: 672 CSNCQTTMTSLWRRNTLGEPVCNACGLYFKLHGVN----------RPHTMKKDSIQTRKR 721
Query: 125 KPK 127
KPK
Sbjct: 722 KPK 724
>gi|57546103|gb|AAW51922.1| GATA binding protein 4 [Homo sapiens]
Length = 442
Score = 199 bits (507), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 93/127 (73%), Positives = 102/127 (80%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 207 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 266
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T T LWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 267 VGLSCANCQTTTTALWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 316
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 317 TRKRKPK 323
>gi|338722454|ref|XP_001495866.2| PREDICTED: transcription factor GATA-4-like [Equus caballus]
Length = 315
Score = 199 bits (506), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 80 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 139
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 140 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 189
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 190 TRKRKPK 196
>gi|1845570|dbj|BAA11334.1| GATA-4 transcription factor [Homo sapiens]
gi|122893044|gb|ABM67539.1| Homo sapiens GATA4 [Shuttle vector pUCAG.GATA4]
Length = 439
Score = 199 bits (506), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 204 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 263
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 264 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 313
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 314 TRKRKPK 320
>gi|1169844|sp|P43691.1|GATA4_CHICK RecName: Full=Transcription factor GATA-4; AltName:
Full=GATA-binding factor 4
gi|511480|gb|AAA57503.1| GATA-4 transcription factor, partial [Gallus gallus]
Length = 380
Score = 199 bits (506), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 101/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL KP +RL+A+RR+GL
Sbjct: 151 YSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLFKPQRRLSASRRVGLS 210
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQTRKR
Sbjct: 211 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKR 260
Query: 125 KPK 127
KPK
Sbjct: 261 KPK 263
>gi|301630516|ref|XP_002944363.1| PREDICTED: GATA-binding factor 5-B-like, partial [Xenopus
(Silurana) tropicalis]
Length = 217
Score = 199 bits (506), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 104/127 (81%), Gaps = 10/127 (7%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL+++RR GLCC
Sbjct: 7 GEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSRRAGLCC 66
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 67 TNCHTSTTTLWRRNSEGEPVCNACGLYMKLHGV----------PRPLAMKKESIQTRKRK 116
Query: 126 PKKQGGG 132
PK G
Sbjct: 117 PKNVSKG 123
>gi|351695298|gb|EHA98216.1| Transcription factor GATA-4 [Heterocephalus glaber]
Length = 410
Score = 199 bits (506), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 175 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 234
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 235 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 284
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 285 TRKRKPK 291
>gi|193788522|dbj|BAG53416.1| unnamed protein product [Homo sapiens]
Length = 236
Score = 199 bits (506), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 1 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 60
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 61 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 110
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 111 TRKRKPK 117
>gi|111493951|gb|AAI05654.1| Gata5 protein [Mus musculus]
Length = 350
Score = 199 bits (506), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 92/122 (75%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 137 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSGLCC 196
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 197 SNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 246
Query: 126 PK 127
PK
Sbjct: 247 PK 248
>gi|1841756|gb|AAB47506.1| GATA-5 cardiac transcription factor [Mus musculus]
Length = 404
Score = 199 bits (506), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 91/122 (74%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 191 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSGLCC 250
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 251 SNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 300
Query: 126 PK 127
P+
Sbjct: 301 PE 302
>gi|157169239|gb|ABV25954.1| GATA-binding transcription factor B1 [Capitella teleta]
Length = 750
Score = 199 bits (505), Expect = 4e-49, Method: Composition-based stats.
Identities = 94/137 (68%), Positives = 102/137 (74%), Gaps = 23/137 (16%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT---------- 56
EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLVKP +R T
Sbjct: 400 EGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVKPQRRSTGGGSSTTSPV 459
Query: 57 ---ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLA 113
+RRLGL C NC T TTLWRRNN+GEPVCNACGLYYKLH V RP++
Sbjct: 460 PNLGSRRLGLQCANCSTTTTTLWRRNNEGEPVCNACGLYYKLHQV----------ARPIS 509
Query: 114 MRKDGIQTRKRKPKKQG 130
M+KDGIQTRKRKPK G
Sbjct: 510 MKKDGIQTRKRKPKGSG 526
>gi|344281251|ref|XP_003412393.1| PREDICTED: transcription factor GATA-4-like [Loxodonta africana]
Length = 611
Score = 199 bits (505), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 93/123 (75%), Positives = 102/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 211 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLS 270
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQTRKR
Sbjct: 271 CANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKR 320
Query: 125 KPK 127
KPK
Sbjct: 321 KPK 323
>gi|395507323|ref|XP_003757975.1| PREDICTED: transcription factor GATA-4 isoform 2 [Sarcophilus
harrisii]
Length = 452
Score = 199 bits (505), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 219 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 278
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 279 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 328
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 329 TRKRKPK 335
>gi|395507321|ref|XP_003757974.1| PREDICTED: transcription factor GATA-4 isoform 1 [Sarcophilus
harrisii]
Length = 451
Score = 198 bits (504), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 218 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 277
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 278 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 327
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 328 TRKRKPK 334
>gi|354471588|ref|XP_003498023.1| PREDICTED: transcription factor GATA-4-like [Cricetulus griseus]
Length = 238
Score = 198 bits (504), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 1 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 60
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 61 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 110
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 111 TRKRKPK 117
>gi|242020217|ref|XP_002430552.1| GATA-binding factor, putative [Pediculus humanus corporis]
gi|212515716|gb|EEB17814.1| GATA-binding factor, putative [Pediculus humanus corporis]
Length = 409
Score = 198 bits (504), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 93/125 (74%), Positives = 100/125 (80%), Gaps = 10/125 (8%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
+ EGRECVNCG +STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK KRLT +RLG
Sbjct: 148 VDYSEGRECVNCGTVSTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPKRLTNNKRLG 207
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
L C+NC T T+LWRRN+ GEPVCNACGLYYKLH VN RPLAM+KD IQTR
Sbjct: 208 LQCSNCNTATTSLWRRNSLGEPVCNACGLYYKLHGVN----------RPLAMKKDSIQTR 257
Query: 123 KRKPK 127
KRKPK
Sbjct: 258 KRKPK 262
>gi|74145051|dbj|BAE22221.1| unnamed protein product [Mus musculus]
Length = 236
Score = 198 bits (504), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 1 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 60
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 61 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 110
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 111 TRKRKPK 117
>gi|111494048|gb|AAI05655.1| Gata5 protein [Mus musculus]
Length = 271
Score = 198 bits (504), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 92/122 (75%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 58 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSGLCC 117
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 118 SNCHTATTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 167
Query: 126 PK 127
PK
Sbjct: 168 PK 169
>gi|157169243|gb|ABV25956.1| GATA-binding transcription factor B3 [Capitella teleta]
Length = 372
Score = 198 bits (503), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 103/123 (83%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRP++KP KR+ A+RR+GL
Sbjct: 156 FCEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGLNRPMMKPQKRMCASRRMGLQ 215
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN DGEPVCNACGLY+KLH V+ RP++MRKDGIQTRKR
Sbjct: 216 CANCQTSTTTLWRRNADGEPVCNACGLYFKLHGVH----------RPMSMRKDGIQTRKR 265
Query: 125 KPK 127
KPK
Sbjct: 266 KPK 268
>gi|24647320|ref|NP_732098.1| serpent, isoform B [Drosophila melanogaster]
gi|23171429|gb|AAN13691.1| serpent, isoform B [Drosophila melanogaster]
Length = 1249
Score = 197 bits (500), Expect = 1e-48, Method: Composition-based stats.
Identities = 88/123 (71%), Positives = 97/123 (78%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAISTPLWRRD TGHYLCNACGLY KMNGMNRPL+K +RL+A++R GL
Sbjct: 728 FTEGRECVNCGAISTPLWRRDNTGHYLCNACGLYMKMNGMNRPLIKQPRRLSASKRAGLS 787
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T T+LWRRN GEPVCNACGLYYKLH+V RPL M+KD IQ RKR
Sbjct: 788 CSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RPLTMKKDTIQKRKR 837
Query: 125 KPK 127
KPK
Sbjct: 838 KPK 840
>gi|293345|gb|AAA37662.1| GATA-binding transcription factor [Mus musculus]
Length = 439
Score = 197 bits (500), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 205 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 264
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 265 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 314
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 315 TRKRKPK 321
>gi|344254252|gb|EGW10356.1| Endonuclease VIII-like 2 [Cricetulus griseus]
Length = 509
Score = 197 bits (500), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 93/123 (75%), Positives = 102/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 26 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLS 85
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQTRKR
Sbjct: 86 CANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKR 135
Query: 125 KPK 127
KPK
Sbjct: 136 KPK 138
>gi|54262117|ref|NP_001005725.1| GATA transcription factor e [Strongylocentrotus purpuratus]
gi|52001519|gb|AAU21562.1| GATA transcription factor [Strongylocentrotus purpuratus]
Length = 567
Score = 197 bits (500), Expect = 1e-48, Method: Composition-based stats.
Identities = 97/143 (67%), Positives = 106/143 (74%), Gaps = 19/143 (13%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRLG 62
++G GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG NRPL+K +RL + +RR G
Sbjct: 255 EYGLGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRPLIKNPRRLQSGSRREG 314
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NC T TTLWRRN DGEPVCNACGLY+KLH VN RPLAM+KDGIQTR
Sbjct: 315 ITCANCHTSTTTLWRRNKDGEPVCNACGLYFKLHGVN----------RPLAMKKDGIQTR 364
Query: 123 KRKPKKQGGG--------GGGGS 137
KRKPK G GGG S
Sbjct: 365 KRKPKNPNKGNQQSNARNGGGQS 387
>gi|111493996|gb|AAI05656.1| Gata5 protein [Mus musculus]
Length = 320
Score = 196 bits (499), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 91/122 (74%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 107 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRSGLCC 166
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN++G PVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 167 SNCHTATTTLWRRNSEGGPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 216
Query: 126 PK 127
PK
Sbjct: 217 PK 218
>gi|432946176|ref|XP_004083805.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-4
[Oryzias latipes]
Length = 388
Score = 196 bits (498), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/137 (71%), Positives = 107/137 (78%), Gaps = 13/137 (9%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 180 FAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLS 239
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+KDGIQTRKR
Sbjct: 240 CTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKDGIQTRKR 289
Query: 125 KPK---KQGGGGGGGSG 138
KPK K G G G
Sbjct: 290 KPKNLNKTQSGTPGSDG 306
>gi|410953434|ref|XP_003983375.1| PREDICTED: transcription factor GATA-5 [Felis catus]
Length = 391
Score = 196 bits (498), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 94/122 (77%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRLT++RR GLCC
Sbjct: 177 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLTSSRRAGLCC 236
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 237 SNCHTTNTTLWRRNADGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 286
Query: 126 PK 127
PK
Sbjct: 287 PK 288
>gi|114319150|gb|ABI63575.1| GATA4 [Danio rerio]
Length = 352
Score = 196 bits (498), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 97/135 (71%), Positives = 106/135 (78%), Gaps = 10/135 (7%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLVKP +RL+A+RR+GL
Sbjct: 164 LGEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLVKPQRRLSASRRVGLS 223
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKR
Sbjct: 224 CTNCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKR 273
Query: 125 KPKKQGGGGGGGSGG 139
KPK G S G
Sbjct: 274 KPKNISKTKPGSSEG 288
>gi|120974910|gb|ABM46761.1| GATA4 [Gorilla gorilla]
Length = 176
Score = 196 bits (497), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 94/127 (74%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 1 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 60
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 61 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 110
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 111 TRKRKPK 117
>gi|54306071|ref|NP_571310.2| transcription factor GATA-5 [Danio rerio]
gi|30519592|emb|CAB43400.2| transcription factor gata5 [Danio rerio]
Length = 383
Score = 196 bits (497), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 95/132 (71%), Positives = 102/132 (77%), Gaps = 11/132 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT-RRLGLCC 65
EGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL +T RR GLCC
Sbjct: 182 EGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQSTSRRAGLCC 241
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 242 TNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 291
Query: 126 PKKQGGGGGGGS 137
PK GS
Sbjct: 292 PKMPKTKSSSGS 303
>gi|390179289|ref|XP_003736856.1| GA26230, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859789|gb|EIM52929.1| GA26230, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 1295
Score = 195 bits (496), Expect = 4e-48, Method: Composition-based stats.
Identities = 87/123 (70%), Positives = 97/123 (78%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAI TPLWRRD TGHYLCNACGLY KMNGMNRPL+K +RL+A++R GL
Sbjct: 755 FTEGRECVNCGAIQTPLWRRDNTGHYLCNACGLYMKMNGMNRPLIKQPRRLSASKRNGLS 814
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T T+LWRRN GEPVCNACGLY+KLH+V RPLAM+KD IQ RKR
Sbjct: 815 CSNCLTTYTSLWRRNPAGEPVCNACGLYFKLHSVT----------RPLAMKKDTIQKRKR 864
Query: 125 KPK 127
KPK
Sbjct: 865 KPK 867
>gi|94573490|gb|AAI16538.1| GATA-binding protein 5 [Danio rerio]
Length = 383
Score = 195 bits (496), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 95/132 (71%), Positives = 102/132 (77%), Gaps = 11/132 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT-RRLGLCC 65
EGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL +T RR GLCC
Sbjct: 182 EGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQSTSRRAGLCC 241
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 242 TNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 291
Query: 126 PKKQGGGGGGGS 137
PK GS
Sbjct: 292 PKMPKTKSSSGS 303
>gi|117276621|gb|ABK32791.1| GATA transcription factor 123 [Platynereis dumerilii]
Length = 523
Score = 195 bits (495), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 89/119 (74%), Positives = 97/119 (81%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 288 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGANRPLIKPKRRLSAARRAGTSCA 347
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NCGT TTLWRRN++G+PVCNACGLYYKLH+VN RPL M+KDGIQTR RK
Sbjct: 348 NCGTTTTTLWRRNHNGDPVCNACGLYYKLHSVN----------RPLTMKKDGIQTRNRK 396
>gi|348540838|ref|XP_003457894.1| PREDICTED: GATA-binding factor 5-A-like [Oreochromis niloticus]
Length = 383
Score = 194 bits (494), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 93/128 (72%), Positives = 102/128 (79%), Gaps = 11/128 (8%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATR 59
+D EGRECVNCG++STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL T +R
Sbjct: 176 IDDMPTEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQTTSR 235
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R GLCCTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ I
Sbjct: 236 RAGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESI 285
Query: 120 QTRKRKPK 127
QTRKRKPK
Sbjct: 286 QTRKRKPK 293
>gi|410916493|ref|XP_003971721.1| PREDICTED: transcription factor GATA-4-like [Takifugu rubripes]
Length = 386
Score = 194 bits (493), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 97/138 (70%), Positives = 108/138 (78%), Gaps = 13/138 (9%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 179 DYAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGL 238
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+ K+GIQTRK
Sbjct: 239 SCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMK----------KEGIQTRK 288
Query: 124 RKPK---KQGGGGGGGSG 138
RKPK K G GG G
Sbjct: 289 RKPKNLNKIQPGTPGGEG 306
>gi|348524793|ref|XP_003449907.1| PREDICTED: transcription factor GATA-4 [Oreochromis niloticus]
Length = 389
Score = 194 bits (492), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 93/124 (75%), Positives = 103/124 (83%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 179 DFAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGL 238
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 239 SCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 288
Query: 124 RKPK 127
RKPK
Sbjct: 289 RKPK 292
>gi|18858735|ref|NP_571311.1| transcription factor GATA-4 [Danio rerio]
gi|6524988|gb|AAF15275.1|AF191577_1 zinc finger transcription factor Gata4 [Danio rerio]
Length = 338
Score = 194 bits (492), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 95/137 (69%), Positives = 104/137 (75%), Gaps = 10/137 (7%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
F GRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLVKP +RL+A+RR+G
Sbjct: 159 FDDLRGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLVKPQRRLSASRRVG 218
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
L CTNC TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTR
Sbjct: 219 LSCTNCQPTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTR 268
Query: 123 KRKPKKQGGGGGGGSGG 139
KRKPK G S G
Sbjct: 269 KRKPKNISKTKPGSSEG 285
>gi|47206170|emb|CAF89866.1| unnamed protein product [Tetraodon nigroviridis]
Length = 306
Score = 193 bits (490), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 92/127 (72%), Positives = 100/127 (78%), Gaps = 9/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
MD EGRECVNCG++STPLWRRDGTGHYLCNACGLYHKMNG NRPLV+P +R +RR
Sbjct: 188 MDDGSTEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGSNRPLVRPQRRPAPSRR 247
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
GLCCTNCGT TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQ
Sbjct: 248 AGLCCTNCGTSTTTLWRRNAEGEPVCNACGLYMKLHGVGH---------RPLAMKKESIQ 298
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 299 TRKRKPK 305
>gi|431894591|gb|ELK04391.1| Transcription factor GATA-5 [Pteropus alecto]
Length = 243
Score = 193 bits (490), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 93/135 (68%), Positives = 105/135 (77%), Gaps = 11/135 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRD TGHYLCNACGLYHKMNG+NRPL++P KRL+++RR GLCC
Sbjct: 30 GEGRECVNCGALSTPLWRRDTTGHYLCNACGLYHKMNGVNRPLLRPQKRLSSSRRAGLCC 89
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRR+ +G+PVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 90 TNCHTATTTLWRRDAEGQPVCNACGLYAKLHGVP----------RPLAMKKESIQTRKRK 139
Query: 126 PKKQG-GGGGGGSGG 139
PK G GS G
Sbjct: 140 PKNAAKSKGSSGSTG 154
>gi|355697729|gb|EHH28277.1| GATA-binding factor 4 [Macaca mulatta]
Length = 304
Score = 193 bits (490), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 93/127 (73%), Positives = 101/127 (79%), Gaps = 12/127 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL +RR
Sbjct: 72 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRL--SRR 129
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 130 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 179
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 180 TRKRKPK 186
>gi|432865839|ref|XP_004070639.1| PREDICTED: GATA-binding factor 5-A-like [Oryzias latipes]
Length = 217
Score = 192 bits (489), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 95/138 (68%), Positives = 104/138 (75%), Gaps = 11/138 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT-R 59
+D EGRECVNCG++STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL +T R
Sbjct: 10 LDDMPTEGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQSTSR 69
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R GLCCTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ I
Sbjct: 70 RAGLCCTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESI 119
Query: 120 QTRKRKPKKQGGGGGGGS 137
QTRKRKPK GS
Sbjct: 120 QTRKRKPKLPKNRTSAGS 137
>gi|285157785|gb|ADC35037.1| GATA123a [Chaetopterus sp. MB-2010a]
Length = 489
Score = 192 bits (489), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 89/118 (75%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 264 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGANRPLIKPKRRLSAARRAGTSCA 323
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NCGT TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+KDGIQTR R
Sbjct: 324 NCGTSTTTLWRRNPNGDPVCNACGLYYKLHNVN----------RPLTMKKDGIQTRNR 371
>gi|156147115|gb|ABU53701.1| GATA4 [Oreochromis niloticus]
Length = 392
Score = 192 bits (489), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 92/124 (74%), Positives = 102/124 (82%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
F EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 179 DFAEGRECVNCGAMPTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGL 238
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 239 SCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 288
Query: 124 RKPK 127
RKPK
Sbjct: 289 RKPK 292
>gi|444707682|gb|ELW48920.1| Transcription factor GATA-5 [Tupaia chinensis]
Length = 242
Score = 192 bits (488), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 91/127 (71%), Positives = 103/127 (81%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
++ GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHK+NG+NRPL++P KRL ++RR
Sbjct: 17 LEESPGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKVNGVNRPLLRPQKRLPSSRR 76
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
GLCCTNC T TTLWRR+ DGEPVCNACGLY KLH V RPLAM+K+ IQ
Sbjct: 77 AGLCCTNCHTSCTTLWRRSADGEPVCNACGLYMKLHGVP----------RPLAMKKERIQ 126
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 127 TRKRKPK 133
>gi|443718380|gb|ELU09032.1| hypothetical protein CAPTEDRAFT_121659, partial [Capitella teleta]
Length = 144
Score = 192 bits (487), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 103/123 (83%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRP++KP KR+ A+RR+GL
Sbjct: 1 FCEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGLNRPMMKPQKRMCASRRMGLQ 60
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN DGEPVCNACGLY+KLH V+ RP++MRKDGIQTRKR
Sbjct: 61 CANCQTSTTTLWRRNADGEPVCNACGLYFKLHGVH----------RPMSMRKDGIQTRKR 110
Query: 125 KPK 127
KPK
Sbjct: 111 KPK 113
>gi|313224233|emb|CBY20022.1| unnamed protein product [Oikopleura dioica]
Length = 483
Score = 192 bits (487), Expect = 4e-47, Method: Composition-based stats.
Identities = 88/119 (73%), Positives = 96/119 (80%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP KRL+A RR G C+
Sbjct: 256 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRPLIKPKKRLSAARRAGTSCS 315
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN++G+PVCNACGLYYKLH VN RPL M+K+GIQTR RK
Sbjct: 316 NCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVN----------RPLTMKKEGIQTRNRK 364
>gi|345314700|ref|XP_001514983.2| PREDICTED: transcription factor GATA-4-like, partial
[Ornithorhynchus anatinus]
Length = 315
Score = 192 bits (487), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 91/120 (75%), Positives = 100/120 (83%), Gaps = 10/120 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL C N
Sbjct: 22 GRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLSCAN 81
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
C T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQTRKRKPK
Sbjct: 82 CQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKRKPK 131
>gi|355779507|gb|EHH63983.1| GATA-binding factor 4 [Macaca fascicularis]
Length = 253
Score = 192 bits (487), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 93/127 (73%), Positives = 101/127 (79%), Gaps = 12/127 (9%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL +RR
Sbjct: 21 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRL--SRR 78
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+GL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAMRK+GIQ
Sbjct: 79 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMRKEGIQ 128
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 129 TRKRKPK 135
>gi|307775397|ref|NP_001182721.1| transcription factor GATA-3 [Oncorhynchus mykiss]
gi|307342716|emb|CAR95099.1| GATA binding protein 3 [Oncorhynchus mykiss]
Length = 441
Score = 192 bits (487), Expect = 5e-47, Method: Composition-based stats.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 258 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 317
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR RK
Sbjct: 318 NCQTSTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNRK 366
>gi|285157858|gb|ADC35040.1| GATA456b [Themiste lageniformis]
Length = 496
Score = 192 bits (487), Expect = 5e-47, Method: Composition-based stats.
Identities = 91/131 (69%), Positives = 101/131 (77%), Gaps = 23/131 (17%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYL---------CNACGLYHKMNGMNRPLVKPSKRL 55
FGEGRECVNCGAISTPLWRRDGTGHYL CNACGLYHKMNG+NRPL+KP +RL
Sbjct: 224 FGEGRECVNCGAISTPLWRRDGTGHYLRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRL 283
Query: 56 ----TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
+A RR+GL C+NC T TTLWRRNN+GEPVCNACGLY+KLH V+ RP
Sbjct: 284 VNDHSANRRVGLQCSNCHTTTTTLWRRNNEGEPVCNACGLYFKLHGVS----------RP 333
Query: 112 LAMRKDGIQTR 122
L+MRKDGIQTR
Sbjct: 334 LSMRKDGIQTR 344
>gi|284520963|ref|NP_001165271.1| transcription factor GATA-3 [Salmo salar]
gi|180038038|gb|ACB87012.1| transcription factor GATA-3 [Salmo salar]
Length = 441
Score = 191 bits (485), Expect = 8e-47, Method: Composition-based stats.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 258 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 317
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR RK
Sbjct: 318 NCQTSTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNRK 366
>gi|301764531|ref|XP_002917681.1| PREDICTED: endothelial transcription factor GATA-2-like isoform 1
[Ailuropoda melanoleuca]
gi|281347007|gb|EFB22591.1| hypothetical protein PANDA_006028 [Ailuropoda melanoleuca]
Length = 480
Score = 191 bits (485), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 349
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 350 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 399
>gi|410951840|ref|XP_003982601.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Felis
catus]
Length = 480
Score = 191 bits (485), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 349
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 350 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 399
>gi|355689765|gb|AER98939.1| GATA binding protein 2 [Mustela putorius furo]
Length = 480
Score = 191 bits (484), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 349
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 350 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 399
>gi|47218982|emb|CAG02020.1| unnamed protein product [Tetraodon nigroviridis]
Length = 449
Score = 191 bits (484), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 267 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 326
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLY+KLHNVN RPL M+K+GIQTR RK
Sbjct: 327 ANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNRK 376
>gi|432858527|ref|XP_004068890.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Oryzias latipes]
Length = 456
Score = 191 bits (484), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 266 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 325
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLY+KLHNVN RPL M+K+GIQTR RK
Sbjct: 326 ANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNRK 375
>gi|348510371|ref|XP_003442719.1| PREDICTED: GATA-binding factor 2-like [Oreochromis niloticus]
Length = 458
Score = 191 bits (484), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 268 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 327
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLY+KLHNVN RPL M+K+GIQTR RK
Sbjct: 328 ANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNRK 377
>gi|432858529|ref|XP_004068891.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Oryzias latipes]
Length = 447
Score = 191 bits (484), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 316
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLY+KLHNVN RPL M+K+GIQTR RK
Sbjct: 317 ANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNRK 366
>gi|410920237|ref|XP_003973590.1| PREDICTED: GATA-binding factor 2-like [Takifugu rubripes]
Length = 457
Score = 190 bits (483), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 267 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 326
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLY+KLHNVN RPL M+K+GIQTR RK
Sbjct: 327 ANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNRK 376
>gi|301780742|ref|XP_002925788.1| PREDICTED: transcription factor GATA-5-like, partial [Ailuropoda
melanoleuca]
gi|281352342|gb|EFB27926.1| hypothetical protein PANDA_015338 [Ailuropoda melanoleuca]
Length = 292
Score = 190 bits (483), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 92/122 (75%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 79 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 138
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN DG+PVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 139 SNCRTTTTTLWRRNADGDPVCNACGLYTKLHGVP----------RPLAMKKESIQTRKRK 188
Query: 126 PK 127
PK
Sbjct: 189 PK 190
>gi|300797896|ref|NP_001179043.1| endothelial transcription factor GATA-2 [Bos taurus]
Length = 480
Score = 190 bits (483), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 96/119 (80%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 399
>gi|51242135|ref|NP_001003797.1| GATA-binding factor 2 [Gallus gallus]
gi|326927940|ref|XP_003210145.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Meleagris
gallopavo]
gi|120958|sp|P23824.1|GATA2_CHICK RecName: Full=GATA-binding factor 2; Short=GATA-2; AltName:
Full=Transcription factor NF-E1b
gi|62966|emb|CAA40252.1| NF-E1 [Gallus gallus]
Length = 466
Score = 190 bits (483), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 276 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 335
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 336 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 384
>gi|50540446|ref|NP_001002689.1| GATA-binding protein 2b [Danio rerio]
gi|49900268|gb|AAH76462.1| GATA-binding protein 2b [Danio rerio]
gi|182890930|gb|AAI65802.1| Gata2b protein [Danio rerio]
Length = 427
Score = 190 bits (482), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 89/119 (74%), Positives = 96/119 (80%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL++P +RL+A+RR G CC
Sbjct: 260 SGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKRRLSASRRAGTCCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +GEPVCNACGLYYKLHNVN RPL M+KDGIQTR RK
Sbjct: 320 NCQTGTTTLWRRNANGEPVCNACGLYYKLHNVN----------RPLTMKKDGIQTRNRK 368
>gi|8648977|emb|CAB94842.1| GATA-4 zinc-finger transcription factor [Oryctolagus cuniculus]
Length = 116
Score = 190 bits (482), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 92/123 (74%), Positives = 102/123 (82%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL
Sbjct: 1 FSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLS 60
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +G+PVCNACGLY KLH V RPLAMRK+GIQTRKR
Sbjct: 61 CANCQTTTTTLWRRNAEGKPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKR 110
Query: 125 KPK 127
KPK
Sbjct: 111 KPK 113
>gi|15559217|ref|NP_254277.1| endothelial transcription factor GATA-2 [Rattus norvegicus]
gi|62286680|sp|Q924Y4.1|GATA2_RAT RecName: Full=Endothelial transcription factor GATA-2; AltName:
Full=GATA-binding protein 2
gi|14029161|gb|AAK51128.1| GATA2 [Rattus norvegicus]
gi|38197660|gb|AAH61745.1| GATA binding protein 2 [Rattus norvegicus]
gi|149036689|gb|EDL91307.1| GATA binding protein 2, isoform CRA_a [Rattus norvegicus]
gi|149036690|gb|EDL91308.1| GATA binding protein 2, isoform CRA_a [Rattus norvegicus]
gi|184185940|dbj|BAG30820.1| GATA-binding protein 2 [Rattus norvegicus]
gi|184185942|dbj|BAG30821.1| GATA-binding protein 2 [Rattus norvegicus]
Length = 480
Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|395733176|ref|XP_002813217.2| PREDICTED: endothelial transcription factor GATA-2 [Pongo abelii]
Length = 406
Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 216 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 275
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 276 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 324
>gi|335371115|gb|AEH57086.1| GATA123 [Bugula neritina]
Length = 281
Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 94/119 (78%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG +RPL+KP +R +ATRR G CC
Sbjct: 76 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGSSRPLIKPKRRQSATRRAGTCCA 135
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NCGT TTLWRRN G+PVCNACGLYYKLHNVN RP M+KD IQTR RK
Sbjct: 136 NCGTFATTLWRRNQSGDPVCNACGLYYKLHNVN----------RPKTMKKDAIQTRNRK 184
>gi|18858733|ref|NP_571286.1| transcription factor GATA-3 [Danio rerio]
gi|3183010|sp|Q91428.1|GATA3_DANRE RecName: Full=Transcription factor GATA-3; AltName:
Full=GATA-binding factor 3
gi|1245717|gb|AAA93491.1| transcription factor [Danio rerio]
gi|190336965|gb|AAI62389.1| GATA-binding protein 3 [Danio rerio]
gi|190336980|gb|AAI62401.1| GATA-binding protein 3 [Danio rerio]
Length = 438
Score = 189 bits (481), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/120 (72%), Positives = 95/120 (79%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 251 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSC 310
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR RK
Sbjct: 311 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNRK 360
>gi|148234322|ref|NP_001084043.1| GATA-binding factor 2 [Xenopus laevis]
gi|80476387|gb|AAI08545.1| XGATA-2 protein [Xenopus laevis]
Length = 453
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 263 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 322
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 323 ANCQTSTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 371
>gi|18033511|gb|AAL57180.1| transcription factor GATA-2 [Rattus norvegicus]
Length = 480
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGSCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|204306643|gb|ACH99859.1| GATA-3 [Cyprinus carpio]
Length = 442
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/120 (72%), Positives = 95/120 (79%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 255 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSC 314
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR RK
Sbjct: 315 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNRK 364
>gi|432917241|ref|XP_004079469.1| PREDICTED: transcription factor GATA-6-like [Oryzias latipes]
Length = 501
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 98/124 (79%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
GE RECVNCG+ISTPLWRRDGTGH+LCNACGLY KMNG++RPL+KP KR + +RR+GL
Sbjct: 297 DMGESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRTSTSRRIGL 356
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 357 SCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVP----------RPLAMKKEGIQTRK 406
Query: 124 RKPK 127
RKPK
Sbjct: 407 RKPK 410
>gi|12836018|dbj|BAB23463.1| unnamed protein product [Mus musculus]
Length = 480
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|402887127|ref|XP_003906956.1| PREDICTED: endothelial transcription factor GATA-2 isoform 4 [Papio
anubis]
Length = 568
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 378 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 437
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 438 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 486
>gi|296225992|ref|XP_002758738.1| PREDICTED: endothelial transcription factor GATA-2 [Callithrix
jacchus]
gi|403268240|ref|XP_003926186.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Saimiri boliviensis boliviensis]
gi|403268242|ref|XP_003926187.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Saimiri boliviensis boliviensis]
gi|403268244|ref|XP_003926188.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Saimiri boliviensis boliviensis]
Length = 480
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 349
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 350 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|182996|gb|AAA35868.1| GATA-binding protein [Homo sapiens]
Length = 480
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|301607037|ref|XP_002933110.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Xenopus (Silurana)
tropicalis]
Length = 451
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 261 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 320
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 321 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 369
>gi|126336247|ref|XP_001366872.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Monodelphis
domestica]
Length = 473
Score = 189 bits (480), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 283 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 342
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 343 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 391
>gi|354482833|ref|XP_003503600.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Cricetulus griseus]
gi|344253365|gb|EGW09469.1| Endothelial transcription factor GATA-2 [Cricetulus griseus]
Length = 480
Score = 189 bits (480), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|126321769|ref|XP_001363044.1| PREDICTED: transcription factor GATA-6 [Monodelphis domestica]
Length = 590
Score = 189 bits (480), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 88/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ +TRRLGL
Sbjct: 378 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSTRRLGL 437
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 438 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 487
Query: 124 RKPK 127
RKPK
Sbjct: 488 RKPK 491
>gi|402887121|ref|XP_003906953.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Papio
anubis]
gi|402887123|ref|XP_003906954.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Papio
anubis]
Length = 480
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 349
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 350 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|109098094|ref|XP_001097801.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Macaca mulatta]
Length = 480
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 290 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 349
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 350 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|301607039|ref|XP_002933111.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Xenopus (Silurana)
tropicalis]
Length = 453
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 263 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 322
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 323 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 371
>gi|157278177|ref|NP_001098188.1| transcription factor GATA-3 [Oryzias latipes]
gi|62005802|dbj|BAD91312.1| transcription factor GATA-3 [Oryzias latipes]
Length = 440
Score = 189 bits (479), Expect = 4e-46, Method: Composition-based stats.
Identities = 86/119 (72%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 254 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 313
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLY+KLHN+N RPL M+K+GIQTR RK
Sbjct: 314 NCQTTTTTLWRRNANGDPVCNACGLYFKLHNIN----------RPLTMKKEGIQTRNRK 362
>gi|440913274|gb|ELR62746.1| Endothelial transcription factor GATA-2, partial [Bos grunniens
mutus]
Length = 385
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 96/119 (80%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 223 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 282
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 283 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 331
>gi|226530726|ref|NP_032116.4| endothelial transcription factor GATA-2 [Mus musculus]
gi|21264417|sp|O09100.2|GATA2_MOUSE RecName: Full=Endothelial transcription factor GATA-2; AltName:
Full=GATA-binding protein 2
gi|18181872|dbj|BAA19053.2| GATA-2 protein [Mus musculus]
gi|74191748|dbj|BAE32831.1| unnamed protein product [Mus musculus]
gi|74196104|dbj|BAE32970.1| unnamed protein product [Mus musculus]
gi|76827995|gb|AAI07010.1| GATA binding protein 2 [Mus musculus]
gi|148666827|gb|EDK99243.1| GATA binding protein 2 [Mus musculus]
Length = 480
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|133777354|gb|AAI07011.2| Gata2 protein [Mus musculus]
Length = 447
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 258 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 317
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 318 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 365
>gi|12835796|dbj|BAB23365.1| unnamed protein product [Mus musculus]
Length = 480
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|426341992|ref|XP_004036301.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Gorilla gorilla gorilla]
gi|426341996|ref|XP_004036303.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Gorilla gorilla gorilla]
Length = 480
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|149728223|ref|XP_001488214.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Equus
caballus]
Length = 480
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|30584557|gb|AAP36531.1| Homo sapiens hypothetical protein MGC2306 [synthetic construct]
gi|61371794|gb|AAX43732.1| GATA binding protein 2 [synthetic construct]
gi|61371798|gb|AAX43733.1| GATA binding protein 2 [synthetic construct]
Length = 481
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|339247095|ref|XP_003375181.1| GATA-binding factor 2 [Trichinella spiralis]
gi|316971560|gb|EFV55318.1| GATA-binding factor 2 [Trichinella spiralis]
Length = 215
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 98/119 (82%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP KR +A++R G+ C
Sbjct: 2 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRPLIKPKKRSSASKRTGINCA 61
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NCGT TTLWRRN +G+PVCNACGLYYKLHNVN RPL+M+K+GIQTR RK
Sbjct: 62 NCGTNTTTLWRRNQNGDPVCNACGLYYKLHNVN----------RPLSMKKEGIQTRNRK 110
>gi|395847143|ref|XP_003796243.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Otolemur garnettii]
gi|395847147|ref|XP_003796245.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Otolemur garnettii]
Length = 480
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|53147411|dbj|BAD52301.1| transcription factor Gata-binding protein 2/3 [Eptatretus burgeri]
Length = 491
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 86/121 (71%), Positives = 96/121 (79%), Gaps = 10/121 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G
Sbjct: 295 LAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTS 354
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T +TTLWRRN +G+PVCNACGLY+KLHNVN RP+ M+K+GIQTR R
Sbjct: 355 CANCQTTITTLWRRNANGDPVCNACGLYFKLHNVN----------RPMTMKKEGIQTRNR 404
Query: 125 K 125
K
Sbjct: 405 K 405
>gi|20070352|ref|NP_116027.2| endothelial transcription factor GATA-2 isoform 1 [Homo sapiens]
gi|224611699|ref|NP_001139133.1| endothelial transcription factor GATA-2 isoform 1 [Homo sapiens]
gi|114589043|ref|XP_516728.2| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Pan
troglodytes]
gi|332261773|ref|XP_003279941.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Nomascus leucogenys]
gi|397518553|ref|XP_003829449.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Pan
paniscus]
gi|397518555|ref|XP_003829450.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Pan
paniscus]
gi|397518557|ref|XP_003829451.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3 [Pan
paniscus]
gi|229462971|sp|P23769.3|GATA2_HUMAN RecName: Full=Endothelial transcription factor GATA-2; AltName:
Full=GATA-binding protein 2
gi|15990463|gb|AAH15613.1| GATA binding protein 2 [Homo sapiens]
gi|30410997|gb|AAH51342.1| GATA binding protein 2 [Homo sapiens]
gi|119599719|gb|EAW79313.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|119599721|gb|EAW79315.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|119599722|gb|EAW79316.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|119599723|gb|EAW79317.1| GATA binding protein 2, isoform CRA_a [Homo sapiens]
gi|208966348|dbj|BAG73188.1| GATA binding protein 2 [synthetic construct]
gi|410209098|gb|JAA01768.1| GATA binding protein 2 [Pan troglodytes]
gi|410267030|gb|JAA21481.1| GATA binding protein 2 [Pan troglodytes]
gi|410288416|gb|JAA22808.1| GATA binding protein 2 [Pan troglodytes]
Length = 480
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|238863840|gb|ACR66214.1| transcription factor GATA123 [Branchiostoma floridae]
Length = 473
Score = 189 bits (479), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 94/119 (78%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 272 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTQCA 331
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLH VN RPL M+KDGIQTR RK
Sbjct: 332 NCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVN----------RPLTMKKDGIQTRNRK 380
>gi|12803465|gb|AAH02557.1| GATA binding protein 2 [Homo sapiens]
gi|30582181|gb|AAP35317.1| hypothetical protein MGC2306 [Homo sapiens]
gi|60655087|gb|AAX32107.1| GATA binding protein 2 [synthetic construct]
gi|123982652|gb|ABM83067.1| GATA binding protein 2 [synthetic construct]
gi|123997319|gb|ABM86261.1| GATA binding protein 2 [synthetic construct]
gi|189054715|dbj|BAG37347.1| unnamed protein product [Homo sapiens]
Length = 480
Score = 189 bits (479), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|348501136|ref|XP_003438126.1| PREDICTED: transcription factor GATA-6-like [Oreochromis niloticus]
Length = 491
Score = 189 bits (479), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 98/124 (79%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
GE RECVNCG+ISTPLWRRDGTGH+LCNACGLY KMNG++RPL+KP KR + +RR+GL
Sbjct: 284 DMGESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPPKRTSTSRRIGL 343
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 344 SCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGV----------PRPLAMKKEGIQTRK 393
Query: 124 RKPK 127
RKPK
Sbjct: 394 RKPK 397
>gi|47523100|ref|NP_999044.1| endothelial transcription factor GATA-2 [Sus scrofa]
gi|29470193|gb|AAO73945.1| transcription factor GATA-2 [Sus scrofa]
Length = 480
Score = 189 bits (479), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|120960|sp|P23770.1|GATA2_XENLA RecName: Full=GATA-binding factor 2; Short=GATA-2; AltName:
Full=Transcription factor xGATA-2
gi|214171|gb|AAA49723.1| GATA binding factor-2 [Xenopus laevis]
Length = 452
Score = 189 bits (479), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 262 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 321
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 322 ANCQTSTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 370
>gi|317419025|emb|CBN81063.1| GATA-binding factor 2 [Dicentrarchus labrax]
Length = 345
Score = 189 bits (479), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 154 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 213
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLY+KLHNVN RPL M+K+GIQTR RK
Sbjct: 214 ANCQTTTTTLWRRNGNGDPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNRK 263
>gi|449473953|ref|XP_002194063.2| PREDICTED: GATA-binding factor 2-like [Taeniopygia guttata]
Length = 347
Score = 188 bits (478), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 157 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 216
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 217 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 265
>gi|395516754|ref|XP_003762552.1| PREDICTED: endothelial transcription factor GATA-2 [Sarcophilus
harrisii]
Length = 402
Score = 188 bits (478), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 212 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 271
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 272 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 320
>gi|291393346|ref|XP_002713202.1| PREDICTED: GATA binding protein 2 isoform 2 [Oryctolagus cuniculus]
Length = 480
Score = 188 bits (478), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|291461544|dbj|BAI83406.1| GATA-C [Parasteatoda tepidariorum]
Length = 528
Score = 188 bits (478), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 88/118 (74%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 313 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 372
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +GEPVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 373 NCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 420
>gi|345324774|ref|XP_001506274.2| PREDICTED: GATA-binding factor 2-like [Ornithorhynchus anatinus]
Length = 250
Score = 188 bits (478), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 96/119 (80%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 61 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 120
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 121 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 169
>gi|165292335|dbj|BAF98873.1| GATA binding protein 3 [Carassius auratus langsdorfii]
Length = 441
Score = 188 bits (478), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 87/120 (72%), Positives = 95/120 (79%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 254 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSC 313
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR RK
Sbjct: 314 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNRK 363
>gi|157278509|ref|NP_001098356.1| transcription factor GATA-2 [Oryzias latipes]
gi|57157777|dbj|BAD83861.1| transcription factor GATA-2 [Oryzias latipes]
Length = 455
Score = 188 bits (478), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 93/118 (78%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 271 EGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 330
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 331 NCQTTTTTLWRRNASGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 378
>gi|410923801|ref|XP_003975370.1| PREDICTED: GATA-binding factor 6-A-like [Takifugu rubripes]
Length = 494
Score = 188 bits (478), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 89/133 (66%), Positives = 100/133 (75%), Gaps = 10/133 (7%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
GE RECVNCG+ISTPLWRRDGTGH+LCNACGLY KMNG++RPL+KP KR + +RR+GL
Sbjct: 288 DMGESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRTSTSRRIGL 347
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 348 SCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGV----------PRPLAMKKEGIQTRK 397
Query: 124 RKPKKQGGGGGGG 136
RKPK G G
Sbjct: 398 RKPKTLNKAKGSG 410
>gi|40288197|ref|NP_005248.2| transcription factor GATA-6 [Homo sapiens]
gi|215273987|sp|Q92908.2|GATA6_HUMAN RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
gi|119621542|gb|EAX01137.1| GATA binding protein 6 [Homo sapiens]
gi|167887577|gb|ACA05995.1| transcription factor GATA-6 [Homo sapiens]
Length = 595
Score = 188 bits (478), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 383 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 442
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 443 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 492
Query: 124 RKPK 127
RKPK
Sbjct: 493 RKPK 496
>gi|426385574|ref|XP_004059281.1| PREDICTED: transcription factor GATA-6 [Gorilla gorilla gorilla]
Length = 596
Score = 188 bits (478), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 384 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 443
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 444 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 493
Query: 124 RKPK 127
RKPK
Sbjct: 494 RKPK 497
>gi|351706178|gb|EHB09097.1| Endothelial transcription factor GATA-2 [Heterocephalus glaber]
Length = 480
Score = 188 bits (478), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|348507926|ref|XP_003441506.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Oreochromis
niloticus]
Length = 452
Score = 188 bits (478), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 93/118 (78%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 269 EGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 328
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 329 NCQTTTTTLWRRNAHGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 376
>gi|348507924|ref|XP_003441505.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Oreochromis
niloticus]
Length = 455
Score = 188 bits (478), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 93/118 (78%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 272 EGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 331
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 332 NCQTTTTTLWRRNAHGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 379
>gi|387540446|gb|AFJ70850.1| transcription factor GATA-6 [Macaca mulatta]
Length = 595
Score = 188 bits (477), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 383 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 442
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 443 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 492
Query: 124 RKPK 127
RKPK
Sbjct: 493 RKPK 496
>gi|62898826|dbj|BAD97267.1| GATA binding protein 6 variant [Homo sapiens]
Length = 595
Score = 188 bits (477), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 383 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 442
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 443 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 492
Query: 124 RKPK 127
RKPK
Sbjct: 493 RKPK 496
>gi|297702367|ref|XP_002828153.1| PREDICTED: transcription factor GATA-6 [Pongo abelii]
Length = 595
Score = 188 bits (477), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 383 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 442
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 443 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 492
Query: 124 RKPK 127
RKPK
Sbjct: 493 RKPK 496
>gi|402902790|ref|XP_003914278.1| PREDICTED: transcription factor GATA-6 [Papio anubis]
Length = 595
Score = 188 bits (477), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 383 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 442
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 443 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 492
Query: 124 RKPK 127
RKPK
Sbjct: 493 RKPK 496
>gi|285157761|gb|ADC35036.1| GATA456a [Chaetopterus sp. MB-2010a]
Length = 597
Score = 188 bits (477), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 92/145 (63%), Positives = 100/145 (68%), Gaps = 37/145 (25%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL--------- 55
F EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL+KP +RL
Sbjct: 331 FAEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLIKPQRRLVGQPELNSY 390
Query: 56 ------------------TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHN 97
+A+RR+GL C NC T TTLWRRNN+GEPVCNACGLYYKLH
Sbjct: 391 NQYSQFNQSVCIGSSIKKSASRRVGLSCANCQTTTTTLWRRNNEGEPVCNACGLYYKLHG 450
Query: 98 VNRPLAMRHLAMRPLAMRKDGIQTR 122
VN RPLAM+K+GIQTR
Sbjct: 451 VN----------RPLAMKKEGIQTR 465
>gi|391331261|ref|XP_003740068.1| PREDICTED: GATA-binding factor 2-like [Metaseiulus occidentalis]
Length = 480
Score = 188 bits (477), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 95/120 (79%), Gaps = 11/120 (9%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRLGLC 64
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL +A RR G
Sbjct: 244 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLQSAARRAGTS 303
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN++GEPVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 304 CANCKTTTTTLWRRNHNGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 353
>gi|37781177|gb|AAP34298.1| GATA transcription factor [Patiria miniata]
Length = 676
Score = 188 bits (477), Expect = 7e-46, Method: Composition-based stats.
Identities = 94/156 (60%), Positives = 103/156 (66%), Gaps = 43/156 (27%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL--------- 55
FG GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM+RPL+KP +RL
Sbjct: 354 FGIGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMSRPLIKPQRRLDELGELCEK 413
Query: 56 ------------------------TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGL 91
+ +RR G+ C NC T TTLWRRN +GEPVCNACGL
Sbjct: 414 APDMTIEGLDPNLLRFKDRRWKPQSGSRREGIVCANCHTTTTTLWRRNKEGEPVCNACGL 473
Query: 92 YYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
YYKLH+VN RPLAM+KDGIQTRKRKPK
Sbjct: 474 YYKLHSVN----------RPLAMKKDGIQTRKRKPK 499
>gi|11875205|ref|NP_062058.1| transcription factor GATA-6 [Rattus norvegicus]
gi|1235614|gb|AAA92577.1| DNA binding protein [Rattus norvegicus]
Length = 441
Score = 188 bits (477), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 231 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 290
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 291 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 340
Query: 124 RKPK 127
RKPK
Sbjct: 341 RKPK 344
>gi|238863842|gb|ACR66215.1| transcription factor GATA123s [Branchiostoma floridae]
Length = 337
Score = 188 bits (477), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 94/119 (78%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 136 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTQCA 195
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLH VN RPL M+KDGIQTR RK
Sbjct: 196 NCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVN----------RPLTMKKDGIQTRNRK 244
>gi|386765869|ref|NP_001247128.1| serpent, isoform E [Drosophila melanogaster]
gi|383292736|gb|AFH06446.1| serpent, isoform E [Drosophila melanogaster]
Length = 731
Score = 187 bits (476), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 88/123 (71%), Positives = 97/123 (78%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAISTPLWRRD TGHYLCNACGLY KMNGMNRPL+K +RL+A++R GL
Sbjct: 210 FTEGRECVNCGAISTPLWRRDNTGHYLCNACGLYMKMNGMNRPLIKQPRRLSASKRAGLS 269
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T T+LWRRN GEPVCNACGLYYKLH+V RPL M+KD IQ RKR
Sbjct: 270 CSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RPLTMKKDTIQKRKR 319
Query: 125 KPK 127
KPK
Sbjct: 320 KPK 322
>gi|215274094|sp|P46153.2|GATA6_RAT RecName: Full=Transcription factor GATA-6; AltName:
Full=DNA-binding protein GATA-GT2; AltName:
Full=GATA-binding factor 6
Length = 587
Score = 187 bits (476), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 377 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 436
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 437 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 486
Query: 124 RKPK 127
RKPK
Sbjct: 487 RKPK 490
>gi|5882288|gb|AAD55267.1|AF179425_1 transcription factor GATA-6 [Mus musculus]
Length = 443
Score = 187 bits (476), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 231 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 290
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 291 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 340
Query: 124 RKPK 127
RKPK
Sbjct: 341 RKPK 344
>gi|40538758|ref|NP_571308.1| GATA-binding protein 2a [Danio rerio]
gi|31544961|gb|AAH53131.1| GATA-binding protein 2a [Danio rerio]
Length = 456
Score = 187 bits (476), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 87/117 (74%), Positives = 94/117 (80%), Gaps = 10/117 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 266 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 325
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 326 ANCQTTTTTLWRRNGNGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 372
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 36 GLYHK---MNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLY 92
GL+H ++G K T + G C NCG T LWRR+ G +CNACGLY
Sbjct: 239 GLFHPGTLLSGSASSFTPKCKSKTRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLY 298
Query: 93 YKLHNVNRPL 102
+K++ NRPL
Sbjct: 299 HKMNGQNRPL 308
>gi|387538427|gb|AFJ79490.1| GATA binding protein 1/2/3, partial [Branchiostoma lanceolatum]
Length = 278
Score = 187 bits (476), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 94/119 (78%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 116 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTQCA 175
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLH VN RPL M+KDGIQTR RK
Sbjct: 176 NCKTTTTTLWRRNQNGDPVCNACGLYYKLHAVN----------RPLTMKKDGIQTRNRK 224
>gi|46909571|ref|NP_034388.2| transcription factor GATA-6 [Mus musculus]
gi|215274120|sp|Q61169.3|GATA6_MOUSE RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
gi|74210838|dbj|BAE25049.1| unnamed protein product [Mus musculus]
gi|148691038|gb|EDL22985.1| GATA binding protein 6 [Mus musculus]
Length = 589
Score = 187 bits (476), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 377 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 436
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 437 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 486
Query: 124 RKPK 127
RKPK
Sbjct: 487 RKPK 490
>gi|56605890|ref|NP_001008444.1| GATA-binding factor 3 [Gallus gallus]
gi|120961|sp|P23825.1|GATA3_CHICK RecName: Full=GATA-binding factor 3; Short=GATA-3; AltName:
Full=Transcription factor NF-E1c
gi|62968|emb|CAA40253.1| NF-E1 [Gallus gallus]
Length = 444
Score = 187 bits (476), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|383417433|gb|AFH31930.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Macaca mulatta]
gi|383417435|gb|AFH31931.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Macaca mulatta]
gi|383417437|gb|AFH31932.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Macaca mulatta]
Length = 444
Score = 187 bits (476), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|374671149|gb|AEZ56247.1| transcription factor GATA-3 [Gadus morhua]
Length = 440
Score = 187 bits (476), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 86/120 (71%), Positives = 95/120 (79%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 257 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSC 316
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLY+KLHN+N RPL M+K+GIQTR RK
Sbjct: 317 ANCQTTTTTLWRRNANGDPVCNACGLYFKLHNIN----------RPLTMKKEGIQTRNRK 366
>gi|402879607|ref|XP_003903424.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Papio anubis]
Length = 444
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|359322060|ref|XP_541740.3| PREDICTED: LOW QUALITY PROTEIN: endothelial transcription factor
GATA-2 isoform 1 [Canis lupus familiaris]
Length = 480
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWR N +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 351 NCQTTTTTLWRGNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 399
>gi|351708641|gb|EHB11560.1| Trans-acting T-cell-specific transcription factor GATA-3
[Heterocephalus glaber]
Length = 444
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|55633439|ref|XP_507651.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
isoform 3 [Pan troglodytes]
gi|397515219|ref|XP_003827854.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Pan paniscus]
Length = 444
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|60827672|gb|AAX36808.1| GATA binding protein 3 [synthetic construct]
gi|61368371|gb|AAX43165.1| GATA binding protein 3 [synthetic construct]
Length = 445
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|432103480|gb|ELK30584.1| Endothelial transcription factor GATA-2 [Myotis davidii]
Length = 355
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 95/118 (80%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 166 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 225
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 226 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 273
>gi|50541959|ref|NP_001002295.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 1
[Homo sapiens]
gi|31664|emb|CAA38916.1| hGATA-3 factor [Homo sapiens]
gi|13905020|gb|AAH06793.1| GATA binding protein 3 [Homo sapiens]
gi|61358245|gb|AAX41534.1| GATA binding protein 3 [synthetic construct]
gi|119606774|gb|EAW86368.1| GATA binding protein 3, isoform CRA_b [Homo sapiens]
Length = 444
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|426363955|ref|XP_004049093.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Gorilla gorilla gorilla]
Length = 444
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|410918522|ref|XP_003972734.1| PREDICTED: transcription factor GATA-3-like isoform 1 [Takifugu
rubripes]
Length = 444
Score = 187 bits (476), Expect = 9e-46, Method: Composition-based stats.
Identities = 85/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 258 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 317
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLY+KLHN+N RPL M+K+GIQTR R
Sbjct: 318 NCQTTTTTLWRRNANGDPVCNACGLYFKLHNIN----------RPLTMKKEGIQTRNR 365
>gi|1655915|gb|AAC50941.1| hGATA-6 [Homo sapiens]
gi|2506076|dbj|BAA22621.1| GATA-6 [Homo sapiens]
Length = 449
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 237 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 296
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 297 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 346
Query: 124 RKPK 127
RKPK
Sbjct: 347 RKPK 350
>gi|1877210|emb|CAA64997.1| GATA-6 DNA binding protein [Homo sapiens]
Length = 449
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 237 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 296
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 297 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 346
Query: 124 RKPK 127
RKPK
Sbjct: 347 RKPK 350
>gi|6679951|ref|NP_032117.1| trans-acting T-cell-specific transcription factor GATA-3 [Mus
musculus]
gi|120963|sp|P23772.1|GATA3_MOUSE RecName: Full=Trans-acting T-cell-specific transcription factor
GATA-3; AltName: Full=GATA-binding factor 3
gi|51053|emb|CAA38917.1| GATA-3 factor [Mus musculus]
gi|38566046|gb|AAH62915.1| GATA binding protein 3 [Mus musculus]
gi|74140120|dbj|BAE33786.1| unnamed protein product [Mus musculus]
gi|89275361|gb|ABD66080.1| GATA binding protein 3 [Mus musculus]
gi|89275371|gb|ABD66081.1| GATA binding protein 3 [Mus musculus]
gi|148676049|gb|EDL07996.1| GATA binding protein 3 [Mus musculus]
Length = 443
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 259 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 318
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 319 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 366
>gi|355782626|gb|EHH64547.1| hypothetical protein EGM_17792 [Macaca fascicularis]
Length = 444
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|109088183|ref|XP_001108337.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like isoform 3 [Macaca mulatta]
gi|355562284|gb|EHH18878.1| hypothetical protein EGK_19445 [Macaca mulatta]
Length = 444
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|297686007|ref|XP_002820561.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Pongo abelii]
Length = 444
Score = 187 bits (476), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|74185986|dbj|BAE34137.1| unnamed protein product [Mus musculus]
Length = 408
Score = 187 bits (476), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 224 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 283
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 284 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 331
>gi|395827355|ref|XP_003786870.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Otolemur garnettii]
Length = 445
Score = 187 bits (476), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 261 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 320
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 321 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 368
>gi|165292343|dbj|BAF98877.1| GATA binding protein 3 isoform 4 [Carassius auratus langsdorfii]
Length = 277
Score = 187 bits (476), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 103 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 162
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR RK
Sbjct: 163 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNRK 211
>gi|354465002|ref|XP_003494969.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Cricetulus griseus]
gi|344238968|gb|EGV95071.1| Trans-acting T-cell-specific transcription factor GATA-3
[Cricetulus griseus]
Length = 443
Score = 187 bits (476), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 259 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 318
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 319 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 366
>gi|52345846|ref|NP_001004967.1| GATA binding protein 3 [Xenopus (Silurana) tropicalis]
gi|49522444|gb|AAH75479.1| GATA binding protein 3 [Xenopus (Silurana) tropicalis]
Length = 437
Score = 187 bits (476), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 253 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 312
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 313 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 360
>gi|120964|sp|P23773.1|GATA3_XENLA RecName: Full=GATA-binding factor 3; AltName: Full=Transcription
factor xGATA-3
gi|214173|gb|AAA49724.1| GATA binding factor-3 [Xenopus laevis]
Length = 435
Score = 187 bits (476), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 252 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 311
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 312 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 359
>gi|344276401|ref|XP_003409997.1| PREDICTED: endothelial transcription factor GATA-2-like [Loxodonta
africana]
Length = 404
Score = 187 bits (476), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/121 (71%), Positives = 96/121 (79%), Gaps = 10/121 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G
Sbjct: 212 SWSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGT 271
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CC NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 272 CCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRN 321
Query: 124 R 124
R
Sbjct: 322 R 322
>gi|449272886|gb|EMC82589.1| GATA-binding factor 3 [Columba livia]
Length = 444
Score = 187 bits (476), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|148222826|ref|NP_001084335.1| GATA-binding factor 3 [Xenopus laevis]
gi|83406000|gb|AAI10755.1| XGATA-3 protein [Xenopus laevis]
Length = 435
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 252 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 311
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 312 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 359
>gi|410919499|ref|XP_003973222.1| PREDICTED: GATA-binding factor 2-like isoform 1 [Takifugu rubripes]
Length = 456
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 272 EGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 331
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 332 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 379
>gi|18959228|ref|NP_579827.1| GATA binding protein 3 [Rattus norvegicus]
gi|12704536|gb|AAK00586.1| GATA-3 [Rattus norvegicus]
gi|149021023|gb|EDL78630.1| GATA binding protein 3 [Rattus norvegicus]
Length = 444
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|332225943|ref|XP_003262148.1| PREDICTED: transcription factor GATA-6-like [Nomascus leucogenys]
Length = 476
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 264 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 323
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 324 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 373
Query: 124 RKPK 127
RKPK
Sbjct: 374 RKPK 377
>gi|332217060|ref|XP_003257671.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Nomascus leucogenys]
Length = 444
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|371501187|emb|CBN81009.2| GATA-binding factor 3 [Dicentrarchus labrax]
Length = 460
Score = 187 bits (475), Expect = 1e-45, Method: Composition-based stats.
Identities = 84/118 (71%), Positives = 92/118 (77%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R G C
Sbjct: 241 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTLCA 300
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +GEPVCNACGLY+KLHNVN RPL M+KDGIQTR R
Sbjct: 301 NCHTSTTTLWRRNANGEPVCNACGLYFKLHNVN----------RPLTMKKDGIQTRNR 348
>gi|348575369|ref|XP_003473462.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3-like [Cavia porcellus]
Length = 444
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|327266039|ref|XP_003217814.1| PREDICTED: LOW QUALITY PROTEIN: endothelial transcription factor
GATA-2-like [Anolis carolinensis]
Length = 387
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 197 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 256
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 257 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 305
>gi|126340357|ref|XP_001363328.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Monodelphis domestica]
Length = 444
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|395538986|ref|XP_003771455.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Sarcophilus harrisii]
Length = 444
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|403296353|ref|XP_003939076.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Saimiri boliviensis boliviensis]
Length = 444
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|162316932|gb|ABX84143.1| GATA-binding protein 6 [Microtus levis]
Length = 248
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 43 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 102
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 103 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 152
Query: 124 RKPK 127
RKPK
Sbjct: 153 RKPK 156
>gi|1699253|gb|AAB37426.1| zinc finger transcription factor [Mus sp.]
Length = 444
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 232 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 291
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 292 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 341
Query: 124 RKPK 127
RKPK
Sbjct: 342 RKPK 345
>gi|37595582|gb|AAQ94635.1| Gata6 protein [Danio rerio]
gi|111185536|gb|AAH67710.2| Gata6 protein [Danio rerio]
Length = 501
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 85/125 (68%), Positives = 100/125 (80%), Gaps = 10/125 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ E RECVNCG++STPLWRRDGTGH+LCNACGLY KMNG++RPL+KP KR++++RR+GL
Sbjct: 290 EMVESRECVNCGSMSTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRMSSSRRIGL 349
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 350 SCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVP----------RPLAMKKEGIQTRK 399
Query: 124 RKPKK 128
RKPK
Sbjct: 400 RKPKS 404
>gi|432089717|gb|ELK23534.1| Trans-acting T-cell-specific transcription factor GATA-3 [Myotis
davidii]
Length = 444
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|326911086|ref|XP_003201893.1| PREDICTED: GATA-binding factor 3-like [Meleagris gallopavo]
Length = 408
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 224 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 283
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 284 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 331
>gi|291401984|ref|XP_002717407.1| PREDICTED: GATA binding protein 3 [Oryctolagus cuniculus]
Length = 444
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|410052588|ref|XP_003954430.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6 [Pan
troglodytes]
Length = 472
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 260 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 319
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 320 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRK 369
Query: 124 RKPK 127
RKPK
Sbjct: 370 RKPK 373
>gi|113205712|ref|NP_001038032.1| trans-acting T-cell-specific transcription factor GATA-3 [Sus
scrofa]
gi|91177057|gb|ABE26883.1| GATA binding protein 3 [Sus scrofa]
Length = 444
Score = 187 bits (474), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|148236237|ref|NP_001081452.1| GATA-binding factor 6-A [Xenopus laevis]
gi|49899014|gb|AAH76718.1| LOC397845 protein [Xenopus laevis]
Length = 502
Score = 187 bits (474), Expect = 1e-45, Method: Composition-based stats.
Identities = 84/124 (67%), Positives = 98/124 (79%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ E RECVNCG++ TPLWRRDGTGH+LCNACGLY KMNG++RPL+KP KR+ ++RR+GL
Sbjct: 286 ELPESRECVNCGSVQTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRVPSSRRIGL 345
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 346 ACANCHTSTTTLWRRNTEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 395
Query: 124 RKPK 127
RKPK
Sbjct: 396 RKPK 399
>gi|410918524|ref|XP_003972735.1| PREDICTED: transcription factor GATA-3-like isoform 2 [Takifugu
rubripes]
Length = 451
Score = 187 bits (474), Expect = 1e-45, Method: Composition-based stats.
Identities = 85/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 265 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 324
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLY+KLHN+N RPL M+K+GIQTR R
Sbjct: 325 NCQTTTTTLWRRNANGDPVCNACGLYFKLHNIN----------RPLTMKKEGIQTRNR 372
>gi|440906268|gb|ELR56550.1| Trans-acting T-cell-specific transcription factor GATA-3 [Bos
grunniens mutus]
Length = 444
Score = 187 bits (474), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|449276666|gb|EMC85098.1| GATA-binding factor 2, partial [Columba livia]
Length = 233
Score = 187 bits (474), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 43 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 102
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 103 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 151
>gi|344277697|ref|XP_003410636.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3-like [Loxodonta africana]
Length = 444
Score = 187 bits (474), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|345803446|ref|XP_547642.3| PREDICTED: transcription factor GATA-6 [Canis lupus familiaris]
Length = 627
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/121 (71%), Positives = 97/121 (80%), Gaps = 10/121 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL C
Sbjct: 414 ESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCA 473
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKP
Sbjct: 474 NCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKRKP 523
Query: 127 K 127
K
Sbjct: 524 K 524
>gi|269785247|ref|NP_001161551.1| GATA2 transcription factor [Saccoglossus kowalevskii]
gi|268054087|gb|ACY92530.1| GATA2 transcription factor [Saccoglossus kowalevskii]
Length = 352
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/120 (72%), Positives = 93/120 (77%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 136 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSC 195
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC TTLWRRN +G+PVCNACGLYYKLH VN RPL M+KDGIQTR RK
Sbjct: 196 ANCQATQTTLWRRNANGDPVCNACGLYYKLHGVN----------RPLTMKKDGIQTRNRK 245
>gi|241998888|ref|XP_002434087.1| endothelial transcription factor GATA-2, putative [Ixodes
scapularis]
gi|215495846|gb|EEC05487.1| endothelial transcription factor GATA-2, putative [Ixodes
scapularis]
Length = 385
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/119 (73%), Positives = 94/119 (78%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 175 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSC 234
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +GEPVCNACGLY+KLHNVN RPL M+K+GIQTR R
Sbjct: 235 ANCKTTTTTLWRRNQNGEPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNR 283
>gi|194678210|ref|XP_001253597.2| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6 [Bos
taurus]
Length = 497
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/121 (71%), Positives = 97/121 (80%), Gaps = 10/121 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL C
Sbjct: 288 ESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCA 347
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKP
Sbjct: 348 NCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKRKP 397
Query: 127 K 127
K
Sbjct: 398 K 398
>gi|417401032|gb|JAA47421.1| Putative trans-acting t-cell-specific transcription factor gata-3
[Desmodus rotundus]
Length = 444
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|296222377|ref|XP_002807542.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Callithrix jacchus]
Length = 592
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 96/124 (77%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 380 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 439
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM K+GIQTRK
Sbjct: 440 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMXKEGIQTRK 489
Query: 124 RKPK 127
RKPK
Sbjct: 490 RKPK 493
>gi|297263167|ref|XP_001097903.2| PREDICTED: endothelial transcription factor GATA-2 isoform 3
[Macaca mulatta]
Length = 479
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
GRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 290 SGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 349
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 350 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 397
>gi|355786384|gb|EHH66567.1| hypothetical protein EGM_03585 [Macaca fascicularis]
Length = 480
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/119 (72%), Positives = 95/119 (79%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRR+GTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 290 SEGRECVNCGATATPLWRRNGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCC 349
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 350 ANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|348521214|ref|XP_003448121.1| PREDICTED: transcription factor GATA-3-like isoform 1 [Oreochromis
niloticus]
Length = 443
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 85/119 (71%), Positives = 94/119 (78%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 256 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSC 315
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLY+KLHN+N RPL M+K+GIQTR R
Sbjct: 316 ANCQTTTTTLWRRNANGDPVCNACGLYFKLHNIN----------RPLTMKKEGIQTRNR 364
>gi|73949076|ref|XP_849153.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
isoform 1 [Canis lupus familiaris]
Length = 444
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|397489370|ref|XP_003815701.1| PREDICTED: transcription factor GATA-6 [Pan paniscus]
Length = 402
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 190 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 249
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 250 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRK 299
Query: 124 RKPK 127
RKPK
Sbjct: 300 RKPK 303
>gi|2494686|sp|Q91678.1|GAT6A_XENLA RecName: Full=GATA-binding factor 6-A; AltName: Full=Transcription
factor xGATA-6a
gi|1209880|gb|AAB05648.1| transcription factor xGATA-6 [Xenopus laevis]
Length = 391
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 84/124 (67%), Positives = 98/124 (79%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ E RECVNCG++ TPLWRRDGTGH+LCNACGLY KMNG++RPL+KP KR+ ++RR+GL
Sbjct: 175 ELPESRECVNCGSVQTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRVPSSRRIGL 234
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 235 ACANCHTSTTTLWRRNTEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 284
Query: 124 RKPK 127
RKPK
Sbjct: 285 RKPK 288
>gi|18858739|ref|NP_571632.1| transcription factor GATA-6 [Danio rerio]
gi|6524990|gb|AAF15276.1|AF191578_1 zinc finger transcription factor Gata6 [Danio rerio]
Length = 383
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 85/125 (68%), Positives = 100/125 (80%), Gaps = 10/125 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ E RECVNCG++STPLWRRDGTGH+LCNACGLY KMNG++RPL+KP KR++++RR+GL
Sbjct: 177 EMVESRECVNCGSMSTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRMSSSRRIGL 236
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 237 SCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVP----------RPLAMKKEGIQTRK 286
Query: 124 RKPKK 128
RKPK
Sbjct: 287 RKPKS 291
>gi|431917650|gb|ELK16915.1| Trans-acting T-cell-specific transcription factor GATA-3 [Pteropus
alecto]
Length = 409
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 225 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 284
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 285 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 332
>gi|301769219|ref|XP_002920028.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like [Ailuropoda melanoleuca]
Length = 411
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 227 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 286
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 287 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 334
>gi|297489684|ref|XP_002697773.1| PREDICTED: transcription factor GATA-6, partial [Bos taurus]
gi|296473832|tpg|DAA15947.1| TPA: GATA-6 [Bos taurus]
Length = 344
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 132 DLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 191
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 192 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRK 241
Query: 124 RKPK 127
RKPK
Sbjct: 242 RKPK 245
>gi|281339813|gb|EFB15397.1| hypothetical protein PANDA_008702 [Ailuropoda melanoleuca]
Length = 412
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 228 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 287
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 288 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 335
>gi|355701861|gb|EHH29214.1| GATA-binding factor 6 [Macaca mulatta]
Length = 398
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 186 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 245
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 246 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRK 295
Query: 124 RKPK 127
RKPK
Sbjct: 296 RKPK 299
>gi|443686031|gb|ELT89440.1| hypothetical protein CAPTEDRAFT_149492, partial [Capitella teleta]
Length = 347
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 95/120 (79%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 215 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTNC 274
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NCGT TTLWRRN++G+PVCNACGLYYKLHNVN RPL M+KDGIQT K
Sbjct: 275 ANCGTTTTTLWRRNHNGDPVCNACGLYYKLHNVN----------RPLTMKKDGIQTPNPK 324
>gi|116284294|gb|AAI24484.1| GATA-binding protein 1 [Danio rerio]
gi|182889206|gb|AAI64788.1| Gata1 protein [Danio rerio]
Length = 418
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 86/125 (68%), Positives = 93/125 (74%), Gaps = 10/125 (8%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R
Sbjct: 228 MRLSPPEARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKR 287
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
G C NC T TTLWRRN GEPVCNACGLY+KLHNVN RPL M+KDGIQ
Sbjct: 288 AGTQCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVN----------RPLTMKKDGIQ 337
Query: 121 TRKRK 125
TR RK
Sbjct: 338 TRNRK 342
>gi|68364574|ref|XP_693371.1| PREDICTED: GATA-binding factor 2 [Danio rerio]
Length = 372
Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 85/125 (68%), Positives = 97/125 (77%), Gaps = 10/125 (8%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M+F + RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KR ++R
Sbjct: 188 MNFSPLDARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRPVVSKR 247
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+G C NC T TTLWRRN+ GEPVCNACGLY+KLHNVN RPLAM+K+GIQ
Sbjct: 248 IGTQCANCQTSTTTLWRRNSSGEPVCNACGLYFKLHNVN----------RPLAMKKEGIQ 297
Query: 121 TRKRK 125
TR RK
Sbjct: 298 TRNRK 302
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 49/97 (50%), Gaps = 1/97 (1%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
G +C NC +T LWRR+ +G +CNACGLY K++ +NRPL + + R
Sbjct: 249 GTQCANCQTSTTTLWRRNSSGEPVCNACGLYFKLHNVNRPLAMKKEGIQTRNRKVSSKNR 308
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
G + +T+ P AC ++ +++ P A+
Sbjct: 309 KGKKFSTMEENLYCDFPKTPACDQHFDMYS-QSPAAL 344
>gi|426240701|ref|XP_004014232.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3 [Ovis aries]
Length = 421
Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 237 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 296
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 297 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 344
>gi|327290258|ref|XP_003229840.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like, partial [Anolis carolinensis]
Length = 364
Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 180 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 239
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 240 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 287
>gi|301349225|gb|ADK74261.1| Gata 4-like protein [Salmo salar]
Length = 181
Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 92/124 (74%), Positives = 103/124 (83%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHK+NG+NRPL+KP +RL+A+RR+GL
Sbjct: 61 DFAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKVNGINRPLIKPQRRLSASRRVGL 120
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 121 LCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 170
Query: 124 RKPK 127
RKPK
Sbjct: 171 RKPK 174
>gi|260809775|ref|XP_002599680.1| hypothetical protein BRAFLDRAFT_205741 [Branchiostoma floridae]
gi|229284961|gb|EEN55692.1| hypothetical protein BRAFLDRAFT_205741 [Branchiostoma floridae]
Length = 473
Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 93/118 (78%), Gaps = 10/118 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 273 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTQCAN 332
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
C T TTLWRRN +G+PVCNACGLYYKLH VN RPL M+KDGIQTR RK
Sbjct: 333 CKTTTTTLWRRNQNGDPVCNACGLYYKLHAVN----------RPLTMKKDGIQTRNRK 380
>gi|348510297|ref|XP_003442682.1| PREDICTED: GATA-binding factor 2-like [Oreochromis niloticus]
Length = 424
Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 85/124 (68%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R
Sbjct: 233 MRISTPEARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKR 292
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
G C NC T TTLWRRN +GEPVCNACGLY+KLHNVN RPL M+KDGIQ
Sbjct: 293 AGTLCANCHTSTTTLWRRNANGEPVCNACGLYFKLHNVN----------RPLTMKKDGIQ 342
Query: 121 TRKR 124
TR R
Sbjct: 343 TRNR 346
>gi|324507780|gb|ADY43293.1| Transcription factor GATA-3 [Ascaris suum]
Length = 397
Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 84/120 (70%), Positives = 95/120 (79%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVKP KR +A +R G+ C
Sbjct: 227 SEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKRQSAQKRTGIEC 286
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN G+PVCNACGLY+KLHN++ RP++M+KDGIQTR RK
Sbjct: 287 VNCKTNNTTLWRRNAHGQPVCNACGLYHKLHNIS----------RPISMKKDGIQTRNRK 336
>gi|348576623|ref|XP_003474086.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Cavia porcellus]
Length = 418
Score = 186 bits (471), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 86/121 (71%), Positives = 96/121 (79%), Gaps = 10/121 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGREC NCG+ TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL C
Sbjct: 209 EGRECANCGSFQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCA 268
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKP
Sbjct: 269 NCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKRKP 318
Query: 127 K 127
K
Sbjct: 319 K 319
>gi|313224235|emb|CBY20024.1| unnamed protein product [Oikopleura dioica]
Length = 265
Score = 186 bits (471), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 96/119 (80%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP KRL+A RR G C+
Sbjct: 38 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRPLIKPKKRLSAARRAGTSCS 97
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN++G+PVCNACGLYYKLH VN RPL M+K+GIQTR RK
Sbjct: 98 NCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVN----------RPLTMKKEGIQTRNRK 146
>gi|147903431|ref|NP_001083725.1| GATA-binding factor 6-B [Xenopus laevis]
gi|51950143|gb|AAH82349.1| Gata-6 protein [Xenopus laevis]
Length = 502
Score = 186 bits (471), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 84/123 (68%), Positives = 97/123 (78%), Gaps = 10/123 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ E RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RR+GL
Sbjct: 286 ELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRIGL 345
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 346 ACANCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 395
Query: 124 RKP 126
RKP
Sbjct: 396 RKP 398
>gi|182998|gb|AAA35869.1| transcription factor GATA-2 [Homo sapiens]
Length = 474
Score = 185 bits (470), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 85/118 (72%), Positives = 93/118 (78%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACG YHKM G NRPL+KP +RL+A RR G CC
Sbjct: 285 EGRECVNCGATATPLWRRDGTGHYLCNACGFYHKMKGQNRPLIKPKRRLSAARRAGTCCA 344
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 345 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 392
>gi|38569869|gb|AAR24452.1| GATA transcription factor [Nematostella vectensis]
Length = 422
Score = 185 bits (470), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 85/122 (69%), Positives = 97/122 (79%), Gaps = 10/122 (8%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
F +G+GRECVNCGA STPLWRRDG+GHYLCNACGLYHKMNG +RPL+KP +RL+A RR G
Sbjct: 209 FSYGDGRECVNCGATSTPLWRRDGSGHYLCNACGLYHKMNGSSRPLIKPKRRLSAARRAG 268
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C NC T TTLWRRN +G+PVCNACGLY+KLH VN RPL+M+KDGIQTR
Sbjct: 269 TSCANCHTTQTTLWRRNQNGDPVCNACGLYWKLHAVN----------RPLSMKKDGIQTR 318
Query: 123 KR 124
R
Sbjct: 319 NR 320
>gi|3183008|sp|P70005.1|GAT6B_XENLA RecName: Full=GATA-binding factor 6-B; AltName: Full=Transcription
factor xGATA-6b
gi|1621281|emb|CAA70088.1| GATA-6 protein [Xenopus laevis]
Length = 391
Score = 185 bits (470), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 84/123 (68%), Positives = 97/123 (78%), Gaps = 10/123 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ E RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RR+GL
Sbjct: 175 ELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRIGL 234
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 235 ACANCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 284
Query: 124 RKP 126
RKP
Sbjct: 285 RKP 287
>gi|395511636|ref|XP_003760062.1| PREDICTED: transcription factor GATA-6 [Sarcophilus harrisii]
Length = 461
Score = 185 bits (470), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ +TRRLGL
Sbjct: 249 DLSDSRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSTRRLGL 308
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 309 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRK 358
Query: 124 RKPK 127
RKPK
Sbjct: 359 RKPK 362
>gi|74207086|dbj|BAE33319.1| unnamed protein product [Mus musculus]
Length = 442
Score = 185 bits (470), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 259 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 318
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 319 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 365
>gi|4503929|ref|NP_002042.1| trans-acting T-cell-specific transcription factor GATA-3 isoform 2
[Homo sapiens]
gi|120962|sp|P23771.1|GATA3_HUMAN RecName: Full=Trans-acting T-cell-specific transcription factor
GATA-3; AltName: Full=GATA-binding factor 3
gi|31666|emb|CAA41102.1| hGATA3 transcription factor [Homo sapiens]
gi|13111766|gb|AAH03070.1| GATA binding protein 3 [Homo sapiens]
gi|39843077|gb|AAR32096.1| GATA binding protein 3 [Homo sapiens]
gi|119606773|gb|EAW86367.1| GATA binding protein 3, isoform CRA_a [Homo sapiens]
Length = 443
Score = 185 bits (470), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 260 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 319
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 366
>gi|30584633|gb|AAP36569.1| Homo sapiens GATA binding protein 3 [synthetic construct]
gi|61372534|gb|AAX43861.1| GATA binding protein 3 [synthetic construct]
gi|61372538|gb|AAX43862.1| GATA binding protein 3 [synthetic construct]
Length = 444
Score = 185 bits (470), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 260 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 319
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 366
>gi|19386566|gb|AAL86577.1| GATA-3 [Raja eglanteria]
Length = 444
Score = 185 bits (469), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 256 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 315
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 316 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 362
>gi|313240837|emb|CBY33127.1| unnamed protein product [Oikopleura dioica]
Length = 249
Score = 185 bits (469), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 96/119 (80%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP KRL+A RR G C+
Sbjct: 22 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGHNRPLIKPKKRLSAARRAGTSCS 81
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN++G+PVCNACGLYYKLH VN RPL M+K+GIQTR RK
Sbjct: 82 NCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVN----------RPLTMKKEGIQTRNRK 130
>gi|31662|emb|CAA38877.1| GATA-3 [Homo sapiens]
Length = 443
Score = 185 bits (469), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 260 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 319
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 366
>gi|345310085|ref|XP_003428923.1| PREDICTED: LOW QUALITY PROTEIN: trans-acting T-cell-specific
transcription factor GATA-3-like [Ornithorhynchus
anatinus]
Length = 440
Score = 185 bits (469), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 256 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 315
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 316 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 363
>gi|45361689|ref|NP_989422.1| GATA binding protein 6 [Xenopus (Silurana) tropicalis]
gi|32442468|gb|AAP82292.1| zinc-finger transcription factor Gata6 [Xenopus (Silurana)
tropicalis]
Length = 524
Score = 185 bits (469), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 84/123 (68%), Positives = 97/123 (78%), Gaps = 10/123 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ E RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RR+GL
Sbjct: 309 ELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRIGL 368
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 369 ACANCHTTTTTLWRRNTEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 418
Query: 124 RKP 126
RKP
Sbjct: 419 RKP 421
>gi|393910073|gb|EJD75720.1| hypothetical protein LOAG_17191 [Loa loa]
Length = 395
Score = 185 bits (469), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 84/120 (70%), Positives = 96/120 (80%), Gaps = 10/120 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVKP KR +A +R G+ C
Sbjct: 227 SEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKRQSAQKRTGIEC 286
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN+ G+PVCNACGLY+KLHN++ RP++M+KDGIQTR RK
Sbjct: 287 VNCKTNNTTLWRRNSLGQPVCNACGLYHKLHNIS----------RPISMKKDGIQTRNRK 336
>gi|335353907|dbj|BAK39710.1| GATA binding protein 3 [Tursiops truncatus]
Length = 251
Score = 185 bits (469), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 117 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 176
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 177 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 224
>gi|18858731|ref|NP_571309.1| GATA binding protein 1a [Danio rerio]
gi|1132419|gb|AAA86090.1| zg1 [Danio rerio]
Length = 418
Score = 184 bits (468), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 85/125 (68%), Positives = 93/125 (74%), Gaps = 10/125 (8%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M E RECVNCGA +TPLWR+DGTGHYLCNACGLYHKMNG NRPL++P KRL ++R
Sbjct: 228 MRLSPPEARECVNCGATATPLWRQDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKR 287
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
G C NC T TTLWRRN GEPVCNACGLY+KLHNVN RPL M+KDGIQ
Sbjct: 288 AGTQCANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVN----------RPLTMKKDGIQ 337
Query: 121 TRKRK 125
TR RK
Sbjct: 338 TRNRK 342
>gi|116003829|ref|NP_001070272.1| trans-acting T-cell-specific transcription factor GATA-3 [Bos
taurus]
gi|122132376|sp|Q08DV0.1|GATA3_BOVIN RecName: Full=Trans-acting T-cell-specific transcription factor
GATA-3; AltName: Full=GATA-binding factor 3
gi|115305262|gb|AAI23556.1| GATA binding protein 3 [Bos taurus]
gi|296481519|tpg|DAA23634.1| TPA: trans-acting T-cell-specific transcription factor GATA-3 [Bos
taurus]
Length = 443
Score = 184 bits (468), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 260 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 319
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 366
>gi|356582220|ref|NP_001239112.1| trans-acting T-cell-specific transcription factor GATA-3 [Ovis
aries]
gi|342671952|dbj|BAK57314.1| transcription factor GATA3 [Ovis aries]
Length = 444
Score = 184 bits (468), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 261 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 320
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 321 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|45382241|ref|NP_990751.1| transcription factor GATA-6 [Gallus gallus]
gi|1169849|sp|P43693.1|GATA6_CHICK RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
gi|511484|gb|AAA57505.1| GATA-6 transcription factor [Gallus gallus]
Length = 387
Score = 184 bits (468), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 86/123 (69%), Positives = 97/123 (78%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
E RECVNCG+I TPLWRRDGTG+YLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 175 LSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVPSSRRLGLS 234
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKR
Sbjct: 235 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKR 284
Query: 125 KPK 127
KPK
Sbjct: 285 KPK 287
>gi|410920497|ref|XP_003973720.1| PREDICTED: GATA-binding factor 2-like [Takifugu rubripes]
Length = 421
Score = 184 bits (468), Expect = 8e-45, Method: Composition-based stats.
Identities = 83/118 (70%), Positives = 91/118 (77%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R G C
Sbjct: 235 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTLCA 294
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN GEPVCNACGLY+KLHNVN RPL M+K+GIQTR R
Sbjct: 295 NCHTSTTTLWRRNASGEPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNR 342
>gi|47223008|emb|CAG07095.1| unnamed protein product [Tetraodon nigroviridis]
Length = 225
Score = 184 bits (467), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 98/124 (79%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
GE RECVNCG+ISTPLWRRDGTGH+LCNACGLY KMNG++RPL+KP KR + +RR+GL
Sbjct: 77 DMGESRECVNCGSISTPLWRRDGTGHFLCNACGLYSKMNGLSRPLIKPQKRTSTSRRIGL 136
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 137 SCANCQTSTTTLWRRNAEGEPVCNACGLYTKLHGVP----------RPLAMKKEGIQTRK 186
Query: 124 RKPK 127
RKPK
Sbjct: 187 RKPK 190
>gi|410963189|ref|XP_003988148.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Felis catus]
Length = 492
Score = 184 bits (467), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 309 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 368
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 369 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 415
>gi|342671958|dbj|BAK57317.1| transcription factor GATA6 [Ovis aries]
Length = 217
Score = 184 bits (467), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 5 DLPESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 64
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 65 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRK 114
Query: 124 RKPK 127
RKPK
Sbjct: 115 RKPK 118
>gi|328697578|ref|XP_001945293.2| PREDICTED: hypothetical protein LOC100159936 [Acyrthosiphon pisum]
Length = 657
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 87/122 (71%), Positives = 94/122 (77%), Gaps = 13/122 (10%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TATRRLG 62
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +A RR G
Sbjct: 417 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLSLQSAARRAG 476
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C NC T TTLWRRN +GEPVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 477 TSCANCKTSTTTLWRRNQNGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 526
Query: 123 KR 124
R
Sbjct: 527 NR 528
>gi|327264202|ref|XP_003216904.1| PREDICTED: erythroid transcription factor-like [Anolis
carolinensis]
Length = 383
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 85/116 (73%), Positives = 93/116 (80%), Gaps = 10/116 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R G C+
Sbjct: 189 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCS 248
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
NC T TTLWRRN +GEPVCNACGLY+KLHNVNRPLAMR KDGIQTR
Sbjct: 249 NCQTTTTTLWRRNVNGEPVCNACGLYFKLHNVNRPLAMR----------KDGIQTR 294
>gi|258645094|ref|NP_001158260.1| grain [Tribolium castaneum]
Length = 492
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 88/123 (71%), Positives = 95/123 (77%), Gaps = 13/123 (10%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TATRRLG 62
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +A RR G
Sbjct: 264 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLSLQSAARRAG 323
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C NC T TTLWRRN +GEPVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 324 TSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 373
Query: 123 KRK 125
RK
Sbjct: 374 NRK 376
>gi|117646668|emb|CAL37449.1| hypothetical protein [synthetic construct]
gi|208966350|dbj|BAG73189.1| GATA binding protein 3 [synthetic construct]
Length = 443
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 84/117 (71%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 260 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 319
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYY+LHN+N RPL M+K+GIQTR R
Sbjct: 320 CQTTTTTLWRRNANGDPVCNACGLYYELHNIN----------RPLTMKKEGIQTRNR 366
>gi|213511622|ref|NP_001133239.1| GATA-binding factor 1 [Salmo salar]
gi|209147311|gb|ACI32884.1| GATA-binding factor 3 [Salmo salar]
Length = 424
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 85/125 (68%), Positives = 93/125 (74%), Gaps = 10/125 (8%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M + RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R
Sbjct: 233 MRLSPPDARECVNCGATNTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKR 292
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
G C NC T TTLWRRN GEPVCNACGLY+KLHNVN RPL M+KDGIQ
Sbjct: 293 AGTQCANCNTSTTTLWRRNASGEPVCNACGLYFKLHNVN----------RPLTMKKDGIQ 342
Query: 121 TRKRK 125
TR RK
Sbjct: 343 TRNRK 347
>gi|395829517|ref|XP_003787903.1| PREDICTED: transcription factor GATA-5 [Otolemur garnettii]
Length = 435
Score = 184 bits (466), Expect = 1e-44, Method: Composition-based stats.
Identities = 94/122 (77%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL++TRR GLCC
Sbjct: 222 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSTRRAGLCC 281
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 282 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 331
Query: 126 PK 127
PK
Sbjct: 332 PK 333
>gi|449494011|ref|XP_002194999.2| PREDICTED: transcription factor GATA-6 [Taeniopygia guttata]
Length = 390
Score = 184 bits (466), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 85/123 (69%), Positives = 97/123 (78%), Gaps = 10/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
E RECVNCG+I TPLWRRDGTG+YLCNACGLY KMNG++RPL+KP KR+ ++RR+GL
Sbjct: 178 LSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVPSSRRMGLS 237
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKR
Sbjct: 238 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKR 287
Query: 125 KPK 127
KPK
Sbjct: 288 KPK 290
>gi|117646902|emb|CAL37566.1| hypothetical protein [synthetic construct]
Length = 443
Score = 183 bits (465), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 84/115 (73%), Positives = 92/115 (80%), Gaps = 10/115 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 260 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 319
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR
Sbjct: 320 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTR 364
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 30/50 (60%)
Query: 53 KRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
R A G C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 251 SRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPL 300
>gi|395823099|ref|XP_003784834.1| PREDICTED: transcription factor GATA-6 [Otolemur garnettii]
Length = 589
Score = 183 bits (465), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 86/121 (71%), Positives = 96/121 (79%), Gaps = 10/121 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+K KR+ ++RRLGL C
Sbjct: 380 ESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKQQKRVPSSRRLGLSCA 439
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKP
Sbjct: 440 NCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKRKP 489
Query: 127 K 127
K
Sbjct: 490 K 490
>gi|194044318|ref|XP_001924217.1| PREDICTED: transcription factor GATA-5-like [Sus scrofa]
Length = 400
Score = 183 bits (465), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 98/135 (72%), Positives = 107/135 (79%), Gaps = 12/135 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 186 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 245
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 246 TNCRTTNTTLWRRNADGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 295
Query: 126 PKK--QGGGGGGGSG 138
PK + G G SG
Sbjct: 296 PKSVAKTKGSSGSSG 310
>gi|323652552|gb|ADX98527.1| GATA-binding protein 3 [Coturnix japonica]
Length = 281
Score = 183 bits (464), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 149 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 208
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 209 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 255
>gi|157278507|ref|NP_001098355.1| hematopietic transcription factor GATA-1 [Oryzias latipes]
gi|39748630|dbj|BAD04991.1| hematopietic transcription factor GATA-1 [Oryzias latipes]
Length = 417
Score = 183 bits (464), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 83/118 (70%), Positives = 92/118 (77%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R G C
Sbjct: 234 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCA 293
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +GEPVCNACGLY+KLHNVN RPL M+K+GIQTR R
Sbjct: 294 NCHTSTTTLWRRNANGEPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNR 341
>gi|126302721|ref|XP_001368185.1| PREDICTED: GATA-binding factor 5-A-like [Monodelphis domestica]
Length = 397
Score = 183 bits (464), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 94/122 (77%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL++TRR GLCC
Sbjct: 186 GEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSTRRAGLCC 245
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 246 TNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 295
Query: 126 PK 127
PK
Sbjct: 296 PK 297
>gi|270015717|gb|EFA12165.1| hypothetical protein TcasGA2_TC002315 [Tribolium castaneum]
Length = 409
Score = 182 bits (463), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 88/123 (71%), Positives = 95/123 (77%), Gaps = 13/123 (10%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL---TATRRLG 62
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +A RR G
Sbjct: 181 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLSLQSAARRAG 240
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C NC T TTLWRRN +GEPVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 241 TSCANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 290
Query: 123 KRK 125
RK
Sbjct: 291 NRK 293
>gi|332016552|gb|EGI57433.1| GATA-binding factor A [Acromyrmex echinatior]
Length = 704
Score = 182 bits (463), Expect = 3e-44, Method: Composition-based stats.
Identities = 87/137 (63%), Positives = 97/137 (70%), Gaps = 15/137 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP---LVKPSKRLTAT--RRL 61
E +ECVNC A TPLWRRDGTGHYLCNACGLY+KMNG+NRP KP + + T RR
Sbjct: 478 EPKECVNCAASMTPLWRRDGTGHYLCNACGLYNKMNGVNRPPMRCTKPKQSVAPTNVRRA 537
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C NC T TTLWRRNN+GEPVCNACGLYYKLHNVN RPL+M+K+GIQT
Sbjct: 538 GVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVN----------RPLSMKKEGIQT 587
Query: 122 RKRKPKKQGGGGGGGSG 138
RKRKPK G G
Sbjct: 588 RKRKPKNHSNITGNLPG 604
>gi|395506703|ref|XP_003757670.1| PREDICTED: GATA-binding factor 5-A-like [Sarcophilus harrisii]
Length = 398
Score = 182 bits (462), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 94/122 (77%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL++TRR GLCC
Sbjct: 186 GEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSTRRAGLCC 245
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 246 TNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 295
Query: 126 PK 127
PK
Sbjct: 296 PK 297
>gi|221127849|ref|XP_002159183.1| PREDICTED: uncharacterized protein LOC100208265 [Hydra
magnipapillata]
Length = 408
Score = 182 bits (461), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 94/124 (75%), Gaps = 13/124 (10%)
Query: 2 DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
DF EGREC+NCGA STPLWRRD GHYLCNACGLYHKMNG NRPL+KP +RL+ RR
Sbjct: 223 DF---EGRECMNCGATSTPLWRRDTRGHYLCNACGLYHKMNGANRPLIKPKRRLSQARRT 279
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C+NC T TTLWRRN GEPVCNACGLYYKLH VN RPL M+KDGIQT
Sbjct: 280 GIVCSNCKTSQTTLWRRNGSGEPVCNACGLYYKLHKVN----------RPLTMKKDGIQT 329
Query: 122 RKRK 125
R RK
Sbjct: 330 RNRK 333
>gi|326917513|ref|XP_003205043.1| PREDICTED: transcription factor GATA-6-like [Meleagris gallopavo]
Length = 402
Score = 182 bits (461), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 86/124 (69%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTG+YLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 189 DLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVPSSRRLGL 248
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 249 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 298
Query: 124 RKPK 127
RKPK
Sbjct: 299 RKPK 302
>gi|426241173|ref|XP_004014466.1| PREDICTED: transcription factor GATA-5 [Ovis aries]
Length = 403
Score = 182 bits (461), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 94/122 (77%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 189 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 248
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 249 TNCHTTTTTLWRRNADGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 298
Query: 126 PK 127
PK
Sbjct: 299 PK 300
>gi|391332619|ref|XP_003740730.1| PREDICTED: GATA-binding factor 5-A-like [Metaseiulus occidentalis]
Length = 158
Score = 182 bits (461), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 88/127 (69%), Positives = 101/127 (79%), Gaps = 10/127 (7%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M+ GEGRECVNCGAISTPLWRRDGTGHYLCNACGLY+KMNG +RP++K +RL+A+RR
Sbjct: 9 MEMYAGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYNKMNGAHRPIIKTPRRLSASRR 68
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
GL C+NC T T+LWRRNN GEPVCNACGLY++LH V RPL M+KD IQ
Sbjct: 69 AGLTCSNCQTTNTSLWRRNNVGEPVCNACGLYFRLHGVP----------RPLTMKKDTIQ 118
Query: 121 TRKRKPK 127
TRKRKPK
Sbjct: 119 TRKRKPK 125
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
G C NC +T LWRR+ G +CNACGLY +++G+ RPL + +R C N
Sbjct: 70 GLTCSNCQTTNTSLWRRNNVGEPVCNACGLYFRLHGVPRPLTMKKDTIQTRKRKPKCSPN 129
>gi|77735401|ref|NP_001029393.1| transcription factor GATA-5 [Bos taurus]
gi|122140236|sp|Q3SZJ5.1|GATA5_BOVIN RecName: Full=Transcription factor GATA-5; AltName:
Full=GATA-binding factor 5
gi|74354092|gb|AAI02822.1| GATA binding protein 5 [Bos taurus]
gi|296481044|tpg|DAA23159.1| TPA: transcription factor GATA-5 [Bos taurus]
Length = 403
Score = 182 bits (461), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 98/135 (72%), Positives = 107/135 (79%), Gaps = 12/135 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 189 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 248
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 249 TNCHTTTTTLWRRNVDGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 298
Query: 126 PK--KQGGGGGGGSG 138
PK + G G SG
Sbjct: 299 PKNIAKTKGSSGSSG 313
>gi|403265451|ref|XP_003924952.1| PREDICTED: transcription factor GATA-6-like [Saimiri boliviensis
boliviensis]
Length = 452
Score = 182 bits (461), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 85/121 (70%), Positives = 95/121 (78%), Gaps = 10/121 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCG+I TPLWRR G GHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL C
Sbjct: 243 ESRECVNCGSIQTPLWRRGGPGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCA 302
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKP
Sbjct: 303 NCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKRKP 352
Query: 127 K 127
K
Sbjct: 353 K 353
>gi|449278190|gb|EMC86134.1| Transcription factor GATA-6, partial [Columba livia]
Length = 396
Score = 181 bits (460), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 85/124 (68%), Positives = 97/124 (78%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTG+YLCNACGLY KMNG++RPL+KP KR+ ++RR+GL
Sbjct: 183 DLSESRECVNCGSIQTPLWRRDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVPSSRRVGL 242
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 243 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 292
Query: 124 RKPK 127
RKPK
Sbjct: 293 RKPK 296
>gi|395752542|ref|XP_002830553.2| PREDICTED: transcription factor GATA-5 [Pongo abelii]
Length = 397
Score = 181 bits (460), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 184 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 243
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 244 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 293
Query: 126 PK 127
PK
Sbjct: 294 PK 295
>gi|183000|gb|AAA35870.1| Gata3 enhancer-binding protein [Homo sapiens]
Length = 444
Score = 181 bits (460), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 84/118 (71%), Positives = 92/118 (77%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLC CGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCRRCGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 319
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 320 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 367
>gi|73992691|ref|XP_543086.2| PREDICTED: transcription factor GATA-5 [Canis lupus familiaris]
Length = 400
Score = 181 bits (460), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 94/123 (76%), Positives = 102/123 (82%), Gaps = 10/123 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 187 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 246
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 247 TNCHTTTTTLWRRNADGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 296
Query: 126 PKK 128
PK
Sbjct: 297 PKS 299
>gi|332858914|ref|XP_514767.3| PREDICTED: transcription factor GATA-5 [Pan troglodytes]
Length = 397
Score = 181 bits (460), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 184 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 243
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 244 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 293
Query: 126 PK 127
PK
Sbjct: 294 PK 295
>gi|335371117|gb|AEH57087.1| GATA456 [Bugula neritina]
Length = 512
Score = 181 bits (460), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 90/132 (68%), Positives = 96/132 (72%), Gaps = 20/132 (15%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKPSKRLTA------ 57
FGE RECVNCGA+ TPLWRRD GHYLCNACGLY KMNG+NRPL VKP+ T
Sbjct: 247 FGENRECVNCGAMHTPLWRRDTDGHYLCNACGLYQKMNGLNRPLQVKPAPPPTKKQTNST 306
Query: 58 ---TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAM 114
+RR GL C NC T TTLWRRNNDGEPVCNACGLYYKLHNVN RP +M
Sbjct: 307 AGLSRRTGLVCANCNTGTTTLWRRNNDGEPVCNACGLYYKLHNVN----------RPPSM 356
Query: 115 RKDGIQTRKRKP 126
+KDGIQTRKRKP
Sbjct: 357 KKDGIQTRKRKP 368
>gi|195499000|ref|XP_002096762.1| GE24873 [Drosophila yakuba]
gi|194182863|gb|EDW96474.1| GE24873 [Drosophila yakuba]
Length = 707
Score = 181 bits (460), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 93/125 (74%), Gaps = 16/125 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATR 59
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +
Sbjct: 477 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAK 536
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+GI
Sbjct: 537 RAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KEGI 586
Query: 120 QTRKR 124
QTR R
Sbjct: 587 QTRNR 591
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
VKP + T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 466 VKPVRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 519
>gi|17998698|ref|NP_536721.1| transcription factor GATA-5 [Homo sapiens]
gi|20138325|sp|Q9BWX5.1|GATA5_HUMAN RecName: Full=Transcription factor GATA-5; AltName:
Full=GATA-binding factor 5
gi|109658580|gb|AAI17359.1| GATA binding protein 5 [Homo sapiens]
gi|109658884|gb|AAI17357.1| GATA binding protein 5 [Homo sapiens]
gi|119595766|gb|EAW75360.1| GATA binding protein 5, isoform CRA_a [Homo sapiens]
gi|119595767|gb|EAW75361.1| GATA binding protein 5, isoform CRA_a [Homo sapiens]
gi|302313183|gb|ADL14516.1| GATA binding protein 5 [Homo sapiens]
Length = 397
Score = 181 bits (460), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 184 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 243
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 244 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 293
Query: 126 PK 127
PK
Sbjct: 294 PK 295
>gi|195330796|ref|XP_002032089.1| GM26366 [Drosophila sechellia]
gi|194121032|gb|EDW43075.1| GM26366 [Drosophila sechellia]
Length = 703
Score = 181 bits (460), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 93/125 (74%), Gaps = 16/125 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATR 59
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +
Sbjct: 473 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAK 532
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+GI
Sbjct: 533 RAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KEGI 582
Query: 120 QTRKR 124
QTR R
Sbjct: 583 QTRNR 587
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
VKP + T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 462 VKPVRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 515
>gi|47229585|emb|CAG06781.1| unnamed protein product [Tetraodon nigroviridis]
Length = 456
Score = 181 bits (460), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 84/118 (71%), Positives = 92/118 (77%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRD TGHYLC CGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 272 EGRECVNCGATSTPLWRRDSTGHYLCKRCGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 331
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLY+KLHNVN RPL M+K+GIQTR R
Sbjct: 332 NCQTTTTTLWRRNANGDPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNR 379
>gi|195037647|ref|XP_001990272.1| GH19248 [Drosophila grimshawi]
gi|193894468|gb|EDV93334.1| GH19248 [Drosophila grimshawi]
Length = 696
Score = 181 bits (460), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 93/125 (74%), Gaps = 16/125 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATR 59
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +
Sbjct: 466 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAK 525
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+GI
Sbjct: 526 RAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KEGI 575
Query: 120 QTRKR 124
QTR R
Sbjct: 576 QTRNR 580
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
VKP + T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 455 VKPVRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 508
>gi|342671956|dbj|BAK57316.1| transcription factor GATA5 [Ovis aries]
Length = 333
Score = 181 bits (459), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 94/122 (77%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 119 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 178
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 179 TNCHTTTTTLWRRNADGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 228
Query: 126 PK 127
PK
Sbjct: 229 PK 230
>gi|384942494|gb|AFI34852.1| transcription factor GATA-5 [Macaca mulatta]
Length = 397
Score = 181 bits (459), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 184 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 243
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 244 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 293
Query: 126 PK 127
PK
Sbjct: 294 PK 295
>gi|221378310|ref|NP_001138026.1| grain, isoform B [Drosophila melanogaster]
gi|220903030|gb|ACL83485.1| grain, isoform B [Drosophila melanogaster]
Length = 699
Score = 181 bits (459), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 93/125 (74%), Gaps = 16/125 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATR 59
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +
Sbjct: 469 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAK 528
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+GI
Sbjct: 529 RAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KEGI 578
Query: 120 QTRKR 124
QTR R
Sbjct: 579 QTRNR 583
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
VKP + T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 458 VKPVRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 511
>gi|256083783|ref|XP_002578117.1| transcription factor GATA-1 [Schistosoma mansoni]
Length = 735
Score = 181 bits (459), Expect = 8e-44, Method: Composition-based stats.
Identities = 82/120 (68%), Positives = 95/120 (79%), Gaps = 11/120 (9%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRLGLCC 65
EGRECVNCGA STPLWRRDG G+YLCNACGLY KMNG NRPL+KP +RL +++RR G C
Sbjct: 198 EGRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNGQNRPLIKPKRRLQSSSRRTGTIC 257
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN +GEPVCNACGLY+KLHN+ RP++M+KDGIQTR RK
Sbjct: 258 SNCRTITTTLWRRNTNGEPVCNACGLYFKLHNIQ----------RPISMKKDGIQTRNRK 307
>gi|195390588|ref|XP_002053950.1| GJ23061 [Drosophila virilis]
gi|194152036|gb|EDW67470.1| GJ23061 [Drosophila virilis]
Length = 686
Score = 181 bits (459), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 93/125 (74%), Gaps = 16/125 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATR 59
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +
Sbjct: 456 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAK 515
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+GI
Sbjct: 516 RAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KEGI 565
Query: 120 QTRKR 124
QTR R
Sbjct: 566 QTRNR 570
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
VKP + T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 445 VKPVRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 498
>gi|348554147|ref|XP_003462887.1| PREDICTED: transcription factor GATA-5-like [Cavia porcellus]
Length = 337
Score = 181 bits (459), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 94/122 (77%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 124 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 183
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 184 TNCHTTHTTLWRRNTDGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 233
Query: 126 PK 127
PK
Sbjct: 234 PK 235
>gi|3702858|gb|AAC62961.1| GATA transcription factor [Strongylocentrotus purpuratus]
Length = 119
Score = 181 bits (459), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 92/126 (73%), Positives = 99/126 (78%), Gaps = 11/126 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRLGLCCT 66
GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG NRPL+K +RL + +RR G+ C
Sbjct: 1 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRPLIKNPRRLQSGSRREGITCA 60
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN DGEPVCNACGLY+KLH VN RPLAM+KDGIQTRKRKP
Sbjct: 61 NCHTSTTTLWRRNKDGEPVCNACGLYFKLHGVN----------RPLAMKKDGIQTRKRKP 110
Query: 127 KKQGGG 132
K G
Sbjct: 111 KNPNKG 116
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 30/43 (69%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
EG C NC +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 55 EGITCANCHTSTTTLWRRNKDGEPVCNACGLYFKLHGVNRPLA 97
>gi|426392409|ref|XP_004062545.1| PREDICTED: transcription factor GATA-5 [Gorilla gorilla gorilla]
Length = 479
Score = 181 bits (459), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 266 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 325
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 326 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 375
Query: 126 PK 127
PK
Sbjct: 376 PK 377
>gi|442617963|ref|NP_001262366.1| grain, isoform C [Drosophila melanogaster]
gi|440217190|gb|AGB95748.1| grain, isoform C [Drosophila melanogaster]
Length = 712
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 93/125 (74%), Gaps = 16/125 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATR 59
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +
Sbjct: 469 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAK 528
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+GI
Sbjct: 529 RAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KEGI 578
Query: 120 QTRKR 124
QTR R
Sbjct: 579 QTRNR 583
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
VKP + T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 458 VKPVRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 511
>gi|157169245|gb|ABV25957.1| GATA-binding transcription factor A1 [Capitella teleta]
Length = 372
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 84/114 (73%), Positives = 93/114 (81%), Gaps = 10/114 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKM+G NRPL+KP +RL+A RR G C
Sbjct: 269 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMSGQNRPLIKPKRRLSAARRAGTNCA 328
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
NCGT TTLWRRN++G+PVCNACGLYYKLHNVN RPL M+K+GIQ
Sbjct: 329 NCGTTTTTLWRRNHNGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQ 372
>gi|353228552|emb|CCD74723.1| putative gata binding factor [Schistosoma mansoni]
Length = 735
Score = 181 bits (458), Expect = 1e-43, Method: Composition-based stats.
Identities = 82/120 (68%), Positives = 95/120 (79%), Gaps = 11/120 (9%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRLGLCC 65
EGRECVNCGA STPLWRRDG G+YLCNACGLY KMNG NRPL+KP +RL +++RR G C
Sbjct: 198 EGRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNGQNRPLIKPKRRLQSSSRRTGTIC 257
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN +GEPVCNACGLY+KLHN+ RP++M+KDGIQTR RK
Sbjct: 258 SNCRTITTTLWRRNTNGEPVCNACGLYFKLHNIQ----------RPISMKKDGIQTRNRK 307
>gi|321467569|gb|EFX78559.1| hypothetical protein DAPPUDRAFT_24785 [Daphnia pulex]
Length = 118
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 88/118 (74%), Positives = 96/118 (81%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL+A RR G C
Sbjct: 1 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLSAARRAGTTCA 60
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T MTTLWRRN++GEPVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 61 NCKTTMTTLWRRNHNGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 108
>gi|198452168|ref|XP_001358657.2| GA21945, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198131814|gb|EAL27798.2| GA21945, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 699
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 93/125 (74%), Gaps = 16/125 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATR 59
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +
Sbjct: 469 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAK 528
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R G C NC T TTLWRRN GEPVCNACGLYYKLHNVN RPL M+K+GI
Sbjct: 529 RAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVN----------RPLTMKKEGI 578
Query: 120 QTRKR 124
QTR R
Sbjct: 579 QTRNR 583
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
VKP + T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 458 VKPVRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 511
>gi|345308422|ref|XP_001505950.2| PREDICTED: GATA-binding factor 5-A-like [Ornithorhynchus anatinus]
Length = 397
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL+++RR GLCC
Sbjct: 185 GEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSSSRRAGLCC 244
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 245 TNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 294
Query: 126 PK 127
PK
Sbjct: 295 PK 296
>gi|156408197|ref|XP_001641743.1| predicted protein [Nematostella vectensis]
gi|156228883|gb|EDO49680.1| predicted protein [Nematostella vectensis]
Length = 423
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 85/123 (69%), Positives = 97/123 (78%), Gaps = 11/123 (8%)
Query: 3 FQFG-EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
F +G +GRECVNCGA STPLWRRDG+GHYLCNACGLYHKMNG +RPL+KP +RL+A RR
Sbjct: 209 FSYGADGRECVNCGATSTPLWRRDGSGHYLCNACGLYHKMNGSSRPLIKPKRRLSAARRA 268
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G C NC T TTLWRRN +G+PVCNACGLY+KLH VN RPL+M+KDGIQT
Sbjct: 269 GTSCANCHTTQTTLWRRNQNGDPVCNACGLYWKLHAVN----------RPLSMKKDGIQT 318
Query: 122 RKR 124
R R
Sbjct: 319 RNR 321
>gi|29126838|gb|AAH47790.1| GATA5 protein, partial [Homo sapiens]
Length = 417
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 204 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 263
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 264 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 313
Query: 126 PK 127
PK
Sbjct: 314 PK 315
>gi|195145358|ref|XP_002013663.1| GL23278 [Drosophila persimilis]
gi|194102606|gb|EDW24649.1| GL23278 [Drosophila persimilis]
Length = 705
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 93/125 (74%), Gaps = 16/125 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATR 59
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +
Sbjct: 475 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAK 534
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R G C NC T TTLWRRN GEPVCNACGLYYKLHNVN RPL M+K+GI
Sbjct: 535 RAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVN----------RPLTMKKEGI 584
Query: 120 QTRKR 124
QTR R
Sbjct: 585 QTRNR 589
Score = 58.5 bits (140), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
VKP + T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 464 VKPVRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 517
>gi|358334312|dbj|GAA52740.1| transcription factor GATA-3 [Clonorchis sinensis]
Length = 1004
Score = 180 bits (457), Expect = 1e-43, Method: Composition-based stats.
Identities = 82/120 (68%), Positives = 95/120 (79%), Gaps = 11/120 (9%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRLGLCC 65
EGRECVNCGA STPLWRRDG G+YLCNACGLY KMNG NRPL+KP +RL +++RR G C
Sbjct: 611 EGRECVNCGATSTPLWRRDGQGNYLCNACGLYQKMNGQNRPLIKPKRRLQSSSRRTGTIC 670
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
+NC T TTLWRRN +GEPVCNACGLY+KLHN+ RP++M+KDGIQTR RK
Sbjct: 671 SNCRTVTTTLWRRNTNGEPVCNACGLYFKLHNIQ----------RPISMKKDGIQTRNRK 720
>gi|327269885|ref|XP_003219723.1| PREDICTED: transcription factor GATA-6-like [Anolis carolinensis]
Length = 504
Score = 180 bits (457), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 84/124 (67%), Positives = 96/124 (77%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ E RECVNCG+ TPLWR+DGTG+YLCNACGLY KMNG++RPL+KP KR+ + RRLGL
Sbjct: 289 ELQESRECVNCGSFQTPLWRKDGTGNYLCNACGLYTKMNGLSRPLIKPQKRVPSQRRLGL 348
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 349 SCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 398
Query: 124 RKPK 127
RKPK
Sbjct: 399 RKPK 402
>gi|194746249|ref|XP_001955593.1| GF16170 [Drosophila ananassae]
gi|190628630|gb|EDV44154.1| GF16170 [Drosophila ananassae]
Length = 703
Score = 180 bits (457), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 93/125 (74%), Gaps = 16/125 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATR 59
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +
Sbjct: 473 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAK 532
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+GI
Sbjct: 533 RAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KEGI 582
Query: 120 QTRKR 124
QTR R
Sbjct: 583 QTRNR 587
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
VKP + T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 462 VKPVRTKPRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 515
>gi|242020471|ref|XP_002430677.1| endothelial transcription factor GATA-2, putative [Pediculus
humanus corporis]
gi|212515857|gb|EEB17939.1| endothelial transcription factor GATA-2, putative [Pediculus
humanus corporis]
Length = 399
Score = 180 bits (457), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 84/115 (73%), Positives = 91/115 (79%), Gaps = 10/115 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL+A RR G C
Sbjct: 295 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLSAARRAGTSC 354
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
NC T TTLWRRN +GEPVCNACGLYYKLHNVN RPL M+K+GIQ
Sbjct: 355 ANCKTTTTTLWRRNQNGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQ 399
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 36/60 (60%)
Query: 43 GMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
G++ L S+ + + G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 278 GLHSNLKNNSRNKSRSSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 337
>gi|390178038|ref|XP_003736551.1| GA21945, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859299|gb|EIM52624.1| GA21945, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 488
Score = 180 bits (456), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/127 (69%), Positives = 94/127 (74%), Gaps = 16/127 (12%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------A 57
Q EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A
Sbjct: 256 QREEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSA 315
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
+R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+
Sbjct: 316 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KE 365
Query: 118 GIQTRKR 124
GIQTR R
Sbjct: 366 GIQTRNR 372
>gi|383851463|ref|XP_003701252.1| PREDICTED: GATA-binding factor C-like [Megachile rotundata]
Length = 292
Score = 180 bits (456), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 93/118 (78%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
+GRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL+A RR G C
Sbjct: 63 KGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLSAARRAGTSCA 122
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN GEPVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 123 NCKTATTTLWRRNQAGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 170
>gi|24644977|ref|NP_731211.1| grain, isoform A [Drosophila melanogaster]
gi|3183017|sp|P91623.1|GATC_DROME RecName: Full=GATA-binding factor C; AltName: Full=Protein grain;
AltName: Full=Transcription factor GATA-C; AltName:
Full=dGATA-C
gi|1815600|dbj|BAA09102.1| GATA transcription factor [Drosophila melanogaster]
gi|7298992|gb|AAF54195.1| grain, isoform A [Drosophila melanogaster]
gi|17945890|gb|AAL48991.1| RE40104p [Drosophila melanogaster]
gi|220942548|gb|ACL83817.1| grn-PA [synthetic construct]
gi|220952714|gb|ACL88900.1| grn-PA [synthetic construct]
Length = 486
Score = 180 bits (456), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/127 (69%), Positives = 94/127 (74%), Gaps = 16/127 (12%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------A 57
Q EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A
Sbjct: 254 QREEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSA 313
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
+R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+
Sbjct: 314 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KE 363
Query: 118 GIQTRKR 124
GIQTR R
Sbjct: 364 GIQTRNR 370
>gi|397479159|ref|XP_003810895.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-5 [Pan
paniscus]
Length = 532
Score = 180 bits (456), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 319 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 378
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 379 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 428
Query: 126 PK 127
PK
Sbjct: 429 PK 430
>gi|194475182|gb|ACF74542.1| GATA binding protein 3 [Cervus elaphus]
Length = 191
Score = 179 bits (455), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 85/117 (72%), Positives = 93/117 (79%), Gaps = 10/117 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 69 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 128
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 129 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 175
>gi|440907455|gb|ELR57603.1| Transcription factor GATA-5 [Bos grunniens mutus]
Length = 403
Score = 179 bits (455), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/135 (71%), Positives = 106/135 (78%), Gaps = 12/135 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKM G+NRPLV+P KRL+++RR GLCC
Sbjct: 189 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMKGVNRPLVRPQKRLSSSRRAGLCC 248
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 249 TNCHTTTTTLWRRNVDGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 298
Query: 126 PK--KQGGGGGGGSG 138
PK + G G SG
Sbjct: 299 PKNIAKTKGSSGSSG 313
>gi|360042733|emb|CCD78143.1| putative gata binding factor [Schistosoma mansoni]
Length = 565
Score = 179 bits (455), Expect = 2e-43, Method: Composition-based stats.
Identities = 79/119 (66%), Positives = 91/119 (76%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
+GRECVNCGA TPLWRRD GHYLCNACGLYHKMNG NRPL+KP +R++A R+LG C
Sbjct: 285 DGRECVNCGATQTPLWRRDEAGHYLCNACGLYHKMNGTNRPLIKPKRRISANRKLGTFCA 344
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN G+ VCNACGLYYKLH++N RPL+M+K+ IQTR RK
Sbjct: 345 NCRTSHTTLWRRNQQGDSVCNACGLYYKLHHIN----------RPLSMKKEVIQTRNRK 393
>gi|256073972|ref|XP_002573301.1| endothelial transcription factor GATA-2 [Schistosoma mansoni]
Length = 565
Score = 179 bits (455), Expect = 2e-43, Method: Composition-based stats.
Identities = 79/119 (66%), Positives = 91/119 (76%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
+GRECVNCGA TPLWRRD GHYLCNACGLYHKMNG NRPL+KP +R++A R+LG C
Sbjct: 285 DGRECVNCGATQTPLWRRDEAGHYLCNACGLYHKMNGTNRPLIKPKRRISANRKLGTFCA 344
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN G+ VCNACGLYYKLH++N RPL+M+K+ IQTR RK
Sbjct: 345 NCRTSHTTLWRRNQQGDSVCNACGLYYKLHHIN----------RPLSMKKEVIQTRNRK 393
>gi|194904144|ref|XP_001981009.1| GG17473 [Drosophila erecta]
gi|190652712|gb|EDV49967.1| GG17473 [Drosophila erecta]
Length = 487
Score = 179 bits (455), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/127 (69%), Positives = 94/127 (74%), Gaps = 16/127 (12%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------A 57
Q EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A
Sbjct: 255 QREEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSA 314
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
+R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+
Sbjct: 315 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KE 364
Query: 118 GIQTRKR 124
GIQTR R
Sbjct: 365 GIQTRNR 371
>gi|343488529|ref|NP_999493.2| transcription factor GATA-6 [Sus scrofa]
Length = 592
Score = 179 bits (455), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/122 (68%), Positives = 94/122 (77%), Gaps = 10/122 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCG+I TPLWRRDGTGHYLCN CGLY KMNG++ PL+KP K + ++RRLGL C
Sbjct: 383 ESRECVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNGLSGPLIKPQKPVPSSRRLGLSCA 442
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKP
Sbjct: 443 NCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKRKP 492
Query: 127 KK 128
K
Sbjct: 493 KS 494
>gi|215274026|sp|Q95JA5.2|GATA6_PIG RecName: Full=Transcription factor GATA-6; AltName:
Full=GATA-binding factor 6
Length = 451
Score = 179 bits (455), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/125 (67%), Positives = 94/125 (75%), Gaps = 10/125 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCN CGLY KMNG++ PL+KP K + ++RRLGL
Sbjct: 239 DLPESRECVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNGLSGPLIKPQKPVPSSRRLGL 298
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 299 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 348
Query: 124 RKPKK 128
RKPK
Sbjct: 349 RKPKS 353
>gi|195453771|ref|XP_002073935.1| GK12883 [Drosophila willistoni]
gi|194170020|gb|EDW84921.1| GK12883 [Drosophila willistoni]
Length = 491
Score = 179 bits (455), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/127 (69%), Positives = 94/127 (74%), Gaps = 16/127 (12%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------A 57
Q EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A
Sbjct: 260 QREEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSA 319
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
+R G C NC T TTLWRRN GEPVCNACGLYYKLHNVNRPL M+ K+
Sbjct: 320 AKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLTMK----------KE 369
Query: 118 GIQTRKR 124
GIQTR R
Sbjct: 370 GIQTRNR 376
>gi|441637994|ref|XP_004090097.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-5
[Nomascus leucogenys]
Length = 369
Score = 179 bits (455), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 153 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 212
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 213 TNCRTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 262
Query: 126 PK 127
PK
Sbjct: 263 PK 264
>gi|195108873|ref|XP_001999017.1| GI24284 [Drosophila mojavensis]
gi|193915611|gb|EDW14478.1| GI24284 [Drosophila mojavensis]
Length = 378
Score = 179 bits (454), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 93/124 (75%), Gaps = 16/124 (12%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT------ATRR 60
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RLT A +R
Sbjct: 149 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLTLQSLQSAAKR 208
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
G C NC T TTLWRRN GEPVCNACGLYYKLHNVN RPL M+K+GIQ
Sbjct: 209 AGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQ 258
Query: 121 TRKR 124
TR R
Sbjct: 259 TRNR 262
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/44 (56%), Positives = 29/44 (65%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
R G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 147 REEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 190
>gi|477758|pir||B48099 transcription factor GATA-2, retinoic acid-inducible - mouse
(fragment)
Length = 118
Score = 179 bits (454), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 87/118 (73%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL A RR G CC
Sbjct: 1 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLPAARRAGTCCA 60
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 61 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 108
>gi|15824330|gb|AAL09304.1|AF295687_1 transcription factor GATA-6 [Sus scrofa]
Length = 446
Score = 179 bits (454), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/125 (67%), Positives = 94/125 (75%), Gaps = 10/125 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCN CGLY KMNG++ PL+KP K + ++RRLGL
Sbjct: 234 DLPESRECVNCGSIQTPLWRRDGTGHYLCNRCGLYSKMNGLSGPLIKPQKPVPSSRRLGL 293
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 294 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 343
Query: 124 RKPKK 128
RKPK
Sbjct: 344 RKPKS 348
>gi|301630601|ref|XP_002944405.1| PREDICTED: GATA-binding factor 1-B-like [Xenopus (Silurana)
tropicalis]
Length = 363
Score = 179 bits (454), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 83/122 (68%), Positives = 92/122 (75%), Gaps = 10/122 (8%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
FQ E RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++P KRL ++R G
Sbjct: 172 FQSTEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAG 231
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C+NC T TTLWRRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 232 TQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 281
Query: 123 KR 124
R
Sbjct: 282 NR 283
>gi|62205319|gb|AAH93138.1| Gata5 protein, partial [Danio rerio]
Length = 285
Score = 179 bits (454), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 83/100 (83%), Positives = 89/100 (89%), Gaps = 1/100 (1%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT-RRLGLCC 65
EGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL +T RR GLCC
Sbjct: 182 EGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQSTSRRAGLCC 241
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+
Sbjct: 242 TNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMK 281
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 29/44 (65%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR 54
C NC +T LWRR+ G +CNACGLY K++G+ RPL K+
Sbjct: 241 CTNCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKKK 284
>gi|427930996|pdb|4HC7|A Chain A, Crystal Structure Of The Full Dna Binding Domain Of
Gata3-Complex 2
gi|427930999|pdb|4HC7|B Chain B, Crystal Structure Of The Full Dna Binding Domain Of
Gata3-Complex 2
gi|427931003|pdb|4HC9|A Chain A, Dna Binding By Gata Transcription Factor-Complex 3
gi|427931007|pdb|4HCA|A Chain A, Dna Binding By Gata Transcription Factor-Complex 1
Length = 115
Score = 179 bits (454), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 5 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 64
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
C T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR RK
Sbjct: 65 CQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNRK 112
>gi|1132421|gb|AAA86091.1| zg2, partial [Danio rerio]
Length = 194
Score = 179 bits (454), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 85/115 (73%), Positives = 92/115 (80%), Gaps = 10/115 (8%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
G ECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC N
Sbjct: 6 GAECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCAN 65
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 66 CQTTTTTLWRRNGNGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 110
>gi|344306282|ref|XP_003421817.1| PREDICTED: transcription factor GATA-5-like [Loxodonta africana]
Length = 409
Score = 179 bits (454), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 92/122 (75%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 196 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGINRPLVQPQKRLSSSRRAGLCC 255
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRR+ +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 256 TNCRTTNTTLWRRSAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 305
Query: 126 PK 127
PK
Sbjct: 306 PK 307
>gi|431896291|gb|ELK05707.1| Transcription factor GATA-6 [Pteropus alecto]
Length = 291
Score = 179 bits (453), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 87/121 (71%), Positives = 97/121 (80%), Gaps = 10/121 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL C
Sbjct: 81 ESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCA 140
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKP
Sbjct: 141 NCRTTTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRKRKP 190
Query: 127 K 127
K
Sbjct: 191 K 191
>gi|213623822|gb|AAI70266.1| GATA binding factor-1b [Xenopus laevis]
Length = 364
Score = 179 bits (453), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 83/122 (68%), Positives = 92/122 (75%), Gaps = 10/122 (8%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
FQ E RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++P KRL ++R G
Sbjct: 172 FQSTEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRPLIRPKKRLIISKRAG 231
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C+NC T TTLWRRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 232 TQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 281
Query: 123 KR 124
R
Sbjct: 282 NR 283
>gi|213626857|gb|AAI70262.1| GATA binding factor-1b [Xenopus laevis]
Length = 364
Score = 179 bits (453), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 83/122 (68%), Positives = 92/122 (75%), Gaps = 10/122 (8%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
FQ E RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++P KRL ++R G
Sbjct: 172 FQSTEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRPLIRPKKRLIISKRAG 231
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C+NC T TTLWRRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 232 TQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 281
Query: 123 KR 124
R
Sbjct: 282 NR 283
>gi|432094055|gb|ELK25847.1| Transcription factor GATA-5, partial [Myotis davidii]
Length = 226
Score = 179 bits (453), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 95/122 (77%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL++TRR GLCC
Sbjct: 12 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSTRRAGLCC 71
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 72 TNCHTTTTTLWRRNADGEPVCNACGLYTKLHGVP----------RPLAMKKESIQTRKRK 121
Query: 126 PK 127
PK
Sbjct: 122 PK 123
>gi|347971495|ref|XP_562743.4| AGAP004228-PA [Anopheles gambiae str. PEST]
gi|333468694|gb|EAL40667.4| AGAP004228-PA [Anopheles gambiae str. PEST]
Length = 603
Score = 179 bits (453), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 87/124 (70%), Positives = 93/124 (75%), Gaps = 15/124 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-----TATRR 60
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +A RR
Sbjct: 333 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVSSLQSAARR 392
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
G C NC T TTLWRRN GEPVCNACGLYYKLHNVN RPL M+K+GIQ
Sbjct: 393 AGTSCANCKTTTTTLWRRNQGGEPVCNACGLYYKLHNVN----------RPLTMKKEGIQ 442
Query: 121 TRKR 124
TR R
Sbjct: 443 TRNR 446
>gi|327271947|ref|XP_003220748.1| PREDICTED: GATA-binding factor 5-A-like [Anolis carolinensis]
Length = 399
Score = 178 bits (452), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 93/130 (71%), Positives = 104/130 (80%), Gaps = 15/130 (11%)
Query: 3 FQF-----GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTA 57
F+F GE RECVNCGA+STPLWR+DGTGHYLCNACGLYHKMNG+NRPL+KP KRL++
Sbjct: 178 FEFLEEFPGESRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGINRPLIKPQKRLSS 237
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
+RR GLCCTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+
Sbjct: 238 SRRAGLCCTNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKE 287
Query: 118 GIQTRKRKPK 127
IQTRKRKPK
Sbjct: 288 SIQTRKRKPK 297
>gi|297259418|ref|XP_001115055.2| PREDICTED: hypothetical protein LOC720048 [Macaca mulatta]
Length = 780
Score = 178 bits (451), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 204 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 263
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 264 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 313
Query: 126 PK 127
PK
Sbjct: 314 PK 315
>gi|351714890|gb|EHB17809.1| Transcription factor GATA-5, partial [Heterocephalus glaber]
Length = 223
Score = 178 bits (451), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 93/122 (76%), Positives = 102/122 (83%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 10 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCC 69
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 70 TNCHTTHTTLWRRNTEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 119
Query: 126 PK 127
PK
Sbjct: 120 PK 121
>gi|390462790|ref|XP_003732908.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-5
[Callithrix jacchus]
Length = 338
Score = 178 bits (451), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 92/122 (75%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KRL+++RR GLCC
Sbjct: 125 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRTGLCC 184
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRR+++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 185 TNCHTTNTTLWRRSSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 234
Query: 126 PK 127
PK
Sbjct: 235 PK 236
>gi|148232497|ref|NP_001079109.1| GATA-binding factor 1-A [Xenopus laevis]
gi|120966|sp|P23767.1|GAT1A_XENLA RecName: Full=GATA-binding factor 1-A; AltName: Full=Transcription
factor xGATA-1A
gi|214167|gb|AAA49721.1| GATA binding factor-1 [Xenopus laevis]
Length = 359
Score = 178 bits (451), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 82/120 (68%), Positives = 91/120 (75%), Gaps = 10/120 (8%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
FQ E RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++P KRL ++R G
Sbjct: 170 FQSTEDRECVNCGATVTPLWRRDMSGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAG 229
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C+NC T TTLWRRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 230 TQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 279
>gi|213623646|gb|AAI70023.1| GATA binding factor-1 [Xenopus laevis]
gi|213625189|gb|AAI70025.1| GATA binding factor-1 [Xenopus laevis]
Length = 359
Score = 177 bits (450), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 82/120 (68%), Positives = 91/120 (75%), Gaps = 10/120 (8%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
FQ E RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++P KRL ++R G
Sbjct: 170 FQSTEDRECVNCGATVTPLWRRDMSGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAG 229
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C+NC T TTLWRRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 230 TQCSNCHTSTTTLWRRNASGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 279
>gi|47227325|emb|CAF96874.1| unnamed protein product [Tetraodon nigroviridis]
Length = 190
Score = 177 bits (450), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 83/119 (69%), Positives = 91/119 (76%), Gaps = 10/119 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R G C
Sbjct: 1 AEARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTLC 60
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN GEPVCNACGLY+KLHNVN RPL M+K+GIQTR R
Sbjct: 61 ANCHTSTTTLWRRNASGEPVCNACGLYFKLHNVN----------RPLTMKKEGIQTRNR 109
>gi|355754933|gb|EHH58800.1| GATA-binding factor 6 [Macaca fascicularis]
Length = 410
Score = 177 bits (450), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 85/124 (68%), Positives = 95/124 (76%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPL R DGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 198 DLSESRECVNCGSIQTPLGRGDGTGHYLCNACGLYSKMNGISRPLIKPQKRVPSSRRLGL 257
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 258 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRK 307
Query: 124 RKPK 127
RKPK
Sbjct: 308 RKPK 311
>gi|307198089|gb|EFN79142.1| GATA-binding factor A [Harpegnathos saltator]
Length = 728
Score = 177 bits (450), Expect = 9e-43, Method: Composition-based stats.
Identities = 86/137 (62%), Positives = 95/137 (69%), Gaps = 15/137 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV---KPSKRLTAT--RRL 61
E +EC NC A TPLWRRDGTGHYLCNACGLY KMNG NRP + KP + +T T RR
Sbjct: 501 EPKECANCAASLTPLWRRDGTGHYLCNACGLYSKMNGHNRPPMRCPKPKQTITPTGVRRT 560
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C NC T TTLWRRNN GEPVCNACGLY+KLHN+N RPL+M+KDGIQT
Sbjct: 561 GVQCANCKTSNTTLWRRNNSGEPVCNACGLYFKLHNMN----------RPLSMKKDGIQT 610
Query: 122 RKRKPKKQGGGGGGGSG 138
RKRKPK G G
Sbjct: 611 RKRKPKNHTNLSGNLPG 627
>gi|148234737|ref|NP_001079244.1| GATA-binding factor 1-B [Xenopus laevis]
gi|120969|sp|P23768.1|GAT1B_XENLA RecName: Full=GATA-binding factor 1-B; AltName: Full=Transcription
factor xGATA-1B
gi|214169|gb|AAA49722.1| GATA binding factor-1 [Xenopus laevis]
Length = 364
Score = 177 bits (450), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 83/122 (68%), Positives = 92/122 (75%), Gaps = 10/122 (8%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
FQ E RECVNCGA TPLWRRD +GHYLCNACGLYHKMNG NRPL++P KRL ++R G
Sbjct: 172 FQSTEDRECVNCGATVTPLWRRDLSGHYLCNACGLYHKMNGQNRPLIRPKKRLIISKRAG 231
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C+NC T TTLWRRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR
Sbjct: 232 TQCSNCHTSTTTLWRRNAGGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTR 281
Query: 123 KR 124
R
Sbjct: 282 NR 283
>gi|112984186|ref|NP_001037446.1| transcription factor BCFI [Bombyx mori]
gi|483331|gb|AAA65734.1| transcription factor BCFI [Bombyx mori]
Length = 509
Score = 177 bits (450), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 88/123 (71%), Positives = 95/123 (77%), Gaps = 11/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAI TPLW RDGTGHYLCNACGLY+KMNGMNRPL +P +RL A +R G
Sbjct: 263 FTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPLKQP-RRLMAAKRPGTM 321
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T+LWRRN GE VCNACGLY+KLHNVN RPL M+KD IQTRKR
Sbjct: 322 CTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVN----------RPLTMKKDSIQTRKR 371
Query: 125 KPK 127
KPK
Sbjct: 372 KPK 374
>gi|1730200|sp|P52167.1|GATB_BOMMO RecName: Full=Transcription factor BCFI; AltName: Full=BmGATA-beta;
Short=GATA-beta
Length = 508
Score = 177 bits (450), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 88/123 (71%), Positives = 95/123 (77%), Gaps = 11/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAI TPLW RDGTGHYLCNACGLY+KMNGMNRPL +P +RL A +R G
Sbjct: 263 FTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPLKQP-RRLMAAKRPGTM 321
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T+LWRRN GE VCNACGLY+KLHNVN RPL M+KD IQTRKR
Sbjct: 322 CTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVN----------RPLTMKKDSIQTRKR 371
Query: 125 KPK 127
KPK
Sbjct: 372 KPK 374
>gi|603164|gb|AAA67886.1| BmGATA beta isoform 1, partial [Bombyx mori]
Length = 313
Score = 177 bits (449), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 88/123 (71%), Positives = 95/123 (77%), Gaps = 11/123 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
F EGRECVNCGAI TPLW RDGTGHYLCNACGLY+KMNGMNRPL +P +RL A +R G
Sbjct: 68 FTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPLKQP-RRLMAAKRPGTM 126
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T+LWRRN GE VCNACGLY+KLHNVN RPL M+KD IQTRKR
Sbjct: 127 CTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVN----------RPLTMKKDSIQTRKR 176
Query: 125 KPK 127
KPK
Sbjct: 177 KPK 179
>gi|237512877|dbj|BAH58791.1| GATA-binding protein 5 [Polypterus senegalus]
Length = 395
Score = 177 bits (448), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/123 (76%), Positives = 102/123 (82%), Gaps = 11/123 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRLGLC 64
GEGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL T++RR GLC
Sbjct: 184 GEGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRLQTSSRRAGLC 243
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKR
Sbjct: 244 CTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKR 293
Query: 125 KPK 127
KPK
Sbjct: 294 KPK 296
>gi|57547700|gb|AAW52540.1| GATA transcription factor pannier-like protein [Schistocerca
americana]
Length = 88
Score = 177 bits (448), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 85/88 (96%), Positives = 86/88 (97%)
Query: 16 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTL 75
AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTR TTL
Sbjct: 1 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRTTTL 60
Query: 76 WRRNNDGEPVCNACGLYYKLHNVNRPLA 103
WRRNNDGEPVCNACGLY+KLH VNRPLA
Sbjct: 61 WRRNNDGEPVCNACGLYFKLHGVNRPLA 88
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 29/39 (74%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 50 CTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPLA 88
>gi|45382623|ref|NP_990795.1| erythroid transcription factor [Gallus gallus]
gi|120955|sp|P17678.1|GATA1_CHICK RecName: Full=Erythroid transcription factor; AltName: Full=Eryf1;
AltName: Full=GATA-binding factor 1; Short=GATA-1;
AltName: Full=NF-E1 DNA-binding protein; Short=NF-E1a
gi|212629|gb|AAA49055.1| Eryf1 protein [Gallus gallus]
Length = 304
Score = 177 bits (448), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 83/124 (66%), Positives = 94/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRDGTGHYLCNACGLYH++NG NRPL++P KRL ++R G C+
Sbjct: 106 EARECVNCGATATPLWRRDGTGHYLCNACGLYHRLNGQNRPLIRPKKRLLVSKRAGTVCS 165
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRR+ G+PVCNACGLYYKLH VN RPL MRKDGIQTR RK
Sbjct: 166 NCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVN----------RPLTMRKDGIQTRNRKV 215
Query: 127 KKQG 130
+G
Sbjct: 216 SSKG 219
>gi|355564543|gb|EHH21043.1| hypothetical protein EGK_04020 [Macaca mulatta]
Length = 480
Score = 176 bits (447), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/118 (72%), Positives = 94/118 (79%), Gaps = 10/118 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +R +A RR G CC
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRRSAARRAGTCCA 350
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 351 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 398
>gi|307186220|gb|EFN71904.1| Transcription factor GATA-4 [Camponotus floridanus]
Length = 713
Score = 176 bits (446), Expect = 3e-42, Method: Composition-based stats.
Identities = 84/137 (61%), Positives = 96/137 (70%), Gaps = 14/137 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV---KPSKRL-TATRRLG 62
E +EC NC + TPLWRRDG GHYLCNACGLY KMNG+NRP + KP + + T RR G
Sbjct: 486 ETKECANCASQMTPLWRRDGAGHYLCNACGLYGKMNGVNRPPMRCPKPKQTVPTGVRRTG 545
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NC T TTLWRRNN+GEPVCNACGLY+KLHNVN RPL+M+K+GIQTR
Sbjct: 546 VQCANCRTNNTTLWRRNNNGEPVCNACGLYFKLHNVN----------RPLSMKKEGIQTR 595
Query: 123 KRKPKKQGGGGGGGSGG 139
KRKPK G G G
Sbjct: 596 KRKPKNHSGSITGNLPG 612
>gi|238863846|gb|ACR66217.1| transcription factor GATA456b, partial [Branchiostoma floridae]
Length = 380
Score = 176 bits (446), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 77/92 (83%), Positives = 84/92 (91%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL C
Sbjct: 289 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLSASRRVGLQC 348
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHN 97
NC T TTLWRRNN+GEPVCNACGLYYKLHN
Sbjct: 349 ANCRTTQTTLWRRNNEGEPVCNACGLYYKLHN 380
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 37/58 (63%), Gaps = 6/58 (10%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL--AMRHLAMRPLAMRKDGIQ 120
C NCG T LWRR+ G +CNACGLY+K++ VNRPL R L+ A R+ G+Q
Sbjct: 294 CVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLS----ASRRVGLQ 347
>gi|403282679|ref|XP_003932769.1| PREDICTED: transcription factor GATA-5 [Saimiri boliviensis
boliviensis]
Length = 288
Score = 176 bits (446), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 91/122 (74%), Positives = 103/122 (84%), Gaps = 10/122 (8%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL++P KRL+++RR GLCC
Sbjct: 75 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLLRPQKRLSSSRRAGLCC 134
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRR+++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 135 TNCHTTTTTLWRRSSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 184
Query: 126 PK 127
PK
Sbjct: 185 PK 186
>gi|328721860|ref|XP_001943307.2| PREDICTED: hypothetical protein LOC100166174 isoform 1
[Acyrthosiphon pisum]
gi|328721862|ref|XP_003247419.1| PREDICTED: hypothetical protein LOC100166174 isoform 2
[Acyrthosiphon pisum]
Length = 637
Score = 176 bits (446), Expect = 3e-42, Method: Composition-based stats.
Identities = 81/129 (62%), Positives = 96/129 (74%), Gaps = 14/129 (10%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR----LTATRRLGLC 64
+ECVNC A TPLWRRDGTGH+LCNACGLY+++NG+NRP V+ +++ T +R G+
Sbjct: 415 KECVNCAANVTPLWRRDGTGHHLCNACGLYNRINGVNRPPVRSAQKKTTQQTGNKRSGVA 474
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRNN+GEPVCNACGLY+KLHNVN RPL M+KDGIQTRKR
Sbjct: 475 CANCSTNTTTLWRRNNNGEPVCNACGLYFKLHNVN----------RPLTMKKDGIQTRKR 524
Query: 125 KPKKQGGGG 133
KPK GG
Sbjct: 525 KPKNPSIGG 533
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 31/53 (58%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC +T LWRR+ G +CNACGLY K++ +NRPL + +R
Sbjct: 472 GVACANCSTNTTTLWRRNNNGEPVCNACGLYFKLHNVNRPLTMKKDGIQTRKR 524
>gi|47551055|ref|NP_999704.1| GATA transcription factor [Strongylocentrotus purpuratus]
gi|3702856|gb|AAC62960.1| GATA transcription factor [Strongylocentrotus purpuratus]
Length = 431
Score = 176 bits (445), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 83/118 (70%), Positives = 91/118 (77%), Gaps = 11/118 (9%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDG GHYLCNACGLYHKMNG NRPL+KP +RL+A +R G C
Sbjct: 217 EGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRPLIKPKRRLSA-KRTGTSCA 275
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC TTLWRRN +G+PVCNACGLYYKLH VN RPL M+K+GIQTR R
Sbjct: 276 NCQATATTLWRRNPNGDPVCNACGLYYKLHGVN----------RPLTMKKEGIQTRNR 323
>gi|170060991|ref|XP_001866045.1| GATA-binding factor-C [Culex quinquefasciatus]
gi|167879282|gb|EDS42665.1| GATA-binding factor-C [Culex quinquefasciatus]
Length = 474
Score = 176 bits (445), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 83/124 (66%), Positives = 92/124 (74%), Gaps = 15/124 (12%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR-----LTATRRL 61
EGRECVNCGA STPLWRRD TGHYLCNACGLY+KMNG NRPL+KP ++ +A RR
Sbjct: 252 EGRECVNCGATSTPLWRRDATGHYLCNACGLYYKMNGQNRPLIKPKRKPVTSQQSAARRA 311
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G C NC T T+LWRRN GEPVCNACGLYYKLHNV+ RPL M+K+GIQT
Sbjct: 312 GTSCANCKTTTTSLWRRNQSGEPVCNACGLYYKLHNVD----------RPLTMKKEGIQT 361
Query: 122 RKRK 125
R RK
Sbjct: 362 RNRK 365
>gi|301753709|ref|XP_002912716.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Ailuropoda melanoleuca]
Length = 478
Score = 175 bits (444), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 84/121 (69%), Positives = 94/121 (77%), Gaps = 11/121 (9%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCG+I TPLWR G GHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL C
Sbjct: 270 ESRECVNCGSIQTPLWRXGG-GHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCA 328
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKP
Sbjct: 329 NCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRKRKP 378
Query: 127 K 127
K
Sbjct: 379 K 379
>gi|161511589|gb|ABX71821.1| GATA1/2/3 [Paracentrotus lividus]
gi|270154815|gb|ACZ62636.1| GATA1/2/3 [Paracentrotus lividus]
Length = 430
Score = 175 bits (444), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 83/118 (70%), Positives = 91/118 (77%), Gaps = 11/118 (9%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDG GHYLCNACGLYHKMNG NRPL+KP +RL+A +R G C
Sbjct: 216 EGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRPLIKPKRRLSA-KRTGTSCA 274
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC TTLWRRN +G+PVCNACGLYYKLH VN RPL M+K+GIQTR R
Sbjct: 275 NCQATATTLWRRNPNGDPVCNACGLYYKLHGVN----------RPLTMKKEGIQTRNR 322
>gi|1289502|gb|AAC52841.1| transcription factor GATA-6, partial [Mus musculus]
Length = 205
Score = 174 bits (442), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 87/132 (65%), Positives = 97/132 (73%), Gaps = 13/132 (9%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGT 70
CVNCG+I TPLWRRDGTGHYLCNACGLY KMN ++RPL+KP KR+ ++RRLGL C NC T
Sbjct: 1 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNALSRPLIKPQKRVPSSRRLGLSCANCHT 60
Query: 71 RMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK--- 127
TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKPK
Sbjct: 61 TTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRKRKPKNIN 110
Query: 128 KQGGGGGGGSGG 139
K G SG
Sbjct: 111 KSKACSGNSSGS 122
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 52 GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 93
>gi|157115508|ref|XP_001658239.1| gata-binding factor-c (transcription factor gata-c) [Aedes aegypti]
gi|108876905|gb|EAT41130.1| AAEL007205-PA [Aedes aegypti]
Length = 509
Score = 174 bits (442), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 83/124 (66%), Positives = 92/124 (74%), Gaps = 15/124 (12%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR-----LTATRRL 61
EGRECVNCGA STPLWRRD TGHYLCNACGLY+KMNG NRPL+KP ++ +A RR
Sbjct: 242 EGRECVNCGATSTPLWRRDATGHYLCNACGLYYKMNGQNRPLIKPKRKPVTSQQSAARRA 301
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G C NC T T+LWRRN GEPVCNACGLYYKLHNV+ RPL M+K+GIQT
Sbjct: 302 GTSCANCKTTTTSLWRRNQSGEPVCNACGLYYKLHNVD----------RPLTMKKEGIQT 351
Query: 122 RKRK 125
R RK
Sbjct: 352 RNRK 355
>gi|74101554|gb|AAZ99709.1| GATA-6, partial [Bos taurus]
Length = 179
Score = 174 bits (442), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 83/116 (71%), Positives = 93/116 (80%), Gaps = 10/116 (8%)
Query: 12 VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTR 71
VNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL C NC T
Sbjct: 1 VNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTT 60
Query: 72 MTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRKRKPK
Sbjct: 61 TTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRKRKPK 106
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 51 GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 92
>gi|321467985|gb|EFX78972.1| hypothetical protein DAPPUDRAFT_28576 [Daphnia pulex]
Length = 115
Score = 174 bits (441), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 99/124 (79%), Positives = 107/124 (86%), Gaps = 11/124 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVKPSKRLTATRRLGL 63
+ EGRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMN G+NRPL+KP +RL+ATRRLGL
Sbjct: 1 YLEGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNNGINRPLLKPPRRLSATRRLGL 60
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CCTNCGT TTLWRRN +GEPVCNACGLY+KLH VN RPLAMRKDGIQTRK
Sbjct: 61 CCTNCGTTTTTLWRRNAEGEPVCNACGLYHKLHGVN----------RPLAMRKDGIQTRK 110
Query: 124 RKPK 127
RKPK
Sbjct: 111 RKPK 114
>gi|73949078|ref|XP_858071.1| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
isoform 2 [Canis lupus familiaris]
Length = 439
Score = 174 bits (441), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 83/118 (70%), Positives = 90/118 (76%), Gaps = 15/118 (12%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL T C
Sbjct: 260 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLVRT-----SCA 314
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 315 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 362
>gi|351706522|gb|EHB09441.1| Histone deacetylase 6 [Heterocephalus glaber]
Length = 1489
Score = 174 bits (441), Expect = 1e-41, Method: Composition-based stats.
Identities = 89/133 (66%), Positives = 96/133 (72%), Gaps = 10/133 (7%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 174 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 233
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 234 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 283
Query: 127 KKQGGGGGGGSGG 139
+G G S G
Sbjct: 284 SGKGKKKRGSSLG 296
>gi|349804083|gb|AEQ17514.1| putative gata binding protein 6 [Hymenochirus curtipes]
Length = 164
Score = 173 bits (438), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 85/124 (68%), Positives = 98/124 (79%), Gaps = 10/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ E RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG +RPL+KP KR+ ++RR+GL
Sbjct: 5 ELPESRECVNCGSVQTPLWRRDGTGHYLCNACGLYSKMNGPSRPLIKPQKRVPSSRRIGL 64
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CTNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+++GIQTRK
Sbjct: 65 ACTNCHTTTTTLWRRNTEGEPVCNACGLYMKLHGV----------PRPLAMKREGIQTRK 114
Query: 124 RKPK 127
RKPK
Sbjct: 115 RKPK 118
>gi|126342847|ref|XP_001372037.1| PREDICTED: erythroid transcription factor-like [Monodelphis
domestica]
Length = 401
Score = 172 bits (437), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 88/119 (73%), Positives = 93/119 (78%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 203 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 262
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN GEPVCNACGLYYKLHNVN RPL MRKDGIQTR RK
Sbjct: 263 NCQTTTTTLWRRNASGEPVCNACGLYYKLHNVN----------RPLTMRKDGIQTRNRK 311
>gi|355784333|gb|EHH65184.1| GATA-binding factor 5, partial [Macaca fascicularis]
Length = 283
Score = 172 bits (436), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 92/122 (75%), Positives = 102/122 (83%), Gaps = 11/122 (9%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KR +++RR GLCC
Sbjct: 71 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKR-SSSRRAGLCC 129
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 130 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 179
Query: 126 PK 127
PK
Sbjct: 180 PK 181
>gi|260782140|ref|XP_002586149.1| hypothetical protein BRAFLDRAFT_255109 [Branchiostoma floridae]
gi|229271242|gb|EEN42160.1| hypothetical protein BRAFLDRAFT_255109 [Branchiostoma floridae]
Length = 107
Score = 171 bits (434), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 79/97 (81%), Positives = 87/97 (89%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL C
Sbjct: 1 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLIKPQRRLSASRRVGLQC 60
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
NC T TTLWRRNN+GEPVCNACGLYYKLHNV+ L
Sbjct: 61 ANCRTTQTTLWRRNNEGEPVCNACGLYYKLHNVSTTL 97
>gi|45382243|ref|NP_990752.1| transcription factor GATA-5 [Gallus gallus]
gi|1169848|sp|P43692.1|GATA5_CHICK RecName: Full=Transcription factor GATA-5; AltName:
Full=GATA-binding factor 5
gi|511482|gb|AAA57504.1| GATA-5 transcription factor [Gallus gallus]
Length = 391
Score = 171 bits (434), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/122 (74%), Positives = 101/122 (82%), Gaps = 11/122 (9%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
G+GRECVNCGA+STPLWR+DGTGHYLCNACGLYHKMNG+NRPL KP KRL+++RR GLCC
Sbjct: 181 GDGRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGINRPL-KPQKRLSSSRRAGLCC 239
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 240 TNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 289
Query: 126 PK 127
PK
Sbjct: 290 PK 291
>gi|322782909|gb|EFZ10627.1| hypothetical protein SINV_01972 [Solenopsis invicta]
Length = 726
Score = 171 bits (434), Expect = 6e-41, Method: Composition-based stats.
Identities = 84/138 (60%), Positives = 98/138 (71%), Gaps = 16/138 (11%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVK-PSKRLTAT----RR 60
E +ECVNC A TPLWRRDG+GHYLCNACGLY+KMN G+NRP ++ P + T RR
Sbjct: 499 EPKECVNCAASMTPLWRRDGSGHYLCNACGLYNKMNNGLNRPPMRCPKPKQTVAPANLRR 558
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
G+ C NC T TTLWRRNN+GEPVCNACGLY+KLHNVN RPL+M+K+GIQ
Sbjct: 559 TGVQCANCRTSNTTLWRRNNNGEPVCNACGLYFKLHNVN----------RPLSMKKEGIQ 608
Query: 121 TRKRKPKKQGGGGGGGSG 138
TRKRKPK G +G
Sbjct: 609 TRKRKPKNHSNITGSLAG 626
>gi|357609511|gb|EHJ66491.1| GATA transcription factor GATAc [Danaus plexippus]
Length = 633
Score = 171 bits (434), Expect = 6e-41, Method: Composition-based stats.
Identities = 90/176 (51%), Positives = 104/176 (59%), Gaps = 45/176 (25%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV-----KPSKRL--TAT 58
GE +ECVNC A +TPLWRRDGTGHYLCNACGLY ++NG+NRP + KP + L
Sbjct: 447 GEVKECVNCAAATTPLWRRDGTGHYLCNACGLYTRINGVNRPPLKGQKTKPQQALPTNGN 506
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHN--------------------- 97
RR+G+ C NC T TTLWRRNN+GEPVCNACGLYYKLHN
Sbjct: 507 RRVGVTCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNIVDINTTRGRASITVVVVESL 566
Query: 98 VNRPLAMRHL-----------------AMRPLAMRKDGIQTRKRKPKKQGGGGGGG 136
V+ R+L RPL+M+KDGIQTRKRKPK G GG G
Sbjct: 567 VDDAEEARYLIGAVQQLSGDSSANIGPVNRPLSMKKDGIQTRKRKPKSMGSGGASG 622
>gi|326931899|ref|XP_003212061.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-5-like
[Meleagris gallopavo]
Length = 386
Score = 171 bits (434), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/122 (74%), Positives = 101/122 (82%), Gaps = 11/122 (9%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
G+GRECVNCGA+STPLWR+DGTGHYLCNACGLYHKMNG+NRPL KP KRL+++RR GLCC
Sbjct: 176 GDGRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGINRPL-KPQKRLSSSRRAGLCC 234
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 235 TNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 284
Query: 126 PK 127
PK
Sbjct: 285 PK 286
>gi|405960917|gb|EKC26787.1| Trans-acting T-cell-specific transcription factor GATA-3
[Crassostrea gigas]
Length = 518
Score = 171 bits (433), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 75/92 (81%), Positives = 80/92 (86%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 209 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSC 268
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHN 97
NCGT TTLWRRN +G+PVCNACGLYYKLHN
Sbjct: 269 ANCGTNTTTLWRRNGNGDPVCNACGLYYKLHN 300
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 68/133 (51%), Gaps = 24/133 (18%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKM----NGMNRPLVKPSKRLTATRRLGL 63
G C NCG +T LWRR+G G +CNACGLY+K+ +G+N + + ++ +
Sbjct: 265 GTSCANCGTNTTTLWRRNGNGDPVCNACGLYYKLHNDRSGVNLSRSRGALDDSSHHKWPF 324
Query: 64 CCT--NCGTR-MTTLWRRNNDGEP-------VCNAC-GLYYKLHNVNRPLAMRHLAMRPL 112
T N G R +T+L + + +P + C G+Y A+ L RPL
Sbjct: 325 LTTGRNMGARALTSLNNKPRNSQPRNSYRNILPRECTGVY---------AAISDLVNRPL 375
Query: 113 AMRKDGIQTRKRK 125
M+KDGIQTR RK
Sbjct: 376 TMKKDGIQTRNRK 388
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 29 HYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNA 88
H A ++ + ++RP K R +A G C NCG T LWRR+ G +CNA
Sbjct: 183 HSFHPAASVFKATSALSRPRTK--ARSSAE---GRECVNCGATSTPLWRRDGTGHYLCNA 237
Query: 89 CGLYYKLHNVNRPL 102
CGLY+K++ NRPL
Sbjct: 238 CGLYHKMNGQNRPL 251
>gi|270008082|gb|EFA04530.1| grain [Tribolium castaneum]
Length = 537
Score = 171 bits (433), Expect = 9e-41, Method: Composition-based stats.
Identities = 83/122 (68%), Positives = 95/122 (77%), Gaps = 13/122 (10%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT---RRLGLCC 65
+ECVNCGA TPLWRRDGTGHYLCNACGLY+K+NG+NRP V+P K+ A RR G+ C
Sbjct: 285 KECVNCGASVTPLWRRDGTGHYLCNACGLYNKINGVNRPPVRPPKKPQAQPGPRRNGVQC 344
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRNN GEPVCNACGLY+KLHNVN RPL+M+K+GIQTRKR+
Sbjct: 345 ANCKTGNTTLWRRNNQGEPVCNACGLYFKLHNVN----------RPLSMKKEGIQTRKRR 394
Query: 126 PK 127
PK
Sbjct: 395 PK 396
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G +C NC +T LWRR+ G +CNACGLY K++ +NRPL + + +R
Sbjct: 341 GVQCANCKTGNTTLWRRNNQGEPVCNACGLYFKLHNVNRPLSMKKEGIQTRKR 393
>gi|42566590|gb|AAS21059.1| GATA factor, partial [Nematostella vectensis]
Length = 208
Score = 171 bits (432), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 81/116 (69%), Positives = 91/116 (78%), Gaps = 10/116 (8%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNC 68
RECVNCGA STPLWRRDG+GHYLCNACGLYHKMNG +RPL+KP +RL+A RR G C NC
Sbjct: 1 RECVNCGATSTPLWRRDGSGHYLCNACGLYHKMNGSSRPLIKPKRRLSAARRAGTSCANC 60
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
T TTLWRRN +G+PVCNACGLY+ LH VN RPL+M+KDGIQTR R
Sbjct: 61 HTTQTTLWRRNQNGDPVCNACGLYWMLHAVN----------RPLSMKKDGIQTRNR 106
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 26/41 (63%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC T LWRR+ G +CNACGLY ++ +NRPL
Sbjct: 54 GTSCANCHTTQTTLWRRNQNGDPVCNACGLYWMLHAVNRPL 94
>gi|355562973|gb|EHH19535.1| GATA-binding factor 5, partial [Macaca mulatta]
Length = 237
Score = 171 bits (432), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/122 (75%), Positives = 102/122 (83%), Gaps = 11/122 (9%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
GEGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLV+P KR +++RR GLCC
Sbjct: 25 GEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKR-SSSRRAGLCC 83
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 84 TNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 133
Query: 126 PK 127
PK
Sbjct: 134 PK 135
>gi|449486444|ref|XP_002194802.2| PREDICTED: transcription factor GATA-5 [Taeniopygia guttata]
Length = 260
Score = 171 bits (432), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/122 (74%), Positives = 101/122 (82%), Gaps = 11/122 (9%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
G+GRECVNCGA+STPLWR+DGTGHYLCNACGLYHKMNG+NRPL KP KRL+++RR GLCC
Sbjct: 48 GDGRECVNCGAMSTPLWRKDGTGHYLCNACGLYHKMNGINRPL-KPQKRLSSSRRAGLCC 106
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
TNC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+ IQTRKRK
Sbjct: 107 TNCHTTNTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKESIQTRKRK 156
Query: 126 PK 127
PK
Sbjct: 157 PK 158
>gi|196011732|ref|XP_002115729.1| hypothetical protein TRIADDRAFT_59720 [Trichoplax adhaerens]
gi|190581505|gb|EDV21581.1| hypothetical protein TRIADDRAFT_59720 [Trichoplax adhaerens]
Length = 468
Score = 170 bits (431), Expect = 2e-40, Method: Composition-based stats.
Identities = 75/114 (65%), Positives = 90/114 (78%), Gaps = 10/114 (8%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNC 68
R C NCG ISTPLWRRDG+G YLCNACGLYHK+NG+NRPLVKP+KRL++TRR+ L C NC
Sbjct: 290 RACANCGVISTPLWRRDGSGQYLCNACGLYHKINGVNRPLVKPNKRLSSTRRMDLICANC 349
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
T TTLWRRN G+PVCNACGLY+KLH +N RP++++K+ IQTR
Sbjct: 350 NTDRTTLWRRNKYGQPVCNACGLYFKLHEIN----------RPMSLKKEWIQTR 393
>gi|358335481|dbj|GAA54062.1| endothelial transcription factor GATA-2 [Clonorchis sinensis]
Length = 412
Score = 169 bits (429), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 78/121 (64%), Positives = 92/121 (76%), Gaps = 10/121 (8%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
+GRECVNCGA TPLWRRD TGHYLCNACGLYHKMNG +RPL+KP +R++ATR+ G
Sbjct: 62 LTDGRECVNCGATQTPLWRRDETGHYLCNACGLYHKMNGTSRPLIKPKRRMSATRKSGTI 121
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN G+ VCNACGLYYKLH++N RPL+M+K+ IQTR R
Sbjct: 122 CANCRTAHTTLWRRNQHGDSVCNACGLYYKLHHIN----------RPLSMKKEIIQTRNR 171
Query: 125 K 125
+
Sbjct: 172 R 172
>gi|344269962|ref|XP_003406816.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Loxodonta africana]
Length = 390
Score = 169 bits (429), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 84/124 (67%), Positives = 92/124 (74%), Gaps = 11/124 (8%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACG Y KMNG++ P +P KR A+RRLGL
Sbjct: 178 DLPESRECVNCGSIQTPLWRRDGTGHYLCNACGXYSKMNGLSGP-DQPPKRAPASRRLGL 236
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 237 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAMKKEGIQTRK 286
Query: 124 RKPK 127
RKPK
Sbjct: 287 RKPK 290
>gi|443687408|gb|ELT90399.1| hypothetical protein CAPTEDRAFT_175842 [Capitella teleta]
Length = 341
Score = 169 bits (428), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 84/127 (66%), Positives = 92/127 (72%), Gaps = 23/127 (18%)
Query: 17 ISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTA-------------TRRLGL 63
+STPLWRRDGTGHYLCNACGLYHKMNG+NRPLVKP +R T +RRLGL
Sbjct: 1 MSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVKPQRRSTGGGSSTTSPVPNLGSRRLGL 60
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRNN+GEPVCNACGLYYKLH V RP++M+KDGIQTRK
Sbjct: 61 QCANCSTTTTTLWRRNNEGEPVCNACGLYYKLHQV----------ARPISMKKDGIQTRK 110
Query: 124 RKPKKQG 130
RKPK G
Sbjct: 111 RKPKGSG 117
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 29/41 (70%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G +C NC +T LWRR+ G +CNACGLY+K++ + RP+
Sbjct: 59 GLQCANCSTTTTTLWRRNNEGEPVCNACGLYYKLHQVARPI 99
>gi|47208918|emb|CAF91187.1| unnamed protein product [Tetraodon nigroviridis]
Length = 334
Score = 169 bits (427), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 74/95 (77%), Positives = 82/95 (86%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 239 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSC 298
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNR 100
NC T TTLWRRN +G+PVCNACGLY+KLHNV++
Sbjct: 299 ANCQTTTTTLWRRNANGDPVCNACGLYFKLHNVSK 333
Score = 52.0 bits (123), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 244 CVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPL 281
>gi|444724388|gb|ELW64993.1| Trans-acting T-cell-specific transcription factor GATA-3 [Tupaia
chinensis]
Length = 363
Score = 169 bits (427), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 75/93 (80%), Positives = 81/93 (87%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 259 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 318
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
NC T TTLWRRN +G+PVCNACGLYYKLHNV+
Sbjct: 319 NCQTTTTTLWRRNANGDPVCNACGLYYKLHNVS 351
Score = 52.0 bits (123), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 263 CVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPL 300
>gi|1078989|pir||S53811 BmGATA beta isoform 2 - silkworm (fragment)
gi|603165|gb|AAA67885.1| BmGATA beta isoform 2, partial [Bombyx mori]
Length = 327
Score = 169 bits (427), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 86/136 (63%), Positives = 94/136 (69%), Gaps = 23/136 (16%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL--------- 55
F EGRECVNCGAI TPLW RDGTGHYLCNACGLY+KMNGMNRPL +P + +
Sbjct: 68 FTEGRECVNCGAIHTPLWHRDGTGHYLCNACGLYNKMNGMNRPLKQPRRLVRQRHAALAA 127
Query: 56 ----TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
A +R G CTNC T T+LWRRN GE VCNACGLY+KLHNVN RP
Sbjct: 128 PPHDMAAKRPGTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVN----------RP 177
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQTRKRKPK
Sbjct: 178 LTMKKDSIQTRKRKPK 193
>gi|198415760|ref|XP_002122567.1| PREDICTED: zinc finger protein [Ciona intestinalis]
gi|93003144|tpd|FAA00155.1| TPA: zinc finger protein [Ciona intestinalis]
Length = 518
Score = 168 bits (426), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 86/119 (72%), Positives = 94/119 (78%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP KRL+A RR G C+
Sbjct: 292 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKKRLSAARRAGTSCS 351
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN G+PVCNACGLY+KLH VN RPL M+K+GIQTR RK
Sbjct: 352 NCSTTTTTLWRRNASGDPVCNACGLYFKLHGVN----------RPLTMKKEGIQTRNRK 400
>gi|70569766|dbj|BAE06473.1| transcription factor protein [Ciona intestinalis]
Length = 485
Score = 168 bits (426), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 86/119 (72%), Positives = 94/119 (78%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP KRL+A RR G C+
Sbjct: 259 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKKRLSAARRAGTSCS 318
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN G+PVCNACGLY+KLH VN RPL M+K+GIQTR RK
Sbjct: 319 NCSTTTTTLWRRNASGDPVCNACGLYFKLHGVN----------RPLTMKKEGIQTRNRK 367
>gi|70569760|dbj|BAE06472.1| transcription factor protein [Ciona intestinalis]
Length = 553
Score = 167 bits (424), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 86/119 (72%), Positives = 94/119 (78%), Gaps = 10/119 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP KRL+A RR G C+
Sbjct: 327 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKKRLSAARRAGTSCS 386
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN G+PVCNACGLY+KLH VN RPL M+K+GIQTR RK
Sbjct: 387 NCSTTTTTLWRRNASGDPVCNACGLYFKLHGVN----------RPLTMKKEGIQTRNRK 435
>gi|74007379|ref|XP_538026.2| PREDICTED: erythroid transcription factor [Canis lupus familiaris]
Length = 412
Score = 167 bits (422), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 92/137 (67%), Positives = 100/137 (72%), Gaps = 14/137 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 199 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 258
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VN RPL MRKDGIQTR RK
Sbjct: 259 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVN----------RPLTMRKDGIQTRNRKA 308
Query: 127 ----KKQGGGGGGGSGG 139
KK+ G GG+GG
Sbjct: 309 SGKGKKKRGSSLGGTGG 325
>gi|268551963|ref|XP_002633964.1| C. briggsae CBR-ELT-1 protein [Caenorhabditis briggsae]
Length = 483
Score = 167 bits (422), Expect = 2e-39, Method: Composition-based stats.
Identities = 79/125 (63%), Positives = 89/125 (71%), Gaps = 11/125 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR-LTATRRLGLCC 65
E RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVKP KR A +R G+ C
Sbjct: 280 EDRECVNCGVHATPLWRRDGSGNYLCNACGLYFKMNHQARPLVKPKKRQQNAQKRTGIEC 339
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G PVCNACGLYYKLH V RP+AM+K+GIQTR RK
Sbjct: 340 VNCHTNNTTLWRRNGEGHPVCNACGLYYKLHKVE----------RPMAMKKEGIQTRNRK 389
Query: 126 PKKQG 130
+G
Sbjct: 390 LSSKG 394
>gi|74192130|dbj|BAE34273.1| unnamed protein product [Mus musculus]
Length = 365
Score = 167 bits (422), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 76/104 (73%), Positives = 82/104 (78%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C N
Sbjct: 259 GRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCAN 318
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
C T TTLWRRN +G+PVCNACGLYYKLHN RH +P
Sbjct: 319 CQTTTTTLWRRNANGDPVCNACGLYYKLHNTPDYEERRHPDPKP 362
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 30/50 (60%)
Query: 53 KRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
R A G C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 250 SRPKARSSTGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPL 299
>gi|157125526|ref|XP_001654370.1| transcription factor GATA-4 (GATA binding factor-4) [Aedes aegypti]
gi|108873605|gb|EAT37830.1| AAEL010224-PA [Aedes aegypti]
Length = 719
Score = 166 bits (421), Expect = 2e-39, Method: Composition-based stats.
Identities = 77/129 (59%), Positives = 86/129 (66%), Gaps = 18/129 (13%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVK-------PSKRLTAT 58
E RECVNCG+ TPLWRRD GH LCNAC LY++ N G NRP + P +
Sbjct: 473 EPRECVNCGSSDTPLWRRDNVGHTLCNACALYNRQNPGTNRPPNRSHKAKNPPKTPVPGN 532
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
RR G+ C NC T TTLWRRNN GEPVCNACGLY+KLHNV+ RPL M+KDG
Sbjct: 533 RRTGVVCANCSTTTTTLWRRNNRGEPVCNACGLYHKLHNVD----------RPLTMKKDG 582
Query: 119 IQTRKRKPK 127
IQTRKRKPK
Sbjct: 583 IQTRKRKPK 591
>gi|354485961|ref|XP_003505150.1| PREDICTED: erythroid transcription factor-like [Cricetulus griseus]
Length = 412
Score = 166 bits (421), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 86/124 (69%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 199 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 258
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 259 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 308
Query: 127 KKQG 130
+G
Sbjct: 309 SGKG 312
>gi|341575748|gb|AEK80449.1| gata4 [Bubalus bubalis]
Length = 215
Score = 166 bits (421), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 78/106 (73%), Positives = 86/106 (81%), Gaps = 10/106 (9%)
Query: 22 WRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNND 81
WRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL C NC T TTLWRRN +
Sbjct: 1 WRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLSCANCQTTTTTLWRRNAE 60
Query: 82 GEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
GEPVCNACGLY KLH V RPLAMRK+GIQTRKRKPK
Sbjct: 61 GEPVCNACGLYMKLHGVP----------RPLAMRKEGIQTRKRKPK 96
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 41 GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 82
>gi|432110143|gb|ELK33920.1| Erythroid transcription factor [Myotis davidii]
Length = 391
Score = 166 bits (421), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 178 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 237
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 238 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 287
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 288 SGKGKKKRGSSLGGTG 303
>gi|431893570|gb|ELK03433.1| Erythroid transcription factor [Pteropus alecto]
Length = 413
Score = 166 bits (421), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|56900908|gb|AAW31747.1| GATA transcription factor GATAc [Aedes aegypti]
Length = 719
Score = 166 bits (421), Expect = 2e-39, Method: Composition-based stats.
Identities = 77/129 (59%), Positives = 86/129 (66%), Gaps = 18/129 (13%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVK-------PSKRLTAT 58
E RECVNCG+ TPLWRRD GH LCNAC LY++ N G NRP + P +
Sbjct: 472 EPRECVNCGSSDTPLWRRDNVGHTLCNACALYNRQNPGTNRPPNRSHKAKNPPKTPVPGN 531
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
RR G+ C NC T TTLWRRNN GEPVCNACGLY+KLHNV+ RPL M+KDG
Sbjct: 532 RRTGVVCANCSTTTTTLWRRNNRGEPVCNACGLYHKLHNVD----------RPLTMKKDG 581
Query: 119 IQTRKRKPK 127
IQTRKRKPK
Sbjct: 582 IQTRKRKPK 590
>gi|345491204|ref|XP_001607832.2| PREDICTED: hypothetical protein LOC100124020 [Nasonia vitripennis]
Length = 737
Score = 166 bits (421), Expect = 2e-39, Method: Composition-based stats.
Identities = 79/128 (61%), Positives = 92/128 (71%), Gaps = 16/128 (12%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT-----RRLG 62
GRECVNCGA +TPLWRRDGT YLCNACG+ K NG++RP + +K T+ RRLG
Sbjct: 508 GRECVNCGAHTTPLWRRDGTT-YLCNACGICSKTNGISRPPTQRAKPKTSVPPTGGRRLG 566
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NC T TTLWRRNN+GEPVCNACGLY+KLH VN RP++M+KDGIQTR
Sbjct: 567 VRCANCSTTTTTLWRRNNNGEPVCNACGLYFKLHGVN----------RPMSMKKDGIQTR 616
Query: 123 KRKPKKQG 130
KRKPK
Sbjct: 617 KRKPKNHA 624
>gi|6978877|ref|NP_036896.1| erythroid transcription factor [Rattus norvegicus]
gi|392355488|ref|XP_003752057.1| PREDICTED: erythroid transcription factor-like [Rattus norvegicus]
gi|1169842|sp|P43429.1|GATA1_RAT RecName: Full=Erythroid transcription factor; AltName: Full=Eryf1;
AltName: Full=GATA-binding factor 1; Short=GATA-1;
Short=GF-1; AltName: Full=NF-E1 DNA-binding protein
gi|644896|dbj|BAA02735.1| GATA-1 [Rattus norvegicus]
gi|149028423|gb|EDL83808.1| GATA binding protein 1 [Rattus norvegicus]
Length = 413
Score = 166 bits (420), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 85/124 (68%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KR+ ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRMIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VN RPL MRKDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVN----------RPLTMRKDGIQTRNRKA 309
Query: 127 KKQG 130
+G
Sbjct: 310 SGKG 313
>gi|402910093|ref|XP_003917725.1| PREDICTED: erythroid transcription factor [Papio anubis]
Length = 413
Score = 166 bits (420), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|321467981|gb|EFX78968.1| hypothetical protein DAPPUDRAFT_52915 [Daphnia pulex]
Length = 108
Score = 166 bits (419), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/116 (70%), Positives = 91/116 (78%), Gaps = 11/116 (9%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVKPSKRLTATRRLGLCC 65
+GRECVNCG+ISTPLWRRDGTGHYLCNACGLYHKMN G RPL+K ++RL+ TRRLGL C
Sbjct: 1 DGRECVNCGSISTPLWRRDGTGHYLCNACGLYHKMNNGTQRPLIKQTRRLSTTRRLGLRC 60
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
NC T T+LWRRNN GE VCNACGLY+KLH VN RPLAM+KD IQ
Sbjct: 61 ANCATTTTSLWRRNNQGETVCNACGLYFKLHGVN----------RPLAMKKDNIQV 106
>gi|355704777|gb|EHH30702.1| Eryf1 [Macaca mulatta]
gi|355757338|gb|EHH60863.1| Eryf1 [Macaca fascicularis]
Length = 413
Score = 166 bits (419), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|291407452|ref|XP_002719945.1| PREDICTED: GATA binding protein 1 [Oryctolagus cuniculus]
Length = 413
Score = 166 bits (419), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 86/124 (69%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 KKQG 130
+G
Sbjct: 310 SGKG 313
>gi|395854434|ref|XP_003799696.1| PREDICTED: erythroid transcription factor [Otolemur garnettii]
Length = 413
Score = 166 bits (419), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|335305975|ref|XP_003360352.1| PREDICTED: erythroid transcription factor [Sus scrofa]
Length = 412
Score = 166 bits (419), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 199 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 258
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 259 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 308
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 309 SGKGKKKRGSSLGGTG 324
>gi|124001863|gb|ABM87879.1| GATA4 [Papio hamadryas]
Length = 97
Score = 165 bits (418), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 76/96 (79%), Positives = 84/96 (87%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR
Sbjct: 1 MFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRR 60
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLH 96
+GL C NC T TTLWRRN +GEPVCNACGLY KLH
Sbjct: 61 VGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLH 96
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 11 CVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPL 48
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG 43
G C NC +T LWRR+ G +CNACGLY K++G
Sbjct: 62 GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHG 97
>gi|403297486|ref|XP_003939593.1| PREDICTED: erythroid transcription factor [Saimiri boliviensis
boliviensis]
Length = 413
Score = 165 bits (418), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 86/124 (69%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 KKQG 130
+G
Sbjct: 310 SGKG 313
>gi|54696410|gb|AAV38577.1| GATA binding protein 1 (globin transcription factor 1) [synthetic
construct]
gi|61368845|gb|AAX43246.1| GATA binding protein 1 [synthetic construct]
Length = 414
Score = 165 bits (418), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|397471373|ref|XP_003807270.1| PREDICTED: erythroid transcription factor [Pan paniscus]
gi|426395812|ref|XP_004064154.1| PREDICTED: erythroid transcription factor [Gorilla gorilla gorilla]
Length = 413
Score = 165 bits (418), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|301764787|ref|XP_002917807.1| PREDICTED: erythroid transcription factor-like [Ailuropoda
melanoleuca]
gi|281353655|gb|EFB29239.1| hypothetical protein PANDA_006176 [Ailuropoda melanoleuca]
Length = 413
Score = 165 bits (418), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|4503925|ref|NP_002040.1| erythroid transcription factor [Homo sapiens]
gi|120956|sp|P15976.1|GATA1_HUMAN RecName: Full=Erythroid transcription factor; AltName: Full=Eryf1;
AltName: Full=GATA-binding factor 1; Short=GATA-1;
Short=GF-1; AltName: Full=NF-E1 DNA-binding protein
gi|31243|emb|CAA35120.1| unnamed protein product [Homo sapiens]
gi|183072|gb|AAA35885.1| erythroid DNA-binding protein [Homo sapiens]
gi|119571136|gb|EAW50751.1| GATA binding protein 1 (globin transcription factor 1), isoform
CRA_c [Homo sapiens]
gi|208966346|dbj|BAG73187.1| GATA binding protein 1 [synthetic construct]
gi|226698|prf||1603357A transcription factor
Length = 413
Score = 165 bits (418), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|344250001|gb|EGW06105.1| Erythroid transcription factor [Cricetulus griseus]
Length = 330
Score = 165 bits (418), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 86/124 (69%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 117 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 176
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 177 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 226
Query: 127 KKQG 130
+G
Sbjct: 227 SGKG 230
>gi|323319537|gb|ADX36139.1| GATA123a [Schmidtea polychroa]
Length = 101
Score = 165 bits (418), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 73/98 (74%), Positives = 85/98 (86%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL++ R++G C
Sbjct: 4 EGRECVNCGATQTPLWRRDGTGHYLCNACGLYHKMNGTNRPLIKPKRRLSSARKVGTTCI 63
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
NCGT TTLWRRN G+ VCNACGLYYKLH+++RP++M
Sbjct: 64 NCGTNHTTLWRRNQQGDSVCNACGLYYKLHHISRPISM 101
>gi|407025371|gb|AFS65552.1| Gata4/5/6, partial [Parastichopus parvimensis]
Length = 302
Score = 165 bits (417), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 72/95 (75%), Positives = 82/95 (86%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +R++ +RR G+ C N
Sbjct: 195 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRPLLKPQRRMSGSRREGITCAN 254
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C T TTLWRRN +GEPVCNACG Y+K++ NRPL
Sbjct: 255 CHTSTTTLWRRNKEGEPVCNACGFYHKMNGQNRPL 289
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 31/53 (58%), Positives = 38/53 (71%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATR 59
EG C NC +T LWRR+ G +CNACG YHKMNG NRPL+KP +RL++ R
Sbjct: 248 EGITCANCHTSTTTLWRRNKEGEPVCNACGFYHKMNGQNRPLIKPKRRLSSKR 300
Score = 52.4 bits (124), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 29/47 (61%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
T G C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 189 TGPPEYGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGYNRPL 235
>gi|410988529|ref|XP_004000536.1| PREDICTED: erythroid transcription factor [Felis catus]
Length = 413
Score = 165 bits (417), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|332255512|ref|XP_003276877.1| PREDICTED: LOW QUALITY PROTEIN: erythroid transcription factor
[Nomascus leucogenys]
Length = 413
Score = 165 bits (417), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|30851257|gb|AAH52653.1| GATA binding protein 1 [Mus musculus]
Length = 413
Score = 165 bits (417), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 84/124 (67%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KR+ ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRMIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLY+KLH VN RPL MRKDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYFKLHQVN----------RPLTMRKDGIQTRNRKA 309
Query: 127 KKQG 130
+G
Sbjct: 310 SGKG 313
>gi|348553638|ref|XP_003462633.1| PREDICTED: erythroid transcription factor-like [Cavia porcellus]
Length = 409
Score = 165 bits (417), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 86/124 (69%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 197 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 256
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 257 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 306
Query: 127 KKQG 130
+G
Sbjct: 307 SGKG 310
>gi|6679947|ref|NP_032115.1| erythroid transcription factor [Mus musculus]
gi|120957|sp|P17679.1|GATA1_MOUSE RecName: Full=Erythroid transcription factor; AltName: Full=Eryf1;
AltName: Full=GATA-binding factor 1; Short=GATA-1;
Short=GF-1; AltName: Full=NF-E1 DNA-binding protein
gi|51064|emb|CAA33769.1| DNA-binding protein, GF-1 [Mus musculus]
gi|74147266|dbj|BAE27527.1| unnamed protein product [Mus musculus]
gi|148701991|gb|EDL33938.1| GATA binding protein 1 [Mus musculus]
gi|226387|prf||1509337A DNA binding protein
Length = 413
Score = 164 bits (416), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 84/124 (67%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KR+ ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRMIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLY+KLH VN RPL MRKDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYFKLHQVN----------RPLTMRKDGIQTRNRKA 309
Query: 127 KKQG 130
+G
Sbjct: 310 SGKG 313
>gi|405957946|gb|EKC24122.1| GATA-binding factor A [Crassostrea gigas]
Length = 358
Score = 164 bits (416), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 90/160 (56%), Positives = 98/160 (61%), Gaps = 41/160 (25%)
Query: 2 DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVK---------- 50
D+ E RECVNCGAISTPLWRRDGTGHYLCNACGL HKMN G +R VK
Sbjct: 9 DYFNNECRECVNCGAISTPLWRRDGTGHYLCNACGLLHKMNPGQSRTPVKQPNREESPIS 68
Query: 51 -------PSKRLTATR-------------RLGLCCTNCGTRMTTLWRRNNDGEPVCNACG 90
P KR+ + R+GL C NC T TTLWRRN +GEPVCNACG
Sbjct: 69 EEEKAPPPIKRIDDAKFDKISGINNNNRSRMGLSCANCNTSTTTLWRRNGEGEPVCNACG 128
Query: 91 LYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQG 130
LYYKLH VN RPL+M+KDGIQTRKRKPK G
Sbjct: 129 LYYKLHQVN----------RPLSMKKDGIQTRKRKPKTPG 158
>gi|297709913|ref|XP_002831655.1| PREDICTED: erythroid transcription factor [Pongo abelii]
Length = 358
Score = 164 bits (416), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 310 SGKGKKKRGSSLGGTG 325
>gi|329664512|ref|NP_001193169.1| erythroid transcription factor [Bos taurus]
gi|356582212|ref|NP_001239108.1| erythroid transcription factor [Ovis aries]
gi|296470754|tpg|DAA12869.1| TPA: GATA binding protein 1-like [Bos taurus]
gi|342671949|dbj|BAK57313.1| transcription factor GATA1 [Ovis aries]
gi|440912761|gb|ELR62302.1| Erythroid transcription factor [Bos grunniens mutus]
Length = 413
Score = 164 bits (416), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 86/124 (69%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 309
Query: 127 KKQG 130
+G
Sbjct: 310 SGKG 313
>gi|297303806|ref|XP_001104486.2| PREDICTED: erythroid transcription factor-like [Macaca mulatta]
Length = 389
Score = 164 bits (414), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 176 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 235
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 236 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 285
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 286 SGKGKKKRGSSLGGTG 301
>gi|194227895|ref|XP_001493895.2| PREDICTED: erythroid transcription factor-like [Equus caballus]
Length = 378
Score = 164 bits (414), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/136 (66%), Positives = 99/136 (72%), Gaps = 14/136 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 165 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 224
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLYYKLH VNRPL MR KDGIQTR RK
Sbjct: 225 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMR----------KDGIQTRNRKA 274
Query: 127 ----KKQGGGGGGGSG 138
KK+ G GG+G
Sbjct: 275 SGKGKKKRGSSLGGTG 290
>gi|392900106|ref|NP_001255402.1| Protein ELT-1, isoform d [Caenorhabditis elegans]
gi|379657042|emb|CCG28208.1| Protein ELT-1, isoform d [Caenorhabditis elegans]
Length = 487
Score = 164 bits (414), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 80/129 (62%), Positives = 90/129 (69%), Gaps = 11/129 (8%)
Query: 4 QFG-EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
QF E RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVKP KR A +R G
Sbjct: 281 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKRQNAQKRTG 340
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NC T TTLWRRN +G PVCNACGLY+KLH V RP+ M+KDGIQTR
Sbjct: 341 IECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVE----------RPITMKKDGIQTR 390
Query: 123 KRKPKKQGG 131
RK +G
Sbjct: 391 NRKLSAKGS 399
>gi|308478267|ref|XP_003101345.1| CRE-ELT-1 protein [Caenorhabditis remanei]
gi|308263246|gb|EFP07199.1| CRE-ELT-1 protein [Caenorhabditis remanei]
Length = 412
Score = 162 bits (411), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/129 (63%), Positives = 92/129 (71%), Gaps = 12/129 (9%)
Query: 4 QFG-EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR-LTATRRL 61
QFG E RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVKP KR A +R
Sbjct: 208 QFGTEDRECVNCGVHATPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKRQQNAQKRT 267
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C NC T TTLWRRN +G PVCNACGLYYKLH V RP+AM+K+GIQT
Sbjct: 268 GIECVNCHTNTTTLWRRNGEGHPVCNACGLYYKLHKVE----------RPMAMKKEGIQT 317
Query: 122 RKRKPKKQG 130
R RK +G
Sbjct: 318 RNRKLSSKG 326
>gi|170043916|ref|XP_001849613.1| GATA transcription factor GATAc [Culex quinquefasciatus]
gi|167867188|gb|EDS30571.1| GATA transcription factor GATAc [Culex quinquefasciatus]
Length = 578
Score = 162 bits (410), Expect = 4e-38, Method: Composition-based stats.
Identities = 77/129 (59%), Positives = 86/129 (66%), Gaps = 18/129 (13%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVK-------PSKRLTAT 58
E RECVNCG+ TPLWRRD GH LCNAC LY++ N G NRP + P +
Sbjct: 447 EHRECVNCGSSDTPLWRRDNVGHTLCNACALYNRQNPGTNRPPNRSQKAKQAPKAPVPGN 506
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
RR G+ C NC T TTLWRRNN GEPVCNACGLY+KLHNV+ RPL M+KDG
Sbjct: 507 RRTGVTCANCQTTTTTLWRRNNRGEPVCNACGLYHKLHNVD----------RPLTMKKDG 556
Query: 119 IQTRKRKPK 127
IQTRKRKPK
Sbjct: 557 IQTRKRKPK 565
>gi|341894024|gb|EGT49959.1| hypothetical protein CAEBREN_00985 [Caenorhabditis brenneri]
Length = 375
Score = 162 bits (409), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 80/125 (64%), Positives = 89/125 (71%), Gaps = 11/125 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR-LTATRRLGLCC 65
E RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVKP KR A +R G+ C
Sbjct: 208 EDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKRQQNAQKRTGIEC 267
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G PVCNACGLYYKLH V RP+AM+KDGIQTR RK
Sbjct: 268 VNCHTNNTTLWRRNGEGHPVCNACGLYYKLHKVE----------RPMAMKKDGIQTRNRK 317
Query: 126 PKKQG 130
+G
Sbjct: 318 LSSKG 322
>gi|321467983|gb|EFX78970.1| hypothetical protein DAPPUDRAFT_305027 [Daphnia pulex]
Length = 321
Score = 162 bits (409), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 81/121 (66%), Positives = 88/121 (72%), Gaps = 14/121 (11%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGREC NC AI+TPLWRRDG HYLCNACGLY NG NRP PS + RR GL C+
Sbjct: 3 EGRECANCSAIATPLWRRDGNNHYLCNACGLYKLTNGTNRP---PSSG-SGNRRAGLTCS 58
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN +GEPVCNACGLY+KLHNV RPLAM+K+GIQTRKRKP
Sbjct: 59 NCNTSTTTLWRRNANGEPVCNACGLYFKLHNV----------CRPLAMKKEGIQTRKRKP 108
Query: 127 K 127
K
Sbjct: 109 K 109
>gi|341880193|gb|EGT36128.1| CBN-ELT-1 protein [Caenorhabditis brenneri]
Length = 393
Score = 161 bits (408), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 80/125 (64%), Positives = 89/125 (71%), Gaps = 11/125 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR-LTATRRLGLCC 65
E RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVKP KR A +R G+ C
Sbjct: 208 EDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKRQQNAQKRTGIEC 267
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NC T TTLWRRN +G PVCNACGLYYKLH V RP+AM+KDGIQTR RK
Sbjct: 268 VNCHTNNTTLWRRNGEGHPVCNACGLYYKLHKVE----------RPMAMKKDGIQTRNRK 317
Query: 126 PKKQG 130
+G
Sbjct: 318 LSSKG 322
>gi|449480617|ref|XP_004176564.1| PREDICTED: LOW QUALITY PROTEIN: GATA-binding factor 3-like
[Taeniopygia guttata]
Length = 445
Score = 160 bits (406), Expect = 1e-37, Method: Composition-based stats.
Identities = 78/120 (65%), Positives = 87/120 (72%), Gaps = 13/120 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHY--LCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
EGRECVNCG STPLWRRDG H+ LCNA L HKMNG NRP +KP +RL+A RR G
Sbjct: 260 EGRECVNCGDTSTPLWRRDGXXHWAILCNAWSL-HKMNGQNRPXIKPKRRLSAARRAGTS 318
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 319 CANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 368
>gi|7861539|dbj|BAA95683.1| transcription factor GATA-6 [Mus musculus]
Length = 470
Score = 160 bits (405), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 70/93 (75%), Positives = 78/93 (83%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
E RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR+ ++RRLGL
Sbjct: 377 DLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGL 436
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLH 96
C NC T TTLWRRN +GEPVCNACGLY KLH
Sbjct: 437 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLH 469
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 28/38 (73%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG+ T LWRR+ G +CNACGLY K++ ++RPL
Sbjct: 384 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 421
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG 43
G C NC +T LWRR+ G +CNACGLY K++G
Sbjct: 435 GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHG 470
>gi|242010040|ref|XP_002425784.1| GATA binding factor-1B, putative [Pediculus humanus corporis]
gi|212509717|gb|EEB13046.1| GATA binding factor-1B, putative [Pediculus humanus corporis]
Length = 907
Score = 159 bits (402), Expect = 3e-37, Method: Composition-based stats.
Identities = 76/137 (55%), Positives = 88/137 (64%), Gaps = 28/137 (20%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTA----------- 57
+ECVNC TPLWRRD GHYLCNACGLY+K+NG+NRPL+K + +
Sbjct: 645 KECVNCATCRTPLWRRDDDGHYLCNACGLYNKVNGVNRPLIKANSKKYNSVSSSSNSASP 704
Query: 58 -------TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
RR G+ C NC T TTLWRRNN GEPVCNACGLY+KLH V R
Sbjct: 705 YSESPANNRRTGVECANCRTTNTTLWRRNNTGEPVCNACGLYFKLHGVP----------R 754
Query: 111 PLAMRKDGIQTRKRKPK 127
PL+M+KDGIQ+RKRKPK
Sbjct: 755 PLSMKKDGIQSRKRKPK 771
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G EC NC +T LWRR+ TG +CNACGLY K++G+ RPL + + +R
Sbjct: 716 GVECANCRTTNTTLWRRNNTGEPVCNACGLYFKLHGVPRPLSMKKDGIQSRKR 768
>gi|86575041|ref|NP_001033435.1| Protein ELT-1, isoform a [Caenorhabditis elegans]
gi|74834771|emb|CAJ30234.1| Protein ELT-1, isoform a [Caenorhabditis elegans]
Length = 488
Score = 159 bits (402), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 80/130 (61%), Positives = 90/130 (69%), Gaps = 12/130 (9%)
Query: 4 QFG-EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRL 61
QF E RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVKP KR A +R
Sbjct: 281 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKRQQNAQKRT 340
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C NC T TTLWRRN +G PVCNACGLY+KLH V RP+ M+KDGIQT
Sbjct: 341 GIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVE----------RPITMKKDGIQT 390
Query: 122 RKRKPKKQGG 131
R RK +G
Sbjct: 391 RNRKLSAKGS 400
>gi|392900108|ref|NP_001255403.1| Protein ELT-1, isoform c [Caenorhabditis elegans]
gi|379657041|emb|CCG28207.1| Protein ELT-1, isoform c [Caenorhabditis elegans]
Length = 487
Score = 159 bits (402), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 80/130 (61%), Positives = 90/130 (69%), Gaps = 12/130 (9%)
Query: 4 QFG-EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRL 61
QF E RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVKP KR A +R
Sbjct: 280 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKRQQNAQKRT 339
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C NC T TTLWRRN +G PVCNACGLY+KLH V RP+ M+KDGIQT
Sbjct: 340 GIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVE----------RPITMKKDGIQT 389
Query: 122 RKRKPKKQGG 131
R RK +G
Sbjct: 390 RNRKLSAKGS 399
>gi|86575043|ref|NP_001033436.1| Protein ELT-1, isoform b [Caenorhabditis elegans]
gi|119299|sp|P28515.1|ELT1_CAEEL RecName: Full=Transcription factor elt-1
gi|6702|emb|CAA40967.1| elt-1 [Caenorhabditis elegans]
gi|74834772|emb|CAJ30235.1| Protein ELT-1, isoform b [Caenorhabditis elegans]
Length = 416
Score = 158 bits (400), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 80/130 (61%), Positives = 90/130 (69%), Gaps = 12/130 (9%)
Query: 4 QFG-EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL-TATRRL 61
QF E RECVNCG +TPLWRRDG+G+YLCNACGLY KMN RPLVKP KR A +R
Sbjct: 209 QFSTEDRECVNCGVHNTPLWRRDGSGNYLCNACGLYFKMNHHARPLVKPKKRQQNAQKRT 268
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C NC T TTLWRRN +G PVCNACGLY+KLH V RP+ M+KDGIQT
Sbjct: 269 GIECVNCRTNTTTLWRRNGEGHPVCNACGLYFKLHKVE----------RPITMKKDGIQT 318
Query: 122 RKRKPKKQGG 131
R RK +G
Sbjct: 319 RNRKLSAKGS 328
>gi|347967441|ref|XP_001230593.3| AGAP002236-PA [Anopheles gambiae str. PEST]
gi|333466294|gb|EAU77738.3| AGAP002236-PA [Anopheles gambiae str. PEST]
Length = 747
Score = 158 bits (400), Expect = 6e-37, Method: Composition-based stats.
Identities = 77/130 (59%), Positives = 84/130 (64%), Gaps = 19/130 (14%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVKPSKR--------LTA 57
E RECVNCG+ TPLWRRD GH LCNAC LY + N G NRP + K
Sbjct: 492 EHRECVNCGSSDTPLWRRDIVGHTLCNACALYTRQNPGTNRPPNRSQKAKQTVKTPPAQG 551
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
RR G+ C NC T TTLWRRNN G+PVCNACGLYYKLH+VN RPL M+KD
Sbjct: 552 NRRSGVTCANCQTTTTTLWRRNNQGDPVCNACGLYYKLHSVN----------RPLTMKKD 601
Query: 118 GIQTRKRKPK 127
GIQTRKRKPK
Sbjct: 602 GIQTRKRKPK 611
>gi|313234089|emb|CBY19666.1| unnamed protein product [Oikopleura dioica]
Length = 553
Score = 157 bits (398), Expect = 9e-37, Method: Composition-based stats.
Identities = 72/117 (61%), Positives = 83/117 (70%), Gaps = 10/117 (8%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNC 68
REC NC A +TPLWRRD G+YLCNACGLY+K+NG +RPL+KP KR+ +R+G C NC
Sbjct: 325 RECANCAATTTPLWRRDKCGNYLCNACGLYYKVNGHSRPLIKPKKRVAPNKRIGTICVNC 384
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
T TTLWRR+ GEPVCNACGLY KLH V RP M+KDGIQTR RK
Sbjct: 385 KTSQTTLWRRSLKGEPVCNACGLYEKLHGVP----------RPKTMKKDGIQTRNRK 431
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G CVNC T LWRR G +CNACGLY K++G+ RP
Sbjct: 378 GTICVNCKTSQTTLWRRSLKGEPVCNACGLYEKLHGVPRP 417
>gi|108744055|gb|ABG02399.1| GATA binding protein 4 [Bubalus bubalis]
Length = 172
Score = 157 bits (398), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 75/103 (72%), Positives = 83/103 (80%), Gaps = 10/103 (9%)
Query: 25 DGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEP 84
DGTGHYLCNACGLYHKMNG+NRPL+KP +RL+A+RR+GL C NC T TTLWRRN +GEP
Sbjct: 1 DGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLSCANCQTTTTTLWRRNAEGEP 60
Query: 85 VCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
VCNACGLY KLH V RPLAMRK+GIQTRKRKPK
Sbjct: 61 VCNACGLYMKLHGVP----------RPLAMRKEGIQTRKRKPK 93
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 38 GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 79
>gi|444509358|gb|ELV09217.1| Erythroid transcription factor [Tupaia chinensis]
Length = 408
Score = 157 bits (397), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 91/146 (62%), Positives = 99/146 (67%), Gaps = 24/146 (16%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL----------T 56
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL
Sbjct: 185 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLGIISHHFIPQI 244
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
++R G CTNC T TTLWRRN G+PVCNACGLYYKLH VN RPL MRK
Sbjct: 245 VSKRAGTQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVN----------RPLTMRK 294
Query: 117 DGIQTRKRKP----KKQGGGGGGGSG 138
DGIQTR RK KK+ G GG+G
Sbjct: 295 DGIQTRNRKASGKGKKKRGSSLGGTG 320
>gi|442570364|pdb|3VD6|C Chain C, Both Zn Fingers Of Gata1 Bound To Palindromic Dna
Recognition Site, P21 Crystal Form
gi|442570368|pdb|3VEK|C Chain C, Both Zn Fingers Of Gata1 Bound To Palindromic Dna
Recognition Site, P1 Crystal Form
gi|442570371|pdb|3VEK|F Chain F, Both Zn Fingers Of Gata1 Bound To Palindromic Dna
Recognition Site, P1 Crystal Form
Length = 119
Score = 157 bits (397), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 84/124 (67%), Positives = 93/124 (75%), Gaps = 10/124 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KR+ ++R G CT
Sbjct: 1 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRMIVSKRAGTQCT 60
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
NC T TTLWRRN G+PVCNACGLY+KLH VN RPL MRKDGIQTR RK
Sbjct: 61 NCQTTTTTLWRRNASGDPVCNACGLYFKLHQVN----------RPLTMRKDGIQTRNRKA 110
Query: 127 KKQG 130
+G
Sbjct: 111 SGKG 114
>gi|323319539|gb|ADX36140.1| GATA123b [Schmidtea polychroa]
Length = 239
Score = 157 bits (396), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 72/110 (65%), Positives = 87/110 (79%), Gaps = 10/110 (9%)
Query: 16 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTL 75
A++TPLWRRDG G+YLCNACGLY KMNG NRPL+KP +RL+A+RR G C+NC T TTL
Sbjct: 1 AVNTPLWRRDGQGNYLCNACGLYQKMNGQNRPLIKPKRRLSASRRTGTICSNCNTSTTTL 60
Query: 76 WRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
WRRN++GEPVCNACGLY+KLH+V+ RP M+K+GIQTR RK
Sbjct: 61 WRRNSNGEPVCNACGLYFKLHSVS----------RPPTMKKEGIQTRNRK 100
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 27/40 (67%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NC +T LWRR+ G +CNACGLY K++ ++RP
Sbjct: 47 GTICSNCNTSTTTLWRRNSNGEPVCNACGLYFKLHSVSRP 86
>gi|313232258|emb|CBY09367.1| unnamed protein product [Oikopleura dioica]
Length = 459
Score = 156 bits (395), Expect = 2e-36, Method: Composition-based stats.
Identities = 75/117 (64%), Positives = 87/117 (74%), Gaps = 10/117 (8%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNC 68
R+CVNC A+STPLWRRD G+YLCNACGLYHK+NG NRPL+KP KR+T ++R G CTNC
Sbjct: 278 RQCVNCAAVSTPLWRRDSNGNYLCNACGLYHKVNGCNRPLIKPKKRVTQSKRTGAKCTNC 337
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
T TTLWRR G+ VCNACGLY KLH VN RP+ M+KDGIQTR R+
Sbjct: 338 NTTQTTLWRRTTTGDAVCNACGLYQKLHGVN----------RPITMKKDGIQTRNRR 384
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/42 (54%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 21 LWRRDGTGHYLCNACGLYHKMNGMNRPL-VKPSKRLTATRRL 61
LWRR TG +CNACGLY K++G+NRP+ +K T RRL
Sbjct: 344 LWRRTTTGDAVCNACGLYQKLHGVNRPITMKKDGIQTRNRRL 385
>gi|313245922|emb|CBY34903.1| unnamed protein product [Oikopleura dioica]
Length = 459
Score = 156 bits (395), Expect = 2e-36, Method: Composition-based stats.
Identities = 75/117 (64%), Positives = 87/117 (74%), Gaps = 10/117 (8%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNC 68
R+CVNC A+STPLWRRD G+YLCNACGLYHK+NG NRPL+KP KR+T ++R G CTNC
Sbjct: 278 RQCVNCAAVSTPLWRRDSNGNYLCNACGLYHKVNGCNRPLIKPKKRVTQSKRTGAKCTNC 337
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
T TTLWRR G+ VCNACGLY KLH VN RP+ M+KDGIQTR R+
Sbjct: 338 NTTQTTLWRRTTTGDAVCNACGLYQKLHGVN----------RPITMKKDGIQTRNRR 384
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/42 (54%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 21 LWRRDGTGHYLCNACGLYHKMNGMNRPL-VKPSKRLTATRRL 61
LWRR TG +CNACGLY K++G+NRP+ +K T RRL
Sbjct: 344 LWRRTTTGDAVCNACGLYQKLHGVNRPITMKKDGIQTRNRRL 385
>gi|157169241|gb|ABV25955.1| GATA-binding transcription factor B2 [Capitella teleta]
Length = 282
Score = 154 bits (390), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 76/129 (58%), Positives = 93/129 (72%), Gaps = 12/129 (9%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSK-RLTATRRL 61
+ G GREC NCG+ PLWR +GTGH LCNACG+ H M+G +P++K S R + +RR+
Sbjct: 66 MEGGVGRECANCGSTYAPLWRWNGTGHLLCNACGV-HVMSGFAKPVMKTSGGRRSVSRRV 124
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
GL C NC T TTLWRRNN+GEPVCNACGLY+KLH VN RP++M+K+GIQT
Sbjct: 125 GLSCANCHTSTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPMSMKKEGIQT 174
Query: 122 RKRKPKKQG 130
RKRKPK G
Sbjct: 175 RKRKPKGSG 183
>gi|303387041|gb|ADM15548.1| GATA-1, partial [Carassius auratus langsdorfii]
Length = 214
Score = 153 bits (387), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 68/94 (72%), Positives = 74/94 (78%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R
Sbjct: 121 MRLSPPEARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKR 180
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYK 94
G C NC T TTLWRRN GEPVCNACGLY+K
Sbjct: 181 AGTQCANCHTSTTTLWRRNASGEPVCNACGLYFK 214
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 131 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPL 168
>gi|443718381|gb|ELU09033.1| hypothetical protein CAPTEDRAFT_121668 [Capitella teleta]
Length = 131
Score = 150 bits (380), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/129 (58%), Positives = 93/129 (72%), Gaps = 12/129 (9%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSK-RLTATRRL 61
+ G GREC NCG+ PLWR +GTGH LCNACG+ H M+G +P++K S R + +RR+
Sbjct: 1 MEGGVGRECANCGSTYAPLWRWNGTGHLLCNACGV-HVMSGFAKPVMKTSGGRRSVSRRV 59
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
GL C NC T TTLWRRNN+GEPVCNACGLY+KLH VN RP++M+K+GIQT
Sbjct: 60 GLSCANCHTSTTTLWRRNNEGEPVCNACGLYFKLHGVN----------RPMSMKKEGIQT 109
Query: 122 RKRKPKKQG 130
RKRKPK G
Sbjct: 110 RKRKPKGSG 118
>gi|313232905|emb|CBY09588.1| unnamed protein product [Oikopleura dioica]
Length = 368
Score = 147 bits (371), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 71/125 (56%), Positives = 84/125 (67%), Gaps = 11/125 (8%)
Query: 2 DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
D E REC NC A +TPLWRRD G+YLCNACGLY+K+NG +RPLVKP +R +R
Sbjct: 184 DASASESRECANCKATTTPLWRRDDAGNYLCNACGLYYKVNGKSRPLVKPKRRSVPNKRE 243
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG-IQ 120
G C NC T T+LWR++N+ + VCNACGLY KLH V RPL M+KDG IQ
Sbjct: 244 GTICDNCKTTETSLWRKSNEQKAVCNACGLYEKLHGV----------ARPLTMKKDGAIQ 293
Query: 121 TRKRK 125
TR RK
Sbjct: 294 TRNRK 298
>gi|395548125|ref|XP_003775205.1| PREDICTED: erythroid transcription factor [Sarcophilus harrisii]
Length = 302
Score = 147 bits (371), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 72/93 (77%), Positives = 77/93 (82%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 203 EARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 262
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
NC T TTLWRRN GEPVCNACGLYYKLHN
Sbjct: 263 NCQTTTTTLWRRNASGEPVCNACGLYYKLHNAE 295
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 207 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPL 244
>gi|313219632|emb|CBY30553.1| unnamed protein product [Oikopleura dioica]
Length = 368
Score = 147 bits (370), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 71/125 (56%), Positives = 84/125 (67%), Gaps = 11/125 (8%)
Query: 2 DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
D E REC NC A +TPLWRRD G+YLCNACGLY+K+NG +RPLVKP +R +R
Sbjct: 184 DAAASESRECANCKATTTPLWRRDDAGNYLCNACGLYYKVNGKSRPLVKPKRRSVPNKRE 243
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG-IQ 120
G C NC T T+LWR++N+ + VCNACGLY KLH V RPL M+KDG IQ
Sbjct: 244 GTICDNCKTTETSLWRKSNEQKAVCNACGLYEKLHGV----------ARPLTMKKDGAIQ 293
Query: 121 TRKRK 125
TR RK
Sbjct: 294 TRNRK 298
>gi|357609507|gb|EHJ66487.1| BmGATA beta isoform 2 - silkworm [Danaus plexippus]
Length = 267
Score = 145 bits (366), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 87/175 (49%), Positives = 93/175 (53%), Gaps = 59/175 (33%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT---- 56
M F EGRECVNCGAI TPLWRRDGTGHYLCNACGLY KMNGMNRPL KP +RL
Sbjct: 1 MAEVFTEGRECVNCGAIDTPLWRRDGTGHYLCNACGLYTKMNGMNRPL-KPPRRLVRQRH 59
Query: 57 ---------ATRRLGL-----------------------------------CCTNCGTRM 72
R L L C+NC T +
Sbjct: 60 AAQAPAPAPDVRSLALTTSARPTLPLHHPATLALPAPARNPRPSMGTKRQGVCSNCETTI 119
Query: 73 TTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
TTLWRRN GE VCNACGLY+KLH +N RP M+KD IQTRKRK K
Sbjct: 120 TTLWRRNPLGENVCNACGLYFKLHGIN----------RPKNMKKDSIQTRKRKSK 164
>gi|335353905|dbj|BAK39709.1| GATA binding protein 2 [Tursiops truncatus]
Length = 83
Score = 144 bits (363), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 66/83 (79%), Positives = 72/83 (86%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNC 68
RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC NC
Sbjct: 1 RECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCANC 60
Query: 69 GTRMTTLWRRNNDGEPVCNACGL 91
T TTLWRRN +G+PVCNACGL
Sbjct: 61 QTTTTTLWRRNANGDPVCNACGL 83
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 3 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPL 40
Score = 36.6 bits (83), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 19/30 (63%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGL 37
G C NC +T LWRR+ G +CNACGL
Sbjct: 54 GTCCANCQTTTTTLWRRNANGDPVCNACGL 83
>gi|57116082|gb|AAW33557.1| GATA transcription factor pannier [Schistocerca americana]
Length = 75
Score = 144 bits (362), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 69/70 (98%), Positives = 69/70 (98%)
Query: 16 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTL 75
AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTR TTL
Sbjct: 1 AISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRTTTL 60
Query: 76 WRRNNDGEPV 85
WRRNNDGEPV
Sbjct: 61 WRRNNDGEPV 70
Score = 42.7 bits (99), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 24/31 (77%)
Query: 73 TTLWRRNNDGEPVCNACGLYYKLHNVNRPLA 103
T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 4 TPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 34
>gi|387538429|gb|AFJ79491.1| GATA binding protein 4/5/6 isoform 2a, partial [Branchiostoma
lanceolatum]
Length = 184
Score = 142 bits (359), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 63/79 (79%), Positives = 71/79 (89%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA STPLWRRDGTGHYLCNACGLYH+MNG+NRPL+KP +RL+A+RR+GL C
Sbjct: 106 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHRMNGVNRPLIKPQRRLSASRRVGLQC 165
Query: 66 TNCGTRMTTLWRRNNDGEP 84
NC T TTLWRRNN+GEP
Sbjct: 166 ANCRTTQTTLWRRNNEGEP 184
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 37/58 (63%), Gaps = 6/58 (10%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL--AMRHLAMRPLAMRKDGIQ 120
C NCG T LWRR+ G +CNACGLY++++ VNRPL R L+ A R+ G+Q
Sbjct: 111 CVNCGATSTPLWRRDGTGHYLCNACGLYHRMNGVNRPLIKPQRRLS----ASRRVGLQ 164
>gi|326927942|ref|XP_003210146.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Meleagris
gallopavo]
Length = 452
Score = 142 bits (357), Expect = 5e-32, Method: Composition-based stats.
Identities = 80/118 (67%), Positives = 87/118 (73%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T
Sbjct: 277 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTT-------- 328
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
TTLWRRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 329 ------TTLWRRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 370
>gi|410951842|ref|XP_003982602.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Felis
catus]
Length = 466
Score = 141 bits (356), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 78/119 (65%), Positives = 85/119 (71%), Gaps = 24/119 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 385
>gi|301764533|ref|XP_002917682.1| PREDICTED: endothelial transcription factor GATA-2-like isoform 2
[Ailuropoda melanoleuca]
Length = 466
Score = 141 bits (356), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 78/119 (65%), Positives = 85/119 (71%), Gaps = 24/119 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 385
>gi|410918526|ref|XP_003972736.1| PREDICTED: transcription factor GATA-3-like isoform 3 [Takifugu
rubripes]
Length = 430
Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats.
Identities = 79/118 (66%), Positives = 87/118 (73%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T
Sbjct: 258 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTT-------- 309
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
TTLWRRN +G+PVCNACGLY+KLHN+N RPL M+K+GIQTR R
Sbjct: 310 ------TTLWRRNANGDPVCNACGLYFKLHNIN----------RPLTMKKEGIQTRNR 351
>gi|348521216|ref|XP_003448122.1| PREDICTED: transcription factor GATA-3-like isoform 2 [Oreochromis
niloticus]
Length = 429
Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats.
Identities = 79/118 (66%), Positives = 87/118 (73%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T
Sbjct: 257 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTT-------- 308
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
TTLWRRN +G+PVCNACGLY+KLHN+N RPL M+K+GIQTR R
Sbjct: 309 ------TTLWRRNANGDPVCNACGLYFKLHNIN----------RPLTMKKEGIQTRNR 350
>gi|296474637|tpg|DAA16752.1| TPA: GATA binding protein 2 [Bos taurus]
Length = 466
Score = 141 bits (355), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/119 (65%), Positives = 85/119 (71%), Gaps = 24/119 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR RK
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNRK 385
>gi|312384257|gb|EFR29024.1| hypothetical protein AND_02351 [Anopheles darlingi]
Length = 806
Score = 140 bits (354), Expect = 1e-31, Method: Composition-based stats.
Identities = 77/132 (58%), Positives = 85/132 (64%), Gaps = 20/132 (15%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRP---------LVKPSKRL 55
E RECVNCG+ TPLWRRD GH LCNAC LY + N G NRP V+ +
Sbjct: 584 AEHRECVNCGSSDTPLWRRDVVGHTLCNACALYTRQNPGTNRPPTRSHKAKQTVQKAPPA 643
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
RR G+ C NC T TTLWRRNN G+PVCNACGLY+KLHNVN RPL M+
Sbjct: 644 QGNRRSGVTCANCNTTTTTLWRRNNQGDPVCNACGLYFKLHNVN----------RPLTMK 693
Query: 116 KDGIQTRKRKPK 127
KDGIQTRKRKPK
Sbjct: 694 KDGIQTRKRKPK 705
>gi|383863207|ref|XP_003707073.1| PREDICTED: uncharacterized protein LOC100875315 [Megachile
rotundata]
Length = 703
Score = 140 bits (354), Expect = 1e-31, Method: Composition-based stats.
Identities = 77/136 (56%), Positives = 91/136 (66%), Gaps = 18/136 (13%)
Query: 9 RECVNCGAISTPLWRRD-GTGHYLCNACGLYHKMNGMNRP---LVKPSKRL--TATRRLG 62
+EC C ++T +WRRD TGH C+ C LY+KMNG NRP L KP + + T RR G
Sbjct: 441 KECSGCANLTT-IWRRDDATGHCYCHTC-LYNKMNGTNRPSMRLGKPKQAVAPTGVRRTG 498
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NC T TTLWRRNN+GEPVCNACGLYYKLHNVN RPL+M+K+GIQTR
Sbjct: 499 VQCANCRTTNTTLWRRNNNGEPVCNACGLYYKLHNVN----------RPLSMKKEGIQTR 548
Query: 123 KRKPKKQGGGGGGGSG 138
KRKPK G G +G
Sbjct: 549 KRKPKNHSGISGNLAG 564
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL + + +R
Sbjct: 498 GVQCANCRTTNTTLWRRNNNGEPVCNACGLYYKLHNVNRPLSMKKEGIQTRKR 550
>gi|402887125|ref|XP_003906955.1| PREDICTED: endothelial transcription factor GATA-2 isoform 3 [Papio
anubis]
Length = 466
Score = 140 bits (352), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 384
>gi|109098098|ref|XP_001097707.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1
[Macaca mulatta]
Length = 466
Score = 140 bits (352), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 384
>gi|380017690|ref|XP_003692780.1| PREDICTED: uncharacterized protein LOC100866028 [Apis florea]
Length = 809
Score = 139 bits (351), Expect = 3e-31, Method: Composition-based stats.
Identities = 75/136 (55%), Positives = 88/136 (64%), Gaps = 17/136 (12%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLY-HKMNGMNRPLVKPSKRLTAT-----RRLG 62
+EC NC AI T + RRD G+YLC +C +KMNG+NR +K K A RR G
Sbjct: 543 KECSNC-AILTTVLRRDEAGNYLCQSCAYTTNKMNGINRASIKCGKPKQAVATAGVRRTG 601
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NC T TTLWRRNN+GEPVCNACGLYYKLHNVN RPL+M+K+GIQTR
Sbjct: 602 VQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVN----------RPLSMKKEGIQTR 651
Query: 123 KRKPKKQGGGGGGGSG 138
KRKPK G G +G
Sbjct: 652 KRKPKNNSGISGNLAG 667
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL + + +R
Sbjct: 601 GVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRPLSMKKEGIQTRKR 653
>gi|301607041|ref|XP_002933112.1| PREDICTED: GATA-binding factor 2-like isoform 3 [Xenopus (Silurana)
tropicalis]
Length = 437
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/119 (64%), Positives = 84/119 (70%), Gaps = 24/119 (20%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 261 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW--- 317
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 318 -----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 355
>gi|14602571|gb|AAH09797.1| GATA1 protein [Homo sapiens]
Length = 335
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 69/91 (75%), Positives = 76/91 (83%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHN 97
NC T TTLWRRN G+PVCNACGLYYKLH+
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHH 290
Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Query: 52 SKRLTATRRLGLC----CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
S +L T L C C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 187 SPKLRGTLPLPPCEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPL 241
>gi|119571134|gb|EAW50749.1| GATA binding protein 1 (globin transcription factor 1), isoform
CRA_a [Homo sapiens]
Length = 334
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 69/91 (75%), Positives = 75/91 (82%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G CT
Sbjct: 200 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCT 259
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHN 97
NC T TTLWRRN G+PVCNACGLYYKLH
Sbjct: 260 NCQTTTTTLWRRNASGDPVCNACGLYYKLHQ 290
Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Query: 52 SKRLTATRRLGLC----CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
S +L T L C C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 187 SPKLRGTLPLPPCEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPL 241
>gi|426341994|ref|XP_004036302.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Gorilla gorilla gorilla]
Length = 466
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 384
>gi|354482835|ref|XP_003503601.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Cricetulus griseus]
Length = 466
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 384
>gi|338714471|ref|XP_003363086.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2 [Equus
caballus]
Length = 466
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 384
>gi|224611701|ref|NP_001139134.1| endothelial transcription factor GATA-2 isoform 2 [Homo sapiens]
gi|114589045|ref|XP_001138192.1| PREDICTED: endothelial transcription factor GATA-2 isoform 1 [Pan
troglodytes]
gi|332261775|ref|XP_003279942.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Nomascus leucogenys]
gi|15990428|gb|AAH15577.1| GATA2 protein [Homo sapiens]
gi|17512029|gb|AAH18988.1| GATA2 protein [Homo sapiens]
gi|30410878|gb|AAH51272.1| GATA2 protein [Homo sapiens]
gi|119599720|gb|EAW79314.1| GATA binding protein 2, isoform CRA_b [Homo sapiens]
Length = 466
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 384
>gi|395847145|ref|XP_003796244.1| PREDICTED: endothelial transcription factor GATA-2 isoform 2
[Otolemur garnettii]
Length = 466
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 384
>gi|291393344|ref|XP_002713201.1| PREDICTED: GATA binding protein 2 isoform 1 [Oryctolagus cuniculus]
Length = 466
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 346
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 347 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 384
>gi|334342417|ref|XP_003341811.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Monodelphis
domestica]
Length = 459
Score = 139 bits (350), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/119 (64%), Positives = 84/119 (70%), Gaps = 24/119 (20%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCC 65
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 283 SEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW--- 339
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 340 -----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 377
>gi|431913709|gb|ELK15199.1| Endothelial transcription factor GATA-2 [Pteropus alecto]
Length = 383
Score = 139 bits (350), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 208 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 263
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 264 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 301
>gi|390465072|ref|XP_002750080.2| PREDICTED: trans-acting T-cell-specific transcription factor GATA-3
[Callithrix jacchus]
Length = 567
Score = 139 bits (350), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 84/118 (71%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 397 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 452
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHN+N RPL M+K+GIQTR R
Sbjct: 453 ----------RRNANGDPVCNACGLYYKLHNIN----------RPLTMKKEGIQTRNR 490
>gi|348507928|ref|XP_003441507.1| PREDICTED: GATA-binding factor 2-like isoform 3 [Oreochromis
niloticus]
Length = 441
Score = 139 bits (349), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 82/118 (69%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 272 EGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 327
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 328 ----------RRNAHGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 365
>gi|407025369|gb|AFS65551.1| Gata1/2/3, partial [Parastichopus parvimensis]
Length = 261
Score = 138 bits (348), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 62/82 (75%), Positives = 68/82 (82%), Gaps = 1/82 (1%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDG GHYLCNACGLYHKMNG NRPL+KP +RL +++R G C
Sbjct: 181 EGRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRPLIKPKRRL-SSKRTGTSCA 239
Query: 67 NCGTRMTTLWRRNNDGEPVCNA 88
NC TTLWRRN +GEPVCNA
Sbjct: 240 NCQATTTTLWRRNPNGEPVCNA 261
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 47 PLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
P KP + ++ G C NCG T LWRR+ +G +CNACGLY+K++ NRPL
Sbjct: 168 PSTKPRAKSRSSTE-GRECVNCGATSTPLWRRDGNGHYLCNACGLYHKMNGQNRPL 222
>gi|410919501|ref|XP_003973223.1| PREDICTED: GATA-binding factor 2-like isoform 2 [Takifugu rubripes]
Length = 442
Score = 138 bits (347), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 77/118 (65%), Positives = 83/118 (70%), Gaps = 24/118 (20%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRD TGHYLCNACGLYHKMNG NRPL+KP +RLT T L
Sbjct: 272 EGRECVNCGATSTPLWRRDSTGHYLCNACGLYHKMNGQNRPLIKPKRRLTTTTTLW---- 327
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
RRN +G+PVCNACGLYYKLHNVN RPL M+K+GIQTR R
Sbjct: 328 ----------RRNANGDPVCNACGLYYKLHNVN----------RPLTMKKEGIQTRNR 365
>gi|340719407|ref|XP_003398145.1| PREDICTED: hypothetical protein LOC100650330 [Bombus terrestris]
Length = 813
Score = 137 bits (345), Expect = 2e-30, Method: Composition-based stats.
Identities = 72/135 (53%), Positives = 88/135 (65%), Gaps = 18/135 (13%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK--PSKRLTAT---RRLGL 63
+EC CG T W+RD TG + C++C +KMNG+NR ++ K+ AT RR G+
Sbjct: 539 KECFGCGN-PTSSWKRDETGRFYCHSC--IYKMNGINRSSMRCGKPKQTVATAGVRRTGV 595
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRNN+GEPVCNACGLYYKLHNVN RPL+M+K+GIQTRK
Sbjct: 596 QCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVN----------RPLSMKKEGIQTRK 645
Query: 124 RKPKKQGGGGGGGSG 138
RKPK G G +G
Sbjct: 646 RKPKNHSGMSGSLAG 660
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL + + +R
Sbjct: 594 GVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRPLSMKKEGIQTRKR 646
>gi|328786511|ref|XP_001121244.2| PREDICTED: hypothetical protein LOC725389 [Apis mellifera]
Length = 808
Score = 137 bits (344), Expect = 2e-30, Method: Composition-based stats.
Identities = 76/137 (55%), Positives = 92/137 (67%), Gaps = 19/137 (13%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLY--HKMNGMNRPLVK--PSKRLTAT---RRL 61
+EC NC AI T + RRD TG+Y+C C +Y +K+NG+NR +K K+ AT RR
Sbjct: 543 KECPNC-AILTNVLRRDETGNYVCQNC-IYAANKINGINRSSIKCGKPKQAVATAGVRRT 600
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C NC T TTLWRRNN+GEPVCNACGLYYKLHNVN RPL+M+K+GIQT
Sbjct: 601 GVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVN----------RPLSMKKEGIQT 650
Query: 122 RKRKPKKQGGGGGGGSG 138
RKRKPK G G +G
Sbjct: 651 RKRKPKNNSGISGNLAG 667
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G +C NC +T LWRR+ G +CNACGLY+K++ +NRPL + + +R
Sbjct: 601 GVQCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNVNRPLSMKKEGIQTRKR 653
>gi|193083007|ref|NP_001122335.1| GATAa protein [Ciona intestinalis]
gi|70569756|dbj|BAE06471.1| transcription factor protein [Ciona intestinalis]
Length = 641
Score = 136 bits (343), Expect = 2e-30, Method: Composition-based stats.
Identities = 69/127 (54%), Positives = 81/127 (63%), Gaps = 13/127 (10%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
Q+GE RECVNCGAIS WRRD +GH+LC+ CG + M P+ K T R++
Sbjct: 300 QYGEPRECVNCGAISATSWRRDASGHFLCSTCGACRSGSYMRAPVKSKGKLATCRRQV-- 357
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C+NC T +TTLWRR+ DG PVCNACGLY KLH V RP M+KD IQTRK
Sbjct: 358 -CSNCSTTVTTLWRRSPDGNPVCNACGLYQKLHGVP----------RPRTMKKDSIQTRK 406
Query: 124 RKPKKQG 130
RKPK QG
Sbjct: 407 RKPKGQG 413
>gi|348517021|ref|XP_003446034.1| PREDICTED: transcription factor GATA-3-like [Oreochromis niloticus]
Length = 170
Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/121 (56%), Positives = 80/121 (66%), Gaps = 12/121 (9%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLC 64
+ RECV+CG S PLWRRD G LCN CGL + N N PL++P +R + +R G
Sbjct: 27 VDQQRECVSCGMQSAPLWRRDAAGRLLCNTCGLRQEAN--NTPLLRPKRRASRIQRKGTQ 84
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRN+ GE VCNACGLYY+LH VN RPLA++KDGIQTRKR
Sbjct: 85 CVNCLTERTTLWRRNSAGEAVCNACGLYYRLHRVN----------RPLALKKDGIQTRKR 134
Query: 125 K 125
K
Sbjct: 135 K 135
>gi|312379466|gb|EFR25730.1| hypothetical protein AND_08684 [Anopheles darlingi]
Length = 566
Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/82 (79%), Positives = 67/82 (81%), Gaps = 10/82 (12%)
Query: 53 KRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPL 112
K+ TATRRLGLCCTNCGTR TTLWRRNNDGEPVCNACGLY+KLH VN RPL
Sbjct: 173 KKQTATRRLGLCCTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVN----------RPL 222
Query: 113 AMRKDGIQTRKRKPKKQGGGGG 134
AMRKDGIQTRKRKPKK GG GG
Sbjct: 223 AMRKDGIQTRKRKPKKAGGSGG 244
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 29/39 (74%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 185 CTNCGTRTTTLWRRNNDGEPVCNACGLYFKLHGVNRPLA 223
>gi|23094291|emb|CAD45643.1| GATAa protein [Ciona intestinalis]
Length = 431
Score = 131 bits (330), Expect = 8e-29, Method: Composition-based stats.
Identities = 68/127 (53%), Positives = 80/127 (62%), Gaps = 13/127 (10%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
Q+ E RECVNCGAIS WRRD +GH+LC+ CG + M P+ K T R++
Sbjct: 90 QYVEPRECVNCGAISATSWRRDASGHFLCSTCGACRSGSYMRAPVKSKGKLATCRRQV-- 147
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C+NC T +TTLWRR+ DG PVCNACGLY KLH V RP M+KD IQTRK
Sbjct: 148 -CSNCSTTVTTLWRRSPDGNPVCNACGLYQKLHGVP----------RPRTMKKDSIQTRK 196
Query: 124 RKPKKQG 130
RKPK QG
Sbjct: 197 RKPKGQG 203
>gi|380861645|gb|AFF18488.1| GATA456 [Schmidtea mediterranea]
Length = 529
Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats.
Identities = 64/138 (46%), Positives = 85/138 (61%), Gaps = 28/138 (20%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT---------- 58
RECVNCGA +T LW RD +G+YLC+ C + + N N + P+ +++++
Sbjct: 258 RECVNCGASNTQLWSRDNSGYYLCDECDRFSQNNSRNLEKL-PTNQVSSSTENEFMKKSN 316
Query: 59 -------RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
+R L C+NC T+LWRRNN+GEPVCNACGLYYKLH ++RP
Sbjct: 317 ANFQQYGKRSDLECSNCKITKTSLWRRNNEGEPVCNACGLYYKLHK----------SLRP 366
Query: 112 LAMRKDGIQTRKRKPKKQ 129
L+MRK+GIQTRKRK K Q
Sbjct: 367 LSMRKEGIQTRKRKRKTQ 384
>gi|323319541|gb|ADX36141.1| GATA456a [Schmidtea polychroa]
Length = 528
Score = 130 bits (326), Expect = 2e-28, Method: Composition-based stats.
Identities = 64/138 (46%), Positives = 85/138 (61%), Gaps = 28/138 (20%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT---------- 58
RECVNCGA +T LW RD TG+YLC+ C + + N N + P+ +++++
Sbjct: 258 RECVNCGASNTQLWSRDNTGYYLCDECDRFSQNNSRNIEKL-PTNQVSSSTENEFMKKSN 316
Query: 59 -------RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
+R L C+NC T+LWRRNN+GEPVCNACGLYYKLH ++RP
Sbjct: 317 ANFQQYGKRSDLECSNCKITKTSLWRRNNEGEPVCNACGLYYKLHK----------SLRP 366
Query: 112 LAMRKDGIQTRKRKPKKQ 129
L+MRK+GIQTR+RK K Q
Sbjct: 367 LSMRKEGIQTRRRKRKTQ 384
>gi|308097152|gb|ADO14130.1| GATA binding protein 4 [Notophthalmus viridescens]
Length = 184
Score = 129 bits (324), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 61/90 (67%), Positives = 71/90 (78%), Gaps = 10/90 (11%)
Query: 38 YHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHN 97
YHKMNG+NRPL+KP +RL+A+RR+GL C NC T +TTLWRRN +GEPVCNACGLY KLH
Sbjct: 1 YHKMNGINRPLIKPQRRLSASRRVGLSCANCHTTITTLWRRNAEGEPVCNACGLYMKLHG 60
Query: 98 VNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
V RPLAM+K+GIQTRKRKPK
Sbjct: 61 V----------PRPLAMKKEGIQTRKRKPK 80
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 25 GLSCANCHTTITTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 66
>gi|340374204|ref|XP_003385628.1| PREDICTED: hypothetical protein LOC100636783 [Amphimedon
queenslandica]
Length = 611
Score = 126 bits (316), Expect = 3e-27, Method: Composition-based stats.
Identities = 60/122 (49%), Positives = 77/122 (63%), Gaps = 10/122 (8%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
+ G R+C NCG+ +TPLWRR+ GHYLCNACGLY+++NG NR + K + +
Sbjct: 287 YNAGIARQCNNCGSSNTPLWRRNSEGHYLCNACGLYYRVNGTNRQGHQKKKVRATLKSMN 346
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
C+NCGT T LWRR +G+PVCN CGLYYKL+ V RP ++ KD IQTR
Sbjct: 347 NKCSNCGTTKTVLWRRLENGDPVCNPCGLYYKLNGV----------ARPKSLCKDTIQTR 396
Query: 123 KR 124
R
Sbjct: 397 NR 398
>gi|28950682|gb|AAO47082.1| GATA-binding protein 3 [Marmota monax]
Length = 108
Score = 125 bits (313), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 60/92 (65%), Positives = 69/92 (75%), Gaps = 10/92 (10%)
Query: 33 NACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLY 92
NACGLY+KMNG NRPL+KP +RL+A RR G C NC T TTLWRRN +G+PVCNACGLY
Sbjct: 1 NACGLYYKMNGQNRPLIKPKRRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLY 60
Query: 93 YKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
YKLHN+N RPL M+K+GIQTR R
Sbjct: 61 YKLHNIN----------RPLTMKKEGIQTRNR 82
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 30 GTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 71
>gi|355689778|gb|AER98943.1| Transcription factor GATA-6 [Mustela putorius furo]
Length = 180
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 58/89 (65%), Positives = 67/89 (75%), Gaps = 10/89 (11%)
Query: 39 HKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNV 98
KMNG++RPL+KP KR+ ++RRLGL C NC T TTLWRRN +GEPVCNACGLY KLH V
Sbjct: 1 SKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGV 60
Query: 99 NRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
RPLAM+K+GIQTRKRKPK
Sbjct: 61 ----------PRPLAMKKEGIQTRKRKPK 79
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 24 GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 65
>gi|345306897|ref|XP_001506818.2| PREDICTED: transcription factor GATA-6-like, partial
[Ornithorhynchus anatinus]
Length = 237
Score = 119 bits (299), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 57/87 (65%), Positives = 66/87 (75%), Gaps = 10/87 (11%)
Query: 41 MNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNR 100
MNG++RPL+KP KR+ ++RRLGL C NC T TTLWRRN +GEPVCNACGLY KLH V
Sbjct: 61 MNGLSRPLIKPQKRVPSSRRLGLSCANCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVP- 119
Query: 101 PLAMRHLAMRPLAMRKDGIQTRKRKPK 127
RPLAM+K+GIQTRKRKPK
Sbjct: 120 ---------RPLAMKKEGIQTRKRKPK 137
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 82 GLSCANCHTSTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 123
>gi|417079353|gb|AFX60071.1| GATA binding protein 2, partial [Mesocricetus auratus]
Length = 309
Score = 119 bits (297), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 51/62 (82%), Positives = 55/62 (88%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G CC
Sbjct: 248 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCA 307
Query: 67 NC 68
NC
Sbjct: 308 NC 309
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 252 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPL 289
>gi|355689768|gb|AER98940.1| GATA binding protein 3 [Mustela putorius furo]
Length = 328
Score = 117 bits (294), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 52/64 (81%), Positives = 55/64 (85%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL+A RR G C
Sbjct: 264 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCA 323
Query: 67 NCGT 70
NC T
Sbjct: 324 NCQT 327
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 268 CVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPL 305
>gi|345491199|ref|XP_001607828.2| PREDICTED: hypothetical protein LOC100124017 [Nasonia vitripennis]
Length = 461
Score = 113 bits (282), Expect = 3e-23, Method: Composition-based stats.
Identities = 46/53 (86%), Positives = 50/53 (94%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTA 57
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK S+R+ +
Sbjct: 406 FTEGRECVNCGAVSTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQSRRMVS 458
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 31/48 (64%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPL 112
C NCG T LWRR+ G +CNACGLY+K++ +NRPL + M L
Sbjct: 412 CVNCGAVSTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQSRRMVSL 459
>gi|196049282|dbj|BAG68610.1| GATA-binding protein 1 [Cyprinus carpio]
Length = 277
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 49/70 (70%), Positives = 54/70 (77%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M E RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL ++R
Sbjct: 207 MRLSPPEARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKR 266
Query: 61 LGLCCTNCGT 70
G C NC T
Sbjct: 267 AGTQCANCHT 276
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 217 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPL 254
>gi|157125534|ref|XP_001654374.1| transcription factor GATA-4 (GATA binding factor-4) [Aedes aegypti]
gi|108873609|gb|EAT37834.1| AAEL010222-PB [Aedes aegypti]
Length = 1003
Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats.
Identities = 51/78 (65%), Positives = 59/78 (75%), Gaps = 10/78 (12%)
Query: 50 KPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAM 109
K S+RL++ RR+GL C+NC T T+LWRRN GEPVCNACGLYYKLHNVN
Sbjct: 630 KQSRRLSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVN---------- 679
Query: 110 RPLAMRKDGIQTRKRKPK 127
RPLAM+KD IQ+RKRKPK
Sbjct: 680 RPLAMKKDNIQSRKRKPK 697
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 35/53 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G +C NC ++T LWRR+ G +CNACGLY+K++ +NRPL + + +R
Sbjct: 642 GLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRPLAMKKDNIQSRKR 694
>gi|41059166|gb|AAR99325.1| GATA transcription factor [Aedes aegypti]
Length = 1003
Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats.
Identities = 52/85 (61%), Positives = 60/85 (70%), Gaps = 10/85 (11%)
Query: 50 KPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAM 109
K S+RL++ RR+GL C+NC T T+LWRRN GEPVCNACGLYYKLHNVN
Sbjct: 630 KQSRRLSSARRVGLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVN---------- 679
Query: 110 RPLAMRKDGIQTRKRKPKKQGGGGG 134
RPLAM+KD IQ+RKRKPK G
Sbjct: 680 RPLAMKKDNIQSRKRKPKGSKNADG 704
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 35/53 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G +C NC ++T LWRR+ G +CNACGLY+K++ +NRPL + + +R
Sbjct: 642 GLQCSNCNTMNTSLWRRNQVGEPVCNACGLYYKLHNVNRPLAMKKDNIQSRKR 694
>gi|384499201|gb|EIE89692.1| hypothetical protein RO3G_14403 [Rhizopus delemar RA 99-880]
Length = 281
Score = 110 bits (276), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/133 (45%), Positives = 75/133 (56%), Gaps = 7/133 (5%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+D + + C NCGA STPLWRR LCNACGLY N N L + S R
Sbjct: 114 IDEENNQMPVCSNCGASSTPLWRRSANAQILCNACGLYVSPNTNN--LTEDSVD-EDNRP 170
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
G C+NCGT T LWRRN +G P+CNACGLYYKLHN RPL+M+ ++ ++ +
Sbjct: 171 SGTVCSNCGTNKTPLWRRNAEGSPLCNACGLYYKLHNEKRPLSMKTDVIK----KRQRTE 226
Query: 121 TRKRKPKKQGGGG 133
T K+Q G
Sbjct: 227 TLIANDKEQSEAG 239
>gi|384489810|gb|EIE81032.1| hypothetical protein RO3G_05737 [Rhizopus delemar RA 99-880]
Length = 393
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 64/99 (64%), Gaps = 4/99 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP-LVKP---SKRLTATRRLGLCCT 66
C NCGA TPLWRR LCNACGLY K++ RP +KP K L L C+
Sbjct: 234 CTNCGATKTPLWRRSIEDDLLCNACGLYQKLHNAPRPKSLKPHNSKKELKEVEGPKLVCS 293
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
NC T T LWRR+++G P+CNACGLYYKLH+ RPL+M+
Sbjct: 294 NCSTIKTPLWRRDDEGAPLCNACGLYYKLHHERRPLSMK 332
>gi|425906025|gb|AFY10809.1| Gata456 [Isodiametra pulchra]
Length = 194
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 53/79 (67%), Positives = 61/79 (77%), Gaps = 10/79 (12%)
Query: 53 KRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPL 112
+++T +RR+GL C NC T TTLWRRNNDG+PVCNACGLYYKLH+VN RPL
Sbjct: 7 RKMTTSRRMGLRCANCHTDSTTLWRRNNDGDPVCNACGLYYKLHHVN----------RPL 56
Query: 113 AMRKDGIQTRKRKPKKQGG 131
AM+KDGIQTRKRKPK GG
Sbjct: 57 AMKKDGIQTRKRKPKSGGG 75
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC ST LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 16 GLRCANCHTDSTTLWRRNNDGDPVCNACGLYYKLHHVNRPLA 57
>gi|213409195|ref|XP_002175368.1| GATA zinc finger domain-containing protein [Schizosaccharomyces
japonicus yFS275]
gi|212003415|gb|EEB09075.1| GATA zinc finger domain-containing protein [Schizosaccharomyces
japonicus yFS275]
Length = 585
Score = 110 bits (274), Expect = 2e-22, Method: Composition-based stats.
Identities = 54/115 (46%), Positives = 68/115 (59%), Gaps = 11/115 (9%)
Query: 11 CVNCGAISTPLWRRD-GTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCG 69
C NC TPLWRR H+LCNACGLY K G+ RPL+ +K GL CTNC
Sbjct: 393 CFNCKVTHTPLWRRTPDRKHFLCNACGLYAKQYGIMRPLLPRTKPAHNKDNAGLVCTNCQ 452
Query: 70 TRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
T+ T+LWR++ G+ VCNACGLY +LH N RP+ +RK+ I R+R
Sbjct: 453 TKKTSLWRKSPQGQTVCNACGLYARLHGQN----------RPVNLRKEKISRRRR 497
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 32/54 (59%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
G C NC T LWR+ G +CNACGLY +++G NRP+ ++++ RR
Sbjct: 445 GLVCTNCQTKKTSLWRKSPQGQTVCNACGLYARLHGQNRPVNLRKEKISRRRRF 498
>gi|405974156|gb|EKC38824.1| Transcription factor GATA-4 [Crassostrea gigas]
Length = 443
Score = 109 bits (273), Expect = 3e-22, Method: Composition-based stats.
Identities = 55/85 (64%), Positives = 60/85 (70%), Gaps = 11/85 (12%)
Query: 43 GMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
GM P P +A+RR+GL C NC T TTLWRRNN+GEPVCNACGLYYKLH VN
Sbjct: 250 GMMYPDYNPLP-YSASRRVGLSCANCSTTTTTLWRRNNEGEPVCNACGLYYKLHGVN--- 305
Query: 103 AMRHLAMRPLAMRKDGIQTRKRKPK 127
RPLAM+KDGIQTRKRKPK
Sbjct: 306 -------RPLAMKKDGIQTRKRKPK 323
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/53 (45%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC +T LWRR+ G +CNACGLY+K++G+NRPL + +R
Sbjct: 268 GLSCANCSTTTTTLWRRNNEGEPVCNACGLYYKLHGVNRPLAMKKDGIQTRKR 320
>gi|338728062|ref|XP_003365610.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor GATA-6-like
[Equus caballus]
Length = 333
Score = 108 bits (271), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 58/124 (46%), Positives = 72/124 (58%), Gaps = 20/124 (16%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL 63
+ R+CVNC + T LWR D +C + ++ ++ ++RRLGL
Sbjct: 130 DLPKSRQCVNCASSQTRLWREDRPRSIIC--AEAWRRLQAK--------RKDPSSRRLGL 179
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM+K+GIQTRK
Sbjct: 180 SCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGV----------PRPLAMKKEGIQTRK 229
Query: 124 RKPK 127
RKPK
Sbjct: 230 RKPK 233
>gi|388855917|emb|CCF50492.1| related to gata transcription factor [Ustilago hordei]
Length = 1129
Score = 108 bits (271), Expect = 5e-22, Method: Composition-based stats.
Identities = 61/166 (36%), Positives = 78/166 (46%), Gaps = 36/166 (21%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP---------------------- 47
+C NCGA STPLWRRD LCNACGLY K++ RP
Sbjct: 612 QCTNCGATSTPLWRRDPNDLLLCNACGLYLKLHKTPRPKSLKNHHSHSHSGHVTPSATPG 671
Query: 48 -LVKPSKRLTATRRLG-------LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
P R + R G + C NCGT T LWR+++ G VCNACGLY KLHN +
Sbjct: 672 GASAPGSRAGSPSRDGEPGAEDMMSCFNCGTYTTPLWRKDDAGHTVCNACGLYLKLHNEH 731
Query: 100 RPLAMRHLAMRPLAM------RKDGIQTRKRKPKKQGGGGGGGSGG 139
RP+ MR ++ + R +R+ P+ Q G G+ G
Sbjct: 732 RPVTMRADVIKKRSRYDEKRGRASAANSRRTSPQPQSGSSWAGAQG 777
>gi|19075466|ref|NP_587966.1| transcription factor, zf-GATA type (predicted) [Schizosaccharomyces
pombe 972h-]
gi|74627051|sp|O94720.1|YCF8_SCHPO RecName: Full=GATA zinc finger domain-containing protein C1393.08
gi|4468727|emb|CAB38164.1| transcription factor, zf-GATA type (predicted) [Schizosaccharomyces
pombe]
Length = 557
Score = 108 bits (271), Expect = 6e-22, Method: Composition-based stats.
Identities = 46/96 (47%), Positives = 62/96 (64%), Gaps = 2/96 (2%)
Query: 11 CVNCGAISTPLWRRDGTG-HYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCG 69
C NCG TPLWRR ++LCNACGLY+K G+ RPL P + ++ L C NC
Sbjct: 365 CFNCGVTETPLWRRTSDKLNFLCNACGLYNKQYGVMRPL-SPRNKGSSKALENLVCANCS 423
Query: 70 TRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
+ T+LWR++ G+ VCNACGLY +LH NRP+ ++
Sbjct: 424 STKTSLWRKDRHGQTVCNACGLYARLHGHNRPIGLK 459
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 21/46 (45%), Positives = 30/46 (65%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT 56
C NC + T LWR+D G +CNACGLY +++G NRP+ ++T
Sbjct: 419 CANCSSTKTSLWRKDRHGQTVCNACGLYARLHGHNRPIGLKKNKIT 464
>gi|195389406|ref|XP_002053368.1| GJ23377 [Drosophila virilis]
gi|194151454|gb|EDW66888.1| GJ23377 [Drosophila virilis]
Length = 771
Score = 108 bits (269), Expect = 9e-22, Method: Composition-based stats.
Identities = 61/127 (48%), Positives = 73/127 (57%), Gaps = 14/127 (11%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHK-MNGMNRPLVKPSKRLTA--TRRLGLC 64
G + + C TP R+ GT C C Y + G + P + K A RR G+
Sbjct: 499 GLQELKCETCQTPFVRK-GTDWPNCPNCTSYVRGQAGRHVPQRQKPKAAAAPNNRRSGVI 557
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T TTLWRRNN+G PVCNACGLY+KLHN+N RPL+M+KDGIQ RKR
Sbjct: 558 CANCHTNSTTLWRRNNEGNPVCNACGLYFKLHNMN----------RPLSMKKDGIQKRKR 607
Query: 125 KPKKQGG 131
KPK GG
Sbjct: 608 KPKNNGG 614
>gi|384500433|gb|EIE90924.1| hypothetical protein RO3G_15635 [Rhizopus delemar RA 99-880]
Length = 379
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 56/118 (47%), Positives = 69/118 (58%), Gaps = 14/118 (11%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP-LVKPS---KRLTATRRLGLCCT 66
C NCGA STPLWRR LCNACGLY K++ RP +KP K L C+
Sbjct: 187 CTNCGATSTPLWRRSAEDELLCNACGLYQKLHNAPRPKTLKPHNARKESKDDEGSQLVCS 246
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
NC T T LWRR+++G P+CNACGLY KLH+ RPL+M+ D I+ R+R
Sbjct: 247 NCSTTTTPLWRRDDEGAPLCNACGLYLKLHH----------ERRPLSMKTDIIKKRQR 294
>gi|195389408|ref|XP_002053369.1| GJ23376 [Drosophila virilis]
gi|194151455|gb|EDW66889.1| GJ23376 [Drosophila virilis]
Length = 1236
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 48/76 (63%), Positives = 56/76 (73%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A+RR G+ C+NC T T+LWRRN GEPVCNACGLYYKLHNV +RP
Sbjct: 699 SRRLSASRRAGMSCSNCQTSYTSLWRRNPGGEPVCNACGLYYKLHNV----------VRP 748
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 749 LTMKKDTIQKRKRKPK 764
Score = 53.1 bits (126), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY+K++ + RPL + +R
Sbjct: 709 GMSCSNCQTSYTSLWRRNPGGEPVCNACGLYYKLHNVVRPLTMKKDTIQKRKR 761
>gi|194767721|ref|XP_001965963.1| GF11880 [Drosophila ananassae]
gi|190619806|gb|EDV35330.1| GF11880 [Drosophila ananassae]
Length = 710
Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats.
Identities = 57/125 (45%), Positives = 75/125 (60%), Gaps = 17/125 (13%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG-MNRPLVKPSKRLTAT-RRLGLCCTN 67
+C NC + L+ R G+ Y C +C + ++ + + KP + RR G+ C N
Sbjct: 457 KCENCHS----LYVRKGS-EYFCPSCTNFMRLPARIPQRQAKPKQAAAPNNRRNGVTCAN 511
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
C T TTLWRRNN+G PVCNACGLY+KLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 512 CQTNSTTLWRRNNEGNPVCNACGLYFKLHNMN----------RPLSMKKEGIQKRKRKPK 561
Query: 128 KQGGG 132
GG
Sbjct: 562 NNGGA 566
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL + + +R
Sbjct: 506 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPLSMKKEGIQKRKR 558
>gi|24647326|ref|NP_650516.2| GATAe, isoform A [Drosophila melanogaster]
gi|23171431|gb|AAF55262.2| GATAe, isoform A [Drosophila melanogaster]
gi|28603696|gb|AAO47880.1| LD08432p [Drosophila melanogaster]
gi|37951320|dbj|BAD00020.1| transcription factor dGATAe [Drosophila melanogaster]
gi|220942528|gb|ACL83807.1| GATAe-PA [synthetic construct]
gi|220952748|gb|ACL88917.1| GATAe-PA [synthetic construct]
Length = 746
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 57/125 (45%), Positives = 68/125 (54%), Gaps = 17/125 (13%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG--MNRPLVKPSKRLTATRRLGLCCTN 67
+C NC P R+ Y C C + +M R + RR G+ C N
Sbjct: 487 KCENCHG---PFLRKGS--EYFCPNCPAFMRMAPRITQRQAKPKAAAAPNNRRNGVTCAN 541
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
C T TTLWRRNN+G PVCNACGLYYKLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 542 CQTNSTTLWRRNNEGNPVCNACGLYYKLHNMN----------RPLSMKKEGIQKRKRKPK 591
Query: 128 KQGGG 132
GG
Sbjct: 592 NNGGA 596
Score = 59.3 bits (142), Expect = 6e-07, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY+K++ MNRPL + + +R
Sbjct: 536 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRPLSMKKEGIQKRKR 588
>gi|195570482|ref|XP_002103236.1| GD19066 [Drosophila simulans]
gi|194199163|gb|EDX12739.1| GD19066 [Drosophila simulans]
Length = 736
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 57/125 (45%), Positives = 69/125 (55%), Gaps = 17/125 (13%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM--NRPLVKPSKRLTATRRLGLCCTN 67
+C NC A P R+ Y C C + +M R + RR G+ C N
Sbjct: 477 KCENCHA---PFLRKGN--EYFCPNCPSFMRMTPRMPQRQAKPKAAAAPNNRRNGVTCAN 531
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
C T TTLWRRNN+G PVCNACGLY+KLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 532 CQTNSTTLWRRNNEGNPVCNACGLYFKLHNMN----------RPLSMKKEGIQKRKRKPK 581
Query: 128 KQGGG 132
GG
Sbjct: 582 NNGGA 586
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL + + +R
Sbjct: 526 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPLSMKKEGIQKRKR 578
>gi|12641859|dbj|BAB21550.1| GATA factor e [Drosophila melanogaster]
Length = 704
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 57/125 (45%), Positives = 68/125 (54%), Gaps = 17/125 (13%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG--MNRPLVKPSKRLTATRRLGLCCTN 67
+C NC P R+ Y C C + +M R + RR G+ C N
Sbjct: 445 KCENCHG---PFLRKGS--EYFCPNCPAFMRMAPRITQRQAKPKAAAAPNNRRNGVTCAN 499
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
C T TTLWRRNN+G PVCNACGLYYKLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 500 CQTNSTTLWRRNNEGNPVCNACGLYYKLHNMN----------RPLSMKKEGIQKRKRKPK 549
Query: 128 KQGGG 132
GG
Sbjct: 550 NNGGA 554
Score = 59.3 bits (142), Expect = 6e-07, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY+K++ MNRPL + + +R
Sbjct: 494 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRPLSMKKEGIQKRKR 546
>gi|320542891|ref|NP_001189231.1| GATAe, isoform B [Drosophila melanogaster]
gi|320542893|ref|NP_001189232.1| GATAe, isoform C [Drosophila melanogaster]
gi|318068787|gb|ADV37322.1| GATAe, isoform B [Drosophila melanogaster]
gi|318068788|gb|ADV37323.1| GATAe, isoform C [Drosophila melanogaster]
Length = 731
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 57/125 (45%), Positives = 68/125 (54%), Gaps = 17/125 (13%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG--MNRPLVKPSKRLTATRRLGLCCTN 67
+C NC P R+ Y C C + +M R + RR G+ C N
Sbjct: 472 KCENCHG---PFLRKGS--EYFCPNCPAFMRMAPRITQRQAKPKAAAAPNNRRNGVTCAN 526
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
C T TTLWRRNN+G PVCNACGLYYKLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 527 CQTNSTTLWRRNNEGNPVCNACGLYYKLHNMN----------RPLSMKKEGIQKRKRKPK 576
Query: 128 KQGGG 132
GG
Sbjct: 577 NNGGA 581
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY+K++ MNRPL + + +R
Sbjct: 521 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRPLSMKKEGIQKRKR 573
>gi|195452168|ref|XP_002073242.1| GK13254 [Drosophila willistoni]
gi|194169327|gb|EDW84228.1| GK13254 [Drosophila willistoni]
Length = 746
Score = 107 bits (266), Expect = 2e-21, Method: Composition-based stats.
Identities = 59/123 (47%), Positives = 69/123 (56%), Gaps = 14/123 (11%)
Query: 12 VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTA--TRRLGLCCTNCG 69
+ C +PL R+ G C +C Y M P K A RR G+ C NC
Sbjct: 472 LKCEGCQSPLVRK-GHNTAFCPSC-TYMPMPPRVAPRQAKPKVPAAPNNRRNGVTCANCS 529
Query: 70 TRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQ 129
T TTLWRRNN+G PVCNACGLYYKLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 530 TNSTTLWRRNNEGNPVCNACGLYYKLHNMN----------RPLSMKKEGIQKRKRKPKNN 579
Query: 130 GGG 132
GG
Sbjct: 580 GGA 582
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY+K++ MNRPL + + +R
Sbjct: 522 GVTCANCSTNSTTLWRRNNEGNPVCNACGLYYKLHNMNRPLSMKKEGIQKRKR 574
>gi|47224312|emb|CAG09158.1| unnamed protein product [Tetraodon nigroviridis]
Length = 365
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 46/56 (82%), Positives = 50/56 (89%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATR 59
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL R
Sbjct: 173 DFAEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLVGDR 228
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 180 CVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPL 217
>gi|195501114|ref|XP_002097664.1| GE24358 [Drosophila yakuba]
gi|194183765|gb|EDW97376.1| GE24358 [Drosophila yakuba]
Length = 738
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 57/125 (45%), Positives = 69/125 (55%), Gaps = 17/125 (13%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM--NRPLVKPSKRLTATRRLGLCCTN 67
+C NC TP R+ Y C C + +M R + RR G+ C N
Sbjct: 483 KCENC---HTPFIRKGSD--YFCPNCPPFMRMPPRMPQRQAKPKAAAAPNNRRNGVTCAN 537
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
C T TTLWRRNN+G PVCNACGLY+KLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 538 CQTNSTTLWRRNNEGNPVCNACGLYFKLHNMN----------RPLSMKKEGIQKRKRKPK 587
Query: 128 KQGGG 132
GG
Sbjct: 588 NNGGA 592
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL + + +R
Sbjct: 532 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPLSMKKEGIQKRKR 584
>gi|242218586|ref|XP_002475082.1| hypothetical iron transporter biosynthesis regulating transcription
factor [Postia placenta Mad-698-R]
gi|220725761|gb|EED79735.1| hypothetical iron transporter biosynthesis regulating transcription
factor [Postia placenta Mad-698-R]
Length = 111
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/118 (49%), Positives = 71/118 (60%), Gaps = 12/118 (10%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL--GLCCTNC 68
C NCGA TPLWRRDG G +CNACGLY+K +G +RP S + R L C NC
Sbjct: 1 CANCGAEVTPLWRRDGVGKTVCNACGLYYKAHGSSRPGTISSGGRSRMRAAVGALSCANC 60
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKP 126
GT T LWRR++ G +CNACGLY+KLH + RP +M+K I+ RKR P
Sbjct: 61 GTSTTPLWRRDDVGNNICNACGLYFKLHGTH----------RPNSMKKTVIKRRKRVP 108
>gi|334350414|ref|XP_001371963.2| PREDICTED: histone deacetylase 6-like [Monodelphis domestica]
Length = 1117
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 43/54 (79%), Positives = 47/54 (87%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATR 59
G+ RECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL++P KRL R
Sbjct: 28 GKARECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIRPKKRLGQKR 81
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 33 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPL 70
>gi|195328567|ref|XP_002030986.1| GM24276 [Drosophila sechellia]
gi|194119929|gb|EDW41972.1| GM24276 [Drosophila sechellia]
Length = 733
Score = 106 bits (264), Expect = 4e-21, Method: Composition-based stats.
Identities = 57/125 (45%), Positives = 69/125 (55%), Gaps = 17/125 (13%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM--NRPLVKPSKRLTATRRLGLCCTN 67
+C NC A P R+ Y C C + +M R + RR G+ C N
Sbjct: 478 KCENCHA---PFLRKGN--EYFCPNCPSFMRMTPRMPQRQAKPKAAAAPNNRRNGVTCAN 532
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
C T TTLWRRNN+G PVCNACGLY+KLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 533 CQTNSTTLWRRNNEGNPVCNACGLYFKLHNMN----------RPLSMKKEGIQKRKRKPK 582
Query: 128 KQGGG 132
GG
Sbjct: 583 NTGGA 587
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL + + +R
Sbjct: 527 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPLSMKKEGIQKRKR 579
>gi|402589688|gb|EJW83619.1| GATA zinc finger family protein, partial [Wuchereria bancrofti]
Length = 276
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 44/49 (89%), Positives = 45/49 (91%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR 54
EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVKP KR
Sbjct: 227 SEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKR 275
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLA 103
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 232 CVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLV 270
>gi|170589177|ref|XP_001899350.1| GATA zinc finger family protein [Brugia malayi]
gi|158593563|gb|EDP32158.1| GATA zinc finger family protein [Brugia malayi]
Length = 278
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 44/49 (89%), Positives = 45/49 (91%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR 54
EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVKP KR
Sbjct: 227 SEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKR 275
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLA 103
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 232 CVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLV 270
>gi|198454788|ref|XP_001359721.2| GA10213 [Drosophila pseudoobscura pseudoobscura]
gi|198132954|gb|EAL28873.2| GA10213 [Drosophila pseudoobscura pseudoobscura]
Length = 759
Score = 105 bits (261), Expect = 8e-21, Method: Composition-based stats.
Identities = 55/122 (45%), Positives = 70/122 (57%), Gaps = 13/122 (10%)
Query: 12 VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKP-SKRLTATRRLGLCCTNCGT 70
+ C + +P R+ GT + C C M + KP + RR G+ C NC T
Sbjct: 493 LKCESCHSPFVRK-GT-DFFCPNCSFIRMTPRMPQRQAKPKTPAAPNNRRNGVTCANCQT 550
Query: 71 RMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQG 130
TTLWRRNN+G PVCNACGLY+KLH++N RPL+M+K+GIQ RKRKPK G
Sbjct: 551 NSTTLWRRNNEGNPVCNACGLYFKLHSLN----------RPLSMKKEGIQKRKRKPKNNG 600
Query: 131 GG 132
G
Sbjct: 601 GA 602
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/53 (45%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY K++ +NRPL + + +R
Sbjct: 542 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHSLNRPLSMKKEGIQKRKR 594
>gi|195036114|ref|XP_001989516.1| GH18757 [Drosophila grimshawi]
gi|193893712|gb|EDV92578.1| GH18757 [Drosophila grimshawi]
Length = 978
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 56/76 (73%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A+RR G+ C+NC T T+LWRRN GEPVCNACGLY+KLHNV +RP
Sbjct: 496 SRRLSASRRAGMSCSNCQTSYTSLWRRNPGGEPVCNACGLYFKLHNV----------VRP 545
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 546 LTMKKDTIQKRKRKPK 561
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY K++ + RPL + +R
Sbjct: 506 GMSCSNCQTSYTSLWRRNPGGEPVCNACGLYFKLHNVVRPLTMKKDTIQKRKR 558
>gi|312084748|ref|XP_003144401.1| GATA zinc finger family protein [Loa loa]
Length = 278
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 44/49 (89%), Positives = 45/49 (91%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR 54
EGRECVNCGA+ TPLWRRDGTGHYLCNACGLYHKMNG NRPLVKP KR
Sbjct: 227 SEGRECVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLVKPKKR 275
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLA 103
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 232 CVNCGAVQTPLWRRDGTGHYLCNACGLYHKMNGQNRPLV 270
>gi|194901184|ref|XP_001980132.1| GG16970 [Drosophila erecta]
gi|190651835|gb|EDV49090.1| GG16970 [Drosophila erecta]
Length = 749
Score = 104 bits (260), Expect = 1e-20, Method: Composition-based stats.
Identities = 55/124 (44%), Positives = 70/124 (56%), Gaps = 16/124 (12%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM-NRPLVKPSKRLTATRRLGLCCTNC 68
+C NC + L R+ Y C C + ++ + R + RR G+ C NC
Sbjct: 495 KCENC---HSQLLRKGSD--YFCPNCPSFMRIQRIPQRQAKPKAAAAPNNRRNGVTCANC 549
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKK 128
T TTLWRRNN+G PVCNACGLY+KLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 550 QTNSTTLWRRNNEGNPVCNACGLYFKLHNMN----------RPLSMKKEGIQKRKRKPKN 599
Query: 129 QGGG 132
GG
Sbjct: 600 NGGA 603
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL + + +R
Sbjct: 543 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPLSMKKEGIQKRKR 595
>gi|410933019|ref|XP_003979890.1| PREDICTED: GATA-binding factor 5-A-like, partial [Takifugu
rubripes]
Length = 226
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 44/49 (89%), Positives = 48/49 (97%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL 55
EGRECVNCG++STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP KRL
Sbjct: 178 EGRECVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQKRL 226
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 29/38 (76%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG+ T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 182 CVNCGSVSTPLWRRDGTGHYLCNACGLYHKMNGINRPL 219
>gi|195450286|ref|XP_002072447.1| GK22843 [Drosophila willistoni]
gi|194168532|gb|EDW83433.1| GK22843 [Drosophila willistoni]
Length = 1047
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 47/75 (62%), Positives = 53/75 (70%), Gaps = 10/75 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL A+RR GL C+NC T T+LWRRN GEPVCNACGLY+KLHNV RP
Sbjct: 709 SRRLNASRRAGLSCSNCLTTYTSLWRRNPSGEPVCNACGLYFKLHNV----------ARP 758
Query: 112 LAMRKDGIQTRKRKP 126
L M+KD IQ RKRKP
Sbjct: 759 LTMKKDTIQKRKRKP 773
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ +G +CNACGLY K++ + RPL + +R
Sbjct: 719 GLSCSNCLTTYTSLWRRNPSGEPVCNACGLYFKLHNVARPLTMKKDTIQKRKR 771
>gi|426253995|ref|XP_004020674.1| PREDICTED: transcription factor GATA-6 [Ovis aries]
Length = 409
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/85 (61%), Positives = 60/85 (70%), Gaps = 10/85 (11%)
Query: 43 GMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
GM + + P KR+ ++RRLGL C NC T TTLWRRN +GEPVCNACGLY KLH V
Sbjct: 236 GMFKLIGFPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP--- 292
Query: 103 AMRHLAMRPLAMRKDGIQTRKRKPK 127
RPLAM+K+GIQTRKRKPK
Sbjct: 293 -------RPLAMKKEGIQTRKRKPK 310
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 255 GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 296
>gi|195157148|ref|XP_002019458.1| GL12213 [Drosophila persimilis]
gi|194116049|gb|EDW38092.1| GL12213 [Drosophila persimilis]
Length = 760
Score = 103 bits (258), Expect = 2e-20, Method: Composition-based stats.
Identities = 55/122 (45%), Positives = 69/122 (56%), Gaps = 13/122 (10%)
Query: 12 VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKP-SKRLTATRRLGLCCTNCGT 70
+ C + +P R+ GT + C C M KP + RR G+ C NC T
Sbjct: 490 LKCESCHSPFVRK-GT-DFFCPNCSFIRMTPRMPPRQAKPKTPAAPNNRRNGVTCANCQT 547
Query: 71 RMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQG 130
TTLWRRNN+G PVCNACGLY+KLH++N RPL+M+K+GIQ RKRKPK G
Sbjct: 548 NSTTLWRRNNEGNPVCNACGLYFKLHSLN----------RPLSMKKEGIQKRKRKPKNNG 597
Query: 131 GG 132
G
Sbjct: 598 GA 599
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/53 (45%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY K++ +NRPL + + +R
Sbjct: 539 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHSLNRPLSMKKEGIQKRKR 591
>gi|195111294|ref|XP_002000214.1| GI22647 [Drosophila mojavensis]
gi|193916808|gb|EDW15675.1| GI22647 [Drosophila mojavensis]
Length = 1005
Score = 103 bits (256), Expect = 3e-20, Method: Composition-based stats.
Identities = 46/76 (60%), Positives = 56/76 (73%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A+RR G+ C+NC T T+LWRRN GEPVCNACGLY+KLH+V +RP
Sbjct: 484 SRRLSASRRAGMSCSNCQTSYTSLWRRNPGGEPVCNACGLYFKLHSV----------VRP 533
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 534 LTMKKDTIQKRKRKPK 549
Score = 52.8 bits (125), Expect = 4e-05, Method: Composition-based stats.
Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 18/78 (23%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV-----------KP----- 51
G C NC T LWRR+ G +CNACGLY K++ + RPL KP
Sbjct: 494 GMSCSNCQTSYTSLWRRNPGGEPVCNACGLYFKLHSVVRPLTMKKDTIQKRKRKPKGTKS 553
Query: 52 --SKRLTATRRLGLCCTN 67
SK++ A++ GL +N
Sbjct: 554 EKSKKMRASQAAGLVASN 571
>gi|122054007|gb|ABM66009.1| GATA4 [Ateles geoffroyi]
Length = 55
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/52 (86%), Positives = 49/52 (94%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL 55
F EGRECVNCGA+STPLWRRDGTGHYLCNACGLYHKMNG+NRPL+KP +RL
Sbjct: 4 DFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRL 55
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 11 CVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPL 48
>gi|198454790|ref|XP_002137945.1| GA26230, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132955|gb|EDY68503.1| GA26230, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 1316
Score = 102 bits (255), Expect = 4e-20, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 56/76 (73%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLY+KLH+V RP
Sbjct: 823 SRRLSASKRNGLSCSNCLTTYTSLWRRNPAGEPVCNACGLYFKLHSVT----------RP 872
Query: 112 LAMRKDGIQTRKRKPK 127
LAM+KD IQ RKRKPK
Sbjct: 873 LAMKKDTIQKRKRKPK 888
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY K++ + RPL + +R
Sbjct: 833 GLSCSNCLTTYTSLWRRNPAGEPVCNACGLYFKLHSVTRPLAMKKDTIQKRKR 885
>gi|194767717|ref|XP_001965961.1| GF11857 [Drosophila ananassae]
gi|190619804|gb|EDV35328.1| GF11857 [Drosophila ananassae]
Length = 1271
Score = 102 bits (255), Expect = 4e-20, Method: Composition-based stats.
Identities = 46/76 (60%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+R++A +R GL C+NC T T+LWRRN +GEPVCNACGLY+KLHNV RP
Sbjct: 806 SRRMSAQKRAGLSCSNCHTTYTSLWRRNPNGEPVCNACGLYFKLHNVP----------RP 855
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 856 LTMKKDTIQKRKRKPK 871
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 31/60 (51%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
M Q G C NC T LWRR+ G +CNACGLY K++ + RPL + +R
Sbjct: 809 MSAQKRAGLSCSNCHTTYTSLWRRNPNGEPVCNACGLYFKLHNVPRPLTMKKDTIQKRKR 868
>gi|340726756|ref|XP_003401719.1| PREDICTED: endothelial transcription factor GATA-2-like [Bombus
terrestris]
Length = 362
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/53 (83%), Positives = 48/53 (90%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT 58
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +T
Sbjct: 310 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVST 362
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 29/45 (64%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 308 TSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 352
>gi|380020456|ref|XP_003694099.1| PREDICTED: trans-acting T-cell-specific transcription factor
GATA-3-like [Apis florea]
Length = 310
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/52 (84%), Positives = 48/52 (92%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT 58
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +T
Sbjct: 259 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVST 310
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 29/45 (64%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 256 TSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 300
>gi|71019249|ref|XP_759855.1| hypothetical protein UM03708.1 [Ustilago maydis 521]
gi|46099653|gb|EAK84886.1| hypothetical protein UM03708.1 [Ustilago maydis 521]
Length = 1104
Score = 102 bits (255), Expect = 4e-20, Method: Composition-based stats.
Identities = 61/158 (38%), Positives = 73/158 (46%), Gaps = 40/158 (25%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-------------VKPSKRLT 56
+C NCGA STPLWRRD LCNACGLY K++ RP + PS
Sbjct: 546 QCTNCGATSTPLWRRDPNDLLLCNACGLYLKLHKTPRPRSLKSHHSHSHSGHITPSATPG 605
Query: 57 ATRRLG-----------------LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
G C NCGT T LWR+++ G VCNACGLY KLHN +
Sbjct: 606 GASAPGSRAGSPSRSGSPSGEDMTSCFNCGTYTTPLWRKDDAGHTVCNACGLYLKLHNEH 665
Query: 100 RPLAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGGGGGS 137
RP+ MR D I+ R R +K+G GS
Sbjct: 666 ----------RPVTMRADVIKKRSRYDEKRGRASAAGS 693
>gi|328784695|ref|XP_003250485.1| PREDICTED: endothelial transcription factor GATA-2-like [Apis
mellifera]
Length = 311
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/53 (83%), Positives = 48/53 (90%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT 58
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +T
Sbjct: 259 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVST 311
Score = 55.1 bits (131), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 29/45 (64%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 257 TSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 301
>gi|332029518|gb|EGI69407.1| Endothelial transcription factor GATA-2 [Acromyrmex echinatior]
Length = 303
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 44/52 (84%), Positives = 48/52 (92%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT 58
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +T
Sbjct: 252 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVST 303
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 29/45 (64%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 249 TSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 293
>gi|343425496|emb|CBQ69031.1| related to gata transcription factor [Sporisorium reilianum SRZ2]
Length = 1061
Score = 102 bits (254), Expect = 5e-20, Method: Composition-based stats.
Identities = 61/158 (38%), Positives = 74/158 (46%), Gaps = 40/158 (25%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP---------------------- 47
+C NCGA STPLWRRD LCNACGLY K++ RP
Sbjct: 611 QCTNCGATSTPLWRRDPNDLLLCNACGLYLKLHKTPRPKSLKNHHSHSHAGHMTPSATPG 670
Query: 48 -LVKPSKRLTATRRLG-------LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
P R + R G + C NCGT T LWR+++ G VCNACGLY KLHN +
Sbjct: 671 GASAPGSRAGSPSRPGSPGAEDMMSCFNCGTYTTPLWRKDDAGHTVCNACGLYLKLHNEH 730
Query: 100 RPLAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGGGGGS 137
RP+ MR D I+ R R +K+G S
Sbjct: 731 ----------RPVTMRADVIKKRSRYDEKRGRASAASS 758
>gi|164662389|ref|XP_001732316.1| hypothetical protein MGL_0091 [Malassezia globosa CBS 7966]
gi|159106219|gb|EDP45102.1| hypothetical protein MGL_0091 [Malassezia globosa CBS 7966]
Length = 435
Score = 102 bits (254), Expect = 5e-20, Method: Composition-based stats.
Identities = 52/110 (47%), Positives = 68/110 (61%), Gaps = 10/110 (9%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV-KPSKRLTATR---RLGLC-- 64
C NCG +TPLWRR+ LCNACGLY K++ +RPL+ + ++L +TR C
Sbjct: 172 CSNCGTDNTPLWRRNHNMLLLCNACGLYLKIHKTHRPLLLRKRQQLNSTRASQSQDPCSG 231
Query: 65 ----CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
CTNCGT++T LWR+ G +CNACGLY KLH NRP+ R +R
Sbjct: 232 PSSGCTNCGTKVTPLWRKGISGAVLCNACGLYLKLHQSNRPVRYRADVIR 281
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 23/43 (53%), Positives = 28/43 (65%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NCG TPLWR+ +G LCNACGLY K++ NRP+
Sbjct: 231 GPSSGCTNCGTKVTPLWRKGISGAVLCNACGLYLKLHQSNRPV 273
>gi|339236855|ref|XP_003379982.1| zinc finger protein [Trichinella spiralis]
gi|316977267|gb|EFV60390.1| zinc finger protein [Trichinella spiralis]
Length = 419
Score = 102 bits (253), Expect = 6e-20, Method: Composition-based stats.
Identities = 47/74 (63%), Positives = 54/74 (72%), Gaps = 10/74 (13%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
T ++R GL C NC T TTLWRRN +G+PVCNACGLY+KLH VN RPL+M+
Sbjct: 116 TTSKRQGLVCVNCQTMQTTLWRRNQNGDPVCNACGLYFKLHRVN----------RPLSMK 165
Query: 116 KDGIQTRKRKPKKQ 129
KDGIQTRKRKPK Q
Sbjct: 166 KDGIQTRKRKPKNQ 179
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 33/54 (61%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+G CVNC + T LWRR+ G +CNACGLY K++ +NRPL + +R
Sbjct: 121 QGLVCVNCQTMQTTLWRRNQNGDPVCNACGLYFKLHRVNRPLSMKKDGIQTRKR 174
>gi|195157150|ref|XP_002019459.1| GL12212 [Drosophila persimilis]
gi|194116050|gb|EDW38093.1| GL12212 [Drosophila persimilis]
Length = 1126
Score = 102 bits (253), Expect = 6e-20, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 56/76 (73%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLY+KLH+V RP
Sbjct: 820 SRRLSASKRNGLSCSNCLTTYTSLWRRNPAGEPVCNACGLYFKLHSVT----------RP 869
Query: 112 LAMRKDGIQTRKRKPK 127
LAM+KD IQ RKRKPK
Sbjct: 870 LAMKKDTIQKRKRKPK 885
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY K++ + RPL + +R
Sbjct: 830 GLSCSNCLTTYTSLWRRNPAGEPVCNACGLYFKLHSVTRPLAMKKDTIQKRKR 882
>gi|45551908|ref|NP_732100.2| serpent, isoform A [Drosophila melanogaster]
gi|73920793|sp|P52172.2|SRP_DROME RecName: Full=Box A-binding factor; Short=ABF; AltName:
Full=GATA-binding factor B; AltName: Full=Protein
serpent; AltName: Full=Transcription factor GATA-B;
AltName: Full=dGATA-B
gi|45446505|gb|AAF55261.2| serpent, isoform A [Drosophila melanogaster]
Length = 1264
Score = 102 bits (253), Expect = 6e-20, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 790 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RP 839
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 840 LTMKKDTIQKRKRKPK 855
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY+K++ + RPL + +R
Sbjct: 800 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLTMKKDTIQKRKR 852
>gi|350422369|ref|XP_003493143.1| PREDICTED: hypothetical protein LOC100742684 [Bombus impatiens]
Length = 215
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 44/53 (83%), Positives = 48/53 (90%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT 58
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +T
Sbjct: 163 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVST 215
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 29/45 (64%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 161 TSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 205
>gi|307212940|gb|EFN88533.1| GATA-binding factor 2 [Harpegnathos saltator]
Length = 306
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 44/52 (84%), Positives = 48/52 (92%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT 58
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +T
Sbjct: 255 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVST 306
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 29/45 (64%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 252 TSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 296
>gi|194901182|ref|XP_001980131.1| GG16969 [Drosophila erecta]
gi|190651834|gb|EDV49089.1| GG16969 [Drosophila erecta]
Length = 1244
Score = 101 bits (252), Expect = 8e-20, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 780 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RP 829
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 830 LTMKKDTIQKRKRKPK 845
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY+K++ + RPL + +R
Sbjct: 790 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLTMKKDTIQKRKR 842
>gi|358339242|dbj|GAA47341.1| GATA-binding protein 4/5/6 [Clonorchis sinensis]
Length = 725
Score = 101 bits (252), Expect = 8e-20, Method: Composition-based stats.
Identities = 50/94 (53%), Positives = 59/94 (62%), Gaps = 12/94 (12%)
Query: 36 GLYHKMNGMNRPLVKPS--KRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYY 93
G K NG P + + A RR G CTNC T TTLWRRN +G+PVCNACGLYY
Sbjct: 558 GFQRKQNGSQTPTFESTMLNLHQANRRPGQICTNCNTSATTLWRRNAEGDPVCNACGLYY 617
Query: 94 KLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
KLH VN RP++M+K+GIQTRKRKP+
Sbjct: 618 KLHKVN----------RPISMKKEGIQTRKRKPR 641
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 34/53 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G+ C NC +T LWRR+ G +CNACGLY+K++ +NRP+ + + +R
Sbjct: 586 GQICTNCNTSATTLWRRNAEGDPVCNACGLYYKLHKVNRPISMKKEGIQTRKR 638
>gi|345495111|ref|XP_003427437.1| PREDICTED: hypothetical protein LOC100680343 [Nasonia vitripennis]
Length = 478
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 43/51 (84%), Positives = 47/51 (92%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT 56
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL+
Sbjct: 412 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLS 462
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 29/45 (64%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
T G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 410 TSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 454
>gi|242002772|ref|XP_002436029.1| GATA binding factor-1B, putative [Ixodes scapularis]
gi|215499365|gb|EEC08859.1| GATA binding factor-1B, putative [Ixodes scapularis]
Length = 425
Score = 101 bits (252), Expect = 9e-20, Method: Composition-based stats.
Identities = 43/63 (68%), Positives = 53/63 (84%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+++ GEGRECVNCGAISTPLWRRDGTGHYLCNACGLY+KMNG +RP++K +RL + +
Sbjct: 357 LEYFGGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYNKMNGAHRPIIKTPRRLVSRGK 416
Query: 61 LGL 63
L
Sbjct: 417 WNL 419
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY K++ +RP+
Sbjct: 367 CVNCGAISTPLWRRDGTGHYLCNACGLYNKMNGAHRPI 404
>gi|390595728|gb|EIN05132.1| hypothetical protein PUNSTDRAFT_122479 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 839
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 56/140 (40%), Positives = 72/140 (51%), Gaps = 20/140 (14%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP----------LVKPSKRL 55
G EC NCGA TPLWRR CNACGLY K++ RP + + R
Sbjct: 410 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKTMRNSHGEGRTQVAPRQ 469
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+ C NC T T LWR++++G+ VCNACGLYYKLH A RP++M+
Sbjct: 470 DTVDVMTAQCYNCHTTTTPLWRKDDEGKTVCNACGLYYKLHG----------AARPISMK 519
Query: 116 KDGIQTRKRKPKKQGGGGGG 135
D I+ R R ++ G G G
Sbjct: 520 SDVIRKRSRHDARRVGTGVG 539
>gi|165292337|dbj|BAF98874.1| GATA binding protein 3 isoform 1 [Carassius auratus langsdorfii]
Length = 178
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/51 (86%), Positives = 46/51 (90%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT 56
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL
Sbjct: 121 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLV 171
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 126 CVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPL 163
>gi|195328569|ref|XP_002030987.1| GM24275 [Drosophila sechellia]
gi|194119930|gb|EDW41973.1| GM24275 [Drosophila sechellia]
Length = 1238
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 778 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RP 827
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 828 LTMKKDTIQKRKRKPK 843
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY+K++ + RPL + +R
Sbjct: 788 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLTMKKDTIQKRKR 840
>gi|165292345|dbj|BAF98878.1| GATA binding protein 3 isoform 5 [Carassius auratus langsdorfii]
Length = 159
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/50 (88%), Positives = 46/50 (92%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT 56
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL
Sbjct: 103 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLV 152
Score = 51.6 bits (122), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 107 CVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPL 144
>gi|348551454|ref|XP_003461545.1| PREDICTED: endothelial transcription factor GATA-2-like [Cavia
porcellus]
Length = 455
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 43/50 (86%), Positives = 46/50 (92%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT 56
EGRECVNCGA +TPLWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL
Sbjct: 291 EGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLV 340
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 295 CVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPL 332
>gi|17862982|gb|AAL39968.1| SD07261p [Drosophila melanogaster]
gi|220942384|gb|ACL83735.1| srp-PD [synthetic construct]
gi|220952630|gb|ACL88858.1| srp-PD [synthetic construct]
Length = 950
Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 476 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RP 525
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 526 LTMKKDTIQKRKRKPK 541
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY+K++ + RPL + +R
Sbjct: 486 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLTMKKDTIQKRKR 538
>gi|146760638|gb|ABQ44507.1| GATA-binding protein 4 [Oryzias latipes]
Length = 218
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/60 (76%), Positives = 53/60 (88%)
Query: 37 LYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLH 96
LYHKMNG+NRPL+KP +RL+A+RR+GL CTNC T TTLWRRN +GEPVCNACGLY KLH
Sbjct: 158 LYHKMNGINRPLIKPQRRLSASRRVGLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLH 217
Score = 43.1 bits (100), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG 43
G C NC +T LWRR+ G +CNACGLY K++G
Sbjct: 183 GLSCTNCHTTTTTLWRRNAEGEPVCNACGLYMKLHG 218
>gi|336363936|gb|EGN92304.1| hypothetical protein SERLA73DRAFT_65990 [Serpula lacrymans var.
lacrymans S7.3]
Length = 772
Score = 100 bits (250), Expect = 1e-19, Method: Composition-based stats.
Identities = 54/138 (39%), Positives = 71/138 (51%), Gaps = 20/138 (14%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP----------LVKPSKRL 55
G EC NCGA TPLWRR CNACGLY K++ RP + + R
Sbjct: 369 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRSNHGEGRTQAAPRQ 428
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+ C NC T T LWR++++G+ VCNACGLYYKLH + RP++M+
Sbjct: 429 ETVDVMVAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHG----------SARPISMK 478
Query: 116 KDGIQTRKRKPKKQGGGG 133
D I+ R R ++ G G
Sbjct: 479 SDVIRKRSRHDARRVGPG 496
>gi|195111292|ref|XP_002000213.1| GI22648 [Drosophila mojavensis]
gi|193916807|gb|EDW15674.1| GI22648 [Drosophila mojavensis]
Length = 750
Score = 100 bits (250), Expect = 1e-19, Method: Composition-based stats.
Identities = 55/123 (44%), Positives = 68/123 (55%), Gaps = 14/123 (11%)
Query: 12 VNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNR-PLVKPSKRLTA--TRRLGLCCTNC 68
+ C P R+ G+ C C Y + + P + K A RR G+ C NC
Sbjct: 472 IKCETCQIPFIRK-GSDWPNCPNCTSYGRAQPVRHVPQRQKPKAAAAPNNRRSGVVCANC 530
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKK 128
T TTLWRRNN+G PVCNACGLY+KLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 531 QTNSTTLWRRNNEGNPVCNACGLYFKLHNMN----------RPLSMKKEGIQKRKRKPKN 580
Query: 129 QGG 131
G
Sbjct: 581 NGA 583
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL + + +R
Sbjct: 524 GVVCANCQTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPLSMKKEGIQKRKR 576
>gi|336379100|gb|EGO20256.1| hypothetical protein SERLADRAFT_452950 [Serpula lacrymans var.
lacrymans S7.9]
Length = 784
Score = 100 bits (250), Expect = 1e-19, Method: Composition-based stats.
Identities = 57/137 (41%), Positives = 71/137 (51%), Gaps = 19/137 (13%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSK----RLTATRRL 61
G EC NCGA TPLWRR CNACGLY K++ RP S R A R
Sbjct: 406 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRSNHGEGRTQAAPRQ 465
Query: 62 GLC-----CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
C NC T T LWR++++G+ VCNACGLYYKLH + RP++M+
Sbjct: 466 ETVDVMAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHG----------SARPISMKS 515
Query: 117 DGIQTRKRKPKKQGGGG 133
D I+ R R ++ G G
Sbjct: 516 DVIRKRSRHDARRVGPG 532
>gi|195501117|ref|XP_002097665.1| GE24357 [Drosophila yakuba]
gi|194183766|gb|EDW97377.1| GE24357 [Drosophila yakuba]
Length = 1272
Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats.
Identities = 45/75 (60%), Positives = 54/75 (72%), Gaps = 10/75 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLY+KLH+V RP
Sbjct: 794 SRRLSASKRAGLSCSNCNTTHTSLWRRNPAGEPVCNACGLYFKLHSVP----------RP 843
Query: 112 LAMRKDGIQTRKRKP 126
L M+KD IQ RKRKP
Sbjct: 844 LTMKKDTIQKRKRKP 858
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY K++ + RPL + +R
Sbjct: 804 GLSCSNCNTTHTSLWRRNPAGEPVCNACGLYFKLHSVPRPLTMKKDTIQKRKR 856
>gi|357619685|gb|EHJ72160.1| grain [Danaus plexippus]
Length = 304
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 43/50 (86%), Positives = 46/50 (92%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL 55
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL
Sbjct: 198 AEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRL 247
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 48/102 (47%), Gaps = 12/102 (11%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCN---ACGLYHKMNG---MNRPLVKPSKRLTATRR 60
EG + +C S P +G Y A G Y+ G + KP T T
Sbjct: 145 EGAQPDSCPQESQP------SGEYNAQYNAAAGDYYNYQGKGYSGQQQSKPRPNKTRTSA 198
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
G C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 199 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 240
>gi|159884077|gb|ABX00717.1| GH11649p [Drosophila melanogaster]
Length = 1009
Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 812 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RP 861
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 862 LTMKKDTIQKRKRKPK 877
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY+K++ + RPL + +R
Sbjct: 822 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLTMKKDTIQKRKR 874
>gi|384489627|gb|EIE80849.1| hypothetical protein RO3G_05554 [Rhizopus delemar RA 99-880]
Length = 221
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 66/134 (49%), Gaps = 35/134 (26%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL--------------------- 48
C NCG +TPLWRR G +CNACGLY+K RP+
Sbjct: 88 SCSNCGTTTTPLWRRSPLGETICNACGLYYKARNTCRPVWLKRNLSKKNLVRQQPPLLAP 147
Query: 49 ------------VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLH 96
++P R L C NC T+ T LWRR+ G+P+CNACGLY+KLH
Sbjct: 148 ALTEKKSHQVIVIQPPNRTEENN--NLVCANCSTKTTPLWRRDESGQPICNACGLYFKLH 205
Query: 97 NVNRPLAMRHLAMR 110
NV+RP+ M+ ++
Sbjct: 206 NVHRPMTMKRSTIK 219
>gi|195036112|ref|XP_001989515.1| GH18758 [Drosophila grimshawi]
gi|193893711|gb|EDV92577.1| GH18758 [Drosophila grimshawi]
Length = 749
Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats.
Identities = 56/123 (45%), Positives = 68/123 (55%), Gaps = 14/123 (11%)
Query: 12 VNCGAISTPLWRRDGTGHYLCNACGLYHK-MNGMNRPLVKPSKRLTA--TRRLGLCCTNC 68
+ C T R+ G C C Y + G + P + K A RR G+ C NC
Sbjct: 470 LKCDTCQTSFMRK-GNEWPNCPNCTSYGRNQAGRHVPQRQKPKAAAAPNNRRNGVTCANC 528
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKK 128
T TTLWRRNN+G PVCNACGLY+KLHN+N RPL+M+K+GIQ RKRKPK
Sbjct: 529 STNSTTLWRRNNEGNPVCNACGLYFKLHNMN----------RPLSMKKEGIQKRKRKPKN 578
Query: 129 QGG 131
G
Sbjct: 579 NGA 581
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 33/53 (62%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC ST LWRR+ G+ +CNACGLY K++ MNRPL + + +R
Sbjct: 522 GVTCANCSTNSTTLWRRNNEGNPVCNACGLYFKLHNMNRPLSMKKEGIQKRKR 574
>gi|388580872|gb|EIM21184.1| iron transporter biosynthesis regulating transcription factor,
partial [Wallemia sebi CBS 633.66]
Length = 113
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/121 (47%), Positives = 65/121 (53%), Gaps = 15/121 (12%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPS-----KRLTATRRLGLCC 65
C NCG +TPLWRR LCNACGLY K RP S + T L C
Sbjct: 1 CSNCGTSNTPLWRRGLNDQTLCNACGLYEKNRNTPRPTTLQSTTINQSDINKTSGSSLQC 60
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRK 125
NCGT T LWRR +DG+P CNACGLY KLHN A RP+ M+K I+ RKR
Sbjct: 61 ANCGTTTTPLWRRTDDGKPQCNACGLYQKLHN----------APRPVHMKKTIIKRRKRV 110
Query: 126 P 126
P
Sbjct: 111 P 111
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 29/48 (60%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
++ G +C NCG +TPLWRR G CNACGLY K++ RP+
Sbjct: 50 INKTSGSSLQCANCGTTTTPLWRRTDDGKPQCNACGLYQKLHNAPRPV 97
>gi|270008081|gb|EFA04529.1| serpent [Tribolium castaneum]
Length = 528
Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats.
Identities = 58/96 (60%), Positives = 65/96 (67%), Gaps = 16/96 (16%)
Query: 38 YHKMNGMNRPLV------KPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGL 91
YH NG NR K S+RL+A+RR+GL CTNC T T+LWRRN GEPVCNACGL
Sbjct: 303 YHGYNGANRAPALHTTEEKSSRRLSASRRVGLTCTNCHTSTTSLWRRNTVGEPVCNACGL 362
Query: 92 YYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
Y+KLH VN RPLAM+KD IQTRKRKPK
Sbjct: 363 YFKLHGVN----------RPLAMKKDSIQTRKRKPK 388
Score = 58.5 bits (140), Expect = 7e-07, Method: Composition-based stats.
Identities = 24/53 (45%), Positives = 32/53 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC +T LWRR+ G +CNACGLY K++G+NRPL + +R
Sbjct: 333 GLTCTNCHTSTTSLWRRNTVGEPVCNACGLYFKLHGVNRPLAMKKDSIQTRKR 385
>gi|391347263|ref|XP_003747884.1| PREDICTED: uncharacterized protein LOC100905208 [Metaseiulus
occidentalis]
Length = 594
Score = 100 bits (248), Expect = 2e-19, Method: Composition-based stats.
Identities = 49/73 (67%), Positives = 52/73 (71%), Gaps = 10/73 (13%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+R GL C+NC T TTLWRRNN GEPVCNACGLYYKLH VN RP AM+KDG
Sbjct: 372 KRPGLVCSNCDTNNTTLWRRNNQGEPVCNACGLYYKLHGVN----------RPAAMKKDG 421
Query: 119 IQTRKRKPKKQGG 131
IQTRKRKPK G
Sbjct: 422 IQTRKRKPKSVEG 434
Score = 80.9 bits (198), Expect = 1e-13, Method: Composition-based stats.
Identities = 33/43 (76%), Positives = 37/43 (86%)
Query: 5 FGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+ + +ECVNCGA+STPLWRRD GHYLCNACGLY KMNG NRP
Sbjct: 266 YQDSKECVNCGAVSTPLWRRDMAGHYLCNACGLYTKMNGSNRP 308
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 32/53 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC +T LWRR+ G +CNACGLY+K++G+NRP + +R
Sbjct: 375 GLVCSNCDTNNTTLWRRNNQGEPVCNACGLYYKLHGVNRPAAMKKDGIQTRKR 427
Score = 52.4 bits (124), Expect = 7e-05, Method: Composition-based stats.
Identities = 22/42 (52%), Positives = 26/42 (61%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRH 106
C NCG T LWRR+ G +CNACGLY K++ NRP H
Sbjct: 272 CVNCGAVSTPLWRRDMAGHYLCNACGLYTKMNGSNRPPGRNH 313
>gi|119571135|gb|EAW50750.1| GATA binding protein 1 (globin transcription factor 1), isoform
CRA_b [Homo sapiens]
Length = 183
Score = 100 bits (248), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 43/56 (76%), Positives = 48/56 (85%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL ++R G
Sbjct: 117 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAG 172
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Query: 52 SKRLTATRRLGLC----CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
S +L T L C C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 104 SPKLRGTLPLPPCEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPL 158
>gi|260785648|ref|XP_002587872.1| hypothetical protein BRAFLDRAFT_87259 [Branchiostoma floridae]
gi|229273027|gb|EEN43883.1| hypothetical protein BRAFLDRAFT_87259 [Branchiostoma floridae]
Length = 974
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/70 (64%), Positives = 55/70 (78%), Gaps = 1/70 (1%)
Query: 31 LCNACGLYHKMNGMNRP-LVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNAC 89
L GLY ++NG +P L KP K+L+A+RR+GL C NC T TTLWRRNN+GEPVCNAC
Sbjct: 620 LYQNYGLYQQLNGAQKPGLAKPPKKLSASRRVGLQCANCRTTQTTLWRRNNEGEPVCNAC 679
Query: 90 GLYYKLHNVN 99
GLY+KLHNV+
Sbjct: 680 GLYFKLHNVS 689
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 59/164 (35%), Gaps = 76/164 (46%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
G +C NC T LWRR+ G +CNACGLY K++
Sbjct: 652 GLQCANCRTTQTTLWRRNNEGEPVCNACGLYFKLHN------------------------ 687
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRH--------------------- 106
V ACGLYYKLHNV+ L
Sbjct: 688 -----------------VSTACGLYYKLHNVSTTLLQTTQCKYNPTTNYTIPPYVSCYPV 730
Query: 107 --------------LAMRPLAMRKDGIQTRKRKPKKQGGGGGGG 136
RPLAM+KDGIQTRKRKPK G GGG
Sbjct: 731 PANLKPGKGNLGTGQVNRPLAMKKDGIQTRKRKPKTLNKGKGGG 774
Score = 43.9 bits (102), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 21/27 (77%), Positives = 22/27 (81%)
Query: 110 RPLAMRKDGIQTRKRKPKKQGGGGGGG 136
RPLAM+KDGIQTRKRKPK G GGG
Sbjct: 845 RPLAMKKDGIQTRKRKPKTLNKGKGGG 871
>gi|1556394|emb|CAA68943.1| GATA factor [Drosophila melanogaster]
Length = 949
Score = 99.8 bits (247), Expect = 3e-19, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 476 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RP 525
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 526 LTMKKDTIQKRKRKPK 541
Score = 53.1 bits (126), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY+K++ + RPL + +R
Sbjct: 486 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLTMKKDTIQKRKR 538
>gi|169854435|ref|XP_001833892.1| GATA-binding factor 6-B [Coprinopsis cinerea okayama7#130]
gi|116505027|gb|EAU87922.1| GATA-binding factor 6-B [Coprinopsis cinerea okayama7#130]
Length = 877
Score = 99.8 bits (247), Expect = 3e-19, Method: Composition-based stats.
Identities = 55/137 (40%), Positives = 72/137 (52%), Gaps = 25/137 (18%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------------LVKPSK 53
G EC NCGA TPLWRR CNACGLY K++ RP V+P +
Sbjct: 436 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNNHGEGRQQVQPRQ 495
Query: 54 RLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLA 113
T + C NC T T LWR++++G+ VCNACGLYYKLH + RP++
Sbjct: 496 E---TVDVMAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHG----------SARPIS 542
Query: 114 MRKDGIQTRKRKPKKQG 130
M+ D I+ R R ++G
Sbjct: 543 MKSDVIRKRSRHDARRG 559
>gi|449295695|gb|EMC91716.1| hypothetical protein BAUCODRAFT_52987, partial [Baudoinia
compniacensis UAMH 10762]
Length = 105
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 64/114 (56%), Gaps = 10/114 (8%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGT 70
C NCG TPLWRR TG +CNACGLY+K RP+ + + + C NCGT
Sbjct: 1 CSNCGTTKTPLWRRSPTGSIICNACGLYYKARNQMRPVGLKRVQQAQITNVVVACQNCGT 60
Query: 71 RMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
+T LWRR+ G +CNACGLY KLH A RP+ M+K I+ RKR
Sbjct: 61 TITPLWRRDEAGHTLCNACGLYQKLHG----------AHRPVQMKKPEIKRRKR 104
>gi|452838991|gb|EME40931.1| hypothetical protein DOTSEDRAFT_106841, partial [Dothistroma
septosporum NZE10]
Length = 106
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 61/100 (61%), Gaps = 2/100 (2%)
Query: 13 NCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTATRRLGLCCTNCGT 70
NCG TPLWRR G +CNACGLY+K RP L + + A + + C NCGT
Sbjct: 2 NCGTTKTPLWRRSPAGAVICNACGLYYKARNQMRPVGLKRGTPTTQAPTNVVIACQNCGT 61
Query: 71 RMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+T LWRR+ G +CNACGLYYKLH +RP+ M+ ++
Sbjct: 62 TITPLWRRDEAGHTICNACGLYYKLHGAHRPVQMKKAEIK 101
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/38 (63%), Positives = 30/38 (78%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NCG TPLWRRD GH +CNACGLY+K++G +RP+
Sbjct: 56 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGAHRPV 93
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 28/40 (70%)
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
+NCGT T LWRR+ G +CNACGLYYK N RP+ ++
Sbjct: 1 SNCGTTKTPLWRRSPAGAVICNACGLYYKARNQMRPVGLK 40
>gi|170044813|ref|XP_001850027.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867945|gb|EDS31328.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 110
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 43/52 (82%), Positives = 47/52 (90%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT 58
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +
Sbjct: 44 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVSV 95
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 48 CVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 85
>gi|307190871|gb|EFN74708.1| GATA-binding factor C [Camponotus floridanus]
Length = 57
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 44/52 (84%), Positives = 48/52 (92%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT 58
EGRECVNCGA STPLWRRDGTGHYLCNACGLY+KMNG NRPL+KP +RL +T
Sbjct: 6 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLVST 57
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLYYK++ NRPL
Sbjct: 10 CVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPL 47
>gi|384499461|gb|EIE89952.1| hypothetical protein RO3G_14663 [Rhizopus delemar RA 99-880]
Length = 374
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 68/149 (45%), Gaps = 40/149 (26%)
Query: 2 DFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL------------- 48
D C NCG +TPLWRR G +CNACGLY+K +RP+
Sbjct: 78 DIGSTTATSCSNCGTTTTPLWRRSPLGETICNACGLYYKARNTSRPVWLKRNYLKQRQQQ 137
Query: 49 ------VKPSKRLTATRR---------------------LGLCCTNCGTRMTTLWRRNND 81
+P AT++ C NC T T LWRR+
Sbjct: 138 KHLAPRQQPPLLAPATQKPIDPPPLPLVLPTAPAKVEHTTDFVCANCSTETTPLWRRDES 197
Query: 82 GEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
G+P+CNACGLYYKLH+V+RP+ M+ ++
Sbjct: 198 GQPICNACGLYYKLHHVHRPVTMKRSTIK 226
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 40/65 (61%), Gaps = 4/65 (6%)
Query: 45 NRPLVKPSKRLTATRRLG----LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNR 100
N ++PS AT +G C+NCGT T LWRR+ GE +CNACGLYYK N +R
Sbjct: 63 NNSFIEPSNASVATSDIGSTTATSCSNCGTTTTPLWRRSPLGETICNACGLYYKARNTSR 122
Query: 101 PLAMR 105
P+ ++
Sbjct: 123 PVWLK 127
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 2/66 (3%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGT 70
C NC +TPLWRRD +G +CNACGLY+K++ ++RP+ KR T RR + N T
Sbjct: 181 CANCSTETTPLWRRDESGQPICNACGLYYKLHHVHRPVTM--KRSTIKRRKRVTAANLTT 238
Query: 71 RMTTLW 76
++
Sbjct: 239 HKPMVY 244
>gi|410056444|ref|XP_001139973.3| PREDICTED: erythroid transcription factor [Pan troglodytes]
Length = 242
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 48/73 (65%), Positives = 55/73 (75%), Gaps = 4/73 (5%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++P KRL R C T
Sbjct: 167 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLV---RPQTCFT 223
Query: 67 -NCGTRMTTLWRR 78
GT + TL+++
Sbjct: 224 LYTGTPVLTLYKK 236
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Query: 52 SKRLTATRRLGLC----CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
S +L T L C C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 154 SPKLRGTLPLPPCEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPL 208
>gi|409042948|gb|EKM52431.1| hypothetical protein PHACADRAFT_260812 [Phanerochaete carnosa
HHB-10118-sp]
Length = 828
Score = 99.0 bits (245), Expect = 6e-19, Method: Composition-based stats.
Identities = 55/136 (40%), Positives = 72/136 (52%), Gaps = 19/136 (13%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK-----PSKRLTATRR 60
G EC NCGA TPLWRR CNACGLY K++ RP + TA R+
Sbjct: 406 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNNHGEGRSQTAPRQ 465
Query: 61 ----LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
+ C NC T T LWR++++G+ VCNACGLYYKLH + RP++M+
Sbjct: 466 ESQEVVAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHG----------SARPISMKS 515
Query: 117 DGIQTRKRKPKKQGGG 132
D I+ R R ++ G
Sbjct: 516 DVIRKRARHDARRASG 531
>gi|78706774|ref|NP_001027190.1| serpent, isoform D [Drosophila melanogaster]
gi|442619343|ref|NP_001262618.1| serpent, isoform F [Drosophila melanogaster]
gi|42415419|gb|AAS15679.1| LD44281p [Drosophila melanogaster]
gi|71854562|gb|AAN13692.3| serpent, isoform D [Drosophila melanogaster]
gi|440217477|gb|AGB95998.1| serpent, isoform F [Drosophila melanogaster]
Length = 746
Score = 99.0 bits (245), Expect = 6e-19, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 272 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RP 321
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 322 LTMKKDTIQKRKRKPK 337
Score = 52.8 bits (125), Expect = 4e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY+K++ + RPL + +R
Sbjct: 282 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLTMKKDTIQKRKR 334
>gi|449543979|gb|EMD34953.1| hypothetical protein CERSUDRAFT_116480 [Ceriporiopsis subvermispora
B]
Length = 830
Score = 98.6 bits (244), Expect = 7e-19, Method: Composition-based stats.
Identities = 54/137 (39%), Positives = 71/137 (51%), Gaps = 19/137 (13%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV---------KPSKRLT 56
G EC NCGA TPLWRR CNACGLY K++ RP S
Sbjct: 428 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNHSGEGRTQSAPRQ 487
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
++ + C NC T T LWR++++G+ VCNACGLYYKLH + RP++M+
Sbjct: 488 ESQEVIAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHG----------SARPISMKS 537
Query: 117 DGIQTRKRKPKKQGGGG 133
D I+ R R ++ G G
Sbjct: 538 DVIRKRARHDARRVGPG 554
>gi|170097681|ref|XP_001880060.1| siderophore biosynthesis regulatory protein [Laccaria bicolor
S238N-H82]
gi|164645463|gb|EDR09711.1| siderophore biosynthesis regulatory protein [Laccaria bicolor
S238N-H82]
Length = 928
Score = 98.6 bits (244), Expect = 7e-19, Method: Composition-based stats.
Identities = 53/137 (38%), Positives = 69/137 (50%), Gaps = 21/137 (15%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP-----------LVKPSKR 54
G EC NCGA TPLWRR CNACGLY K++ RP ++
Sbjct: 528 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNTHGEGRAQAAPRQ 587
Query: 55 LTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAM 114
T C NC T T LWR++++G+ VCNACGLYYKLH + RP++M
Sbjct: 588 ETVDVMTAAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHG----------SARPISM 637
Query: 115 RKDGIQTRKRKPKKQGG 131
+ D I+ R R ++ G
Sbjct: 638 KSDVIRKRSRHDARRSG 654
>gi|297275072|ref|XP_002800933.1| PREDICTED: transcription factor GATA-6-like [Macaca mulatta]
Length = 653
Score = 98.2 bits (243), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/92 (56%), Positives = 63/92 (68%), Gaps = 14/92 (15%)
Query: 38 YHKMNGMNRPLVKPSKRLT--ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKL 95
+H +N RP++ + L+ ++RRLGL C NC T TTLWRRN +GEPVCNACGLY KL
Sbjct: 475 FHDLN--LRPVIVHRQLLSTPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKL 532
Query: 96 HNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
H V RPLAM+K+GIQTRKRKPK
Sbjct: 533 HGVP----------RPLAMKKEGIQTRKRKPK 554
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 499 GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 540
>gi|195570486|ref|XP_002103238.1| GD19064 [Drosophila simulans]
gi|194199165|gb|EDX12741.1| GD19064 [Drosophila simulans]
Length = 696
Score = 97.8 bits (242), Expect = 1e-18, Method: Composition-based stats.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 568 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RP 617
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 618 LTMKKDTIQKRKRKPK 633
Score = 52.4 bits (124), Expect = 5e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC T LWRR+ G +CNACGLY+K++ + RPL + +R
Sbjct: 578 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLTMKKDTIQKRKR 630
>gi|193683287|ref|XP_001943242.1| PREDICTED: hypothetical protein LOC100168211 [Acyrthosiphon pisum]
Length = 888
Score = 97.8 bits (242), Expect = 1e-18, Method: Composition-based stats.
Identities = 48/79 (60%), Positives = 55/79 (69%), Gaps = 11/79 (13%)
Query: 50 KPSKRL-TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLA 108
K ++R+ A+RR GL CTNC T T+LWRRN GEPVCNACGLY+KLH V
Sbjct: 707 KTTRRMQNASRRQGLQCTNCQTATTSLWRRNQVGEPVCNACGLYFKLHGVK--------- 757
Query: 109 MRPLAMRKDGIQTRKRKPK 127
RPL M+KD IQTRKRKPK
Sbjct: 758 -RPLTMKKDSIQTRKRKPK 775
Score = 58.5 bits (140), Expect = 7e-07, Method: Composition-based stats.
Identities = 23/54 (42%), Positives = 33/54 (61%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+G +C NC +T LWRR+ G +CNACGLY K++G+ RPL + +R
Sbjct: 719 QGLQCTNCQTATTSLWRRNQVGEPVCNACGLYFKLHGVKRPLTMKKDSIQTRKR 772
>gi|432105435|gb|ELK31650.1| Transcription factor GATA-6 [Myotis davidii]
Length = 270
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/73 (65%), Positives = 53/73 (72%), Gaps = 10/73 (13%)
Query: 55 LTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAM 114
L ++RRLGL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM
Sbjct: 108 LPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAM 157
Query: 115 RKDGIQTRKRKPK 127
+K+GIQTRKRKPK
Sbjct: 158 KKEGIQTRKRKPK 170
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 115 GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 156
>gi|442619345|ref|NP_001262619.1| serpent, isoform G [Drosophila melanogaster]
gi|440217478|gb|AGB95999.1| serpent, isoform G [Drosophila melanogaster]
Length = 469
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 272 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSV----------PRP 321
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 322 LTMKKDTIQKRKRKPK 337
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC T LWRR+ G +CNACGLY+K++ + RPL
Sbjct: 282 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLT 323
>gi|441492|emb|CAA53807.1| a box binding factor (ABF), also called dGATA-b [Drosophila
melanogaster]
Length = 779
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/76 (61%), Positives = 55/76 (72%), Gaps = 10/76 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S+RL+A++R GL C+NC T T+LWRRN GEPVCNACGLYYKLH+V RP
Sbjct: 306 SRRLSASKRAGLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVP----------RP 355
Query: 112 LAMRKDGIQTRKRKPK 127
L M+KD IQ RKRKPK
Sbjct: 356 LTMKKDTIQKRKRKPK 371
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC T LWRR+ G +CNACGLY+K++ + RPL
Sbjct: 316 GLSCSNCHTTHTSLWRRNPAGEPVCNACGLYYKLHSVPRPLT 357
>gi|358341608|dbj|GAA49239.1| GATA-binding factor A [Clonorchis sinensis]
Length = 492
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/178 (35%), Positives = 75/178 (42%), Gaps = 76/178 (42%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLC-------NACGLYHKMNGMNRPL------------ 48
RECV CG ++T W+ DGTGHYLC N LY + + PL
Sbjct: 8 SRECVKCGQLTTSFWQPDGTGHYLCEVCGREQNPATLY--FDQLRSPLVNGRVDNVPHCA 65
Query: 49 ---------VKPSK------------------------------------RLTATRRLGL 63
VKP++ R + RR+GL
Sbjct: 66 TRQMGGYEQVKPAENASGDYKLFQQDPATLFNSLSYRRSVRSNLNKTLLSRRSVARRIGL 125
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
CTNC T TTLWRRN DG+PVCNACGLY KLH RP +MRKD IQT
Sbjct: 126 VCTNCETTQTTLWRRNADGQPVCNACGLYQKLHG----------RTRPSSMRKDAIQT 173
>gi|345490814|ref|XP_003426466.1| PREDICTED: hypothetical protein LOC100678632 [Nasonia vitripennis]
Length = 262
Score = 96.3 bits (238), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 45/75 (60%), Positives = 54/75 (72%), Gaps = 10/75 (13%)
Query: 53 KRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPL 112
+RL+A+RR GL C+NC T T+LWRRN G+ VCNACGLY+KLH +N RPL
Sbjct: 32 RRLSASRRAGLSCSNCKTTTTSLWRRNAQGDAVCNACGLYFKLHGIN----------RPL 81
Query: 113 AMRKDGIQTRKRKPK 127
M+KD IQTRKRKPK
Sbjct: 82 TMKKDAIQTRKRKPK 96
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+NRPL
Sbjct: 41 GLSCSNCKTTTTSLWRRNAQGDAVCNACGLYFKLHGINRPLT 82
>gi|392587614|gb|EIW76948.1| hypothetical protein CONPUDRAFT_110709 [Coniophora puteana
RWD-64-598 SS2]
Length = 807
Score = 95.9 bits (237), Expect = 5e-18, Method: Composition-based stats.
Identities = 54/158 (34%), Positives = 72/158 (45%), Gaps = 42/158 (26%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------------------ 47
G ECVNCGA TPLWRR CNACGLY K++ RP
Sbjct: 395 GVKAECVNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKTMRSNHGEGRTQAAPRQ 454
Query: 48 --------------LVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYY 93
+ R+ +T C NC T T LWR++++G+ VCNACGLY+
Sbjct: 455 ETVDVVGGYKASPSMTPVQTRVASTPAGTAQCYNCHTTATPLWRKDDEGKTVCNACGLYF 514
Query: 94 KLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQGG 131
KLH + RP++M+ D I+ R R ++ G
Sbjct: 515 KLHG----------SARPISMKSDVIRKRSRHDARRAG 542
>gi|324507577|gb|ADY43212.1| Transcription factor elt-2 [Ascaris suum]
Length = 502
Score = 95.9 bits (237), Expect = 5e-18, Method: Composition-based stats.
Identities = 46/90 (51%), Positives = 59/90 (65%), Gaps = 10/90 (11%)
Query: 43 GMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
G +RP K ++ ++ RR GL C NC TTLWRRN +GEPVCNACGLY+KLHNV+RP
Sbjct: 274 GKSRPPQKKNQS-SSQRRQGLVCANCHGTNTTLWRRNAEGEPVCNACGLYFKLHNVHRPA 332
Query: 103 AMRHLAMRPLAMRKDGIQTRKRKPKKQGGG 132
+M+ ++ +QTRKRK K G G
Sbjct: 333 SMK---------KEGTLQTRKRKQKTDGSG 353
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 33/57 (57%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
Q +G C NC +T LWRR+ G +CNACGLY K++ ++RP + TR+
Sbjct: 288 QRRQGLVCANCHGTNTTLWRRNAEGEPVCNACGLYFKLHNVHRPASMKKEGTLQTRK 344
>gi|440903489|gb|ELR54140.1| Transcription factor GATA-6 [Bos grunniens mutus]
Length = 481
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 47/73 (64%), Positives = 53/73 (72%), Gaps = 10/73 (13%)
Query: 55 LTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAM 114
+ ++RRLGL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLAM
Sbjct: 320 VPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLAM 369
Query: 115 RKDGIQTRKRKPK 127
+K+GIQTRKRKPK
Sbjct: 370 KKEGIQTRKRKPK 382
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 327 GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 368
>gi|384491758|gb|EIE82954.1| hypothetical protein RO3G_07659 [Rhizopus delemar RA 99-880]
Length = 383
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 65/135 (48%), Gaps = 34/135 (25%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLT----------- 56
EC NCG +TPLWRR G +CNACGLY K RP L K SK+
Sbjct: 8 ECANCGTTTTPLWRRGPNGETICNACGLYLKARNTLRPPTLKKASKKEKNDCGGGTCPGG 67
Query: 57 ---------------------ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKL 95
R L C NC T T LWRR+ G +CNACGLYYKL
Sbjct: 68 GQCNGTGGSTSCAGCPAFNQHQVNRQALICANCRTTTTPLWRRDEAGNTICNACGLYYKL 127
Query: 96 HNVNRPLAMRHLAMR 110
H+V+RP++M+ ++
Sbjct: 128 HHVHRPVSMKRSVIK 142
>gi|302693230|ref|XP_003036294.1| hypothetical protein SCHCODRAFT_66326 [Schizophyllum commune H4-8]
gi|300109990|gb|EFJ01392.1| hypothetical protein SCHCODRAFT_66326 [Schizophyllum commune H4-8]
Length = 428
Score = 95.1 bits (235), Expect = 7e-18, Method: Composition-based stats.
Identities = 51/114 (44%), Positives = 61/114 (53%), Gaps = 14/114 (12%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKM-----------NGMNRPLVKPS 52
Q G EC NCGA TPLWRR CNACGLY+K NG N+P
Sbjct: 244 QDGGKAECANCGATHTPLWRRGLNDELNCNACGLYYKQHRVPRPKTTARNGANKPAPSSV 303
Query: 53 KRLTATRRLGLC---CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLA 103
K+ +C C NC T +T LWR++ DG VCNACGLY+KLH RP++
Sbjct: 304 KQEAEPPVGIICDAQCHNCQTTVTPLWRKDPDGRTVCNACGLYFKLHGSPRPIS 357
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 24/53 (45%), Positives = 31/53 (58%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
+C NC TPLWR+D G +CNACGLY K++G RP+ S + R G
Sbjct: 318 QCHNCQTTVTPLWRKDPDGRTVCNACGLYFKLHGSPRPISGSSDVIRRRSRYG 370
>gi|410977466|ref|XP_003995126.1| PREDICTED: transcription factor GATA-6-like [Felis catus]
Length = 495
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 47/74 (63%), Positives = 53/74 (71%), Gaps = 10/74 (13%)
Query: 54 RLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLA 113
+ ++RRLGL C NC T TTLWRRN +GEPVCNACGLY KLH V RPLA
Sbjct: 333 KAPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVP----------RPLA 382
Query: 114 MRKDGIQTRKRKPK 127
M+K+GIQTRKRKPK
Sbjct: 383 MKKEGIQTRKRKPK 396
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 341 GLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLA 382
>gi|33439247|gb|AAQ18782.1| GATA transcription factor [Haemonchus contortus]
Length = 417
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 50/86 (58%), Positives = 55/86 (63%), Gaps = 11/86 (12%)
Query: 50 KPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAM 109
KPS + RR GL C+NCG TTLWRRN +GEPVCNACGLYYKLHNV
Sbjct: 228 KPSAAQNSQRRQGLVCSNCGGTNTTLWRRNAEGEPVCNACGLYYKLHNV----------Q 277
Query: 110 RPLAMRKDG-IQTRKRKPKKQGGGGG 134
RP M+KDG +QTRKRK K G G
Sbjct: 278 RPPTMKKDGQLQTRKRKAKSDGTNTG 303
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 29/41 (70%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+G C NCG +T LWRR+ G +CNACGLY+K++ + RP
Sbjct: 239 QGLVCSNCGGTNTTLWRRNAEGEPVCNACGLYYKLHNVQRP 279
>gi|393219358|gb|EJD04845.1| hypothetical protein FOMMEDRAFT_27143 [Fomitiporia mediterranea
MF3/22]
Length = 930
Score = 95.1 bits (235), Expect = 9e-18, Method: Composition-based stats.
Identities = 54/150 (36%), Positives = 74/150 (49%), Gaps = 32/150 (21%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK-----PSKRLTATR- 59
G EC NCGA TPLWRR CNACGLY K++ RP ++ +A R
Sbjct: 413 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKKPRPKSMRNQHGEGRQQSAPRN 472
Query: 60 ----------------RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLA 103
++ C NC T T LWR++++G+ VCNACGLY+KLH
Sbjct: 473 DNSDAMGEPVLINWFTKISAQCYNCHTTTTPLWRKDDEGKTVCNACGLYFKLHG------ 526
Query: 104 MRHLAMRPLAMRKDGIQTRKRKPKKQGGGG 133
+ RP++M+ D I+ R R ++ G G
Sbjct: 527 ----SSRPISMKSDIIRKRSRHDARRVGSG 552
>gi|33439249|gb|AAQ18783.1| GATA transcription factor [Haemonchus contortus]
Length = 417
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 50/86 (58%), Positives = 55/86 (63%), Gaps = 11/86 (12%)
Query: 50 KPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAM 109
KPS + RR GL C+NCG TTLWRRN +GEPVCNACGLYYKLHNV
Sbjct: 228 KPSAAQNSQRRQGLVCSNCGGTNTTLWRRNAEGEPVCNACGLYYKLHNV----------Q 277
Query: 110 RPLAMRKDG-IQTRKRKPKKQGGGGG 134
RP M+KDG +QTRKRK K G G
Sbjct: 278 RPPTMKKDGQLQTRKRKAKSDGTNTG 303
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 29/41 (70%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+G C NCG +T LWRR+ G +CNACGLY+K++ + RP
Sbjct: 239 QGLVCSNCGGTNTTLWRRNAEGEPVCNACGLYYKLHNVQRP 279
>gi|342320721|gb|EGU12660.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 1471
Score = 94.7 bits (234), Expect = 1e-17, Method: Composition-based stats.
Identities = 47/108 (43%), Positives = 62/108 (57%), Gaps = 15/108 (13%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR------ 60
+G C +CG+I+TPLWRR LCNACGLY K++ RP K + A++R
Sbjct: 1152 DGPVCSHCGSITTPLWRRGPDDELLCNACGLYLKLHSKPRP--KTFGKSNASKRSSNGAA 1209
Query: 61 -------LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
+ C+NCG T +WR++ +G CNAC LYYKLH VNRP
Sbjct: 1210 AQAAASGVPPSCSNCGATSTPMWRKDQEGRLCCNACSLYYKLHKVNRP 1257
>gi|384498142|gb|EIE88633.1| hypothetical protein RO3G_13344 [Rhizopus delemar RA 99-880]
Length = 228
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 62/104 (59%), Gaps = 13/104 (12%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG------- 62
+C CGA TPLWRR G LCNACGL H M+RP K K ++ + G
Sbjct: 20 KCSLCGATKTPLWRRGPHGEALCNACGLKH----MHRP-AKVKKNDVSSHQAGSFTVFNK 74
Query: 63 -LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
+ C NC T T LWRR+ G +CNACGLYYKLH V+RP++M+
Sbjct: 75 LMMCANCRTTTTPLWRRDEAGNTICNACGLYYKLHQVHRPVSMK 118
Score = 37.7 bits (86), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYY 93
+ P+ LT L C+ CG T LWRR GE +CNACGL +
Sbjct: 5 LAPTTCLTIDSILVNKCSLCGATKTPLWRRGPHGEALCNACGLKH 49
>gi|384488088|gb|EIE80268.1| hypothetical protein RO3G_04973 [Rhizopus delemar RA 99-880]
Length = 336
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 66/145 (45%), Gaps = 40/145 (27%)
Query: 10 ECVNCGAISTPLWRRDGT-GHYLCNACGLYHKMNGMNRPL-------------------- 48
EC NC TPLWRR H LCNACGLY+K +RPL
Sbjct: 151 ECFNCKVTKTPLWRRTPDRKHSLCNACGLYYKQYNHHRPLHVRNKTHTVRAHPYDRGLPV 210
Query: 49 --VKPSKRLTA-------TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
VKP + T + C NC T LWR+N GEP+CNACGLY KLHN +
Sbjct: 211 TIVKPELSYPSSPPSTEQTVEMNQECANCHQTQTPLWRKNERGEPLCNACGLYAKLHNRD 270
Query: 100 RPLAMRHLAMRPLAMRKDGIQTRKR 124
RP MRK IQ R+R
Sbjct: 271 ----------RPAEMRKTTIQRRRR 285
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 27/40 (67%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+EC NC TPLWR++ G LCNACGLY K++ +RP
Sbjct: 233 NQECANCHQTQTPLWRKNERGEPLCNACGLYAKLHNRDRP 272
>gi|392563238|gb|EIW56417.1| hypothetical protein TRAVEDRAFT_73009 [Trametes versicolor
FP-101664 SS1]
Length = 820
Score = 92.0 bits (227), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 64/111 (57%), Gaps = 11/111 (9%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL----------VKPSKRLTATR 59
EC NCGA TPLWRR CNACGLY K++ RP + + R ++
Sbjct: 420 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNNHGEGRAQAAPR-QESQ 478
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NC T T LWR++++G+ VCNACGLYYKLH RP++M+ +R
Sbjct: 479 EVVAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGSARPISMKSDVIR 529
>gi|165292339|dbj|BAF98875.1| GATA binding protein 3 isoform 2 [Carassius auratus langsdorfii]
Length = 261
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 43/70 (61%), Positives = 49/70 (70%), Gaps = 10/70 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A RR G C NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+
Sbjct: 136 SAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMK 185
Query: 116 KDGIQTRKRK 125
K+GIQTR RK
Sbjct: 186 KEGIQTRNRK 195
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 142 GTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 183
>gi|389743019|gb|EIM84204.1| hypothetical protein STEHIDRAFT_123062 [Stereum hirsutum FP-91666
SS1]
Length = 939
Score = 91.7 bits (226), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 49/109 (44%), Positives = 63/109 (57%), Gaps = 8/109 (7%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV---KPSKRLTATRR-----L 61
EC NCGA TPLWRR CNACGLY K++ RP +R A R +
Sbjct: 440 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKTMRNNHGERSQAAPRPESNEV 499
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
C NC T T LWR++++G+ VCNACGLY+KLH RP++M+ +R
Sbjct: 500 MAKCYNCNTTATPLWRKDDEGKTVCNACGLYFKLHGSPRPISMKSDVIR 548
>gi|384497544|gb|EIE88035.1| hypothetical protein RO3G_12746 [Rhizopus delemar RA 99-880]
Length = 556
Score = 91.7 bits (226), Expect = 9e-17, Method: Composition-based stats.
Identities = 43/91 (47%), Positives = 53/91 (58%), Gaps = 5/91 (5%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKPSKRLTATR----R 60
G+ C NC +TPLWRR LCNACGLY K++ RP +KP T+
Sbjct: 466 GKAPICSNCNTTTTPLWRRSVDDELLCNACGLYLKLHNAPRPKHLKPQSSRKDTKDEENM 525
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGL 91
+ C+NCGT T LWRR+ DG P+CNACGL
Sbjct: 526 IQPVCSNCGTSTTPLWRRDVDGSPLCNACGL 556
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 30/53 (56%), Positives = 36/53 (67%), Gaps = 5/53 (9%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
C+NC T T LWRR+ D E +CNACGLY KLHN RP +HL +P + RKD
Sbjct: 471 CSNCNTTTTPLWRRSVDDELLCNACGLYLKLHNAPRP---KHL--KPQSSRKD 518
>gi|302677711|ref|XP_003028538.1| hypothetical protein SCHCODRAFT_11907 [Schizophyllum commune H4-8]
gi|300102227|gb|EFI93635.1| hypothetical protein SCHCODRAFT_11907 [Schizophyllum commune H4-8]
Length = 279
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/108 (42%), Positives = 65/108 (60%), Gaps = 5/108 (4%)
Query: 9 RECVNCGAISTPLWRRDGTG-HYLCNACGLYHKMNGMNRP--LVKPSKRLTATRRL--GL 63
++C +CG TPLWRRD + LCNACGL++K M+RP L++ +
Sbjct: 159 KQCSHCGVKQTPLWRRDPSNFQLLCNACGLFYKQRHMHRPKVLIEADQEDDTGEDDPNAP 218
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
C++CGT T++WRR DG VCNACG+Y +L RPLA++ +RP
Sbjct: 219 TCSHCGTHRTSVWRRGKDGTQVCNACGVYSRLRGKERPLALKKNKIRP 266
>gi|380022377|ref|XP_003695026.1| PREDICTED: uncharacterized protein LOC100864759 [Apis florea]
Length = 588
Score = 91.3 bits (225), Expect = 1e-16, Method: Composition-based stats.
Identities = 40/72 (55%), Positives = 50/72 (69%), Gaps = 10/72 (13%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
T+++ + CTNCGT TT+WRRN GE VCNACGLYYKLH VN RP+ MR+D
Sbjct: 410 TKKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVN----------RPVTMRRD 459
Query: 118 GIQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 460 TIHTRRRRPKGE 471
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPVTMRRDTIHTRRR 466
>gi|256088981|ref|XP_002580598.1| GATA binding factor-1b (transcription factor xgata-1b) [Schistosoma
mansoni]
gi|360042778|emb|CCD78188.1| putative gata binding factor [Schistosoma mansoni]
Length = 919
Score = 91.3 bits (225), Expect = 1e-16, Method: Composition-based stats.
Identities = 43/74 (58%), Positives = 49/74 (66%), Gaps = 10/74 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
+++ RR+GL C+NC T TTLWRRN DGEPVCNACGLY KLH RP
Sbjct: 516 NRKSQTNRRIGLICSNCETTKTTLWRRNLDGEPVCNACGLYQKLHG----------RTRP 565
Query: 112 LAMRKDGIQTRKRK 125
+MRKD IQTRKRK
Sbjct: 566 TSMRKDAIQTRKRK 579
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCT 66
G C NC T LWRR+ G +CNACGLY K++G RP K TR+ C T
Sbjct: 526 GLICSNCETTKTTLWRRNLDGEPVCNACGLYQKLHGRTRP-TSMRKDAIQTRKRKSCST 583
>gi|195572641|ref|XP_002104304.1| GD20889 [Drosophila simulans]
gi|194200231|gb|EDX13807.1| GD20889 [Drosophila simulans]
Length = 292
Score = 90.9 bits (224), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/69 (62%), Positives = 47/69 (68%), Gaps = 10/69 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A +R G C NC T TTLWRRN GEPVCNACGLYYKLHNVN RPL M+
Sbjct: 118 SAAKRAGTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVN----------RPLTMK 167
Query: 116 KDGIQTRKR 124
K+GIQTR R
Sbjct: 168 KEGIQTRNR 176
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 30/42 (71%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 124 GTSCANCKTTTTTLWRRNASGEPVCNACGLYYKLHNVNRPLT 165
>gi|170587074|ref|XP_001898304.1| GATA zinc finger family protein [Brugia malayi]
gi|158594699|gb|EDP33283.1| GATA zinc finger family protein [Brugia malayi]
Length = 567
Score = 90.9 bits (224), Expect = 2e-16, Method: Composition-based stats.
Identities = 41/76 (53%), Positives = 50/76 (65%), Gaps = 9/76 (11%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
+ RR GL C NC TTLWRR+ DG PVCNACGLYYKLH V RP++M+ ++
Sbjct: 352 SQRRQGLVCANCRGTNTTLWRRDADGHPVCNACGLYYKLHQVQRPISMK---------KE 402
Query: 117 DGIQTRKRKPKKQGGG 132
+QTRKRK K +G G
Sbjct: 403 GTLQTRKRKQKSEGNG 418
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 35/57 (61%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
Q +G C NC +T LWRRD GH +CNACGLY+K++ + RP+ + TR+
Sbjct: 353 QRRQGLVCANCRGTNTTLWRRDADGHPVCNACGLYYKLHQVQRPISMKKEGTLQTRK 409
>gi|307172112|gb|EFN63680.1| GATA-binding factor A [Camponotus floridanus]
Length = 595
Score = 90.9 bits (224), Expect = 2e-16, Method: Composition-based stats.
Identities = 39/71 (54%), Positives = 49/71 (69%), Gaps = 10/71 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+++ + CTNCGT TT+WRRN GE VCNACGLYYKLH VN RP+ MR+D
Sbjct: 418 KKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVN----------RPVTMRRDT 467
Query: 119 IQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 468 IHTRRRRPKGE 478
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 424 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPVTMRRDTIHTRRR 473
>gi|332023110|gb|EGI63371.1| GATA-binding factor A [Acromyrmex echinatior]
Length = 595
Score = 90.9 bits (224), Expect = 2e-16, Method: Composition-based stats.
Identities = 39/71 (54%), Positives = 49/71 (69%), Gaps = 10/71 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+++ + CTNCGT TT+WRRN GE VCNACGLYYKLH VN RP+ MR+D
Sbjct: 419 KKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVN----------RPVTMRRDT 468
Query: 119 IQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 469 IHTRRRRPKGE 479
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 425 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPVTMRRDTIHTRRR 474
>gi|302677612|ref|XP_003028489.1| hypothetical protein SCHCODRAFT_86018 [Schizophyllum commune H4-8]
gi|300102177|gb|EFI93586.1| hypothetical protein SCHCODRAFT_86018 [Schizophyllum commune H4-8]
Length = 469
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 65/117 (55%), Gaps = 15/117 (12%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV------------KPSK 53
G EC NCGA TPLWRR CNACGLY K++ RP +P +
Sbjct: 54 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNSNDGSRNNNQPRQ 113
Query: 54 RLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ + C NC T T LWR++++G+ VCNACGLYYKLH +RP++M+ +R
Sbjct: 114 EVA---DVMAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHGTSRPISMKSDIIR 167
>gi|383865617|ref|XP_003708269.1| PREDICTED: uncharacterized protein LOC100877935 [Megachile
rotundata]
Length = 586
Score = 90.1 bits (222), Expect = 3e-16, Method: Composition-based stats.
Identities = 39/71 (54%), Positives = 49/71 (69%), Gaps = 10/71 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+++ + CTNCGT TT+WRRN GE VCNACGLYYKLH VN RP+ MR+D
Sbjct: 409 KKVDMSCTNCGTMTTTIWRRNMKGEIVCNACGLYYKLHGVN----------RPVTMRRDT 458
Query: 119 IQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 459 IHTRRRRPKGE 469
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 415 CTNCGTMTTTIWRRNMKGEIVCNACGLYYKLHGVNRPVTMRRDTIHTRRR 464
>gi|166234101|emb|CAP72497.1| GATA binding protein 6 [Sus scrofa]
Length = 46
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 39/46 (84%), Positives = 43/46 (93%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKR 54
RECVNCG+I TPLWRRDGTGHYLCNACGLY KMNG++RPL+KP KR
Sbjct: 1 RECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKR 46
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 28/38 (73%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG+ T LWRR+ G +CNACGLY K++ ++RPL
Sbjct: 3 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 40
>gi|322789591|gb|EFZ14817.1| hypothetical protein SINV_01115 [Solenopsis invicta]
Length = 547
Score = 90.1 bits (222), Expect = 3e-16, Method: Composition-based stats.
Identities = 39/71 (54%), Positives = 49/71 (69%), Gaps = 10/71 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+++ + CTNCGT TT+WRRN GE VCNACGLYYKLH VN RP+ MR+D
Sbjct: 373 KKIDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVN----------RPVTMRRDT 422
Query: 119 IQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 423 IHTRRRRPKGE 433
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 379 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPVTMRRDTIHTRRR 428
>gi|345486907|ref|XP_003425585.1| PREDICTED: hypothetical protein LOC100678172 [Nasonia vitripennis]
Length = 647
Score = 90.1 bits (222), Expect = 3e-16, Method: Composition-based stats.
Identities = 40/71 (56%), Positives = 50/71 (70%), Gaps = 10/71 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+++ + CTNCGT TT+WRRN GE VCNACGLYYKLH VN RP+AMR+D
Sbjct: 460 KKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVN----------RPVAMRRDT 509
Query: 119 IQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 510 IHTRRRRPKGE 520
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 466 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPVAMRRDTIHTRRR 515
>gi|1078990|pir||S53812 BmGATA beta isoform 3 - silkworm (fragment)
gi|603166|gb|AAA67887.1| BmGATA beta isoform 3, partial [Bombyx mori]
Length = 241
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 60/103 (58%), Gaps = 15/103 (14%)
Query: 30 YLCNACGLYHKMNGMNR---PLVKPSKRLT--ATRRLGLCCTNCGTRMTTLWRRNNDGEP 84
+L N H + R P++ + R A +R G CTNC T T+LWRRN GE
Sbjct: 15 FLLNGIQASHALPAPQRTRVPILLTTSRRKGMAAKRPGTMCTNCQTTATSLWRRNVQGET 74
Query: 85 VCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
VCNACGLY+KLHNVN RPL M+KD IQTRKRKPK
Sbjct: 75 VCNACGLYFKLHNVN----------RPLTMKKDSIQTRKRKPK 107
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 52 GTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRPLT 93
>gi|409075785|gb|EKM76161.1| hypothetical protein AGABI1DRAFT_131482 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1114
Score = 89.7 bits (221), Expect = 3e-16, Method: Composition-based stats.
Identities = 55/180 (30%), Positives = 75/180 (41%), Gaps = 56/180 (31%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN----------------------- 42
G EC NCGA TPLWRR CNACGLY K++
Sbjct: 389 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNSHGGSGSGGSNNN 448
Query: 43 ----GMNRPLVKPSKRLTATRRLGLC-------------------CTNCGTRMTTLWRRN 79
+ RPL + + + G+ C NC T T LWR++
Sbjct: 449 TGNAAVERPLTVSNASIVQSVTHGIGGGSNSRQDPVDVLGSQTAQCYNCNTTATPLWRKD 508
Query: 80 NDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGGGGGSGG 139
++G+ VCNACGLYYKLH + RP++M+ D I+ R R ++G + G
Sbjct: 509 DEGKTVCNACGLYYKLHG----------SPRPISMKSDVIRKRSRHDARRGSNAASSAPG 558
>gi|313217859|emb|CBY41259.1| unnamed protein product [Oikopleura dioica]
Length = 185
Score = 89.7 bits (221), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/70 (61%), Positives = 50/70 (71%), Gaps = 10/70 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A RR G C+NC T TTLWRRN++G+PVCNACGLYYKLH VN RPL M+
Sbjct: 7 SAARRAGTSCSNCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVN----------RPLTMK 56
Query: 116 KDGIQTRKRK 125
K+GIQTR RK
Sbjct: 57 KEGIQTRNRK 66
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC T LWRR+ G +CNACGLY+K++G+NRPL
Sbjct: 13 GTSCSNCHTTQTTLWRRNSNGDPVCNACGLYYKLHGVNRPLT 54
>gi|307197634|gb|EFN78813.1| GATA-binding factor A [Harpegnathos saltator]
Length = 594
Score = 89.7 bits (221), Expect = 3e-16, Method: Composition-based stats.
Identities = 38/72 (52%), Positives = 50/72 (69%), Gaps = 10/72 (13%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
T+++ + C+NCGT TT+WRRN GE VCNACGLYYKLH +N RP+ MR+D
Sbjct: 415 TKKVEMTCSNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGIN----------RPVTMRRD 464
Query: 118 GIQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 465 TIHTRRRRPKGE 476
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 422 CSNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGINRPVTMRRDTIHTRRR 471
>gi|165292347|dbj|BAF98879.1| GATA binding protein 3 isoform 6 [Carassius auratus langsdorfii]
Length = 242
Score = 89.7 bits (221), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/70 (60%), Positives = 48/70 (68%), Gaps = 10/70 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A RR G C NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL +
Sbjct: 117 SAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTTK 166
Query: 116 KDGIQTRKRK 125
K+GIQTR RK
Sbjct: 167 KEGIQTRNRK 176
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 123 GTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 164
>gi|328785124|ref|XP_001120276.2| PREDICTED: hypothetical protein LOC724420 [Apis mellifera]
Length = 588
Score = 89.7 bits (221), Expect = 3e-16, Method: Composition-based stats.
Identities = 39/71 (54%), Positives = 49/71 (69%), Gaps = 10/71 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+++ + CTNCGT TT+WRRN GE VCNACGLYYKLH VN RP+ MR+D
Sbjct: 411 KKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVN----------RPVTMRRDT 460
Query: 119 IQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 461 IHTRRRRPKGE 471
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPVTMRRDTIHTRRR 466
>gi|189240277|ref|XP_001812551.1| PREDICTED: similar to GATAd CG5034-PA [Tribolium castaneum]
Length = 451
Score = 89.7 bits (221), Expect = 3e-16, Method: Composition-based stats.
Identities = 40/69 (57%), Positives = 48/69 (69%), Gaps = 10/69 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
RR+ L CTNCGT+ TT+WRRN GE VCNACGLYYKLH ++ RP MR+D
Sbjct: 280 RRVDLSCTNCGTQTTTIWRRNMKGEMVCNACGLYYKLHGID----------RPHTMRRDT 329
Query: 119 IQTRKRKPK 127
I TR+R+PK
Sbjct: 330 IHTRRRRPK 338
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 22/50 (44%), Positives = 32/50 (64%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG +T +WRR+ G +CNACGLY+K++G++RP + RR
Sbjct: 286 CTNCGTQTTTIWRRNMKGEMVCNACGLYYKLHGIDRPHTMRRDTIHTRRR 335
>gi|124054318|gb|ABM89372.1| GATA3 [Pongo pygmaeus]
Length = 136
Score = 89.7 bits (221), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/69 (60%), Positives = 48/69 (69%), Gaps = 10/69 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A RR G C NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+
Sbjct: 1 SAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMK 50
Query: 116 KDGIQTRKR 124
K+GIQTR R
Sbjct: 51 KEGIQTRNR 59
Score = 55.5 bits (132), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 7 GTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 48
>gi|157112345|ref|XP_001651800.1| gata-binding factor-c (transcription factor gata-c) [Aedes aegypti]
gi|108878107|gb|EAT42332.1| AAEL006114-PA, partial [Aedes aegypti]
Length = 169
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/69 (63%), Positives = 47/69 (68%), Gaps = 10/69 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A RR G C NC T TTLWRRN GEPVCNACGLYYKLHNVN RPL M+
Sbjct: 4 SAARRAGTSCANCKTTTTTLWRRNQGGEPVCNACGLYYKLHNVN----------RPLTMK 53
Query: 116 KDGIQTRKR 124
K+GIQTR R
Sbjct: 54 KEGIQTRNR 62
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 10 GTSCANCKTTTTTLWRRNQGGEPVCNACGLYYKLHNVNRPLT 51
>gi|426193772|gb|EKV43705.1| hypothetical protein AGABI2DRAFT_121841 [Agaricus bisporus var.
bisporus H97]
Length = 1116
Score = 89.4 bits (220), Expect = 4e-16, Method: Composition-based stats.
Identities = 55/180 (30%), Positives = 75/180 (41%), Gaps = 56/180 (31%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN----------------------- 42
G EC NCGA TPLWRR CNACGLY K++
Sbjct: 389 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNSHGGSGSGGSSNN 448
Query: 43 ----GMNRPLVKPSKRLTATRRLGLC-------------------CTNCGTRMTTLWRRN 79
+ RPL + + + G+ C NC T T LWR++
Sbjct: 449 TGNAAVERPLTVSNASIVQSVTHGIGGGSNSRQDPVDVLGSQTAQCYNCNTTATPLWRKD 508
Query: 80 NDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGGGGGSGG 139
++G+ VCNACGLYYKLH + RP++M+ D I+ R R ++G + G
Sbjct: 509 DEGKTVCNACGLYYKLHG----------SPRPISMKSDVIRKRSRHDARRGSNAASSAPG 558
>gi|384484811|gb|EIE76991.1| hypothetical protein RO3G_01695 [Rhizopus delemar RA 99-880]
Length = 326
Score = 89.4 bits (220), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/140 (39%), Positives = 65/140 (46%), Gaps = 35/140 (25%)
Query: 10 ECVNCGAISTPLWRRDGTGHY-LCNACGLYHKMNGMNRPL-------------------- 48
EC NC TPLWRR Y LCNACGLY+K +RPL
Sbjct: 150 ECFNCKVTKTPLWRRTPDRKYSLCNACGLYYKQYNHHRPLHVRNKTHTVRAHPYDRSLPL 209
Query: 49 --VKPSKRLTATRRLGLC--CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
KP A + + + C NC T LWR+N GEP+CNACGLY KL +
Sbjct: 210 SITKPELASFAEQSVEISQECANCHQTQTPLWRKNERGEPLCNACGLYAKLRQRD----- 264
Query: 105 RHLAMRPLAMRKDGIQTRKR 124
RP+ MRK IQ R+R
Sbjct: 265 -----RPVEMRKTTIQRRRR 279
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 34/60 (56%), Gaps = 2/60 (3%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
+EC NC TPLWR++ G LCNACGLY K+ +RP+ ++ T RR C N
Sbjct: 227 SQECANCHQTQTPLWRKNERGEPLCNACGLYAKLRQRDRPV--EMRKTTIQRRRRDCWVN 284
>gi|124001857|gb|ABM87875.1| GATA3 [Papio hamadryas]
Length = 136
Score = 89.4 bits (220), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/69 (60%), Positives = 48/69 (69%), Gaps = 10/69 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A RR G C NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+
Sbjct: 1 SAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMK 50
Query: 116 KDGIQTRKR 124
K+GIQTR R
Sbjct: 51 KEGIQTRNR 59
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 7 GTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 48
>gi|349603634|gb|AEP99422.1| Trans-acting T-cell-specific transcription factor GATA-3-like
protein, partial [Equus caballus]
Length = 136
Score = 89.4 bits (220), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/69 (60%), Positives = 48/69 (69%), Gaps = 10/69 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A RR G C NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+
Sbjct: 1 SAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMK 50
Query: 116 KDGIQTRKR 124
K+GIQTR R
Sbjct: 51 KEGIQTRNR 59
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 7 GTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 48
>gi|308495029|ref|XP_003109703.1| CRE-ELT-2 protein [Caenorhabditis remanei]
gi|308245893|gb|EFO89845.1| CRE-ELT-2 protein [Caenorhabditis remanei]
Length = 426
Score = 89.4 bits (220), Expect = 4e-16, Method: Composition-based stats.
Identities = 40/75 (53%), Positives = 52/75 (69%), Gaps = 9/75 (12%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A+RR GL C+NC TTLWRRN +G+PVCNACGLY+KLH+V RP +M+ +
Sbjct: 225 SASRRQGLICSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHVARPTSMK---------K 275
Query: 116 KDGIQTRKRKPKKQG 130
+ +QTRKRK K G
Sbjct: 276 EGALQTRKRKTKNSG 290
Score = 51.2 bits (121), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 31/54 (57%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+G C NC +T LWRR+ G +CNACGLY K++ + RP + TR+
Sbjct: 230 QGLICSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHVARPTSMKKEGALQTRK 283
>gi|350398203|ref|XP_003485118.1| PREDICTED: hypothetical protein LOC100747914 [Bombus impatiens]
Length = 588
Score = 89.4 bits (220), Expect = 4e-16, Method: Composition-based stats.
Identities = 39/71 (54%), Positives = 49/71 (69%), Gaps = 10/71 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+++ + CTNCGT TT+WRRN GE VCNACGLYYKLH VN RP+ MR+D
Sbjct: 411 KKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVN----------RPVTMRRDT 460
Query: 119 IQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 461 IHTRRRRPKGE 471
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPVTMRRDTIHTRRR 466
>gi|340724648|ref|XP_003400693.1| PREDICTED: hypothetical protein LOC100646437 [Bombus terrestris]
Length = 588
Score = 89.4 bits (220), Expect = 4e-16, Method: Composition-based stats.
Identities = 39/71 (54%), Positives = 49/71 (69%), Gaps = 10/71 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+++ + CTNCGT TT+WRRN GE VCNACGLYYKLH VN RP+ MR+D
Sbjct: 411 KKVDMSCTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVN----------RPVTMRRDT 460
Query: 119 IQTRKRKPKKQ 129
I TR+R+PK +
Sbjct: 461 IHTRRRRPKGE 471
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 34/50 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG ++T +WRR+ G +CNACGLY+K++G+NRP+ + RR
Sbjct: 417 CTNCGTMTTTIWRRNMKGEMVCNACGLYYKLHGVNRPVTMRRDTIHTRRR 466
>gi|402589823|gb|EJW83754.1| GATA zinc finger family protein [Wuchereria bancrofti]
Length = 553
Score = 89.4 bits (220), Expect = 5e-16, Method: Composition-based stats.
Identities = 41/76 (53%), Positives = 50/76 (65%), Gaps = 9/76 (11%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
+ RR GL C NC TTLWRR+ DG PVCNACGLYYKLH V RP++M+ ++
Sbjct: 342 SQRRQGLICANCRGTNTTLWRRDADGHPVCNACGLYYKLHQVQRPISMK---------KE 392
Query: 117 DGIQTRKRKPKKQGGG 132
+QTRKRK K +G G
Sbjct: 393 GTLQTRKRKQKSEGNG 408
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 35/57 (61%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
Q +G C NC +T LWRRD GH +CNACGLY+K++ + RP+ + TR+
Sbjct: 343 QRRQGLICANCRGTNTTLWRRDADGHPVCNACGLYYKLHQVQRPISMKKEGTLQTRK 399
>gi|603162|gb|AAC13781.1| BmGATA beta isoform 3, partial [Bombyx mori]
Length = 440
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 59/105 (56%), Gaps = 19/105 (18%)
Query: 30 YLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGL-------CCTNCGTRMTTLWRRNNDG 82
+L N H + R V LT +RR G+ CTNC T T+LWRRN G
Sbjct: 213 FLLNGIQASHALPAPQRTRVP--ILLTTSRRKGMGAKRPGTMCTNCQTTATSLWRRNVQG 270
Query: 83 EPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
E VCNACGLY+KLHNVN RPL M+KD IQTRKRKPK
Sbjct: 271 ETVCNACGLYFKLHNVN----------RPLTMKKDSIQTRKRKPK 305
Score = 52.4 bits (124), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY K++ +NRPL
Sbjct: 250 GTMCTNCQTTATSLWRRNVQGETVCNACGLYFKLHNVNRPLT 291
>gi|384490311|gb|EIE81533.1| hypothetical protein RO3G_06238 [Rhizopus delemar RA 99-880]
Length = 462
Score = 89.0 bits (219), Expect = 5e-16, Method: Composition-based stats.
Identities = 43/86 (50%), Positives = 50/86 (58%), Gaps = 5/86 (5%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKP-SKRLTATRRLGLC---C 65
C NC +TPLWRR LCNACGLY K++ RP +KP S R A C
Sbjct: 377 CSNCSTTTTPLWRRSAEDELLCNACGLYLKLHNTPRPKHLKPQSSRKDAKDEESFTQPIC 436
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGL 91
+NC T T LWRR+ DG P+CNACGL
Sbjct: 437 SNCATSTTPLWRRDVDGSPLCNACGL 462
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 29/54 (53%), Positives = 36/54 (66%), Gaps = 5/54 (9%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
C+NC T T LWRR+ + E +CNACGLY KLHN RP +HL +P + RKD
Sbjct: 377 CSNCSTTTTPLWRRSAEDELLCNACGLYLKLHNTPRP---KHL--KPQSSRKDA 425
>gi|355689759|gb|AER98937.1| GATA binding protein 1 [Mustela putorius furo]
Length = 76
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/86 (61%), Positives = 59/86 (68%), Gaps = 10/86 (11%)
Query: 45 NRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
NRPL++P KRL ++R G CTNC T TTLWRRN G+PVCNACGLYYKLH VN
Sbjct: 1 NRPLIRPKKRLIVSKRAGTQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVN----- 55
Query: 105 RHLAMRPLAMRKDGIQTRKRKPKKQG 130
RPL MRKDGIQTR RK +G
Sbjct: 56 -----RPLTMRKDGIQTRNRKASGKG 76
Score = 44.7 bits (104), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 24/29 (82%)
Query: 21 LWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 31 LWRRNASGDPVCNACGLYYKLHQVNRPLT 59
>gi|270012803|gb|EFA09251.1| GATAd [Tribolium castaneum]
Length = 745
Score = 87.8 bits (216), Expect = 1e-15, Method: Composition-based stats.
Identities = 40/69 (57%), Positives = 48/69 (69%), Gaps = 10/69 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
RR+ L CTNCGT+ TT+WRRN GE VCNACGLYYKLH ++ RP MR+D
Sbjct: 574 RRVDLSCTNCGTQTTTIWRRNMKGEMVCNACGLYYKLHGID----------RPHTMRRDT 623
Query: 119 IQTRKRKPK 127
I TR+R+PK
Sbjct: 624 IHTRRRRPK 632
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 22/50 (44%), Positives = 32/50 (64%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG +T +WRR+ G +CNACGLY+K++G++RP + RR
Sbjct: 580 CTNCGTQTTTIWRRNMKGEMVCNACGLYYKLHGIDRPHTMRRDTIHTRRR 629
>gi|194709238|pdb|3DFV|D Chain D, Adjacent Gata Dna Binding
gi|194709239|pdb|3DFV|C Chain C, Adjacent Gata Dna Binding
gi|194709242|pdb|3DFX|A Chain A, Opposite Gata Dna Binding
gi|194709243|pdb|3DFX|B Chain B, Opposite Gata Dna Binding
Length = 63
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/70 (61%), Positives = 49/70 (70%), Gaps = 10/70 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A RR G C NC T TTLWRRN +G+PVCNACGLYYKLHN+N RPL M+
Sbjct: 1 SAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN----------RPLTMK 50
Query: 116 KDGIQTRKRK 125
K+GIQTR RK
Sbjct: 51 KEGIQTRNRK 60
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR+ G +CNACGLY+K++ +NRPL
Sbjct: 7 GTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLT 48
>gi|268578523|ref|XP_002644244.1| C. briggsae CBR-ELT-2 protein [Caenorhabditis briggsae]
Length = 460
Score = 87.8 bits (216), Expect = 1e-15, Method: Composition-based stats.
Identities = 38/72 (52%), Positives = 51/72 (70%), Gaps = 9/72 (12%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A+RR GL C+NC TTLWRRN +G+PVCNACGLY+KLH++ RP +M+ +
Sbjct: 237 SASRRQGLVCSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHIARPTSMK---------K 287
Query: 116 KDGIQTRKRKPK 127
+ +QTRKRK K
Sbjct: 288 EGALQTRKRKSK 299
Score = 52.0 bits (123), Expect = 7e-05, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 31/54 (57%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+G C NC +T LWRR+ G +CNACGLY K++ + RP + TR+
Sbjct: 242 QGLVCSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHIARPTSMKKEGALQTRK 295
>gi|393236397|gb|EJD43946.1| glucocorticoid receptor-like (DNA-binding domain), partial
[Auricularia delicata TFB-10046 SS5]
Length = 131
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 64/113 (56%), Gaps = 9/113 (7%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN-RPLVKPSKRLTATRRLG------ 62
EC NCGA TPLWRR CNACGLY K+N + + V+ R A G
Sbjct: 18 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLNFRSGQDGVRAGSRSGAGFADGDGVGPT 77
Query: 63 --LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLA 113
+ C NC T T LWR++ +G VCNACGLY KLH +RPL+M+ ++R A
Sbjct: 78 EPVSCHNCHTTATPLWRKDEEGRTVCNACGLYSKLHGASRPLSMKSESIRKRA 130
Score = 55.1 bits (131), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL 55
E C NC +TPLWR+D G +CNACGLY K++G +RPL S+ +
Sbjct: 78 EPVSCHNCHTTATPLWRKDEEGRTVCNACGLYSKLHGASRPLSMKSESI 126
>gi|429965956|gb|ELA47953.1| hypothetical protein VCUG_00536 [Vavraia culicis 'floridensis']
Length = 463
Score = 87.4 bits (215), Expect = 2e-15, Method: Composition-based stats.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 4/90 (4%)
Query: 9 RECVNCGAISTPLWRRDGTGH-YLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG---LC 64
R C NC +TP WRR G LCNACGLY K++G RP + T + G +
Sbjct: 131 RVCSNCVTTTTPSWRRSTDGKKLLCNACGLYQKLHGRPRPYSTTPEGKTKALKSGFDKIK 190
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYK 94
C NCGT T+ WRR +G P+CN+CGLY++
Sbjct: 191 CGNCGTTETSFWRRGINGHPLCNSCGLYFR 220
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 29/61 (47%), Positives = 35/61 (57%), Gaps = 5/61 (8%)
Query: 44 MNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPV-CNACGLYYKLHNVNRPL 102
MN + K SKR R C+NC T T WRR+ DG+ + CNACGLY KLH RP
Sbjct: 116 MNMRMHKDSKRKPKMR----VCSNCVTTTTPSWRRSTDGKKLLCNACGLYQKLHGRPRPY 171
Query: 103 A 103
+
Sbjct: 172 S 172
Score = 46.2 bits (108), Expect = 0.005, Method: Composition-based stats.
Identities = 18/33 (54%), Positives = 21/33 (63%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 42
+C NCG T WRR GH LCN+CGLY + N
Sbjct: 190 KCGNCGTTETSFWRRGINGHPLCNSCGLYFRDN 222
>gi|238607357|ref|XP_002396957.1| hypothetical protein MPER_02700 [Moniliophthora perniciosa FA553]
gi|215470472|gb|EEB97887.1| hypothetical protein MPER_02700 [Moniliophthora perniciosa FA553]
Length = 198
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/152 (36%), Positives = 72/152 (47%), Gaps = 37/152 (24%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK-----PSKRLTATRRLGL- 63
EC NCGA TPLWRR CNACGLY K++ RP + TA R+ +
Sbjct: 12 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKTMRNSHGEGRSQTAPRQETVD 71
Query: 64 ---------------------CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NC T T LWR++++G+ VCNACGLYYKLH
Sbjct: 72 VVGKYSFRPIHDGVSNFFPLAQCYNCHTTATPLWRKDDEGKTVCNACGLYYKLHG----- 126
Query: 103 AMRHLAMRPLAMRKDGIQTRKRKPKKQGGGGG 134
+ RP++M+ D I+ R R ++ GG
Sbjct: 127 -----SARPISMKSDVIRKRSRHDARRTANGG 153
>gi|393234217|gb|EJD41782.1| glucocorticoid receptor-like (DNA-binding domain), partial
[Auricularia delicata TFB-10046 SS5]
Length = 157
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/129 (38%), Positives = 58/129 (44%), Gaps = 29/129 (22%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP----------------------- 47
C NC TPLWRR G +CNACGLY K M RP
Sbjct: 25 CSNCRTTDTPLWRRGADGKSICNACGLYQKSRRMARPTNLQRTPPPSASAQSPQQQNGNG 84
Query: 48 ----LVKPSKR--LTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
L PS T L C NCGT T LWRR++ G +CNACGLY KLH RP
Sbjct: 85 NGTSLAMPSHSGASTPASPPSLSCHNCGTSTTPLWRRDDAGNNICNACGLYLKLHGTQRP 144
Query: 102 LAMRHLAMR 110
+M+ ++
Sbjct: 145 NSMKKTIIK 153
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 9/73 (12%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T T LWRR DG+ +CNACGLY K + RP ++ R +
Sbjct: 25 CSNCRTTDTPLWRRGADGKSICNACGLYQKSRRMARPTNLQ---------RTPPPSASAQ 75
Query: 125 KPKKQGGGGGGGS 137
P++Q G G G S
Sbjct: 76 SPQQQNGNGNGTS 88
>gi|440492336|gb|ELQ74911.1| GATA-4/5/6 transcription factor [Trachipleistophora hominis]
Length = 463
Score = 86.7 bits (213), Expect = 3e-15, Method: Composition-based stats.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 4/90 (4%)
Query: 9 RECVNCGAISTPLWRRDGTGH-YLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG---LC 64
R C NC +TP WRR G LCNACGLY K++G RP + T + G +
Sbjct: 131 RVCSNCVTTTTPSWRRSTDGKKLLCNACGLYQKLHGRPRPYSTTPEGKTKALKSGFDKIK 190
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYK 94
C NCGT T+ WRR +G P+CN+CGLY++
Sbjct: 191 CGNCGTTETSFWRRGINGHPLCNSCGLYFR 220
Score = 52.8 bits (125), Expect = 4e-05, Method: Composition-based stats.
Identities = 29/61 (47%), Positives = 35/61 (57%), Gaps = 5/61 (8%)
Query: 44 MNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPV-CNACGLYYKLHNVNRPL 102
MN + K SKR R C+NC T T WRR+ DG+ + CNACGLY KLH RP
Sbjct: 116 MNMRMHKDSKRKPKMR----VCSNCVTTTTPSWRRSTDGKKLLCNACGLYQKLHGRPRPY 171
Query: 103 A 103
+
Sbjct: 172 S 172
Score = 45.8 bits (107), Expect = 0.006, Method: Composition-based stats.
Identities = 18/33 (54%), Positives = 21/33 (63%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 42
+C NCG T WRR GH LCN+CGLY + N
Sbjct: 190 KCGNCGTTETSFWRRGINGHPLCNSCGLYFRDN 222
>gi|283468189|emb|CAN84590.1| GATA-3 protein [Melanogrammus aeglefinus]
Length = 251
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 37/40 (92%), Positives = 37/40 (92%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 45
EGRECVNCGA STPLWRRDGTGHYLCNACGLYHKMNG N
Sbjct: 212 SEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQN 251
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 24/35 (68%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
C NCG T LWRR+ G +CNACGLY+K++ N
Sbjct: 217 CVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQN 251
>gi|443712897|gb|ELU05981.1| hypothetical protein CAPTEDRAFT_130016, partial [Capitella
teleta]
Length = 79
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/80 (56%), Positives = 51/80 (63%), Gaps = 6/80 (7%)
Query: 19 TPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRR 78
T WRR+ TG LCNACG Y K++ R S R T C NCGT+ TT+WRR
Sbjct: 1 TTTWRRNLTGEPLCNACGCYLKLHKRWRNTASSSSRETHN------CANCGTQQTTMWRR 54
Query: 79 NNDGEPVCNACGLYYKLHNV 98
N DGEPVCNACGLY+KLH V
Sbjct: 55 NTDGEPVCNACGLYFKLHKV 74
Score = 43.5 bits (101), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGM 44
E C NCG T +WRR+ G +CNACGLY K++ +
Sbjct: 37 ETHNCANCGTQQTTMWRRNTDGEPVCNACGLYFKLHKV 74
>gi|195050653|ref|XP_001992938.1| GH13551 [Drosophila grimshawi]
gi|193899997|gb|EDV98863.1| GH13551 [Drosophila grimshawi]
Length = 911
Score = 86.3 bits (212), Expect = 4e-15, Method: Composition-based stats.
Identities = 38/66 (57%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLYYKLH VNRP H +MR+D I TR
Sbjct: 745 MSCSNCGTLTTTIWRRSARGEMVCNACGLYYKLHGVNRP----H------SMRRDTIHTR 794
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 795 RRRPKE 800
Score = 62.0 bits (149), Expect = 9e-08, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 29/37 (78%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY+K++G+NRP
Sbjct: 747 CSNCGTLTTTIWRRSARGEMVCNACGLYYKLHGVNRP 783
>gi|312066897|ref|XP_003136488.1| GATA zinc finger family protein [Loa loa]
gi|307768347|gb|EFO27581.1| GATA zinc finger family protein [Loa loa]
Length = 564
Score = 85.9 bits (211), Expect = 4e-15, Method: Composition-based stats.
Identities = 42/87 (48%), Positives = 55/87 (63%), Gaps = 10/87 (11%)
Query: 45 NRPLVKPSK-RLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLA 103
NR L + ++ + + RR GL C NC TTLWRR+ DG PVCNACGLY+KLH V RP++
Sbjct: 342 NRQLQRKNQPQHNSQRRQGLVCANCRGTNTTLWRRDADGHPVCNACGLYFKLHQVQRPIS 401
Query: 104 MRHLAMRPLAMRKDGIQTRKRKPKKQG 130
M+ ++ +QTRKRK K G
Sbjct: 402 MK---------KEGTLQTRKRKQKSDG 419
Score = 58.5 bits (140), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 34/57 (59%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
Q +G C NC +T LWRRD GH +CNACGLY K++ + RP+ + TR+
Sbjct: 356 QRRQGLVCANCRGTNTTLWRRDADGHPVCNACGLYFKLHQVQRPISMKKEGTLQTRK 412
>gi|402589624|gb|EJW83555.1| hypothetical protein WUBG_05536, partial [Wuchereria bancrofti]
Length = 119
Score = 85.5 bits (210), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 51/70 (72%), Gaps = 10/70 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A +R G+ C NC T TTLWRRN+ G+PVCNACGLY+KLHN++ RP++M+
Sbjct: 1 SAQKRTGIECVNCKTNNTTLWRRNSLGQPVCNACGLYHKLHNIS----------RPISMK 50
Query: 116 KDGIQTRKRK 125
KDGIQTR RK
Sbjct: 51 KDGIQTRNRK 60
Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 31/41 (75%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G ECVNC +T LWRR+ G +CNACGLYHK++ ++RP+
Sbjct: 7 GIECVNCKTNNTTLWRRNSLGQPVCNACGLYHKLHNISRPI 47
>gi|60594093|pdb|1Y0J|A Chain A, Zinc Fingers As Protein Recognition Motifs: Structural
Basis For The Gata-1FRIEND OF GATA INTERACTION
Length = 46
Score = 85.5 bits (210), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 37/45 (82%), Positives = 40/45 (88%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVK 50
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL++
Sbjct: 2 SEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIR 46
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 7 CVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPL 44
>gi|25146629|ref|NP_509755.2| Protein ELT-2 [Caenorhabditis elegans]
gi|1706638|sp|Q10655.1|ELT2_CAEEL RecName: Full=Transcription factor elt-2
gi|847811|gb|AAC36130.1| GATA-factor [Caenorhabditis elegans]
gi|22265747|emb|CAA90029.2| Protein ELT-2 [Caenorhabditis elegans]
Length = 433
Score = 85.5 bits (210), Expect = 7e-15, Method: Composition-based stats.
Identities = 38/71 (53%), Positives = 50/71 (70%), Gaps = 9/71 (12%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
A+RR GL C+NC TTLWRRN +G+PVCNACGLY+KLH++ RP +M+ ++
Sbjct: 229 ASRRQGLVCSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHIPRPTSMK---------KE 279
Query: 117 DGIQTRKRKPK 127
+QTRKRK K
Sbjct: 280 GALQTRKRKSK 290
Score = 51.2 bits (121), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 31/54 (57%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+G C NC +T LWRR+ G +CNACGLY K++ + RP + TR+
Sbjct: 233 QGLVCSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHIPRPTSMKKEGALQTRK 286
>gi|395328952|gb|EJF61341.1| hypothetical protein DICSQDRAFT_147115 [Dichomitus squalens
LYAD-421 SS1]
Length = 811
Score = 85.1 bits (209), Expect = 8e-15, Method: Composition-based stats.
Identities = 53/138 (38%), Positives = 69/138 (50%), Gaps = 23/138 (16%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSK----RLTATRR- 60
G EC NCGA TPLWRR CNACGLY K++ RP + R A R
Sbjct: 420 GVKAECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNNHGEGRAQAAPRQ 479
Query: 61 ----LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
+ C NC T T LWR++++G+ VCNA YKLH + RP++M+
Sbjct: 480 ESQEVVAQCYNCHTTATPLWRKDDEGKTVCNA----YKLHG----------SARPISMKS 525
Query: 117 DGIQTRKRKPKKQGGGGG 134
D I+ R R ++G G G
Sbjct: 526 DVIRKRARHDARRGVGNG 543
>gi|312084746|ref|XP_003144400.1| hypothetical protein LOAG_08822 [Loa loa]
Length = 149
Score = 85.1 bits (209), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 51/70 (72%), Gaps = 10/70 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+A +R G+ C NC T TTLWRRN+ G+PVCNACGLY+KLHN++ RP++M+
Sbjct: 31 SAQKRTGIECVNCKTNNTTLWRRNSLGQPVCNACGLYHKLHNIS----------RPISMK 80
Query: 116 KDGIQTRKRK 125
KDGIQTR RK
Sbjct: 81 KDGIQTRNRK 90
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 31/41 (75%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G ECVNC +T LWRR+ G +CNACGLYHK++ ++RP+
Sbjct: 37 GIECVNCKTNNTTLWRRNSLGQPVCNACGLYHKLHNISRPI 77
>gi|357625879|gb|EHJ76168.1| hypothetical protein KGM_07907 [Danaus plexippus]
Length = 514
Score = 85.1 bits (209), Expect = 8e-15, Method: Composition-based stats.
Identities = 39/71 (54%), Positives = 47/71 (66%), Gaps = 10/71 (14%)
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
R+ L C+NCGT TT+WRR+ GE VCNACGLYYKLH V RP AMR+D I
Sbjct: 350 RVELSCSNCGTHTTTIWRRDARGEMVCNACGLYYKLHGVP----------RPSAMRRDTI 399
Query: 120 QTRKRKPKKQG 130
TR+R+P+ G
Sbjct: 400 HTRRRRPRHDG 410
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 31/50 (62%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG +T +WRRD G +CNACGLY+K++G+ RP + RR
Sbjct: 355 CSNCGTHTTTIWRRDARGEMVCNACGLYYKLHGVPRPSAMRRDTIHTRRR 404
>gi|195397608|ref|XP_002057420.1| GJ18119 [Drosophila virilis]
gi|194141074|gb|EDW57493.1| GJ18119 [Drosophila virilis]
Length = 860
Score = 85.1 bits (209), Expect = 9e-15, Method: Composition-based stats.
Identities = 38/72 (52%), Positives = 50/72 (69%), Gaps = 10/72 (13%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
+T + + C+NCGT TT+WRR+ GE VCNACGLY+KLH VNRP H +MR+
Sbjct: 690 STSQKDMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP----H------SMRR 739
Query: 117 DGIQTRKRKPKK 128
D I TR+R+PK+
Sbjct: 740 DTIHTRRRRPKE 751
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 5/80 (6%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGT 70
C NCG ++T +WRR G +CNACGLY K++G+NRP + RR C
Sbjct: 698 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRPHSMRRDTIHTRRRRPKECEKSKK 757
Query: 71 RM-----TTLWRRNNDGEPV 85
R TT+ D P+
Sbjct: 758 RNRQIPGTTVVEYERDNSPI 777
>gi|159162363|pdb|1GNF|A Chain A, Solution Structure Of The N-Terminal Zinc Finger Of
Murine Gata-1, Nmr, 25 Structures
Length = 46
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/44 (84%), Positives = 39/44 (88%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG NRPL+
Sbjct: 2 SEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLI 45
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NCG T LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 7 CVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPL 44
>gi|323319543|gb|ADX36142.1| GATA456b [Schmidtea polychroa]
Length = 363
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 61/111 (54%), Gaps = 17/111 (15%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACG-------------LYHKMNGMNRPLVKPSKR 54
+ECV CG + + +DG ++ C C +K N MN+ L R
Sbjct: 165 AKECVICGMENGDTYHQDGNENFHCLNCVRKNPCVNPSLDLIPSYKKNQMNKNLA----R 220
Query: 55 LTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
+R GL C+NC T TTLWRRN++G+PVCNACGLYY+LH +RP MR
Sbjct: 221 KQTAKRTGLQCSNCKTENTTLWRRNSEGQPVCNACGLYYRLHKTHRPPTMR 271
>gi|126002036|ref|XP_001352245.1| GA18611 [Drosophila pseudoobscura pseudoobscura]
gi|54640243|gb|EAL29293.1| GA18611 [Drosophila pseudoobscura pseudoobscura]
Length = 803
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/66 (54%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH +NRP H +MR+D I TR
Sbjct: 654 MSCSNCGTLTTTIWRRSARGEMVCNACGLYFKLHGINRP----H------SMRRDTIHTR 703
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 704 RRRPKE 709
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 656 CSNCGTLTTTIWRRSARGEMVCNACGLYFKLHGINRP 692
>gi|341903414|gb|EGT59349.1| hypothetical protein CAEBREN_06149 [Caenorhabditis brenneri]
Length = 425
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 56/85 (65%), Gaps = 13/85 (15%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+ +RR GL C+NC TTLWRRN +G+PVCNACGLY+KLH++ RP +M+
Sbjct: 226 STSRRQGLVCSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHI----------PRPNSMK 275
Query: 116 KDG-IQTRKRKPKKQGGGGGGGSGG 139
KDG +QTRKRK K G G SGG
Sbjct: 276 KDGALQTRKRKSK--NGDGSNSSGG 298
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 27/41 (65%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+G C NC +T LWRR+ G +CNACGLY K++ + RP
Sbjct: 231 QGLVCSNCNGTNTTLWRRNAEGDPVCNACGLYFKLHHIPRP 271
>gi|24583358|ref|NP_609383.2| GATAd, isoform A [Drosophila melanogaster]
gi|442627216|ref|NP_001260326.1| GATAd, isoform B [Drosophila melanogaster]
gi|442627218|ref|NP_001260327.1| GATAd, isoform C [Drosophila melanogaster]
gi|22946136|gb|AAF52916.2| GATAd, isoform A [Drosophila melanogaster]
gi|379699074|gb|AFD10759.1| FI19405p1 [Drosophila melanogaster]
gi|440213644|gb|AGB92861.1| GATAd, isoform B [Drosophila melanogaster]
gi|440213645|gb|AGB92862.1| GATAd, isoform C [Drosophila melanogaster]
Length = 842
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 37/66 (56%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH VNRP H +MR+D I TR
Sbjct: 690 MSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP----H------SMRRDTIHTR 739
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 740 RRRPKE 745
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 692 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 728
>gi|195578139|ref|XP_002078923.1| GD23678 [Drosophila simulans]
gi|194190932|gb|EDX04508.1| GD23678 [Drosophila simulans]
Length = 844
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 37/66 (56%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH VNRP H +MR+D I TR
Sbjct: 694 MSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP----H------SMRRDTIHTR 743
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 744 RRRPKE 749
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 696 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 732
>gi|195473609|ref|XP_002089085.1| GE18923 [Drosophila yakuba]
gi|194175186|gb|EDW88797.1| GE18923 [Drosophila yakuba]
Length = 838
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 37/66 (56%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH VNRP H +MR+D I TR
Sbjct: 688 MSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP----H------SMRRDTIHTR 737
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 738 RRRPKE 743
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 690 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 726
>gi|195339757|ref|XP_002036483.1| GM18170 [Drosophila sechellia]
gi|194130363|gb|EDW52406.1| GM18170 [Drosophila sechellia]
Length = 844
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 37/66 (56%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH VNRP H +MR+D I TR
Sbjct: 694 MSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP----H------SMRRDTIHTR 743
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 744 RRRPKE 749
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 696 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 732
>gi|194761760|ref|XP_001963094.1| GF15767 [Drosophila ananassae]
gi|190616791|gb|EDV32315.1| GF15767 [Drosophila ananassae]
Length = 810
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 37/66 (56%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH VNRP H +MR+D I TR
Sbjct: 665 MSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP----H------SMRRDTIHTR 714
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 715 RRRPKE 720
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 667 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 703
>gi|408489445|pdb|3GAT|A Chain A, Solution Nmr Structure Of The C-Terminal Domain Of Chicken
Gata-1 Bound To Dna, 34 Structures
gi|425684924|pdb|2GAT|A Chain A, Solution Structure Of The C-Terminal Domain Of Chicken
Gata- 1 Bound To Dna, Nmr, Regularized Mean Structure
Length = 66
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/72 (59%), Positives = 48/72 (66%), Gaps = 10/72 (13%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+R G C+NC T TTLWRR+ G+PVCNACGLYYKLH VN RPL MRKDG
Sbjct: 1 KRAGTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVN----------RPLTMRKDG 50
Query: 119 IQTRKRKPKKQG 130
IQTR RK +G
Sbjct: 51 IQTRNRKVSSKG 62
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR G +CNACGLY+K++ +NRPL
Sbjct: 4 GTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVNRPLT 45
>gi|443726960|gb|ELU13928.1| hypothetical protein CAPTEDRAFT_116469, partial [Capitella
teleta]
Length = 75
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/76 (55%), Positives = 53/76 (69%), Gaps = 2/76 (2%)
Query: 3 FQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPS-KRLTATRRL 61
+ G GREC NCG+ PLWR +GTGH LCNACG+ H M+G +P++K S R + +RR+
Sbjct: 1 MEGGVGRECANCGSTYAPLWRWNGTGHLLCNACGV-HVMSGFAKPVMKTSGGRRSVSRRV 59
Query: 62 GLCCTNCGTRMTTLWR 77
GL C NC T TTLWR
Sbjct: 60 GLSCANCHTSTTTLWR 75
Score = 35.0 bits (79), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLY 92
C NCG+ LWR N G +CNACG++
Sbjct: 9 CANCGSTYAPLWRWNGTGHLLCNACGVH 36
>gi|195146794|ref|XP_002014369.1| GL19159 [Drosophila persimilis]
gi|194106322|gb|EDW28365.1| GL19159 [Drosophila persimilis]
Length = 803
Score = 83.6 bits (205), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/66 (54%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH +NRP H +MR+D I TR
Sbjct: 654 MSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGINRP----H------SMRRDTIHTR 703
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 704 RRRPKE 709
Score = 59.3 bits (142), Expect = 4e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 656 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGINRP 692
>gi|194859800|ref|XP_001969453.1| GG10108 [Drosophila erecta]
gi|190661320|gb|EDV58512.1| GG10108 [Drosophila erecta]
Length = 838
Score = 83.6 bits (205), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/66 (54%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH +NRP H +MR+D I TR
Sbjct: 688 MSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGINRP----H------SMRRDTIHTR 737
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 738 RRRPKE 743
Score = 59.3 bits (142), Expect = 4e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 690 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGINRP 726
>gi|195457472|ref|XP_002075579.1| GK18561 [Drosophila willistoni]
gi|194171664|gb|EDW86565.1| GK18561 [Drosophila willistoni]
Length = 858
Score = 83.2 bits (204), Expect = 3e-14, Method: Composition-based stats.
Identities = 37/66 (56%), Positives = 47/66 (71%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH VNRP H +MR+D I TR
Sbjct: 697 MSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP----H------SMRRDTIHTR 746
Query: 123 KRKPKK 128
+R+PK+
Sbjct: 747 RRRPKE 752
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 699 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 735
>gi|442928|pdb|1GAT|A Chain A, Solution Structure Of The Specific Dna Complex Of The Zinc
Containing Dna Binding Domain Of The Erythroid
Transcription Factor Gata-1 By Multidimensional Nmr
gi|442930|pdb|1GAU|A Chain A, Solution Structure Of The Specific Dna Complex Of The Zinc
Containing Dna Binding Domain Of The Erythroid
Transcription Factor Gata-1 By Multidimensional Nmr
Length = 60
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/67 (62%), Positives = 46/67 (68%), Gaps = 10/67 (14%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDG 118
+R G C+NC T TTLWRR+ G+PVCNACGLYYKLH VN RPL MRKDG
Sbjct: 1 KRAGTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVN----------RPLTMRKDG 50
Query: 119 IQTRKRK 125
IQTR RK
Sbjct: 51 IQTRNRK 57
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NC +T LWRR G +CNACGLY+K++ +NRPL
Sbjct: 4 GTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVNRPLT 45
>gi|402220446|gb|EJU00517.1| glucocorticoid receptor-like protein, partial [Dacryopinax sp.
DJM-731 SS1]
Length = 138
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 57/108 (52%), Gaps = 9/108 (8%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATR------- 59
E +C NCGA TPLWRR CNACGLY K + RPL A +
Sbjct: 22 EEAQCANCGAKRTPLWRRGLGDELNCNACGLYVKQHKKPRPLNLQRDANAAYKNGAGGSG 81
Query: 60 --RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
+ C+NC T T LWR++ DG +CNACGLY KLH RP ++R
Sbjct: 82 QPQEPTRCSNCDTTNTPLWRKDADGSTLCNACGLYMKLHGAPRPRSLR 129
>gi|21069165|gb|AAM33847.1|AF467985_1 uterine GATA-4 transcription factor [Oryctolagus cuniculus]
Length = 55
Score = 82.4 bits (202), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/55 (67%), Positives = 43/55 (78%)
Query: 34 ACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNA 88
ACGLYHKMNG NRPL+KP +RL+A + CC NC T TTLWRRN +G+PVCNA
Sbjct: 1 ACGLYHKMNGQNRPLIKPKRRLSACQESRTCCANCQTTTTTLWRRNANGDPVCNA 55
>gi|195119257|ref|XP_002004148.1| GI18290 [Drosophila mojavensis]
gi|193914723|gb|EDW13590.1| GI18290 [Drosophila mojavensis]
Length = 469
Score = 82.0 bits (201), Expect = 7e-14, Method: Composition-based stats.
Identities = 36/64 (56%), Positives = 45/64 (70%), Gaps = 10/64 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH VNRP H +MR+D I TR
Sbjct: 406 MSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP----H------SMRRDTIHTR 455
Query: 123 KRKP 126
+R+P
Sbjct: 456 RRRP 459
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 408 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 444
>gi|302684915|ref|XP_003032138.1| hypothetical protein SCHCODRAFT_56035 [Schizophyllum commune H4-8]
gi|300105831|gb|EFI97235.1| hypothetical protein SCHCODRAFT_56035 [Schizophyllum commune H4-8]
Length = 190
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/107 (44%), Positives = 59/107 (55%), Gaps = 6/107 (5%)
Query: 11 CVNCGAISTPLWRRDG-TGHYLCNACGLYHKMNGMNRP--LVKPSKRLTATRRL---GLC 64
C +C +TPLWRRD LCNACGLY G RP L+ R G
Sbjct: 78 CWHCRTRTTPLWRRDTRIPGLLCNACGLYLSQRGKLRPRELIDADDDSDVVREANYSGPE 137
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
CT+C TR T++WRRN G VCNACG+Y +L RPL++R +RP
Sbjct: 138 CTHCHTRTTSVWRRNKVGAQVCNACGVYERLKGKERPLSLRRDKIRP 184
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 27/41 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G EC +C +T +WRR+ G +CNACG+Y ++ G RPL
Sbjct: 135 GPECTHCHTRTTSVWRRNKVGAQVCNACGVYERLKGKERPL 175
>gi|339236827|ref|XP_003379968.1| erythroid transcription factor [Trichinella spiralis]
gi|316977299|gb|EFV60416.1| erythroid transcription factor [Trichinella spiralis]
Length = 321
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 56/102 (54%), Gaps = 12/102 (11%)
Query: 10 ECVNCGAISTPLWRRDG-TGHYLCNACGLY------HKMNGMNRPLVKPS-----KRLTA 57
+C CGA + LW TG +C++C L + +R + S K T
Sbjct: 183 DCSRCGAFQSALWHSTADTGALICDSCFLSGVGKNTQASSSCSRIITNYSSKTTRKTATT 242
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
++R GL C NC T TTLWRRN +G+PVCNACGLY+KLH V
Sbjct: 243 SKRQGLVCVNCQTMQTTLWRRNQNGDPVCNACGLYFKLHRVQ 284
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 25/36 (69%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 42
+G CVNC + T LWRR+ G +CNACGLY K++
Sbjct: 246 QGLVCVNCQTMQTTLWRRNQNGDPVCNACGLYFKLH 281
>gi|389738761|gb|EIM79957.1| glucocorticoid receptor-like protein, partial [Stereum hirsutum
FP-91666 SS1]
Length = 207
Score = 79.7 bits (195), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 67/142 (47%), Gaps = 37/142 (26%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------------------ 47
G+ EC NC A TPLWRR CN CGLY K++ RP
Sbjct: 7 GDKAECSNCSATHTPLWRRGLNDELNCNTCGLYCKLHKCPRPKTMRNNHGERSRAAPHPE 66
Query: 48 ---LVKPSKRLTA-----TRRLGLC-----------CTNCGTRMTTLWRRNNDGEPVCNA 88
++ S L+ T RL C NC T T LWR++++G+ VCNA
Sbjct: 67 SNEVMGQSAYLSPRLHHMTNRLHFLRLFISPAVESKCYNCNTTATPLWRKDDEGKTVCNA 126
Query: 89 CGLYYKLHNVNRPLAMRHLAMR 110
CGLY+KLH +RP++M+ +R
Sbjct: 127 CGLYFKLHGSSRPISMKSDVIR 148
>gi|241954936|ref|XP_002420189.1| nitrogen regulatory GATA-factor, putative; transcriptional
activator with GATA-1-type Zn finger DNA-binding motif,
putative [Candida dubliniensis CD36]
gi|223643530|emb|CAX42412.1| nitrogen regulatory GATA-factor, putative [Candida dubliniensis
CD36]
Length = 753
Score = 79.7 bits (195), Expect = 4e-13, Method: Composition-based stats.
Identities = 35/80 (43%), Positives = 49/80 (61%), Gaps = 10/80 (12%)
Query: 45 NRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
+ P K + +A G+ CTNCGT+ T LWRRN G+P+CNACGL+ KLH V
Sbjct: 492 SNPTAKGTATGSAASNAGVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGV------ 545
Query: 105 RHLAMRPLAMRKDGIQTRKR 124
+RPL+++ D I+ R+R
Sbjct: 546 ----VRPLSLKTDVIKKRQR 561
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 509 GVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 549
>gi|255729218|ref|XP_002549534.1| hypothetical protein CTRG_03831 [Candida tropicalis MYA-3404]
gi|240132603|gb|EER32160.1| hypothetical protein CTRG_03831 [Candida tropicalis MYA-3404]
Length = 589
Score = 79.3 bits (194), Expect = 4e-13, Method: Composition-based stats.
Identities = 34/64 (53%), Positives = 44/64 (68%), Gaps = 10/64 (15%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNCGT+ T LWRRN G+P+CNACGL+ KLH V +RPLA++ D I+ R+R
Sbjct: 474 CTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGV----------VRPLALKTDVIKKRQR 523
Query: 125 KPKK 128
+ KK
Sbjct: 524 QSKK 527
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 474 CTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 511
>gi|170036724|ref|XP_001846212.1| GATA transcription factor GATAb-2 [Culex quinquefasciatus]
gi|167879609|gb|EDS42992.1| GATA transcription factor GATAb-2 [Culex quinquefasciatus]
Length = 720
Score = 79.3 bits (194), Expect = 4e-13, Method: Composition-based stats.
Identities = 38/65 (58%), Positives = 44/65 (67%), Gaps = 10/65 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ CTNCGT TT+WRRN GE VCNACGLY+KLH VN RP MR+D I TR
Sbjct: 550 MSCTNCGTTTTTIWRRNIRGEMVCNACGLYFKLHGVN----------RPHTMRRDTIHTR 599
Query: 123 KRKPK 127
+R+PK
Sbjct: 600 RRRPK 604
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 31/50 (62%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG +T +WRR+ G +CNACGLY K++G+NRP + RR
Sbjct: 552 CTNCGTTTTTIWRRNIRGEMVCNACGLYFKLHGVNRPHTMRRDTIHTRRR 601
>gi|402882071|ref|XP_003904577.1| PREDICTED: transcription factor GATA-5, partial [Papio anubis]
Length = 269
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 47/72 (65%), Positives = 54/72 (75%), Gaps = 10/72 (13%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+++RR GLCCTNC T TTLWRRN++GEPVCNACGLY KLH V RPLAM+
Sbjct: 106 SSSRRAGLCCTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVP----------RPLAMK 155
Query: 116 KDGIQTRKRKPK 127
K+ IQTRKRKPK
Sbjct: 156 KESIQTRKRKPK 167
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 35/73 (47%), Gaps = 12/73 (16%)
Query: 21 LWRRDGTGHYLCNACGLYHKMNGMNRPLV-----------KPSKRLTATRRLGLCCTNCG 69
LWRR+ G +CNACGLY K++G+ RPL KP K +T TR TN
Sbjct: 125 LWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQTRKRKP-KTITKTRGSSGSTTNAS 183
Query: 70 TRMTTLWRRNNDG 82
+ + +N
Sbjct: 184 ASPSAVPSTDNSA 196
>gi|194224624|ref|XP_001915060.1| PREDICTED: transcription factor GATA-5-like [Equus caballus]
Length = 255
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/85 (58%), Positives = 58/85 (68%), Gaps = 12/85 (14%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMR 115
+++RR GLCCTNC T TTLWRRN DGEPVCNACGLY KLH V RPLAM+
Sbjct: 91 SSSRRAGLCCTNCHTTTTTLWRRNADGEPVCNACGLYMKLHGVP----------RPLAMK 140
Query: 116 KDGIQTRKRKPK--KQGGGGGGGSG 138
K+ IQTRKR+PK + G G +G
Sbjct: 141 KESIQTRKRRPKNVAKTKGSAGSTG 165
Score = 44.7 bits (104), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 21 LWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
LWRR+ G +CNACGLY K++G+ RPL
Sbjct: 110 LWRRNADGEPVCNACGLYMKLHGVPRPLA 138
>gi|242213967|ref|XP_002472809.1| hypothetical transcription factor [Postia placenta Mad-698-R]
gi|220728105|gb|EED82006.1| hypothetical transcription factor [Postia placenta Mad-698-R]
Length = 759
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 66/133 (49%), Gaps = 23/133 (17%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV---------KPSKRLTATRR 60
EC NCGA TPLWRR CNACGLY K++ RP S ++
Sbjct: 403 ECSNCGATHTPLWRRGLNDELNCNACGLYCKLHKRPRPKSMRNNHGEGRTQSAPRQESQE 462
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+ C NC T T LWR++++G+ VCNA YKLH + RP++M+ D I+
Sbjct: 463 VVAQCYNCHTTATPLWRKDDEGKTVCNA----YKLHG----------SARPISMKSDVIR 508
Query: 121 TRKRKPKKQGGGG 133
R R ++ G G
Sbjct: 509 KRARHDARRAGNG 521
>gi|390597548|gb|EIN06947.1| hypothetical protein PUNSTDRAFT_20481, partial [Punctularia
strigosozonata HHB-11173 SS5]
Length = 109
Score = 79.0 bits (193), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 59/103 (57%), Gaps = 4/103 (3%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATR--RLGLCCTNC 68
C NCG + WRR G CNACGLY + ++RP + L + + L C NC
Sbjct: 1 CSNCGITHSANWRRGQDGKRTCNACGLYFQR--VSRPETPHLQLLNFLQNGKALLRCENC 58
Query: 69 GTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
T +TT+WRR+ VCNACGLYY++H +RP+ +R +RP
Sbjct: 59 DTSITTVWRRDARRRTVCNACGLYYRIHGCDRPIELRTDIIRP 101
>gi|170086289|ref|XP_001874368.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651920|gb|EDR16160.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 169
Score = 79.0 bits (193), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 64/115 (55%), Gaps = 9/115 (7%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGH-YLCNACGLYHKMNGMNRP--LVKPSKRLTATRR-- 60
G ++C +C A STPLWRRD T LCNACGLY + RP L+ T
Sbjct: 47 GVAKKCSHCQATSTPLWRRDPTTFKTLCNACGLYLQQRNKFRPQELIDADTDDGDTTDSS 106
Query: 61 ----LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
+G C++C T T++WRR+ G +CNACG+Y +L RPL+++ ++P
Sbjct: 107 DGNYIGPECSHCRTHHTSVWRRSKTGAQLCNACGVYARLRGKPRPLSLKRNKIKP 161
>gi|312384504|gb|EFR29218.1| hypothetical protein AND_02033 [Anopheles darlingi]
Length = 787
Score = 78.6 bits (192), Expect = 7e-13, Method: Composition-based stats.
Identities = 38/65 (58%), Positives = 44/65 (67%), Gaps = 10/65 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ CTNCGT TT+WRRN GE VCNACGLY+KLH VN RP MR+D I TR
Sbjct: 623 MSCTNCGTTTTTIWRRNIRGEMVCNACGLYFKLHGVN----------RPHTMRRDTIHTR 672
Query: 123 KRKPK 127
+R+PK
Sbjct: 673 RRRPK 677
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 31/50 (62%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG +T +WRR+ G +CNACGLY K++G+NRP + RR
Sbjct: 625 CTNCGTTTTTIWRRNIRGEMVCNACGLYFKLHGVNRPHTMRRDTIHTRRR 674
>gi|391327583|ref|XP_003738277.1| PREDICTED: GATA-binding factor C-like [Metaseiulus occidentalis]
Length = 227
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/123 (43%), Positives = 63/123 (51%), Gaps = 24/123 (19%)
Query: 14 CGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR----LGLCCTNCG 69
CGA + W D G+ LC C P S + +R G+ C NC
Sbjct: 2 CGATHSSHWAVDEGGNTLCKMCAQC-------APPRSESDWNSREKRNGGSAGMVCHNCN 54
Query: 70 TRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQ 129
T TTLWRRN +G PVCNACGLYYKLHN N RPLA++K IQTR+R K
Sbjct: 55 TTTTTLWRRNIEGFPVCNACGLYYKLHNTN----------RPLALKKKDIQTRRR---KS 101
Query: 130 GGG 132
GGG
Sbjct: 102 GGG 104
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 32/53 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
G C NC +T LWRR+ G +CNACGLY+K++ NRPL K + RR
Sbjct: 47 GMVCHNCNTTTTTLWRRNIEGFPVCNACGLYYKLHNTNRPLALKKKDIQTRRR 99
>gi|344234670|gb|EGV66538.1| hypothetical protein CANTEDRAFT_91703 [Candida tenuis ATCC 10573]
Length = 499
Score = 77.8 bits (190), Expect = 1e-12, Method: Composition-based stats.
Identities = 53/178 (29%), Positives = 71/178 (39%), Gaps = 62/178 (34%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNA----------------------------- 34
+ +G+ C NCG +TPLWRR G +CNA
Sbjct: 85 EVSDGQSCSNCGTKTTPLWRRSTNGTLICNACGLYLRSNNTNRPVNLKKTPNLITVENKK 144
Query: 35 ---CGLYHKMNGM-------------NRPLVK-------PSKRLTATRRLGLCCTNCGTR 71
C + NG NR +K P + L + C NC T
Sbjct: 145 HGSCAGDGQCNGTGGSAACKGCPAYNNRMFLKKIHEEEKPKAPTPQSDDLAIACFNCNTT 204
Query: 72 MTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQ 129
+T LWRR++ G +CNACGLYYKLH + RP+ M+K I+ RKR K+
Sbjct: 205 ITPLWRRDDGGNTICNACGLYYKLHG----------SHRPVRMKKATIKRRKRNIDKE 252
>gi|12641865|dbj|BAB21556.1| GATA factor d [Drosophila melanogaster]
Length = 352
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 37/67 (55%), Positives = 47/67 (70%), Gaps = 10/67 (14%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
+ C+NCGT TT+WRR+ GE VCNACGLY+KLH VNRP H +MR+D I T
Sbjct: 199 DMSCSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP----H------SMRRDTIHT 248
Query: 122 RKRKPKK 128
R+R+PK+
Sbjct: 249 RRRRPKE 255
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG ++T +WRR G +CNACGLY K++G+NRP
Sbjct: 202 CSNCGTLTTTIWRRSVRGEMVCNACGLYFKLHGVNRP 238
>gi|409079370|gb|EKM79731.1| hypothetical protein AGABI1DRAFT_56893 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 386
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 65/112 (58%), Gaps = 8/112 (7%)
Query: 8 GRECVNCGAISTPLWRRDGTG-HYLCNACGLYHKMNGMNRP--LVKPSKRLTATRRL--- 61
G++C +C A STPLWRR+ T LCNACGLY + +RP L+ + +
Sbjct: 262 GKKCSHCNATSTPLWRREPTTLKPLCNACGLYLQQRHRHRPRELIDADQEDEESEEEDQN 321
Query: 62 --GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
G C++C T T++WRR+ G +CNACG+Y +L +RPL ++ ++P
Sbjct: 322 YNGPECSHCHTHRTSVWRRSKTGAQLCNACGVYARLRGKDRPLTLKRKKIKP 373
>gi|406607700|emb|CCH40972.1| Nitrogen regulatory protein areA [Wickerhamomyces ciferrii]
Length = 618
Score = 77.4 bits (189), Expect = 2e-12, Method: Composition-based stats.
Identities = 36/84 (42%), Positives = 51/84 (60%), Gaps = 10/84 (11%)
Query: 41 MNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNR 100
++G N P+KR +T CTNC T+ T LWRRN +G+P+CNACGL+ KLH V
Sbjct: 379 VSGNNLSSSAPNKREDSTSNANTRCTNCNTQTTPLWRRNPEGQPLCNACGLFLKLHGV-- 436
Query: 101 PLAMRHLAMRPLAMRKDGIQTRKR 124
+RPL+++ D I+ R+R
Sbjct: 437 --------VRPLSLKTDVIKKRQR 452
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 403 CTNCNTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 440
>gi|426192692|gb|EKV42628.1| hypothetical protein AGABI2DRAFT_122836 [Agaricus bisporus var.
bisporus H97]
Length = 386
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 65/112 (58%), Gaps = 8/112 (7%)
Query: 8 GRECVNCGAISTPLWRRDGTG-HYLCNACGLYHKMNGMNRP--LVKPSKRLTATRRL--- 61
G++C +C A STPLWRR+ T LCNACGLY + +RP L+ + +
Sbjct: 262 GKKCSHCNATSTPLWRREPTTLKPLCNACGLYLQQRHRHRPRELIDADQEDEESEEEDQN 321
Query: 62 --GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
G C++C T T++WRR+ G +CNACG+Y +L +RPL ++ ++P
Sbjct: 322 YNGPECSHCHTHRTSVWRRSKTGAQLCNACGVYARLRGKDRPLTLKRKKIKP 373
>gi|328858702|gb|EGG07814.1| hypothetical protein MELLADRAFT_55799 [Melampsora larici-populina
98AG31]
Length = 186
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG--- 62
E C NC TPLWRR LCNACG+++K++ +RP + T + G
Sbjct: 11 AESPTCSNCRGTQTPLWRRGPDDELLCNACGVFYKVHKKHRPHNLAKVKNTFRGKFGSRR 70
Query: 63 -------LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
+ CTNC T +WR+ DG +CNAC LY+K H RP+
Sbjct: 71 HPRDTQPIKCTNCNATATPMWRKAPDGALLCNACALYFKCHKRPRPM 117
>gi|31559064|gb|AAP50501.1| nitrogen regulatory GATA-factor [Candida albicans]
Length = 755
Score = 77.0 bits (188), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 44/68 (64%), Gaps = 10/68 (14%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
A G+ CTNCGT+ T LWRRN G+P+CNACGL+ KLH V +RPL+++
Sbjct: 497 AASNTGVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGV----------VRPLSLKT 546
Query: 117 DGIQTRKR 124
D I+ R+R
Sbjct: 547 DVIKKRQR 554
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 502 GVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 542
>gi|345110699|pdb|2L6Y|A Chain A, Haddock Model Of Gata1nf:lmo2lim2-Ldb1lid
gi|345110701|pdb|2L6Z|A Chain A, Haddock Model Of Gata1nf:lmo2lim2-Ldb1lid With Fog
Length = 39
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/39 (87%), Positives = 35/39 (89%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 45
E RECVNCGA +TPLWRRD TGHYLCNACGLYHKMNG N
Sbjct: 1 EARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQN 39
Score = 43.9 bits (102), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 24/35 (68%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
C NCG T LWRR+ G +CNACGLY+K++ N
Sbjct: 5 CVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQN 39
>gi|308097154|gb|ADO14131.1| GATA binding protein 5 [Notophthalmus viridescens]
Length = 56
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/56 (76%), Positives = 49/56 (87%)
Query: 39 HKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYK 94
HKMNG+NRPL+KP KRL+++RR GLCCTNC T TTLWRRN +GEPVCNACGLY K
Sbjct: 1 HKMNGINRPLIKPQKRLSSSRRAGLCCTNCHTTTTTLWRRNTEGEPVCNACGLYMK 56
>gi|238881297|gb|EEQ44935.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 582
Score = 76.6 bits (187), Expect = 3e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 44/68 (64%), Gaps = 10/68 (14%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
A G+ CTNCGT+ T LWRRN G+P+CNACGL+ KLH V +RPL+++
Sbjct: 324 AASNTGVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGV----------VRPLSLKT 373
Query: 117 DGIQTRKR 124
D I+ R+R
Sbjct: 374 DVIKKRQR 381
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 329 GVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 369
>gi|68478842|ref|XP_716583.1| hypothetical protein CaO19.1275 [Candida albicans SC5314]
gi|68478951|ref|XP_716530.1| hypothetical protein CaO19.8862 [Candida albicans SC5314]
gi|46438200|gb|EAK97535.1| hypothetical protein CaO19.8862 [Candida albicans SC5314]
gi|46438254|gb|EAK97588.1| hypothetical protein CaO19.1275 [Candida albicans SC5314]
Length = 688
Score = 76.6 bits (187), Expect = 3e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 44/68 (64%), Gaps = 10/68 (14%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
A G+ CTNCGT+ T LWRRN G+P+CNACGL+ KLH V +RPL+++
Sbjct: 430 AASNTGVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGV----------VRPLSLKT 479
Query: 117 DGIQTRKR 124
D I+ R+R
Sbjct: 480 DVIKKRQR 487
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 435 GVSCTNCGTKTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 475
>gi|196049284|dbj|BAG68611.1| GATA-binding protein 2 [Cyprinus carpio]
Length = 244
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 32/33 (96%), Positives = 32/33 (96%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYH 39
EGRECVNCGA STPLWRRDGTGHYLCNACGLYH
Sbjct: 212 EGRECVNCGATSTPLWRRDGTGHYLCNACGLYH 244
Score = 43.5 bits (101), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 31/61 (50%), Gaps = 3/61 (4%)
Query: 36 GLYHK---MNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLY 92
GL+H ++G K T + G C NCG T LWRR+ G +CNACGLY
Sbjct: 184 GLFHPGTLLSGSASSFTPKCKSKTRSCSEGRECVNCGATSTPLWRRDGTGHYLCNACGLY 243
Query: 93 Y 93
+
Sbjct: 244 H 244
>gi|256085819|ref|XP_002579109.1| GATA binding factor-1b (transcription factor xgata-1b) [Schistosoma
mansoni]
gi|360044426|emb|CCD81974.1| putative gata zinc finger domain-containing protein [Schistosoma
mansoni]
Length = 227
Score = 75.9 bits (185), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/59 (61%), Positives = 43/59 (72%), Gaps = 4/59 (6%)
Query: 40 KMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNV 98
K++ ++ PL P+ +TRR G CTNC T TTLWRRN +GEPVCNACGLYYKLH V
Sbjct: 113 KISKLSNPLNIPN----STRRSGQFCTNCNTSATTLWRRNTEGEPVCNACGLYYKLHKV 167
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 5/49 (10%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL 55
G+ C NC +T LWRR+ G +CNACGLY+K++ VKP K L
Sbjct: 130 SGQFCTNCNTSATTLWRRNTEGEPVCNACGLYYKLHK-----VKPMKTL 173
>gi|193636743|ref|XP_001951486.1| PREDICTED: hypothetical protein LOC100167435 isoform 1
[Acyrthosiphon pisum]
Length = 639
Score = 75.9 bits (185), Expect = 5e-12, Method: Composition-based stats.
Identities = 36/81 (44%), Positives = 47/81 (58%), Gaps = 11/81 (13%)
Query: 51 PSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
P KR + C+NCGT +T+WRR N+ +PVCNACGLYYKLH +R
Sbjct: 513 PRKRTARESSVQRMCSNCGTTCSTIWRRLNE-DPVCNACGLYYKLHG----------KIR 561
Query: 111 PLAMRKDGIQTRKRKPKKQGG 131
P MR+D I TR+R+ + G
Sbjct: 562 PPTMRRDTIHTRQRRKRSDNG 582
Score = 45.1 bits (105), Expect = 0.011, Method: Composition-based stats.
Identities = 19/39 (48%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
R C NCG + +WRR +CNACGLY+K++G RP
Sbjct: 525 RMCSNCGTTCSTIWRRLNEDP-VCNACGLYYKLHGKIRP 562
>gi|328709165|ref|XP_003243884.1| PREDICTED: hypothetical protein LOC100167435 isoform 2
[Acyrthosiphon pisum]
Length = 588
Score = 75.1 bits (183), Expect = 9e-12, Method: Composition-based stats.
Identities = 36/81 (44%), Positives = 47/81 (58%), Gaps = 11/81 (13%)
Query: 51 PSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
P KR + C+NCGT +T+WRR N+ +PVCNACGLYYKLH +R
Sbjct: 519 PRKRTARESSVQRMCSNCGTTCSTIWRRLNE-DPVCNACGLYYKLHG----------KIR 567
Query: 111 PLAMRKDGIQTRKRKPKKQGG 131
P MR+D I TR+R+ + G
Sbjct: 568 PPTMRRDTIHTRQRRKRSDNG 588
Score = 44.3 bits (103), Expect = 0.015, Method: Composition-based stats.
Identities = 19/39 (48%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 9 RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
R C NCG + +WRR +CNACGLY+K++G RP
Sbjct: 531 RMCSNCGTTCSTIWRRLNEDP-VCNACGLYYKLHGKIRP 568
>gi|114691382|ref|XP_521049.2| PREDICTED: erythroid transcription factor-like, partial [Pan
troglodytes]
Length = 165
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/74 (60%), Positives = 49/74 (66%), Gaps = 10/74 (13%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
++R G CTNC T TTLWRRN G+PVCNACGLYYKLH VN RPL MRK
Sbjct: 2 VSKRAGTQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVN----------RPLTMRK 51
Query: 117 DGIQTRKRKPKKQG 130
DGIQTR RK +G
Sbjct: 52 DGIQTRNRKASGKG 65
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 24/29 (82%)
Query: 21 LWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
LWRR+ +G +CNACGLY+K++ +NRPL
Sbjct: 20 LWRRNASGDPVCNACGLYYKLHQVNRPLT 48
>gi|448121454|ref|XP_004204211.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
gi|358349750|emb|CCE73029.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
Length = 697
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/62 (51%), Positives = 42/62 (67%), Gaps = 10/62 (16%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ CTNCGTR T LWRRN G+P+CNACGL+ KLH V +RPL+++ D I+ R
Sbjct: 528 IACTNCGTRTTPLWRRNPQGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKR 577
Query: 123 KR 124
+R
Sbjct: 578 QR 579
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 29/71 (40%), Positives = 41/71 (57%), Gaps = 6/71 (8%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGT 70
C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL + + +R N GT
Sbjct: 530 CTNCGTRTTPLWRRNPQGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRQR------NSGT 583
Query: 71 RMTTLWRRNND 81
T+ ++ D
Sbjct: 584 SKKTINSKDGD 594
>gi|50292241|ref|XP_448553.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527865|emb|CAG61516.1| unnamed protein product [Candida glabrata]
Length = 459
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/65 (50%), Positives = 43/65 (66%), Gaps = 10/65 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
L CTNCGT T LWR++ D +P+CNACGL++KLH V MRPL+++ D I+ R
Sbjct: 256 LICTNCGTTNTPLWRKDIDRKPLCNACGLFFKLHGV----------MRPLSLKTDVIKKR 305
Query: 123 KRKPK 127
KR K
Sbjct: 306 KRTAK 310
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 34/57 (59%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
C NCG +TPLWR+D LCNACGL+ K++G+ RPL + + +R TN
Sbjct: 258 CTNCGTTNTPLWRKDIDRKPLCNACGLFFKLHGVMRPLSLKTDVIKKRKRTAKIKTN 314
>gi|448123846|ref|XP_004204769.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
gi|358249402|emb|CCE72468.1| Piso0_000039 [Millerozyma farinosa CBS 7064]
Length = 697
Score = 74.3 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/62 (51%), Positives = 42/62 (67%), Gaps = 10/62 (16%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ CTNCGTR T LWRRN G+P+CNACGL+ KLH V +RPL+++ D I+ R
Sbjct: 528 ISCTNCGTRTTPLWRRNPQGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKR 577
Query: 123 KR 124
+R
Sbjct: 578 QR 579
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 29/71 (40%), Positives = 41/71 (57%), Gaps = 6/71 (8%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGT 70
C NCG +TPLWRR+ G LCNACGL+ K++G+ RPL + + +R N GT
Sbjct: 530 CTNCGTRTTPLWRRNPQGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRQR------NSGT 583
Query: 71 RMTTLWRRNND 81
T+ ++ D
Sbjct: 584 SKKTIGTKDGD 594
>gi|378754340|gb|EHY64374.1| hypothetical protein NERG_02545 [Nematocida sp. 1 ERTm2]
Length = 214
Score = 74.3 bits (181), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 6/90 (6%)
Query: 9 RECVNCGAISTPLWRRDGTGHYL-CNACGLYHKMNGMNRPL-VKPSKRLTATR---RLGL 63
R C NCG +TP WRR + L CNACGLY +++G NRP V P + A + G+
Sbjct: 49 RVCTNCGTTTTPAWRRSTSNKILLCNACGLYQRLHGSNRPFSVTPDGKTKAIKNNIEKGI 108
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYY 93
C CG T LW+R N E +C++CGL Y
Sbjct: 109 C-RGCGVVQTPLWKRGNSNEWLCSSCGLLY 137
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 65 CTNCGTRMTTLWRRNNDGEPV-CNACGLYYKLHNVNRPLAM 104
CTNCGT T WRR+ + + CNACGLY +LH NRP ++
Sbjct: 51 CTNCGTTTTPAWRRSTSNKILLCNACGLYQRLHGSNRPFSV 91
>gi|146420517|ref|XP_001486214.1| hypothetical protein PGUG_01885 [Meyerozyma guilliermondii ATCC
6260]
Length = 568
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 31/62 (50%), Positives = 43/62 (69%), Gaps = 3/62 (4%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC TR T LWRRN G+P+CNACGL+ KLH V RPL+++ ++ R+ G+ + +
Sbjct: 418 CTNCHTRTTPLWRRNPQGQPLCNACGLFLKLHGVVRPLSLKTDVIKK---RQRGLGSTTK 474
Query: 125 KP 126
KP
Sbjct: 475 KP 476
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 418 CTNCHTRTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 455
>gi|238878590|gb|EEQ42228.1| hypothetical protein CAWG_00430 [Candida albicans WO-1]
Length = 517
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 56/194 (28%), Positives = 77/194 (39%), Gaps = 75/194 (38%)
Query: 7 EGRECVNCGAISTPLWRR--DGT------------------------------------- 27
+G++C NCG TPLWRR DGT
Sbjct: 56 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRPPNTIAVVKEEEGS 115
Query: 28 -----------GHYLCNACGLYH-------------KMNGMNRPLVKPSKRLTAT----- 58
G C C Y+ K + P+ K KR T+T
Sbjct: 116 CKGDGRCNGTGGSAACKGCPAYNNRIVAKKALEKSPKNDSSRAPIDKSLKRSTSTDATTE 175
Query: 59 --RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP-LAMR 115
L + C NCGT +T LWRR++ G +CNACGL+Y+LH +RP+ M+ RP + R
Sbjct: 176 DESSLAIACFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMK----RPTIKRR 231
Query: 116 KDGIQTRKRKPKKQ 129
K + +K K + Q
Sbjct: 232 KRNVSDKKSKDEVQ 245
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 33/81 (40%), Positives = 46/81 (56%), Gaps = 7/81 (8%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP---LAMRKDG 118
G C+NCGT T LWRR DG +CNACGLYY+ +N +RP+ ++ RP +A+ K+
Sbjct: 57 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLK----RPPNTIAVVKEE 112
Query: 119 IQTRKRKPKKQGGGGGGGSGG 139
+ K + G GG G
Sbjct: 113 EGSCKGDGRCNGTGGSAACKG 133
>gi|21745321|gb|AAM77345.1|AF520973_1 transcription factor SFU1 [Candida albicans]
Length = 518
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 56/194 (28%), Positives = 77/194 (39%), Gaps = 75/194 (38%)
Query: 7 EGRECVNCGAISTPLWRR--DGT------------------------------------- 27
+G++C NCG TPLWRR DGT
Sbjct: 57 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRPPNTIAVVKEEEGS 116
Query: 28 -----------GHYLCNACGLYH-------------KMNGMNRPLVKPSKRLTAT----- 58
G C C Y+ K + P+ K KR T+T
Sbjct: 117 CKGDGRCNGTGGSAACKGCPAYNNRIVAKKALEKSPKNDSSRAPIDKSLKRSTSTDATTE 176
Query: 59 --RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP-LAMR 115
L + C NCGT +T LWRR++ G +CNACGL+Y+LH +RP+ M+ RP + R
Sbjct: 177 DESSLAIACFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMK----RPTIKRR 232
Query: 116 KDGIQTRKRKPKKQ 129
K + +K K + Q
Sbjct: 233 KRNVSDKKSKDEVQ 246
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 33/81 (40%), Positives = 46/81 (56%), Gaps = 7/81 (8%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP---LAMRKDG 118
G C+NCGT T LWRR DG +CNACGLYY+ +N +RP+ ++ RP +A+ K+
Sbjct: 58 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLK----RPPNTIAVVKEE 113
Query: 119 IQTRKRKPKKQGGGGGGGSGG 139
+ K + G GG G
Sbjct: 114 EGSCKGDGRCNGTGGSAACKG 134
>gi|68464951|ref|XP_723553.1| negative regulator of iron uptake genes [Candida albicans SC5314]
gi|68465330|ref|XP_723364.1| negative regulator of iron uptake genes [Candida albicans SC5314]
gi|46445393|gb|EAL04662.1| negative regulator of iron uptake genes [Candida albicans SC5314]
gi|46445589|gb|EAL04857.1| negative regulator of iron uptake genes [Candida albicans SC5314]
Length = 517
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 56/194 (28%), Positives = 77/194 (39%), Gaps = 75/194 (38%)
Query: 7 EGRECVNCGAISTPLWRR--DGT------------------------------------- 27
+G++C NCG TPLWRR DGT
Sbjct: 56 DGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRPPNTIAVVKEEEGS 115
Query: 28 -----------GHYLCNACGLYH-------------KMNGMNRPLVKPSKRLTAT----- 58
G C C Y+ K + P+ K KR T+T
Sbjct: 116 CKGDGRCNGTGGSAACKGCPAYNNRIVAKKALEKSPKNDSSRAPIDKSLKRSTSTDATTE 175
Query: 59 --RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP-LAMR 115
L + C NCGT +T LWRR++ G +CNACGL+Y+LH +RP+ M+ RP + R
Sbjct: 176 DESSLAIACFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMK----RPTIKRR 231
Query: 116 KDGIQTRKRKPKKQ 129
K + +K K + Q
Sbjct: 232 KRNVSDKKSKDEVQ 245
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 33/81 (40%), Positives = 46/81 (56%), Gaps = 7/81 (8%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP---LAMRKDG 118
G C+NCGT T LWRR DG +CNACGLYY+ +N +RP+ ++ RP +A+ K+
Sbjct: 57 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLK----RPPNTIAVVKEE 112
Query: 119 IQTRKRKPKKQGGGGGGGSGG 139
+ K + G GG G
Sbjct: 113 EGSCKGDGRCNGTGGSAACKG 133
>gi|406699318|gb|EKD02524.1| iron regulator 1 [Trichosporon asahii var. asahii CBS 8904]
Length = 977
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/54 (55%), Positives = 39/54 (72%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
AT +G+ C NCGT T LWRR+ +G P CNACGLY+KLH V RP+AM+ ++
Sbjct: 342 ATTPVGMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVAMKKTVIK 395
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/42 (59%), Positives = 30/42 (71%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 347 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVA 388
>gi|401888062|gb|EJT52030.1| iron regulator 1 [Trichosporon asahii var. asahii CBS 2479]
Length = 977
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/54 (55%), Positives = 39/54 (72%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
AT +G+ C NCGT T LWRR+ +G P CNACGLY+KLH V RP+AM+ ++
Sbjct: 342 ATTPVGMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVAMKKTVIK 395
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/42 (59%), Positives = 30/42 (71%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 347 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVA 388
>gi|448097680|ref|XP_004198732.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
gi|359380154|emb|CCE82395.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
Length = 539
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 54/176 (30%), Positives = 74/176 (42%), Gaps = 61/176 (34%)
Query: 7 EGRECVNCGAISTPLWRR--DGT------GHYL--------------------------- 31
EG++C NCG TPLWRR DGT G YL
Sbjct: 21 EGQQCSNCGTTKTPLWRRAPDGTMICNACGLYLRSNSTNRPVNLKRPPNTIPIAKDEEGS 80
Query: 32 ------CNACGLYHKMNGM----NRPLVK---------PSKRLTATRR-----LGLCCTN 67
CN G G NR L+ PSK + + C N
Sbjct: 81 CKGDGRCNGTGGSSACKGCPAFNNRVLITKELEWAANAPSKAPGPVAEPKDDPMAIACYN 140
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CG+ +T LWRR++ G +CNACGLYY+LH +RP+ M+ ++ +++ IQ +K
Sbjct: 141 CGSTITPLWRRDDAGNTICNACGLYYRLHGSHRPIKMKSSTIK--RRKRNHIQIKK 194
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 32/81 (39%), Positives = 44/81 (54%), Gaps = 7/81 (8%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP---LAMRKDG 118
G C+NCGT T LWRR DG +CNACGLY + ++ NRP+ ++ RP + + KD
Sbjct: 22 GQQCSNCGTTKTPLWRRAPDGTMICNACGLYLRSNSTNRPVNLK----RPPNTIPIAKDE 77
Query: 119 IQTRKRKPKKQGGGGGGGSGG 139
+ K + G GG G
Sbjct: 78 EGSCKGDGRCNGTGGSSACKG 98
>gi|387593300|gb|EIJ88324.1| hypothetical protein NEQG_01768 [Nematocida parisii ERTm3]
gi|387595987|gb|EIJ93609.1| hypothetical protein NEPG_01181 [Nematocida parisii ERTm1]
Length = 297
Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 6/90 (6%)
Query: 9 RECVNCGAISTPLWRRDGTGHYL-CNACGLYHKMNGMNRPL-VKPSKRLTATR---RLGL 63
R C NCG +TP WRR + L CNACGLY +++G NRP V P + A + G+
Sbjct: 132 RVCTNCGTTTTPAWRRSTSNKILLCNACGLYQRLHGSNRPFSVTPDGKTKAIKNNIEKGV 191
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYY 93
C CG T LW+R N E +C++CGL Y
Sbjct: 192 C-RGCGVVQTPLWKRGNSNEWLCSSCGLLY 220
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 49 VKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPV-CNACGLYYKLHNVNRPLAM 104
K K +T + CTNCGT T WRR+ + + CNACGLY +LH NRP ++
Sbjct: 118 AKKEKSMTRKKSKFRVCTNCGTTTTPAWRRSTSNKILLCNACGLYQRLHGSNRPFSV 174
>gi|448101527|ref|XP_004199582.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
gi|359381004|emb|CCE81463.1| Piso0_002120 [Millerozyma farinosa CBS 7064]
Length = 546
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 50/176 (28%), Positives = 72/176 (40%), Gaps = 61/176 (34%)
Query: 7 EGRECVNCGAISTPLWRR--DGT------GHYL--------------------------- 31
EG++C NCG TPLWRR DGT G YL
Sbjct: 25 EGQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNSTNRPVNLKRPPNTIPIAKDEEGS 84
Query: 32 ---------------CNACGLYHKM--------NGMNRPLVKPSKRLTATRR-LGLCCTN 67
C C ++ + N P P A + + C N
Sbjct: 85 CKGDGRCNGTGGSSACKGCPAFNNRVLITKELESAANTPSKAPGPEAEAKDDPMAIACFN 144
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CG+ +T LWRR++ G +CNACGLYY+LH +RP+ M+ ++ +++ IQ +K
Sbjct: 145 CGSTITPLWRRDDAGNTICNACGLYYRLHGSHRPIKMKSSTIK--RRKRNHIQIKK 198
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 32/81 (39%), Positives = 44/81 (54%), Gaps = 7/81 (8%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP---LAMRKDG 118
G C+NCGT T LWRR DG +CNACGLY + ++ NRP+ ++ RP + + KD
Sbjct: 26 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYLRSNSTNRPVNLK----RPPNTIPIAKDE 81
Query: 119 IQTRKRKPKKQGGGGGGGSGG 139
+ K + G GG G
Sbjct: 82 EGSCKGDGRCNGTGGSSACKG 102
>gi|150951390|ref|XP_001387707.2| activator of transcription of nitrogen-regulated genes
[Scheffersomyces stipitis CBS 6054]
gi|149388554|gb|EAZ63684.2| activator of transcription of nitrogen-regulated genes
[Scheffersomyces stipitis CBS 6054]
Length = 820
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 36/80 (45%), Positives = 46/80 (57%), Gaps = 12/80 (15%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ CTNC T+ T LWRRN G P+CNACGL+ KLH V +RPL+++ D I+
Sbjct: 590 GVSCTNCHTQTTPLWRRNPQGLPLCNACGLFLKLHGV----------VRPLSLKTDVIKK 639
Query: 122 RKR--KPKKQGGGGGGGSGG 139
R+R PKK G G
Sbjct: 640 RQRNTNPKKSISGSSKDKDG 659
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 23/41 (56%), Positives = 29/41 (70%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 590 GVSCTNCHTQTTPLWRRNPQGLPLCNACGLFLKLHGVVRPL 630
>gi|37625017|gb|AAQ96332.1| transcription factor GATA-1 [Raja eglanteria]
Length = 50
Score = 73.2 bits (178), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/50 (64%), Positives = 37/50 (74%)
Query: 38 YHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCN 87
YHKMNG NRPL++P KRL ++R G C NC T TTLWRRN G+PVCN
Sbjct: 1 YHKMNGQNRPLIRPKKRLIVSKRAGTLCANCHTSTTTLWRRNISGDPVCN 50
>gi|406602734|emb|CCH45692.1| Zinc finger protein [Wickerhamomyces ciferrii]
Length = 487
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 34/79 (43%), Positives = 50/79 (63%), Gaps = 13/79 (16%)
Query: 51 PSKRLTATRRL---GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHL 107
PS + + +L + CTNCGT +T LWRR+++G+ +CNACGLYYKLH ++
Sbjct: 163 PSSNVMGSEQLENVAIACTNCGTTVTPLWRRDDNGDTICNACGLYYKLHGLH-------- 214
Query: 108 AMRPLAMRKDGIQTRKRKP 126
RP+ M++ I+ RKR P
Sbjct: 215 --RPIKMKRGVIKRRKRNP 231
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 27/50 (54%), Positives = 34/50 (68%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD G +CNACGLY+K++G++RP+ KR RR
Sbjct: 180 CTNCGTTVTPLWRRDDNGDTICNACGLYYKLHGLHRPI--KMKRGVIKRR 227
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 44/82 (53%), Gaps = 5/82 (6%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
T+ C+NC T T LWRR DG +CNACGLYY+ +N +RP+ ++ RP +
Sbjct: 24 TQSTAQICSNCSTTKTPLWRRAPDGSLICNACGLYYRANNSHRPINLK----RPPHVVTV 79
Query: 118 GIQTRKRKPKKQGGGGGGGSGG 139
G Q +G G G+GG
Sbjct: 80 G-QNDSHSGTCKGDGRCNGTGG 100
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 20/38 (52%), Positives = 25/38 (65%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC TPLWRR G +CNACGLY++ N +RP+
Sbjct: 31 CSNCSTTKTPLWRRAPDGSLICNACGLYYRANNSHRPI 68
>gi|12641861|dbj|BAB21551.1| dGATAe [Drosophila melanogaster]
Length = 571
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 38/88 (43%), Positives = 45/88 (51%), Gaps = 7/88 (7%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG--MNRPLVKPSKRLTATRRLGLCCTN 67
+C NC P R+ Y C C + +M R + RR G+ C N
Sbjct: 487 KCENCHG---PFLRKGS--EYFCPNCPAFMRMAPRITQRQAKPKAAAAPNNRRNGVTCAN 541
Query: 68 CGTRMTTLWRRNNDGEPVCNACGLYYKL 95
C T TTLWRRNN+G PVCNACGLYYKL
Sbjct: 542 CQTNSTTLWRRNNEGNPVCNACGLYYKL 569
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 19/34 (55%), Positives = 24/34 (70%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKM 41
G C NC ST LWRR+ G+ +CNACGLY+K+
Sbjct: 536 GVTCANCQTNSTTLWRRNNEGNPVCNACGLYYKL 569
>gi|116266310|gb|ABJ91394.1| glutathione S-transferase/GATA-1 fusion protein [Cloning vector
pGATA]
Length = 359
Score = 72.8 bits (177), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 46/81 (56%), Positives = 52/81 (64%), Gaps = 10/81 (12%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
++R G CTNC T TTLWRRN G+PVCNACGLY+KLH VN RPL MRKD
Sbjct: 285 SKRAGTQCTNCQTTTTTLWRRNASGDPVCNACGLYFKLHQVN----------RPLTMRKD 334
Query: 118 GIQTRKRKPKKQGGGGGGGSG 138
GIQTR RK +G G +G
Sbjct: 335 GIQTRNRKASGKGKKKRGSNG 355
Score = 43.5 bits (101), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 23/29 (79%)
Query: 21 LWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
LWRR+ +G +CNACGLY K++ +NRPL
Sbjct: 302 LWRRNASGDPVCNACGLYFKLHQVNRPLT 330
>gi|190345835|gb|EDK37787.2| hypothetical protein PGUG_01885 [Meyerozyma guilliermondii ATCC
6260]
Length = 568
Score = 72.4 bits (176), Expect = 5e-11, Method: Composition-based stats.
Identities = 34/68 (50%), Positives = 43/68 (63%), Gaps = 3/68 (4%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK- 123
CTNC TR T LWRRN G+P+CNACGL+ KLH V RPL+++ ++ R G T+K
Sbjct: 418 CTNCHTRTTPLWRRNPQGQPLCNACGLFLKLHGVVRPLSLKTDVIKK-RQRGSGSTTKKP 476
Query: 124 -RKPKKQG 130
P K G
Sbjct: 477 TSTPSKDG 484
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 418 CTNCHTRTTPLWRRNPQGQPLCNACGLFLKLHGVVRPL 455
>gi|403174238|ref|XP_003333221.2| hypothetical protein PGTG_14141 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170888|gb|EFP88802.2| hypothetical protein PGTG_14141 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 419
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 37/93 (39%), Positives = 49/93 (52%), Gaps = 12/93 (12%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRL----------TAT 58
C NC TPLWRR LCNACG+++K++ +RP L K +K L T
Sbjct: 327 CFNCRGTQTPLWRRGPNDELLCNACGVFYKVHKKHRPATLSKYNKHLGSSTSATHSETGH 386
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGL 91
+ + CTNC T +WR+ DG +CNAC L
Sbjct: 387 KGPRIQCTNCDATATPMWRKAPDGSLLCNACAL 419
Score = 49.3 bits (116), Expect = 5e-04, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 26/40 (65%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
C NC T LWRR + E +CNACG++YK+H +RP +
Sbjct: 327 CFNCRGTQTPLWRRGPNDELLCNACGVFYKVHKKHRPATL 366
Score = 42.0 bits (97), Expect = 0.079, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGL 37
+C NC A +TP+WR+ G LCNAC L
Sbjct: 392 QCTNCDATATPMWRKAPDGSLLCNACAL 419
>gi|63054447|ref|NP_588278.2| transcription factor Gaf1 [Schizosaccharomyces pombe 972h-]
gi|12644402|sp|Q10280.2|GAF1_SCHPO RecName: Full=Transcription factor gaf1; Short=Gaf-1
gi|157310519|emb|CAB40003.2| transcription factor Gaf1 [Schizosaccharomyces pombe]
Length = 855
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 46/76 (60%), Gaps = 16/76 (21%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC TR T LWRR+ DG+P+CNACGL+ K++ V +RPL+++ D I+ R R
Sbjct: 635 CTNCQTRTTPLWRRSPDGQPLCNACGLFMKINGV----------VRPLSLKTDVIKKRNR 684
Query: 125 ------KPKKQGGGGG 134
PK+ GG G
Sbjct: 685 GVGTSATPKQSGGRKG 700
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 27/38 (71%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR G LCNACGL+ K+NG+ RPL
Sbjct: 635 CTNCQTRTTPLWRRSPDGQPLCNACGLFMKINGVVRPL 672
>gi|402079663|gb|EJT74928.1| GATA factor SREP [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 581
Score = 72.0 bits (175), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 45/78 (57%), Gaps = 11/78 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 265 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVH----------RPVTMKKSVIKRR 314
Query: 123 KRK-PKKQGGGGGGGSGG 139
KR P Q G G GG
Sbjct: 315 KRVLPASQQGSPAPGDGG 332
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 32/39 (82%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 267 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVT 305
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N RP ++
Sbjct: 109 GQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARPTNLK 152
Score = 51.6 bits (122), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 25/44 (56%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
Q G+ C NCG TPLWRR G +CNACGLY K RP
Sbjct: 105 QGQSGQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARP 148
>gi|108802600|gb|ABG21303.1| iron regulator 1 [Cryptococcus neoformans var. neoformans]
Length = 963
Score = 72.0 bits (175), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 38/54 (70%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
A +G+ C NCGT T LWRR+ +G P CNACGLY+KLH V RP+AM+ ++
Sbjct: 300 AATPVGMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVAMKKTVIK 353
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/42 (59%), Positives = 30/42 (71%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVA 346
>gi|58260324|ref|XP_567572.1| gata factor srep [Cryptococcus neoformans var. neoformans JEC21]
gi|134116222|ref|XP_773282.1| hypothetical protein CNBJ0600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255904|gb|EAL18635.1| hypothetical protein CNBJ0600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229622|gb|AAW46055.1| gata factor srep, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 1060
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 39/82 (47%), Positives = 47/82 (57%), Gaps = 11/82 (13%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
A +G+ C NCGT T LWRR+ +G P CNACGLY+KLH V RP+AM+K
Sbjct: 300 AATPVGMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVP----------RPVAMKK 349
Query: 117 DGIQTRKRKPKKQGGGGGGGSG 138
I+ RKR P G GG G
Sbjct: 350 TVIKRRKRVP-AVGSTSTGGRG 370
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL-GLCCT 66
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+ + +R+ + T
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVAMKKTVIKRRKRVPAVGST 364
Query: 67 NCGTRMT 73
+ G R T
Sbjct: 365 STGGRGT 371
>gi|321263187|ref|XP_003196312.1| zinc finger protein regulator of siderophore biosynthesis
[Cryptococcus gattii WM276]
gi|317462787|gb|ADV24525.1| Zinc finger protein regulator of siderophore biosynthesis, putative
[Cryptococcus gattii WM276]
Length = 1015
Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%)
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+G+ C NCGT T LWRR+ +G P CNACGLY+KLH V RP+AM+ ++
Sbjct: 304 VGMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVAMKKTVIK 353
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 42/70 (60%), Gaps = 9/70 (12%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKPSKRLTATRRL 61
G C NCG +TPLWRRD G CNACGLYHK++G+ RP ++K KR+ A +
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVAMKKTVIKRRKRVPA---V 361
Query: 62 GLCCTNCGTR 71
G T+ G R
Sbjct: 362 GSTSTSTGGR 371
>gi|241949727|ref|XP_002417586.1| negative regulator of iron uptake genes, putative [Candida
dubliniensis CD36]
gi|223640924|emb|CAX45241.1| negative regulator of iron uptake genes, putative [Candida
dubliniensis CD36]
Length = 523
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 55/196 (28%), Positives = 77/196 (39%), Gaps = 74/196 (37%)
Query: 4 QFGEGRECVNCGAISTPLWRR--DGT---------------------------------- 27
Q +G++C NCG TPLWRR DGT
Sbjct: 49 QPTDGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLKRPPNTIAVVKEE 108
Query: 28 --------------GHYLCNACGLYHK------------MNGMNRPLVKPSKRLTAT--- 58
G C C Y+ N P+ + KR T++
Sbjct: 109 EGSCKGDGRCNGTGGSAACKGCPAYNNRIVAKKTLEKSPKNDSRAPIDRSLKRSTSSDGA 168
Query: 59 ----RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP-LA 113
L + C NCGT +T LWRR++ G +CNACGL+Y+LH +RP+ M+ RP +
Sbjct: 169 TEDESSLAIACFNCGTTITPLWRRDDAGNTICNACGLFYRLHGSHRPIKMK----RPTIK 224
Query: 114 MRKDGIQTRKRKPKKQ 129
RK + +K K + Q
Sbjct: 225 RRKRNVSDKKSKDEVQ 240
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 33/81 (40%), Positives = 46/81 (56%), Gaps = 7/81 (8%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP---LAMRKDG 118
G C+NCGT T LWRR DG +CNACGLYY+ +N +RP+ ++ RP +A+ K+
Sbjct: 53 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNTHRPVNLK----RPPNTIAVVKEE 108
Query: 119 IQTRKRKPKKQGGGGGGGSGG 139
+ K + G GG G
Sbjct: 109 EGSCKGDGRCNGTGGSAACKG 129
>gi|389738594|gb|EIM79791.1| glucocorticoid receptor-like protein, partial [Stereum hirsutum
FP-91666 SS1]
Length = 163
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLT-----ATRRLGLCC 65
C+NCG TPLWR+ CNA GLY + ++ L + RL + + C
Sbjct: 1 CLNCGVTHTPLWRKGLNNELNCNAGGLYCQSAYLSPRLHHVTNRLQFLHFFISPAVTAKC 60
Query: 66 TNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
NC T T WR++++G+ VCN CGLY+KLH R ++M+ +R
Sbjct: 61 YNCNTTTTPPWRKDDEGKTVCNVCGLYFKLHRSPRLISMKSDVIR 105
Score = 43.9 bits (102), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 22/33 (66%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 42
+C NC +TP WR+D G +CN CGLY K++
Sbjct: 59 KCYNCNTTTTPPWRKDDEGKTVCNVCGLYFKLH 91
>gi|124111290|gb|ABM92043.1| GATA3 [Pan troglodytes]
Length = 35
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/35 (88%), Positives = 33/35 (94%)
Query: 21 LWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRL 55
LWRRDGTGHYLCNACGLYHKMNG NRPL+KP +RL
Sbjct: 1 LWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRL 35
Score = 40.0 bits (92), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 17/28 (60%), Positives = 22/28 (78%)
Query: 75 LWRRNNDGEPVCNACGLYYKLHNVNRPL 102
LWRR+ G +CNACGLY+K++ NRPL
Sbjct: 1 LWRRDGTGHYLCNACGLYHKMNGQNRPL 28
>gi|320581748|gb|EFW95967.1| GATA-type transcription factor [Ogataea parapolymorpha DL-1]
Length = 565
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/70 (44%), Positives = 45/70 (64%), Gaps = 10/70 (14%)
Query: 55 LTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAM 114
+ T + + CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL++
Sbjct: 384 IAGTEQGSIACTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSL 433
Query: 115 RKDGIQTRKR 124
+ D I+ R+R
Sbjct: 434 KTDVIKKRQR 443
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 394 CTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 431
>gi|62815898|emb|CAG27837.1| GATA-type transcription factor [Ogataea angusta]
gi|190360942|gb|ACE76877.1| GATA factor [Ogataea angusta]
Length = 565
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/70 (44%), Positives = 45/70 (64%), Gaps = 10/70 (14%)
Query: 55 LTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAM 114
+ T + + CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL++
Sbjct: 384 IAGTEQGSIACTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSL 433
Query: 115 RKDGIQTRKR 124
+ D I+ R+R
Sbjct: 434 KTDVIKKRQR 443
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 394 CTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 431
>gi|11182141|emb|CAC16182.1| putative GATA factor [Anopheles gambiae]
Length = 77
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 43/77 (55%), Gaps = 9/77 (11%)
Query: 14 CGAISTPLWRRDGTGHYLCNACGLYHKMN-GMNRPLVKPSKRLTAT--------RRLGLC 64
CG+ TPLWRRD GH LCNAC LY + N G NRP + K RR G+
Sbjct: 1 CGSSDTPLWRRDIVGHTLCNACALYTRQNPGTNRPPNRSQKAKQTVKTPPAQGNRRSGVT 60
Query: 65 CTNCGTRMTTLWRRNND 81
C NC T TTLWRRNN
Sbjct: 61 CANCQTTTTTLWRRNNQ 77
>gi|336384681|gb|EGO25829.1| hypothetical protein SERLADRAFT_448749 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1944
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 713 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 762
Query: 125 KPKKQGGGGGGGS 137
G GS
Sbjct: 763 ASGAPNGSARKGS 775
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 713 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 750
>gi|254573912|ref|XP_002494065.1| Transcriptional activator of genes involved in nitrogen catabolite
repression [Komagataella pastoris GS115]
gi|238033864|emb|CAY71886.1| Transcriptional activator of genes involved in nitrogen catabolite
repression [Komagataella pastoris GS115]
gi|328354116|emb|CCA40513.1| Nitrogen catabolic enzyme regulatory protein [Komagataella pastoris
CBS 7435]
Length = 615
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 36/109 (33%), Positives = 58/109 (53%), Gaps = 24/109 (22%)
Query: 42 NGMNRPLVKPSKRLTATRR--------------LGLCCTNCGTRMTTLWRRNNDGEPVCN 87
+G+ RPL K+ T++R+ + + C+NC T+ T LWRR+ G+P+CN
Sbjct: 449 SGIRRPLSATKKKPTSSRKPKDDGMKLKETNNGIPISCSNCKTQTTPLWRRDPSGKPLCN 508
Query: 88 ACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGGGGG 136
ACGL+ KLH ++RPL+++ D I+ R+R + G G
Sbjct: 509 ACGLFLKLHG----------SVRPLSLKTDVIKKRQRDKNNETKVNGTG 547
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD +G LCNACGL+ K++G RPL
Sbjct: 486 CSNCKTQTTPLWRRDPSGKPLCNACGLFLKLHGSVRPL 523
>gi|345571186|gb|EGX54001.1| hypothetical protein AOL_s00004g660 [Arthrobotrys oligospora ATCC
24927]
Length = 820
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 34/85 (40%), Positives = 45/85 (52%), Gaps = 10/85 (11%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR----------PLAM 114
CTNC T+ T LWRRN +G P+CNACGL+ KLH V RPL+++ ++ P+
Sbjct: 586 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPLSLKTDVIKKRNRGSGNSLPVGA 645
Query: 115 RKDGIQTRKRKPKKQGGGGGGGSGG 139
+ RKP Q SGG
Sbjct: 646 SPSRTKKTIRKPSSQNARASAASGG 670
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 25/45 (55%), Positives = 31/45 (68%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
Q G C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 579 QNGLPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 623
>gi|448515041|ref|XP_003867232.1| Gat1 zinc finger transcriptional regulator [Candida orthopsilosis
Co 90-125]
gi|380351571|emb|CCG21794.1| Gat1 zinc finger transcriptional regulator [Candida orthopsilosis]
Length = 761
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 27/44 (61%), Positives = 35/44 (79%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G CTNC T+ T LWRRN +G+P+CNACGL+ KLH V RPL+++
Sbjct: 606 GPSCTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGVTRPLSLK 649
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 23/41 (56%), Positives = 29/41 (70%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 606 GPSCTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGVTRPL 646
>gi|332029519|gb|EGI69408.1| GATA-binding factor C [Acromyrmex echinatior]
Length = 67
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/44 (72%), Positives = 34/44 (77%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN 99
+A RR G C NC T TTLWRRN GEPVCNACGLYYKLHNV+
Sbjct: 21 SAARRAGTSCANCKTATTTLWRRNQSGEPVCNACGLYYKLHNVS 64
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 27/38 (71%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMN 45
G C NC +T LWRR+ +G +CNACGLY+K++ ++
Sbjct: 27 GTSCANCKTATTTLWRRNQSGEPVCNACGLYYKLHNVS 64
>gi|406698929|gb|EKD02150.1| transcriptional activator [Trichosporon asahii var. asahii CBS 8904]
Length = 1273
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 35/75 (46%), Positives = 46/75 (61%), Gaps = 11/75 (14%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 1148 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 1197
Query: 125 -KPKKQGGGGGGGSG 138
P K+ G G G
Sbjct: 1198 AAPTKESSSGRKGRG 1212
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1148 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1185
>gi|401889257|gb|EJT53195.1| transcriptional activator [Trichosporon asahii var. asahii CBS 2479]
Length = 1273
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 35/75 (46%), Positives = 46/75 (61%), Gaps = 11/75 (14%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 1148 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 1197
Query: 125 -KPKKQGGGGGGGSG 138
P K+ G G G
Sbjct: 1198 AAPTKESSSGRKGRG 1212
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1148 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1185
>gi|405122586|gb|AFR97352.1| iron regulator 1 [Cryptococcus neoformans var. grubii H99]
Length = 1019
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 39/82 (47%), Positives = 47/82 (57%), Gaps = 11/82 (13%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
A +G+ C NCGT T LWRR+ +G P CNACGLY+KLH V RP+AM+K
Sbjct: 300 AATPVGMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVP----------RPVAMKK 349
Query: 117 DGIQTRKRKPKKQGGGGGGGSG 138
I+ RKR P G GG G
Sbjct: 350 TVIKRRKRVP-AVGSTSTGGRG 370
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 48/148 (32%), Positives = 61/148 (41%), Gaps = 18/148 (12%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL-GLCCT 66
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+ + +R+ + T
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVAMKKTVIKRRKRVPAVGST 364
Query: 67 NCGTRMTTLWRRNNDGEPVC----------NACGLYYKLHNVNRPLAMR-------HLAM 109
+ G R T + P A L K H + P R H
Sbjct: 365 STGGRGTNAELPSPASAPASVPTVTAPPPHVAPPLDDKAHRASPPFGHRAPQPHSEHRIN 424
Query: 110 RPLAMRKDGIQTRKRKPKKQGGGGGGGS 137
RPL G+ R KP G GS
Sbjct: 425 RPLGPEAYGLAGRYGKPSTPAGMNLPGS 452
>gi|336371932|gb|EGO00272.1| hypothetical protein SERLA73DRAFT_107315 [Serpula lacrymans var.
lacrymans S7.3]
Length = 2013
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 743 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 792
Query: 125 KPKKQGGGGGGGS 137
G GS
Sbjct: 793 ASGAPNGSARKGS 805
Score = 59.7 bits (143), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 743 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 780
>gi|108802602|gb|ABG21304.1| iron regulator 1 [Cryptococcus neoformans var. grubii]
Length = 952
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 39/82 (47%), Positives = 47/82 (57%), Gaps = 11/82 (13%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
A +G+ C NCGT T LWRR+ +G P CNACGLY+KLH V RP+AM+K
Sbjct: 300 AATPVGMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVP----------RPVAMKK 349
Query: 117 DGIQTRKRKPKKQGGGGGGGSG 138
I+ RKR P G GG G
Sbjct: 350 TVIKRRKRVP-AVGSTSTGGRG 370
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 48/148 (32%), Positives = 61/148 (41%), Gaps = 18/148 (12%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL-GLCCT 66
G C NCG +TPLWRRD G CNACGLYHK++G+ RP+ + +R+ + T
Sbjct: 305 GMSCRNCGTSTTPLWRRDEEGRPQCNACGLYHKLHGVPRPVAMKKTVIKRRKRVPAVGST 364
Query: 67 NCGTRMTTLWRRNNDGEPVC----------NACGLYYKLHNVNRPLAMR-------HLAM 109
+ G R T + P A L K H + P R H
Sbjct: 365 STGGRGTNAELPSPASAPASVPTVTAPPPHVAPPLDDKAHRASPPFGHRAPQPHSEHRIN 424
Query: 110 RPLAMRKDGIQTRKRKPKKQGGGGGGGS 137
RPL G+ R KP G GS
Sbjct: 425 RPLGPEAYGLAGRYGKPSTPAGMNLPGS 452
>gi|344231972|gb|EGV63851.1| hypothetical protein CANTEDRAFT_113880 [Candida tenuis ATCC 10573]
Length = 569
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 41/60 (68%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R+R
Sbjct: 384 CTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRQR 433
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 384 CTNCHTKTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 421
>gi|260941173|ref|XP_002614753.1| hypothetical protein CLUG_05531 [Clavispora lusitaniae ATCC 42720]
gi|238851939|gb|EEQ41403.1| hypothetical protein CLUG_05531 [Clavispora lusitaniae ATCC 42720]
Length = 603
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 39/60 (65%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC TR T LWRRN GEP+CNACGL+ KLH +RPL+++ D I+ R+R
Sbjct: 388 CTNCHTRTTPLWRRNPQGEPLCNACGLFLKLHG----------TVRPLSLKTDVIKKRQR 437
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 22/39 (56%), Positives = 28/39 (71%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
+C NC +TPLWRR+ G LCNACGL+ K++G RPL
Sbjct: 387 QCTNCHTRTTPLWRRNPQGEPLCNACGLFLKLHGTVRPL 425
>gi|384495933|gb|EIE86424.1| global nitrogen regulator protein [Rhizopus delemar RA 99-880]
Length = 617
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 39/60 (65%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNCGT T LWRRN +G P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 467 CTNCGTTTTPLWRRNPEGHPLCNACGLFLKLHGV----------VRPLSLKTDIIKKRNR 516
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/39 (61%), Positives = 31/39 (79%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
+C NCG +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 466 KCTNCGTTTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 504
>gi|320036931|gb|EFW18869.1| GATA transcriptional activator AreA [Coccidioides posadasii str.
Silveira]
Length = 760
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 38/98 (38%), Positives = 51/98 (52%), Gaps = 16/98 (16%)
Query: 42 NGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
N +RP + T C NC T+ T LWRRN +G+P+CNACGL+ KLH V
Sbjct: 511 NNPSRPASPGPAKATEQPNAPTTCANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV--- 567
Query: 102 LAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGGGGGSGG 139
+RPL+++ D I+ R R GGG +GG
Sbjct: 568 -------VRPLSLKTDVIKKRNR------GGGNNMAGG 592
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 534 CANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 571
>gi|303324011|ref|XP_003071993.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
gi|240111703|gb|EER29848.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
Length = 760
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 38/98 (38%), Positives = 51/98 (52%), Gaps = 16/98 (16%)
Query: 42 NGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
N +RP + T C NC T+ T LWRRN +G+P+CNACGL+ KLH V
Sbjct: 511 NNPSRPASPGPAKATEQPNAPTTCANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV--- 567
Query: 102 LAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGGGGGSGG 139
+RPL+++ D I+ R R GGG +GG
Sbjct: 568 -------VRPLSLKTDVIKKRNR------GGGNNMAGG 592
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 534 CANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 571
>gi|156844572|ref|XP_001645348.1| hypothetical protein Kpol_1058p27 [Vanderwaltozyma polyspora DSM
70294]
gi|156116009|gb|EDO17490.1| hypothetical protein Kpol_1058p27 [Vanderwaltozyma polyspora DSM
70294]
Length = 484
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 10/67 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NC T T LWRR+ G P+CNACGL+YKLH V MRPL+++ D I+ R
Sbjct: 398 MICDNCKTTNTPLWRRDPIGNPLCNACGLFYKLHGV----------MRPLSLKTDTIKKR 447
Query: 123 KRKPKKQ 129
+R K+
Sbjct: 448 QRYSNKR 454
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 33/85 (38%), Positives = 46/85 (54%), Gaps = 6/85 (7%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL------VKPSKRLTA 57
Q+ + C NC +TPLWRRD G+ LCNACGL++K++G+ RPL +K +R +
Sbjct: 393 QYNDDMICDNCKTTNTPLWRRDPIGNPLCNACGLFYKLHGVMRPLSLKTDTIKKRQRYSN 452
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDG 82
RR N T NN+G
Sbjct: 453 KRRRSHVIVNDKGSSTKTNSTNNNG 477
>gi|448507977|ref|XP_003865868.1| Gzf3 transcription factor [Candida orthopsilosis Co 90-125]
gi|380350206|emb|CCG20426.1| Gzf3 transcription factor [Candida orthopsilosis Co 90-125]
Length = 830
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 41/102 (40%), Positives = 56/102 (54%), Gaps = 19/102 (18%)
Query: 45 NRPLVKPSKRL----TATRRLGL---CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHN 97
N P VKP+ + T+T + L C NC T+ T LWRR+ G+ +CNACGL+ KLH
Sbjct: 197 NHPNVKPTSHVNNDGTSTSKANLSSPVCRNCKTQTTPLWRRDETGQVLCNACGLFLKLH- 255
Query: 98 VNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGGGGGSGG 139
RP RP++++ D I++R R KQG G S G
Sbjct: 256 -GRP--------RPISLKTDTIKSRNR--VKQGAAGANSSKG 286
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 33/51 (64%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NC +TPLWRRD TG LCNACGL+ K++G RP+ + + + R+
Sbjct: 224 CRNCKTQTTPLWRRDETGQVLCNACGLFLKLHGRPRPISLKTDTIKSRNRV 274
>gi|113206467|gb|ABI34463.1| AreA [Talaromyces marneffei]
Length = 883
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 43/68 (63%), Gaps = 15/68 (22%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 648 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 697
Query: 125 KPKKQGGG 132
GGG
Sbjct: 698 -----GGG 700
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 23/43 (53%), Positives = 29/43 (67%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 643 GAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 685
>gi|393217291|gb|EJD02780.1| glucocorticoid receptor-like protein [Fomitiporia mediterranea
MF3/22]
Length = 173
Score = 69.7 bits (169), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 37/47 (78%)
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
CTNC TR T LWRR+ +G+P+CNACGL+YKLH V RPL+++ ++
Sbjct: 54 VCTNCQTRNTPLWRRDPEGQPLCNACGLFYKLHGVTRPLSLKTDVIK 100
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 55 CTNCQTRNTPLWRRDPEGQPLCNACGLFYKLHGVTRPL 92
>gi|346325730|gb|EGX95327.1| Nitrogen regulatory protein areA [Cordyceps militaris CM01]
Length = 857
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 33/70 (47%), Positives = 43/70 (61%), Gaps = 10/70 (14%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 614 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 663
Query: 125 KPKKQGGGGG 134
GG G
Sbjct: 664 GSGNNAGGRG 673
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 23/43 (53%), Positives = 29/43 (67%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 609 GNPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 651
>gi|242809077|ref|XP_002485294.1| GATA transcriptional activator AreA [Talaromyces stipitatus ATCC
10500]
gi|218715919|gb|EED15341.1| GATA transcriptional activator AreA [Talaromyces stipitatus ATCC
10500]
Length = 863
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 43/68 (63%), Gaps = 15/68 (22%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 630 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 679
Query: 125 KPKKQGGG 132
GGG
Sbjct: 680 -----GGG 682
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 23/43 (53%), Positives = 29/43 (67%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 625 GAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 667
>gi|402217264|gb|EJT97345.1| hypothetical protein DACRYDRAFT_102444 [Dacryopinax sp. DJM-731 SS1]
Length = 1326
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 32/74 (43%), Positives = 44/74 (59%), Gaps = 10/74 (13%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G P+CNACGL++KLH V +RPL+++ D I+ R R
Sbjct: 1199 CTNCQTTNTPLWRRDAEGNPLCNACGLFFKLHGV----------VRPLSLKTDVIKKRNR 1248
Query: 125 KPKKQGGGGGGGSG 138
+ GG +G
Sbjct: 1249 TSGQPGGSATRKAG 1262
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G+ LCNACGL+ K++G+ RPL
Sbjct: 1199 CTNCQTTNTPLWRRDAEGNPLCNACGLFFKLHGVVRPL 1236
>gi|213402573|ref|XP_002172059.1| iron-sensing transcription factor 1 [Schizosaccharomyces japonicus
yFS275]
gi|212000106|gb|EEB05766.1| iron-sensing transcription factor 1 [Schizosaccharomyces japonicus
yFS275]
Length = 519
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
G C NCGT T LWRR+ G P+CNACGLYYK+H V+RP+ M+ ++
Sbjct: 177 GTYCQNCGTTTTPLWRRDESGNPICNACGLYYKIHGVHRPVTMKKAVIK 225
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/42 (57%), Positives = 34/42 (80%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
G C NCG +TPLWRRD +G+ +CNACGLY+K++G++RP+
Sbjct: 177 GTYCQNCGTTTTPLWRRDESGNPICNACGLYYKIHGVHRPVT 218
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
C NC +++T WRR D +CNACGLY+K + RP+
Sbjct: 11 VCANCKSQITAQWRRGPDNSILCNACGLYFKTRDAPRPV 49
Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 22/42 (52%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
E + C NC + T WRR LCNACGLY K RP+
Sbjct: 8 EPQVCANCKSQITAQWRRGPDNSILCNACGLYFKTRDAPRPV 49
>gi|354547100|emb|CCE43833.1| hypothetical protein CPAR2_500590 [Candida parapsilosis]
Length = 769
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 26/44 (59%), Positives = 35/44 (79%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NC T+ T LWRRN +G+P+CNACGL+ KLH V RPL+++
Sbjct: 596 GQTCSNCQTKTTPLWRRNPEGQPLCNACGLFLKLHGVTRPLSLK 639
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 23/41 (56%), Positives = 30/41 (73%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G+ C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 596 GQTCSNCQTKTTPLWRRNPEGQPLCNACGLFLKLHGVTRPL 636
>gi|393245125|gb|EJD52636.1| hypothetical protein AURDEDRAFT_158369 [Auricularia delicata
TFB-10046 SS5]
Length = 1111
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 27/41 (65%), Positives = 34/41 (82%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
CTNC T T LWRR+ DG+P+CNACGL+YKLH V RPL+++
Sbjct: 1033 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPLSLK 1073
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 24/43 (55%), Positives = 31/43 (72%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G+ C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1028 GDVTVCTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPL 1070
>gi|444323739|ref|XP_004182510.1| hypothetical protein TBLA_0I03360 [Tetrapisispora blattae CBS 6284]
gi|387515557|emb|CCH62991.1| hypothetical protein TBLA_0I03360 [Tetrapisispora blattae CBS 6284]
Length = 1061
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 26/47 (55%), Positives = 35/47 (74%)
Query: 59 RRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
R+ + C NC T T LWRR+NDG +CNACGL+ KLH ++RPL+M+
Sbjct: 417 RKQQIVCGNCKTTKTPLWRRDNDGNTLCNACGLFQKLHGISRPLSMK 463
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/53 (50%), Positives = 35/53 (66%), Gaps = 3/53 (5%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPS---KRLTATRR 60
C NC TPLWRRD G+ LCNACGL+ K++G++RPL + KR A +R
Sbjct: 423 CGNCKTTKTPLWRRDNDGNTLCNACGLFQKLHGISRPLSMKTDVIKRRNAKKR 475
>gi|212537481|ref|XP_002148896.1| GATA transcriptional activator AreA [Talaromyces marneffei ATCC
18224]
gi|210068638|gb|EEA22729.1| GATA transcriptional activator AreA [Talaromyces marneffei ATCC
18224]
Length = 854
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 43/68 (63%), Gaps = 15/68 (22%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 619 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 668
Query: 125 KPKKQGGG 132
GGG
Sbjct: 669 -----GGG 671
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 23/43 (53%), Positives = 29/43 (67%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 614 GAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 656
>gi|344305512|gb|EGW35744.1| hypothetical protein SPAPADRAFT_58947 [Spathaspora passalidarum
NRRL Y-27907]
Length = 355
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 43/63 (68%), Gaps = 10/63 (15%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C+NC TR T LWRR+ +G+P+CNACGL+ KLH V +RPL+++ D I+
Sbjct: 272 GVSCSNCHTRTTPLWRRDPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKK 321
Query: 122 RKR 124
R+R
Sbjct: 322 RQR 324
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRRD G LCNACGL+ K++G+ RPL
Sbjct: 272 GVSCSNCHTRTTPLWRRDPEGQPLCNACGLFLKLHGVVRPL 312
>gi|367045274|ref|XP_003653017.1| hypothetical protein THITE_2114956 [Thielavia terrestris NRRL 8126]
gi|347000279|gb|AEO66681.1| hypothetical protein THITE_2114956 [Thielavia terrestris NRRL 8126]
Length = 564
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 34/70 (48%), Positives = 42/70 (60%), Gaps = 10/70 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 262 IACQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVH----------RPVTMKKSIIKRR 311
Query: 123 KRKPKKQGGG 132
KR GGG
Sbjct: 312 KRVIPASGGG 321
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 264 CQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKSIIKRRKRV 314
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 31/70 (44%), Positives = 42/70 (60%), Gaps = 6/70 (8%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G C+NCGT T LWRR+ GE +CNACGLY K N RP+ ++ RP ++ +G++
Sbjct: 104 GQICSNCGTTQTPLWRRSPQGETICNACGLYLKARNAARPINLK----RPPSVTTNGVRQ 159
Query: 122 R--KRKPKKQ 129
K PK Q
Sbjct: 160 SLVKLSPKAQ 169
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 25/41 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G+ C NCG TPLWRR G +CNACGLY K RP+
Sbjct: 104 GQICSNCGTTQTPLWRRSPQGETICNACGLYLKARNAARPI 144
>gi|389740007|gb|EIM81199.1| hypothetical protein STEHIDRAFT_142426 [Stereum hirsutum FP-91666
SS1]
Length = 1696
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 1553 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 1602
Query: 125 KPKKQGGGGGGGS 137
G GS
Sbjct: 1603 ASGASAANGRKGS 1615
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1553 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1590
>gi|336263677|ref|XP_003346618.1| hypothetical protein SMAC_04791 [Sordaria macrospora k-hell]
Length = 542
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 34/71 (47%), Positives = 42/71 (59%), Gaps = 10/71 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 234 IACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVH----------RPVTMKKAIIKRR 283
Query: 123 KRKPKKQGGGG 133
KR GG G
Sbjct: 284 KRVIPAAGGEG 294
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 236 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKAIIKRRKRV 286
>gi|353242644|emb|CCA74270.1| hypothetical protein PIIN_08223 [Piriformospora indica DSM 11827]
Length = 1358
Score = 68.9 bits (167), Expect = 5e-10, Method: Composition-based stats.
Identities = 26/41 (63%), Positives = 34/41 (82%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
CTNC T T LWRRN +G+P+CNACGL++KLH V RPL+++
Sbjct: 1177 CTNCSTTTTPLWRRNPEGQPLCNACGLFFKLHGVTRPLSLK 1217
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 37/64 (57%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGT 70
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL + + R G TN
Sbjct: 1177 CTNCSTTTTPLWRRNPEGQPLCNACGLFFKLHGVTRPLSLKTDVIKKRNRNGATLTNPSR 1236
Query: 71 RMTT 74
+ TT
Sbjct: 1237 KSTT 1240
>gi|50312349|ref|XP_456208.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49645344|emb|CAG98916.1| KLLA0F25300p [Kluyveromyces lactis]
Length = 692
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%), Gaps = 10/64 (15%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T+ T LWRR+ G P+CNACGL+ KLH V +RPL+++KD I+ R+R
Sbjct: 471 CSNCHTKTTPLWRRDPQGNPLCNACGLFLKLHGV----------VRPLSLKKDVIKKRQR 520
Query: 125 KPKK 128
K
Sbjct: 521 SSNK 524
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 30/39 (76%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
+C NC +TPLWRRD G+ LCNACGL+ K++G+ RPL
Sbjct: 470 QCSNCHTKTTPLWRRDPQGNPLCNACGLFLKLHGVVRPL 508
>gi|340517662|gb|EGR47905.1| Zn-finger gata type domain-containing protein [Trichoderma reesei
QM6a]
Length = 554
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 48/80 (60%), Gaps = 14/80 (17%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR------PLAMRK 116
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++ P A +
Sbjct: 251 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKATIKRRKRVIPAAQDE 310
Query: 117 DG--------IQTRKRKPKK 128
+G Q++++ P++
Sbjct: 311 EGEEAMEGVETQSQEKTPER 330
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ K+ T RR
Sbjct: 253 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTM--KKATIKRR 300
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 28/41 (68%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
C+NCGT T LWRR+ G +CNACGLY K N RP +++
Sbjct: 95 CSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARPTSLK 135
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTAT 58
C NCG TPLWRR G +CNACGLY K RP L KP + +T
Sbjct: 95 CSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARPTSLKKPPNLVPST 144
>gi|71020337|ref|XP_760399.1| hypothetical protein UM04252.1 [Ustilago maydis 521]
gi|46100068|gb|EAK85301.1| hypothetical protein UM04252.1 [Ustilago maydis 521]
Length = 3893
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 35/88 (39%), Positives = 49/88 (55%), Gaps = 1/88 (1%)
Query: 51 PSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
PS AT C+NC T T LWRR+ +G+P+CNACGL+ KLH V RPL+++ ++
Sbjct: 1583 PSASSMATAEPPTVCSNCHTTKTPLWRRDPEGQPLCNACGLFLKLHGVVRPLSLKTDVIK 1642
Query: 111 PLAMRKDGIQTRKRKPKKQGGGGGGGSG 138
R G Q + + G GGG +
Sbjct: 1643 K-RNRASGAQRADARARASIGSGGGANA 1669
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 27/38 (71%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC TPLWRRD G LCNACGL+ K++G+ RPL
Sbjct: 1597 CSNCHTTKTPLWRRDPEGQPLCNACGLFLKLHGVVRPL 1634
>gi|294657093|ref|XP_459410.2| DEHA2E01826p [Debaryomyces hansenii CBS767]
gi|199432441|emb|CAG87621.2| DEHA2E01826p [Debaryomyces hansenii CBS767]
Length = 636
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 34/80 (42%), Positives = 45/80 (56%), Gaps = 14/80 (17%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ CTNC T T LWRR+ G+P+CNACGL+ KLH V +RPL+++ D I+ R
Sbjct: 455 ISCTNCNTTATPLWRRDPKGKPLCNACGLFLKLHGV----------VRPLSLKTDVIKKR 504
Query: 123 KR----KPKKQGGGGGGGSG 138
+R KK G G G
Sbjct: 505 QRGSNTSSKKSVSGPTDGDG 524
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL+ K++G+ RPL
Sbjct: 457 CTNCNTTATPLWRRDPKGKPLCNACGLFLKLHGVVRPL 494
>gi|170044815|ref|XP_001850028.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867946|gb|EDS31329.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 119
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 33/49 (67%), Positives = 36/49 (73%), Gaps = 3/49 (6%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
+A RR G C NC T TTLWRRN GEPVCNACGLYYKLHN P++M
Sbjct: 9 SAARRAGTSCANCKTTTTTLWRRNQGGEPVCNACGLYYKLHN---PVSM 54
Score = 43.9 bits (102), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 24/35 (68%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMN 42
G C NC +T LWRR+ G +CNACGLY+K++
Sbjct: 15 GTSCANCKTTTTTLWRRNQGGEPVCNACGLYYKLH 49
>gi|380090512|emb|CCC11809.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 554
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 34/71 (47%), Positives = 42/71 (59%), Gaps = 10/71 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 246 IACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVH----------RPVTMKKAIIKRR 295
Query: 123 KRKPKKQGGGG 133
KR GG G
Sbjct: 296 KRVIPAAGGEG 306
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 248 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKAIIKRRKRV 298
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 28/44 (63%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N RP +R
Sbjct: 82 GQVCSNCGTTHTPLWRRSPLGAIICNACGLYLKARNAARPANIR 125
Score = 51.6 bits (122), Expect = 9e-05, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 24/40 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G+ C NCG TPLWRR G +CNACGLY K RP
Sbjct: 82 GQVCSNCGTTHTPLWRRSPLGAIICNACGLYLKARNAARP 121
>gi|8096353|dbj|BAA96108.1| transcription factor ScGATA-6 [Schizophyllum commune]
Length = 1075
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 45/77 (58%), Gaps = 10/77 (12%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R
Sbjct: 955 ILCTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKR 1004
Query: 123 KRKPKKQGGGGGGGSGG 139
R GG S G
Sbjct: 1005 NRASGAPNGGNRKNSTG 1021
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 957 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 994
>gi|392927094|ref|NP_001257105.1| Protein ELT-4, isoform a [Caenorhabditis elegans]
gi|22265829|emb|CAD44111.1| Protein ELT-4, isoform a [Caenorhabditis elegans]
Length = 72
Score = 68.9 bits (167), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 38/51 (74%)
Query: 55 LTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
L A+ R L C+NC TTLWRR +G+PVCNACGLY+KLH+V RP+ M+
Sbjct: 6 LDASHRKRLVCSNCNGTNTTLWRRKAEGDPVCNACGLYFKLHHVTRPIPMK 56
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 28/48 (58%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
+D + C NC +T LWRR G +CNACGLY K++ + RP+
Sbjct: 6 LDASHRKRLVCSNCNGTNTTLWRRKAEGDPVCNACGLYFKLHHVTRPI 53
>gi|258569170|ref|XP_002585329.1| nitrogen regulatory protein areA [Uncinocarpus reesii 1704]
gi|237906775|gb|EEP81176.1| nitrogen regulatory protein areA [Uncinocarpus reesii 1704]
Length = 1107
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 34/75 (45%), Positives = 44/75 (58%), Gaps = 15/75 (20%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 877 CANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 926
Query: 125 KPKKQGGGGGGGSGG 139
GGG S G
Sbjct: 927 -----GGGNNTTSAG 936
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 3/83 (3%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL-VKPSKRLTATRRLGLCCTNCG 69
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL +K R G T+ G
Sbjct: 877 CANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNRGGGNNTTSAG 936
Query: 70 TRMTT--LWRRNNDGEPVCNACG 90
T+ + R+ + +P N G
Sbjct: 937 ATRTSKKISRKQSVQQPTSNPVG 959
>gi|149234449|ref|XP_001523104.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146453213|gb|EDK47469.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1093
Score = 68.6 bits (166), Expect = 7e-10, Method: Composition-based stats.
Identities = 29/63 (46%), Positives = 41/63 (65%), Gaps = 10/63 (15%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G+ C+NC T+ T LWRRN G+P+CNACGL+ KLH +RPL+++ D I+
Sbjct: 769 GVSCSNCHTKTTPLWRRNPQGQPLCNACGLFLKLHG----------TVRPLSLKTDVIKK 818
Query: 122 RKR 124
R+R
Sbjct: 819 RQR 821
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 23/41 (56%), Positives = 28/41 (68%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G RPL
Sbjct: 769 GVSCSNCHTKTTPLWRRNPQGQPLCNACGLFLKLHGTVRPL 809
>gi|389634415|ref|XP_003714860.1| GATA factor SREP [Magnaporthe oryzae 70-15]
gi|351647193|gb|EHA55053.1| GATA factor SREP [Magnaporthe oryzae 70-15]
Length = 539
Score = 68.6 bits (166), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++
Sbjct: 265 IACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKAVIK 312
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 31/39 (79%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 267 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVT 305
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G C+NCGT T LWRR+ G +CNACGLY K N RP ++ RPL + G T
Sbjct: 111 GQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARPTNLK----RPLNIVASG--T 164
Query: 122 RKRKPKKQGGGGGGGS 137
+ K G G GS
Sbjct: 165 PRAADKSAGKGAQSGS 180
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 25/44 (56%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
Q G+ C NCG TPLWRR G +CNACGLY K RP
Sbjct: 107 QTPSGQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARP 150
>gi|302680356|ref|XP_003029860.1| transcription factor scgata-6 [Schizophyllum commune H4-8]
gi|300103550|gb|EFI94957.1| transcription factor scgata-6 [Schizophyllum commune H4-8]
Length = 1075
Score = 68.6 bits (166), Expect = 7e-10, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 45/77 (58%), Gaps = 10/77 (12%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R
Sbjct: 955 ILCTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKR 1004
Query: 123 KRKPKKQGGGGGGGSGG 139
R GG S G
Sbjct: 1005 NRASGAPNGGNRKNSTG 1021
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 957 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 994
>gi|302894399|ref|XP_003046080.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256727007|gb|EEU40367.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 559
Score = 68.6 bits (166), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++
Sbjct: 256 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIK 303
Score = 65.1 bits (157), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ K+ T RR
Sbjct: 258 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTM--KKSTIKRR 305
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 29/44 (65%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N RP +++
Sbjct: 98 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARPTSLK 141
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 24/40 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G+ C NCG TPLWRR G +CNACGLY K RP
Sbjct: 98 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNAARP 137
>gi|440639377|gb|ELR09296.1| hypothetical protein GMDG_03864 [Geomyces destructans 20631-21]
Length = 555
Score = 68.6 bits (166), Expect = 7e-10, Method: Composition-based stats.
Identities = 33/70 (47%), Positives = 41/70 (58%), Gaps = 10/70 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 268 VACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVH----------RPVTMKKSVIKRR 317
Query: 123 KRKPKKQGGG 132
KR GG
Sbjct: 318 KRVVPAAAGG 327
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 36/51 (70%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 270 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKSVIKRRKRV 320
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 29/44 (65%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N +RP M+
Sbjct: 111 GQICSNCGTTSTPLWRRSPQGATICNACGLYQKARNASRPTMMK 154
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 23/40 (57%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G+ C NCG STPLWRR G +CNACGLY K +RP
Sbjct: 111 GQICSNCGTTSTPLWRRSPQGATICNACGLYQKARNASRP 150
>gi|440467530|gb|ELQ36746.1| GATA factor SREP [Magnaporthe oryzae Y34]
gi|440488667|gb|ELQ68382.1| GATA factor SREP [Magnaporthe oryzae P131]
Length = 539
Score = 68.6 bits (166), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++
Sbjct: 265 IACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKAVIK 312
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 31/39 (79%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 267 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVT 305
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G C+NCGT T LWRR+ G +CNACGLY K N RP ++ RPL + G T
Sbjct: 111 GQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARPTNLK----RPLNIVASG--T 164
Query: 122 RKRKPKKQGGGGGGGS 137
+ K G G GS
Sbjct: 165 PRAADKSAGKGAQSGS 180
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 25/44 (56%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
Q G+ C NCG TPLWRR G +CNACGLY K RP
Sbjct: 107 QTPSGQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNAARP 150
>gi|395332895|gb|EJF65273.1| hypothetical protein DICSQDRAFT_132832 [Dichomitus squalens LYAD-421
SS1]
Length = 1227
Score = 68.6 bits (166), Expect = 7e-10, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 41/60 (68%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T +T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 1100 CTNCQTTITPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 1149
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1100 CTNCQTTITPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1137
>gi|164423316|ref|XP_963796.2| hypothetical protein NCU09068 [Neurospora crassa OR74A]
gi|157070042|gb|EAA34560.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 308
Score = 68.6 bits (166), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 40/61 (65%), Gaps = 10/61 (16%)
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CTNC T+ T LWRRN DG+P+CNACGL+ KLH V +RPL+++ D I+ R
Sbjct: 14 TCTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRN 63
Query: 124 R 124
R
Sbjct: 64 R 64
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 15 CTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGVVRPL 52
>gi|67903360|ref|XP_681936.1| AREA_EMENI Nitrogen regulatory protein areA [Aspergillus nidulans
FGSC A4]
gi|1351972|sp|P17429.2|AREA_EMENI RecName: Full=Nitrogen regulatory protein areA
gi|1154625|emb|CAA36731.1| areA [Emericella nidulans]
gi|40741511|gb|EAA60701.1| AREA_EMENI Nitrogen regulatory protein areA [Aspergillus nidulans
FGSC A4]
gi|259483114|tpe|CBF78221.1| TPA: Nitrogen regulatory protein areA
[Source:UniProtKB/Swiss-Prot;Acc:P17429] [Aspergillus
nidulans FGSC A4]
Length = 876
Score = 68.6 bits (166), Expect = 8e-10, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 673 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 722
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 673 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 710
>gi|322694341|gb|EFY86173.1| GATA transcription factor [Metarhizium acridum CQMa 102]
Length = 523
Score = 68.6 bits (166), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++
Sbjct: 228 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKPTIK 275
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ K+ T RR
Sbjct: 230 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTM--KKPTIKRR 277
>gi|366991961|ref|XP_003675746.1| hypothetical protein NCAS_0C03920 [Naumovozyma castellii CBS 4309]
gi|342301611|emb|CCC69382.1| hypothetical protein NCAS_0C03920 [Naumovozyma castellii CBS 4309]
Length = 459
Score = 68.6 bits (166), Expect = 9e-10, Method: Composition-based stats.
Identities = 30/71 (42%), Positives = 41/71 (57%), Gaps = 10/71 (14%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NCGT T LWR++ +G +CNACGL+ KLH V MRPL+++ D I+ R+R
Sbjct: 346 CSNCGTGTTPLWRKDANGNSLCNACGLFLKLHGV----------MRPLSLKTDVIKKRQR 395
Query: 125 KPKKQGGGGGG 135
K G
Sbjct: 396 NKKTSTANNNG 406
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 30/38 (78%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NCG +TPLWR+D G+ LCNACGL+ K++G+ RPL
Sbjct: 346 CSNCGTGTTPLWRKDANGNSLCNACGLFLKLHGVMRPL 383
>gi|321251473|ref|XP_003192077.1| transcriptional activator [Cryptococcus gattii WM276]
gi|317458545|gb|ADV20290.1| Transcriptional activator, putative [Cryptococcus gattii WM276]
Length = 1290
Score = 68.6 bits (166), Expect = 9e-10, Method: Composition-based stats.
Identities = 31/62 (50%), Positives = 41/62 (66%), Gaps = 10/62 (16%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ CTNC T T LWRR+ DG+P+CNACGL+YKLH V +RPL+++ D I+ R
Sbjct: 1181 IMCTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKR 1230
Query: 123 KR 124
R
Sbjct: 1231 NR 1232
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1183 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPL 1220
>gi|119467139|ref|XP_001257376.1| GATA transcriptional activator AreA [Neosartorya fischeri NRRL 181]
gi|119405528|gb|EAW15479.1| GATA transcriptional activator AreA [Neosartorya fischeri NRRL 181]
Length = 882
Score = 68.6 bits (166), Expect = 9e-10, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 670 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 719
Score = 55.5 bits (132), Expect = 8e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 670 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 707
>gi|389646733|ref|XP_003720998.1| nitrogen regulatory protein NUT1 [Magnaporthe oryzae 70-15]
gi|2494690|sp|Q01168.1|NUT1_MAGO7 RecName: Full=Nitrogen regulatory protein NUT1
gi|1401346|gb|AAB03415.1| nitrogen regulatory protein [Magnaporthe grisea]
gi|86196459|gb|EAQ71097.1| hypothetical protein MGCH7_ch7g504 [Magnaporthe oryzae 70-15]
gi|351638390|gb|EHA46255.1| nitrogen regulatory protein NUT1 [Magnaporthe oryzae 70-15]
gi|440466924|gb|ELQ36165.1| nitrogen regulatory protein NUT1 [Magnaporthe oryzae Y34]
gi|440482142|gb|ELQ62657.1| nitrogen regulatory protein NUT1 [Magnaporthe oryzae P131]
Length = 956
Score = 68.6 bits (166), Expect = 9e-10, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 663 CTNCATQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 712
Query: 125 KPKKQGGGGGGGS 137
G GS
Sbjct: 713 GSGSNVPGATSGS 725
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 663 CTNCATQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 700
>gi|310792185|gb|EFQ27712.1| GATA zinc finger [Glomerella graminicola M1.001]
Length = 592
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++
Sbjct: 287 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKSIIK 334
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 32/39 (82%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 289 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVT 327
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 36/60 (60%)
Query: 46 RPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
RP P+ ++ + G C+NCGT T LWRR+ G +CNACGLY K N +RP ++
Sbjct: 103 RPSPAPTATGSSVGQTGQVCSNCGTTRTPLWRRSPQGSTICNACGLYQKARNASRPTNLK 162
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTATRR 60
G+ C NCG TPLWRR G +CNACGLY K +RP L KP + A+ R
Sbjct: 119 GQVCSNCGTTRTPLWRRSPQGSTICNACGLYQKARNASRPTNLKKPPHLVAASPR 173
>gi|408399802|gb|EKJ78894.1| hypothetical protein FPSE_00936 [Fusarium pseudograminearum CS3096]
Length = 557
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++
Sbjct: 258 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIK 305
Score = 65.1 bits (157), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ K+ T RR
Sbjct: 260 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTM--KKSTIKRR 307
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N +RP +++
Sbjct: 101 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLK 144
Score = 52.4 bits (124), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 26/42 (61%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+G+ C NCG TPLWRR G +CNACGLY K +RP
Sbjct: 99 SQGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRP 140
>gi|46136099|ref|XP_389741.1| hypothetical protein FG09565.1 [Gibberella zeae PH-1]
Length = 557
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++
Sbjct: 258 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIK 305
Score = 65.1 bits (157), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ K+ T RR
Sbjct: 260 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTM--KKSTIKRR 307
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N +RP +++
Sbjct: 101 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLK 144
Score = 52.4 bits (124), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 26/42 (61%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+G+ C NCG TPLWRR G +CNACGLY K +RP
Sbjct: 99 SQGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRP 140
>gi|405117554|gb|AFR92329.1| transcriptional activator [Cryptococcus neoformans var. grubii H99]
Length = 1277
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ DG+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 1173 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 1222
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/52 (48%), Positives = 33/52 (63%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
C NC +TPLWRRD G LCNACGL++K++G+ RPL + + R G
Sbjct: 1173 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPLSLKTDVIKKRNRAG 1224
>gi|169621604|ref|XP_001804212.1| hypothetical protein SNOG_14013 [Phaeosphaeria nodorum SN15]
gi|111057518|gb|EAT78638.1| hypothetical protein SNOG_14013 [Phaeosphaeria nodorum SN15]
Length = 511
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 40/92 (43%), Positives = 51/92 (55%), Gaps = 12/92 (13%)
Query: 33 NACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLY 92
+ CG NG N P+ A+ C NCGT +T LWRR++ G +CNACGLY
Sbjct: 243 SGCGSTDGANGNNATTGLPNG--NASTSAIPACQNCGTTITPLWRRDDAGHIICNACGLY 300
Query: 93 YKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
YKLHN +H RP+AM+K I+ RKR
Sbjct: 301 YKLHN-------KH---RPVAMKKQEIKRRKR 322
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 34/51 (66%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++ +RP+ + + +R+
Sbjct: 273 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHNKHRPVAMKKQEIKRRKRI 323
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 31/44 (70%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NC T T LWRR+ GE +CNACGLY+K N +RP+ ++
Sbjct: 110 GQVCSNCKTTQTPLWRRSPTGETICNACGLYFKARNQHRPVNLK 153
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 26/41 (63%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G+ C NC TPLWRR TG +CNACGLY K +RP+
Sbjct: 110 GQVCSNCKTTQTPLWRRSPTGETICNACGLYFKARNQHRPV 150
>gi|85105503|ref|XP_961978.1| hypothetical protein NCU07728 [Neurospora crassa OR74A]
gi|3552028|gb|AAC64946.1| siderophore regulation protein [Neurospora crassa]
gi|28923567|gb|EAA32742.1| predicted protein [Neurospora crassa OR74A]
Length = 587
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 33/69 (47%), Positives = 41/69 (59%), Gaps = 10/69 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 280 IACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVH----------RPVTMKKAIIKRR 329
Query: 123 KRKPKKQGG 131
KR GG
Sbjct: 330 KRVIPAAGG 338
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 282 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKAIIKRRKRV 332
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 6/69 (8%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ- 120
G C+NCGT T LWRR+ G +CNACGLY K N RP +R RP ++ ++
Sbjct: 115 GQVCSNCGTTHTPLWRRSPQGAIICNACGLYLKARNAARPANIR----RPPSVMASNVRQ 170
Query: 121 -TRKRKPKK 128
K PKK
Sbjct: 171 AAAKLSPKK 179
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 24/40 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G+ C NCG TPLWRR G +CNACGLY K RP
Sbjct: 115 GQVCSNCGTTHTPLWRRSPQGAIICNACGLYLKARNAARP 154
>gi|345320418|ref|XP_001516151.2| PREDICTED: erythroid transcription factor-like [Ornithorhynchus
anatinus]
Length = 223
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/31 (90%), Positives = 29/31 (93%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGL 37
E RECVNCGA +TPLWRRDGTGHYLCNACGL
Sbjct: 193 EARECVNCGATATPLWRRDGTGHYLCNACGL 223
Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 24/44 (54%), Gaps = 4/44 (9%)
Query: 52 SKRLTATRRLGLC----CTNCGTRMTTLWRRNNDGEPVCNACGL 91
S +L T L C C NCG T LWRR+ G +CNACGL
Sbjct: 180 SPKLRGTLALSPCEARECVNCGATATPLWRRDGTGHYLCNACGL 223
>gi|58258269|ref|XP_566547.1| transcriptional activator [Cryptococcus neoformans var. neoformans
JEC21]
gi|57222684|gb|AAW40728.1| transcriptional activator, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1290
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ DG+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 1235
Score = 58.5 bits (140), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/52 (48%), Positives = 33/52 (63%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
C NC +TPLWRRD G LCNACGL++K++G+ RPL + + R G
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPLSLKTDVIKKRNRAG 1237
>gi|322712581|gb|EFZ04154.1| GATA transcription factor [Metarhizium anisopliae ARSEF 23]
Length = 554
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++
Sbjct: 259 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKPTIK 306
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ K+ T RR
Sbjct: 261 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTM--KKPTIKRR 308
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 29/44 (65%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N RP +++
Sbjct: 100 GQVCSNCGTTRTPLWRRSPQGATICNACGLYQKARNTARPTSLK 143
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 29/52 (55%), Gaps = 2/52 (3%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTA 57
G+ C NCG TPLWRR G +CNACGLY K RP L KP + A
Sbjct: 100 GQVCSNCGTTRTPLWRRSPQGATICNACGLYQKARNTARPTSLKKPPNVVAA 151
>gi|134106329|ref|XP_778175.1| hypothetical protein CNBA1750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260878|gb|EAL23528.1| hypothetical protein CNBA1750 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1290
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ DG+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 1235
Score = 58.5 bits (140), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/52 (48%), Positives = 33/52 (63%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLG 62
C NC +TPLWRRD G LCNACGL++K++G+ RPL + + R G
Sbjct: 1186 CTNCQTTNTPLWRRDPDGQPLCNACGLFYKLHGVVRPLSLKTDVIKKRNRAG 1237
>gi|345570715|gb|EGX53536.1| hypothetical protein AOL_s00006g402 [Arthrobotrys oligospora ATCC
24927]
Length = 984
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 25/37 (67%), Positives = 31/37 (83%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG TPLWRRD +GH +CNACGLYHK++G++RP
Sbjct: 333 CQNCGTTITPLWRRDESGHTICNACGLYHKLHGVHRP 369
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 37/60 (61%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NCGT +T LWRR+ G +CNACGLY+KLH V+ RP M+K I+ RKR
Sbjct: 333 CQNCGTTITPLWRRDESGHTICNACGLYHKLHGVH----------RPETMKKSVIKRRKR 382
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 28/44 (63%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR +G +CNACGLY K N RP ++
Sbjct: 149 GQVCSNCGTTRTPLWRRAPNGLTICNACGLYLKARNAARPSTLK 192
Score = 53.1 bits (126), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 25/41 (60%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+G+ C NCG TPLWRR G +CNACGLY K RP
Sbjct: 148 KGQVCSNCGTTRTPLWRRAPNGLTICNACGLYLKARNAARP 188
>gi|296424766|ref|XP_002841917.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295638169|emb|CAZ86108.1| unnamed protein product [Tuber melanosporum]
Length = 658
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 29/53 (54%), Positives = 38/53 (71%), Gaps = 6/53 (11%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKPSKRLTA 57
C NCG TPLWRRD +GH +CNACGLYHK++G++RP ++K KR+ A
Sbjct: 353 CQNCGTTITPLWRRDESGHTICNACGLYHKLHGVHRPEAMKKSVIKRRKRVVA 405
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 31/62 (50%), Positives = 39/62 (62%), Gaps = 10/62 (16%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLY+KLH V+ RP AM+K I+ R
Sbjct: 351 VACQNCGTTITPLWRRDESGHTICNACGLYHKLHGVH----------RPEAMKKSVIKRR 400
Query: 123 KR 124
KR
Sbjct: 401 KR 402
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 29/44 (65%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NC T T LWRR DG+ +CNACGLY K N +RP ++
Sbjct: 201 GQVCSNCRTTRTPLWRRAPDGQTICNACGLYLKARNQSRPTNLK 244
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 21/40 (52%), Positives = 24/40 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G+ C NC TPLWRR G +CNACGLY K +RP
Sbjct: 201 GQVCSNCRTTRTPLWRRAPDGQTICNACGLYLKARNQSRP 240
>gi|336470856|gb|EGO59017.1| hypothetical protein NEUTE1DRAFT_59972 [Neurospora tetrasperma FGSC
2508]
gi|350291924|gb|EGZ73119.1| siderophore regulation protein [Neurospora tetrasperma FGSC 2509]
Length = 587
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 33/69 (47%), Positives = 41/69 (59%), Gaps = 10/69 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 280 IACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVH----------RPVTMKKAIIKRR 329
Query: 123 KRKPKKQGG 131
KR GG
Sbjct: 330 KRVIPAAGG 338
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 282 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKAIIKRRKRV 332
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 28/44 (63%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N RP +R
Sbjct: 115 GQVCSNCGTTHTPLWRRSPQGAIICNACGLYLKARNAARPANIR 158
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 24/40 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G+ C NCG TPLWRR G +CNACGLY K RP
Sbjct: 115 GQVCSNCGTTHTPLWRRSPQGAIICNACGLYLKARNAARP 154
>gi|50551201|ref|XP_503074.1| YALI0D20482p [Yarrowia lipolytica]
gi|49648942|emb|CAG81266.1| YALI0D20482p [Yarrowia lipolytica CLIB122]
Length = 238
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 42/73 (57%), Gaps = 10/73 (13%)
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CTNC T+ T LWRRN +GEP+CNACGL+ KLH V +RPL+++ D I+ R
Sbjct: 86 SCTNCHTQTTPLWRRNPEGEPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRN 135
Query: 124 RKPKKQGGGGGGG 136
R G G
Sbjct: 136 RTNGTNASSSGTG 148
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 87 CTNCHTQTTPLWRRNPEGEPLCNACGLFLKLHGVVRPL 124
>gi|121704154|ref|XP_001270341.1| GATA transcriptional activator AreA [Aspergillus clavatus NRRL 1]
gi|119398485|gb|EAW08915.1| GATA transcriptional activator AreA [Aspergillus clavatus NRRL 1]
Length = 879
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 669 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 718
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 669 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 706
>gi|154292043|ref|XP_001546599.1| hypothetical protein BC1G_14396 [Botryotinia fuckeliana B05.10]
gi|347838109|emb|CCD52681.1| bir1, transcription factor Zn, GATA [Botryotinia fuckeliana]
Length = 604
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 34/69 (49%), Positives = 42/69 (60%), Gaps = 11/69 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP + M+K I+ R
Sbjct: 276 VACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP----------VTMKKSIIKRR 325
Query: 123 KR-KPKKQG 130
KR P QG
Sbjct: 326 KRVVPAVQG 334
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 36/51 (70%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 278 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKSIIKRRKRV 328
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 28/41 (68%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
C+NCGT T LWRR+ G +CNACGLY K N +RP ++
Sbjct: 119 CSNCGTTRTPLWRRSPQGATICNACGLYQKARNASRPTNLK 159
Score = 52.8 bits (125), Expect = 4e-05, Method: Composition-based stats.
Identities = 22/44 (50%), Positives = 25/44 (56%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
Q + C NCG TPLWRR G +CNACGLY K +RP
Sbjct: 112 QIQSNQICSNCGTTRTPLWRRSPQGATICNACGLYQKARNASRP 155
>gi|336264503|ref|XP_003347028.1| hypothetical protein SMAC_05227 [Sordaria macrospora k-hell]
gi|380093121|emb|CCC09358.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 936
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN DG+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 683 CTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 732
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 43/142 (30%), Positives = 59/142 (41%), Gaps = 14/142 (9%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR---LGLCCTN 67
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL + + R L
Sbjct: 683 CTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNRGSGASLPVGG 742
Query: 68 CGTR------MTTLWRRNNDGEPVCN-----ACGLYYKLHNVNRPLAMRHLAMRPLAMRK 116
TR M+ R+N+ N A + L NR ++ H + P +
Sbjct: 743 TSTRSKKSASMSAAARKNSTLSVTSNANNQPAAQVATPLAQQNRASSVNHESESPASGPA 802
Query: 117 DGIQTRKRKPKKQGGGGGGGSG 138
G T P G G SG
Sbjct: 803 SGGNTAGSTPTSYHGSAGSSSG 824
>gi|358394095|gb|EHK43496.1| hypothetical protein TRIATDRAFT_174504, partial [Trichoderma
atroviride IMI 206040]
Length = 567
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 3/78 (3%)
Query: 36 GLYHKMNGMNRPLVKPSKRLTATRR---LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLY 92
G + ++ N P+ L A R + + C NCGT +T LWRR+ G +CNACGLY
Sbjct: 237 GCHGRVETKNEPVPIDVNALQAQDRDTTVIIACQNCGTTITPLWRRDEGGHTICNACGLY 296
Query: 93 YKLHNVNRPLAMRHLAMR 110
YKLH V+RP+ M+ ++
Sbjct: 297 YKLHGVHRPVTMKKATIK 314
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD GH +CNACGLY+K++G++RP+ K+ T RR
Sbjct: 269 CQNCGTTITPLWRRDEGGHTICNACGLYYKLHGVHRPVTM--KKATIKRR 316
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 29/44 (65%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NC T T LWRR+ G +CNACGLY K N +RP +++
Sbjct: 109 GQICSNCNTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLK 152
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTA 57
G+ C NC TPLWRR G +CNACGLY K +RP L KP ++A
Sbjct: 109 GQICSNCNTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKKPPNLVSA 160
>gi|25136582|gb|AAN65464.1| major nitrogen regulatory protein [Glomerella lindemuthiana]
Length = 971
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 6/76 (7%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP------LAMRKDG 118
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V RPL+++ ++ ++ G
Sbjct: 698 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNRGSGASLPVGG 757
Query: 119 IQTRKRKPKKQGGGGG 134
TR +K G G
Sbjct: 758 TSTRSKKNAANASGPG 773
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 698 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 735
>gi|225561125|gb|EEH09406.1| GATA transcriptional activator AreA [Ajellomyces capsulatus G186AR]
Length = 951
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 722 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDIIKKRNR 771
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 722 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 759
>gi|12082800|gb|AAG48616.1|AF312694_1 nitrogen response factor NRF1 [Passalora fulva]
Length = 918
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 10/73 (13%)
Query: 52 SKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP 111
S RL + CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RP
Sbjct: 660 SPRLGDNSGVPTSCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRP 709
Query: 112 LAMRKDGIQTRKR 124
L+++ D I+ R R
Sbjct: 710 LSLKTDVIKKRNR 722
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 673 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 710
>gi|240280318|gb|EER43822.1| nitrogen regulatory protein areA [Ajellomyces capsulatus H143]
gi|325096614|gb|EGC49924.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 951
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 722 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDIIKKRNR 771
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 722 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 759
>gi|294658476|ref|XP_002770789.1| DEHA2F10384p [Debaryomyces hansenii CBS767]
gi|202953159|emb|CAR66314.1| DEHA2F10384p [Debaryomyces hansenii CBS767]
Length = 475
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 33/74 (44%), Positives = 44/74 (59%), Gaps = 7/74 (9%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI--- 119
L CTNC T+ T LWR++N G+ +CNACGL+YKLH V RPL+ R ++ D +
Sbjct: 225 LQCTNCNTKTTPLWRKSNKGDLLCNACGLFYKLHGVLRPLSQPD---RKSSVSNDTLTST 281
Query: 120 -QTRKRKPKKQGGG 132
+T KP GG
Sbjct: 282 NKTESSKPNSNMGG 295
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 35/49 (71%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTAT 58
+C NC +TPLWR+ G LCNACGL++K++G+ RPL +P ++ + +
Sbjct: 226 QCTNCNTKTTPLWRKSNKGDLLCNACGLFYKLHGVLRPLSQPDRKSSVS 274
>gi|171690816|ref|XP_001910333.1| hypothetical protein [Podospora anserina S mat+]
gi|170945356|emb|CAP71468.1| unnamed protein product [Podospora anserina S mat+]
Length = 536
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP+ M+ ++
Sbjct: 255 IACQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKSVIK 302
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 31/39 (79%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG TPLWRRD GH +CNACGLY+K++G++RP+
Sbjct: 257 CQNCGTTITPLWRRDEAGHTICNACGLYYKLHGVHRPVT 295
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N +RP+ ++
Sbjct: 98 GQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNTHRPVNLK 141
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTATRR 60
G+ C NCG TPLWRR G +CNACGLY K +RP L +P + +TR+
Sbjct: 98 GQVCSNCGTTQTPLWRRSPQGATICNACGLYLKARNTHRPVNLKRPPNVIPSTRQ 152
>gi|308473360|ref|XP_003098905.1| CRE-EGL-18 protein [Caenorhabditis remanei]
gi|308268044|gb|EFP11997.1| CRE-EGL-18 protein [Caenorhabditis remanei]
Length = 419
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 39/64 (60%), Gaps = 10/64 (15%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T TT WRR+++G VCN CGLYY+LH V RP+ MRK+ IQ R R
Sbjct: 308 CSNCQTDKTTAWRRDSEGRLVCNPCGLYYRLHKVR----------RPIEMRKNHIQQRYR 357
Query: 125 KPKK 128
+ K
Sbjct: 358 RKNK 361
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 17/38 (44%), Positives = 24/38 (63%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC T WRRD G +CN CGLY++++ + RP+
Sbjct: 308 CSNCQTDKTTAWRRDSEGRLVCNPCGLYYRLHKVRRPI 345
>gi|116193523|ref|XP_001222574.1| hypothetical protein CHGG_06479 [Chaetomium globosum CBS 148.51]
gi|88182392|gb|EAQ89860.1| hypothetical protein CHGG_06479 [Chaetomium globosum CBS 148.51]
Length = 981
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 33/74 (44%), Positives = 45/74 (60%), Gaps = 6/74 (8%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPL------AMRKDG 118
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V RPL+++ ++ ++ G
Sbjct: 705 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNRGSGSSLPVGG 764
Query: 119 IQTRKRKPKKQGGG 132
TR RK GG
Sbjct: 765 SSTRTRKNASSSGG 778
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 705 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 742
>gi|396465058|ref|XP_003837137.1| hypothetical protein LEMA_P033710.1 [Leptosphaeria maculans JN3]
gi|312213695|emb|CBX93697.1| hypothetical protein LEMA_P033710.1 [Leptosphaeria maculans JN3]
Length = 511
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 39/60 (65%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NCGT +T LWRR++ G +CNACGLYYKLH + RP+AM+K I+ RKR
Sbjct: 271 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTH----------RPIAMKKQEIKRRKR 320
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + +R+
Sbjct: 271 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPIAMKKQEIKRRKRI 321
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 32/49 (65%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
AT G C+NC T T LWRR+ GE +CNACGLY K N +RP+ ++
Sbjct: 104 ATPVAGQICSNCRTTQTPLWRRSPTGETLCNACGLYMKARNQSRPVNLK 152
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G+ C NC TPLWRR TG LCNACGLY K +RP+
Sbjct: 109 GQICSNCRTTQTPLWRRSPTGETLCNACGLYMKARNQSRPV 149
>gi|417079326|gb|AFX60070.1| GATA binding protein 1, partial [Mesocricetus auratus]
Length = 212
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/32 (87%), Positives = 29/32 (90%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLY 38
E RECVNCGA +TPLWRRD TGHYLCNACGLY
Sbjct: 181 EARECVNCGATATPLWRRDRTGHYLCNACGLY 212
Score = 42.4 bits (98), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 17/28 (60%), Positives = 19/28 (67%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLY 92
C NCG T LWRR+ G +CNACGLY
Sbjct: 185 CVNCGATATPLWRRDRTGHYLCNACGLY 212
>gi|350637200|gb|EHA25558.1| hypothetical protein ASPNIDRAFT_120062 [Aspergillus niger ATCC
1015]
Length = 880
Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 672 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 721
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 672 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 709
>gi|70984747|ref|XP_747880.1| GATA transcriptional activator AreA [Aspergillus fumigatus Af293]
gi|66845507|gb|EAL85842.1| GATA transcriptional activator AreA [Aspergillus fumigatus Af293]
Length = 881
Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 671 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 720
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 671 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 708
>gi|409040728|gb|EKM50215.1| hypothetical protein PHACADRAFT_264823 [Phanerochaete carnosa
HHB-10118-sp]
Length = 661
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/73 (46%), Positives = 42/73 (57%), Gaps = 10/73 (13%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
L C NCGT T LWRR++ G +CNACGLY+KLH + RP +M+K I+ R
Sbjct: 292 LSCANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTH----------RPNSMKKTVIKRR 341
Query: 123 KRKPKKQGGGGGG 135
KR P GG G
Sbjct: 342 KRVPAASGGSPSG 354
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/37 (70%), Positives = 27/37 (72%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCGA STPLWRRDG G +CNACGLY K M RP
Sbjct: 92 CANCGAHSTPLWRRDGEGKAVCNACGLYWKHKNMPRP 128
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 36/54 (66%), Gaps = 6/54 (11%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKPSKRLTAT 58
C NCG +TPLWRRD G+ +CNACGLY K++G +RP ++K KR+ A
Sbjct: 294 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRPNSMKKTVIKRRKRVPAA 347
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 38/79 (48%)
Query: 26 GTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPV 85
G+G Y + ++ L + S C NCG T LWRR+ +G+ V
Sbjct: 53 GSGPYPIDPALREKSTESIDPTLAQASNSSPTIAPAKPPCANCGAHSTPLWRRDGEGKAV 112
Query: 86 CNACGLYYKLHNVNRPLAM 104
CNACGLY+K N+ RP +
Sbjct: 113 CNACGLYWKHKNMPRPSTL 131
>gi|358371133|dbj|GAA87742.1| regulator of nitrogen metabolite repression [Aspergillus kawachii
IFO 4308]
Length = 881
Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 674 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 723
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 674 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 711
>gi|159122664|gb|EDP47785.1| GATA transcriptional activator AreA [Aspergillus fumigatus A1163]
Length = 881
Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 671 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 720
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 671 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 708
>gi|2133281|pir||S51493 major nitrogen regulation protein - Penicillium chrysogenum
(strain Q176)
Length = 862
Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 662 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 711
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 662 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 699
>gi|408389698|gb|EKJ69132.1| hypothetical protein FPSE_10693 [Fusarium pseudograminearum CS3096]
Length = 949
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 671 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 720
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 671 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 708
>gi|400602130|gb|EJP69755.1| siderophore regulation protein [Beauveria bassiana ARSEF 2860]
Length = 532
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 37/48 (77%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C+NCGT +T LWRR+ +G +CNACGLYY+LH V+RP+ M+ ++
Sbjct: 238 IACSNCGTTITPLWRRDGEGNMICNACGLYYRLHGVHRPVTMKKATIK 285
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 40/57 (70%), Gaps = 2/57 (3%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
C NCG TPLWRRDG G+ +CNACGLY++++G++RP+ K+ T RR + +N
Sbjct: 240 CSNCGTTITPLWRRDGEGNMICNACGLYYRLHGVHRPVTM--KKATIKRRKRVIPSN 294
Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 30/49 (61%)
Query: 57 ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
A+ G C+NCGT T LWRR+ G +CNACGLY + N RP ++
Sbjct: 79 ASSSHGQICSNCGTTETPLWRRSPQGATICNACGLYLRARNSARPTNLK 127
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 34/62 (54%), Gaps = 4/62 (6%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTATRRLGLCC 65
G+ C NCG TPLWRR G +CNACGLY + RP L KP K ++ L L
Sbjct: 84 GQICSNCGTTETPLWRRSPQGATICNACGLYLRARNSARPTNLKKPPKVVST--ELPLAS 141
Query: 66 TN 67
T+
Sbjct: 142 TS 143
>gi|67515789|ref|XP_657780.1| hypothetical protein AN0176.2 [Aspergillus nidulans FGSC A4]
gi|4585213|gb|AAD25328.1|AF095898_1 siderophore biosynthesis repressor SREA [Emericella nidulans]
gi|40746893|gb|EAA66049.1| hypothetical protein AN0176.2 [Aspergillus nidulans FGSC A4]
gi|259489617|tpe|CBF90036.1| TPA: Putative uncharacterized proteinSiderophore biosynthesis
repressor SREA ; [Source:UniProtKB/TrEMBL;Acc:Q9Y754]
[Aspergillus nidulans FGSC A4]
Length = 549
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 31/62 (50%), Positives = 39/62 (62%), Gaps = 10/62 (16%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ +G P+CNACGLYYKLH + RP M+K I+ R
Sbjct: 249 VACQNCGTTVTPLWRRDENGHPICNACGLYYKLHG----------SYRPTTMKKTIIKRR 298
Query: 123 KR 124
KR
Sbjct: 299 KR 300
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 41 MNGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNR 100
MNG + P K T LG C+NCGT+ T LWRR+ G +CNACGLY K NV R
Sbjct: 83 MNGHHVEKTSP-KSQKDTSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVAR 141
Query: 101 P 101
P
Sbjct: 142 P 142
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 103 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 142
>gi|392589458|gb|EIW78788.1| hypothetical protein CONPUDRAFT_106827 [Coniophora puteana
RWD-64-598 SS2]
Length = 763
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 41/73 (56%), Gaps = 10/73 (13%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
LCC NC T T LWRR++ G +CNACGLY+KLH + RP +M+K I+
Sbjct: 318 ALCCANCSTSTTPLWRRDDVGNNICNACGLYFKLHGTH----------RPNSMKKTVIKR 367
Query: 122 RKRKPKKQGGGGG 134
RKR P G G
Sbjct: 368 RKRVPAAAGMSSG 380
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 26/53 (49%), Positives = 35/53 (66%), Gaps = 6/53 (11%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKPSKRLTA 57
C NC +TPLWRRD G+ +CNACGLY K++G +RP ++K KR+ A
Sbjct: 321 CANCSTSTTPLWRRDDVGNNICNACGLYFKLHGTHRPNSMKKTVIKRRKRVPA 373
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 29/40 (72%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
C NCGT T LWRR+++G+ +CNACGLY K V RP ++
Sbjct: 146 CNNCGTAETPLWRRDSEGKTICNACGLYLKSRKVARPPSL 185
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 22/37 (59%), Positives = 24/37 (64%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG TPLWRRD G +CNACGLY K + RP
Sbjct: 146 CNNCGTAETPLWRRDSEGKTICNACGLYLKSRKVARP 182
>gi|166214957|sp|P19212.2|NIT2_NEUCR RecName: Full=Nitrogen catabolic enzyme regulatory protein
Length = 1036
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN DG+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 743 CTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 792
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 743 CTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGVVRPL 780
>gi|336463275|gb|EGO51515.1| Nitrogen catabolic enzyme regulatory protein [Neurospora
tetrasperma FGSC 2508]
gi|350297522|gb|EGZ78499.1| nitrogen catabolic enzyme regulatory protein [Neurospora
tetrasperma FGSC 2509]
Length = 1033
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN DG+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 740 CTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 789
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 740 CTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGVVRPL 777
>gi|1107471|gb|AAB03891.1| nitrogen catabolic enzyme regulatory protein [Neurospora crassa]
Length = 1036
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN DG+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 743 CTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 792
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 743 CTNCFTQTTPLWRRNPDGQPLCNACGLFLKLHGVVRPL 780
>gi|261189233|ref|XP_002621028.1| GATA transcriptional activator AreA [Ajellomyces dermatitidis
SLH14081]
gi|239591813|gb|EEQ74394.1| GATA transcriptional activator AreA [Ajellomyces dermatitidis
SLH14081]
Length = 970
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/80 (41%), Positives = 45/80 (56%), Gaps = 10/80 (12%)
Query: 45 NRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
+RP R + CTNC T+ T LWRRN +G+P+CNACGL+ KLH V
Sbjct: 721 SRPASPGGARQSEPSAAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV------ 774
Query: 105 RHLAMRPLAMRKDGIQTRKR 124
+RPL+++ D I+ R R
Sbjct: 775 ----VRPLSLKTDVIKKRNR 790
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 741 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 778
>gi|3182912|sp|O13412.1|AREA_ASPNG RecName: Full=Nitrogen regulatory protein areA
gi|2462907|emb|CAA57524.1| areA [Aspergillus niger]
gi|28412451|emb|CAA68196.1| areA [Aspergillus niger]
Length = 882
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 676 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 725
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 676 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 713
>gi|320590499|gb|EFX02942.1| gata transcriptional activator [Grosmannia clavigera kw1407]
Length = 1048
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/70 (47%), Positives = 43/70 (61%), Gaps = 10/70 (14%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 759 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 808
Query: 125 KPKKQGGGGG 134
Q GG
Sbjct: 809 GSGAQLPVGG 818
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 759 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 796
>gi|330921388|ref|XP_003299406.1| hypothetical protein PTT_10383 [Pyrenophora teres f. teres 0-1]
gi|311326936|gb|EFQ92491.1| hypothetical protein PTT_10383 [Pyrenophora teres f. teres 0-1]
Length = 465
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 39/60 (65%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NCGT +T LWRR++ G +CNACGLYYKLH + RP+AM+K I+ RKR
Sbjct: 232 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTH----------RPVAMKKQEIKRRKR 281
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + +R+
Sbjct: 232 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPVAMKKQEIKRRKRV 282
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 30/44 (68%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NC T T LWRR+ GE VCNACGLY K N +RP+ ++
Sbjct: 79 GQVCSNCKTTQTPLWRRSPSGETVCNACGLYMKARNQSRPVNLK 122
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 21/41 (51%), Positives = 26/41 (63%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G+ C NC TPLWRR +G +CNACGLY K +RP+
Sbjct: 79 GQVCSNCKTTQTPLWRRSPSGETVCNACGLYMKARNQSRPV 119
>gi|149246816|ref|XP_001527833.1| hypothetical protein LELG_00353 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146447787|gb|EDK42175.1| hypothetical protein LELG_00353 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 694
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/59 (52%), Positives = 42/59 (71%), Gaps = 3/59 (5%)
Query: 53 KRLT---ATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLA 108
+RLT T + L CTNC T+ T LWR++N+G+ +CNACGL+YKLH V RPL ++ A
Sbjct: 376 RRLTINSTTPKKILQCTNCETKTTPLWRKSNNGDLLCNACGLFYKLHGVLRPLNNKNKA 434
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 22/44 (50%), Positives = 31/44 (70%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSK 53
+C NC +TPLWR+ G LCNACGL++K++G+ RPL +K
Sbjct: 390 QCTNCETKTTPLWRKSNNGDLLCNACGLFYKLHGVLRPLNNKNK 433
>gi|327354121|gb|EGE82978.1| GATA transcriptional activator AreA [Ajellomyces dermatitidis ATCC
18188]
Length = 976
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/80 (41%), Positives = 45/80 (56%), Gaps = 10/80 (12%)
Query: 45 NRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
+RP R + CTNC T+ T LWRRN +G+P+CNACGL+ KLH V
Sbjct: 727 SRPASPGGARQSEPSAAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV------ 780
Query: 105 RHLAMRPLAMRKDGIQTRKR 124
+RPL+++ D I+ R R
Sbjct: 781 ----VRPLSLKTDVIKKRNR 796
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 747 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 784
>gi|239614730|gb|EEQ91717.1| GATA transcriptional activator AreA [Ajellomyces dermatitidis ER-3]
Length = 970
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/80 (41%), Positives = 45/80 (56%), Gaps = 10/80 (12%)
Query: 45 NRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM 104
+RP R + CTNC T+ T LWRRN +G+P+CNACGL+ KLH V
Sbjct: 721 SRPASPGGARQSEPSAAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV------ 774
Query: 105 RHLAMRPLAMRKDGIQTRKR 124
+RPL+++ D I+ R R
Sbjct: 775 ----VRPLSLKTDVIKKRNR 790
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 741 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 778
>gi|299751251|ref|XP_002911612.1| transcription factor ScGATA-6 [Coprinopsis cinerea okayama7#130]
gi|298409288|gb|EFI28118.1| transcription factor ScGATA-6 [Coprinopsis cinerea okayama7#130]
Length = 1080
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/60 (48%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL++KLH V +RPL+++ D I+ R R
Sbjct: 952 CTNCQTTNTPLWRRDPEGQPLCNACGLFFKLHGV----------VRPLSLKTDVIKKRNR 1001
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 36/66 (54%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCG 69
+C NC +TPLWRRD G LCNACGL+ K++G+ RPL + + R
Sbjct: 951 QCTNCQTTNTPLWRRDPEGQPLCNACGLFFKLHGVVRPLSLKTDVIKKRNRTSGTAPGSN 1010
Query: 70 TRMTTL 75
+R TL
Sbjct: 1011 SRKQTL 1016
>gi|122892555|gb|ABM67327.1| GATA4 [Hylobates klossii]
Length = 42
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 34/41 (82%)
Query: 56 TATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLH 96
+A+RR+GL C NC T TTLWRRN +GEPVCNACGLY KLH
Sbjct: 1 SASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLH 41
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNG 43
G C NC +T LWRR+ G +CNACGLY K++G
Sbjct: 7 GLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHG 42
>gi|302419453|ref|XP_003007557.1| nitrogen regulatory protein areA [Verticillium albo-atrum VaMs.102]
gi|261353208|gb|EEY15636.1| nitrogen regulatory protein areA [Verticillium albo-atrum VaMs.102]
Length = 983
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 11/85 (12%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR-----------HLAMRPLA 113
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V RPL+++ + P
Sbjct: 746 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNRNSQVTTPPAV 805
Query: 114 MRKDGIQTRKRKPKKQGGGGGGGSG 138
+R + P GG +G
Sbjct: 806 VRAGSVNNEGESPASGPASGGNTAG 830
Score = 53.1 bits (126), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 746 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 783
>gi|5002306|gb|AAD37409.1|AF148539_1 major nitrogen regulatory protein [Aspergillus parasiticus]
Length = 866
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 664 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 713
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 664 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 701
>gi|449298668|gb|EMC94683.1| hypothetical protein BAUCODRAFT_35909 [Baudoinia compniacensis UAMH
10762]
Length = 936
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 34/74 (45%), Positives = 43/74 (58%), Gaps = 18/74 (24%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 677 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 726
Query: 125 KPKKQGGGGGGGSG 138
G G SG
Sbjct: 727 --------GSGNSG 732
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 677 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 714
>gi|12802699|gb|AAK08066.1|AF320305_1 AreA [Aspergillus oryzae]
Length = 853
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 652 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 701
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 652 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 689
>gi|2494687|sp|Q01582.1|AREA_PENCH RecName: Full=Nitrogen regulatory protein areA; Short=Nitrogen
regulator nre
gi|437210|gb|AAA83400.1| NRE [Penicillium chrysogenum]
Length = 725
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 525 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 574
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 525 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 562
>gi|169764959|ref|XP_001816951.1| nitrogen regulatory protein areA [Aspergillus oryzae RIB40]
gi|3182913|sp|O13415.1|AREA_ASPOR RecName: Full=Nitrogen regulatory protein areA
gi|2632173|emb|CAA05776.1| areA [Aspergillus oryzae]
gi|83764805|dbj|BAE54949.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 866
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 664 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 713
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 664 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 701
>gi|46128513|ref|XP_388810.1| AREA_GIBFU Nitrogen regulatory protein areA [Gibberella zeae PH-1]
Length = 964
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 686 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 735
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 686 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 723
>gi|392593705|gb|EIW83030.1| hypothetical protein CONPUDRAFT_81125 [Coniophora puteana RWD-64-598
SS2]
Length = 1103
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 973 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 1022
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 973 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1010
>gi|391863417|gb|EIT72728.1| GATA transcription factor [Aspergillus oryzae 3.042]
Length = 866
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 664 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 713
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 664 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 701
>gi|296413330|ref|XP_002836367.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630184|emb|CAZ80558.1| unnamed protein product [Tuber melanosporum]
Length = 930
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 42/73 (57%), Gaps = 10/73 (13%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 699 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 748
Query: 125 KPKKQGGGGGGGS 137
GG S
Sbjct: 749 GSGSSLPVGGSAS 761
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 699 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 736
>gi|255710611|ref|XP_002551589.1| KLTH0A03036p [Lachancea thermotolerans]
gi|238932966|emb|CAR21147.1| KLTH0A03036p [Lachancea thermotolerans CBS 6340]
Length = 720
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 40/64 (62%), Gaps = 10/64 (15%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRR+ G P+CNACGL+ KLH V +RPL+++ D I+ R+R
Sbjct: 461 CTNCHTKTTPLWRRDPQGNPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRQR 510
Query: 125 KPKK 128
K
Sbjct: 511 SSNK 514
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/39 (58%), Positives = 30/39 (76%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
+C NC +TPLWRRD G+ LCNACGL+ K++G+ RPL
Sbjct: 460 KCTNCHTKTTPLWRRDPQGNPLCNACGLFLKLHGVVRPL 498
>gi|157113433|ref|XP_001657827.1| GATA transcription factor (GATAb) [Aedes aegypti]
gi|108877757|gb|EAT41982.1| AAEL006447-PA [Aedes aegypti]
Length = 770
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/53 (58%), Positives = 36/53 (67%), Gaps = 10/53 (18%)
Query: 75 LWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPK 127
+WRRN GE VCNACGLY+KLH VNRP H MR+D I TR+R+PK
Sbjct: 609 IWRRNIRGEMVCNACGLYFKLHGVNRP----H------TMRRDTIHTRRRRPK 651
Score = 45.8 bits (107), Expect = 0.006, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 25/40 (62%)
Query: 21 LWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
+WRR+ G +CNACGLY K++G+NRP + RR
Sbjct: 609 IWRRNIRGEMVCNACGLYFKLHGVNRPHTMRRDTIHTRRR 648
>gi|340521468|gb|EGR51702.1| zinc finger protein [Trichoderma reesei QM6a]
Length = 429
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 32/72 (44%), Positives = 42/72 (58%), Gaps = 4/72 (5%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAM----RPLAMRKDGIQ 120
C NCGT T LWRR+ G +CNACGL+ KLH RP++++ + R MR D
Sbjct: 11 CQNCGTSTTPLWRRDEFGSVLCNACGLFLKLHGRPRPISLKTDVIKSRNRVKTMRPDLAP 70
Query: 121 TRKRKPKKQGGG 132
RKR P++Q G
Sbjct: 71 KRKRLPQQQQHG 82
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 27/75 (36%), Positives = 41/75 (54%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGT 70
C NCG +TPLWRRD G LCNACGL+ K++G RP+ + + + R+ +
Sbjct: 11 CQNCGTSTTPLWRRDEFGSVLCNACGLFLKLHGRPRPISLKTDVIKSRNRVKTMRPDLAP 70
Query: 71 RMTTLWRRNNDGEPV 85
+ L ++ G P+
Sbjct: 71 KRKRLPQQQQHGFPL 85
>gi|378730997|gb|EHY57456.1| GATA-binding protein [Exophiala dermatitidis NIH/UT8656]
Length = 943
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 696 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 745
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 24/45 (53%), Positives = 30/45 (66%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
Q G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 689 QNGVPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 733
>gi|451993853|gb|EMD86325.1| hypothetical protein COCHEDRAFT_109473 [Cochliobolus heterostrophus
C5]
Length = 520
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 39/60 (65%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NCGT +T LWRR++ G +CNACGLYYKLH + RP+AM+K I+ RKR
Sbjct: 287 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTH----------RPVAMKKQEIKRRKR 336
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + +R+
Sbjct: 287 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPVAMKKQEIKRRKRI 337
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 30/44 (68%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NC T T LWRR+ GE VCNACGLY K N +RP+ ++
Sbjct: 128 GQVCSNCKTTQTPLWRRSPAGETVCNACGLYMKARNQSRPVNLK 171
Score = 52.0 bits (123), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/41 (51%), Positives = 25/41 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G+ C NC TPLWRR G +CNACGLY K +RP+
Sbjct: 128 GQVCSNCKTTQTPLWRRSPAGETVCNACGLYMKARNQSRPV 168
>gi|302652995|ref|XP_003018335.1| siderophore transcription factor SreA [Trichophyton verrucosum HKI
0517]
gi|291181965|gb|EFE37690.1| siderophore transcription factor SreA [Trichophyton verrucosum HKI
0517]
Length = 545
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 35/46 (76%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
C NCGT +T LWRR++ G +CNACGLYY+LH +RP+AM+ ++
Sbjct: 253 CQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPVAMKKTVIK 298
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 6/61 (9%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKPSKRLTATR 59
G + C NCG TPLWRRD G+ +CNACGLY++++G +RP ++K KR+
Sbjct: 248 GMPKACQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPVAMKKTVIKRRKRVVPAL 307
Query: 60 R 60
R
Sbjct: 308 R 308
>gi|367022150|ref|XP_003660360.1| hypothetical protein MYCTH_2298578 [Myceliophthora thermophila ATCC
42464]
gi|347007627|gb|AEO55115.1| hypothetical protein MYCTH_2298578 [Myceliophthora thermophila ATCC
42464]
Length = 553
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/70 (47%), Positives = 41/70 (58%), Gaps = 10/70 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 254 IACQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVH----------RPVTMKKSIIKRR 303
Query: 123 KRKPKKQGGG 132
KR GG
Sbjct: 304 KRVIPATGGA 313
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 256 CQNCGTTVTPLWRRDEAGHTICNACGLYYKLHGVHRPVTMKKSIIKRRKRV 306
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 32/70 (45%), Positives = 41/70 (58%), Gaps = 6/70 (8%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G C+NCGT T LWRR+ GE +CNACGLY K N RP+ ++ RP A+ +G +
Sbjct: 95 GQVCSNCGTTFTPLWRRSPQGEIICNACGLYLKTRNAPRPINLK----RPPAVPGNGSRQ 150
Query: 122 R--KRKPKKQ 129
K PK Q
Sbjct: 151 SPVKLSPKAQ 160
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 24/45 (53%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 5 FGE-GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
FG+ G+ C NCG TPLWRR G +CNACGLY K RP+
Sbjct: 91 FGQSGQVCSNCGTTFTPLWRRSPQGEIICNACGLYLKTRNAPRPI 135
>gi|302921056|ref|XP_003053207.1| hypothetical protein NECHADRAFT_59326 [Nectria haematococca mpVI
77-13-4]
gi|256734147|gb|EEU47494.1| hypothetical protein NECHADRAFT_59326 [Nectria haematococca mpVI
77-13-4]
Length = 897
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 32/73 (43%), Positives = 43/73 (58%), Gaps = 10/73 (13%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 619 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 668
Query: 125 KPKKQGGGGGGGS 137
GG +
Sbjct: 669 GSGSNVSVGGSST 681
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 619 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 656
>gi|403419010|emb|CCM05710.1| predicted protein [Fibroporia radiculosa]
Length = 1271
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 1137 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 1186
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 1137 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 1174
>gi|119172876|ref|XP_001238983.1| hypothetical protein CIMG_10005 [Coccidioides immitis RS]
Length = 752
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/83 (39%), Positives = 45/83 (54%), Gaps = 10/83 (12%)
Query: 42 NGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
N +RP + T C NC T+ T LWRRN +G+P+CNACGL+ KLH V
Sbjct: 502 NNPSRPASPGPAKATEQPNAPTTCANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV--- 558
Query: 102 LAMRHLAMRPLAMRKDGIQTRKR 124
+RPL+++ D I+ R R
Sbjct: 559 -------VRPLSLKTDVIKKRNR 574
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 525 CANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 562
>gi|451856759|gb|EMD70050.1| hypothetical protein COCSADRAFT_216437 [Cochliobolus sativus
ND90Pr]
Length = 507
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 39/60 (65%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NCGT +T LWRR++ G +CNACGLYYKLH + RP+AM+K I+ RKR
Sbjct: 274 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTH----------RPVAMKKQEIKRRKR 323
Score = 65.5 bits (158), Expect = 8e-09, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + +R+
Sbjct: 274 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPVAMKKQEIKRRKRI 324
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 30/44 (68%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NC T T LWRR+ GE VCNACGLY K N +RP+ ++
Sbjct: 116 GQICSNCKTTQTPLWRRSPAGETVCNACGLYMKARNQSRPVNLK 159
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 21/41 (51%), Positives = 25/41 (60%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G+ C NC TPLWRR G +CNACGLY K +RP+
Sbjct: 116 GQICSNCKTTQTPLWRRSPAGETVCNACGLYMKARNQSRPV 156
>gi|392869191|gb|EAS27676.2| hypothetical protein CIMG_10005 [Coccidioides immitis RS]
Length = 762
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/83 (39%), Positives = 45/83 (54%), Gaps = 10/83 (12%)
Query: 42 NGMNRPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
N +RP + T C NC T+ T LWRRN +G+P+CNACGL+ KLH V
Sbjct: 512 NNPSRPASPGPAKATEQPNAPTTCANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV--- 568
Query: 102 LAMRHLAMRPLAMRKDGIQTRKR 124
+RPL+++ D I+ R R
Sbjct: 569 -------VRPLSLKTDVIKKRNR 584
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 535 CANCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 572
>gi|145228733|ref|XP_001388675.1| GATA factor SREP [Aspergillus niger CBS 513.88]
gi|134054767|emb|CAK43607.1| unnamed protein product [Aspergillus niger]
gi|350637901|gb|EHA26257.1| hypothetical protein ASPNIDRAFT_52040 [Aspergillus niger ATCC 1015]
Length = 569
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 37/60 (61%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+K I+ RKR
Sbjct: 267 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHG----------CYRPTTMKKTIIKRRKR 316
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 26/44 (59%), Positives = 30/44 (68%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV RP
Sbjct: 104 TSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 147
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 108 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 147
>gi|308486717|ref|XP_003105555.1| CRE-ELT-3 protein [Caenorhabditis remanei]
gi|308255521|gb|EFO99473.1| CRE-ELT-3 protein [Caenorhabditis remanei]
Length = 323
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 52/92 (56%), Gaps = 15/92 (16%)
Query: 47 PLV---KP-SKRLTATR-RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
PLV KP KR+ A + C+NC TR TTLWRRN +G CNAC LY++ +N
Sbjct: 227 PLVAEPKPIKKRMAAVQCHQNSICSNCKTRETTLWRRNGEGGVECNACNLYFRKNN---- 282
Query: 102 LAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGG 133
RPL++RKDGI R R+P+ + G
Sbjct: 283 ------RKRPLSLRKDGIMKRNRRPRNESPGA 308
Score = 44.3 bits (103), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 22/38 (57%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC T LWRR+G G CNAC LY + N RPL
Sbjct: 250 CSNCKTRETTLWRRNGEGGVECNACNLYFRKNNRKRPL 287
>gi|295666031|ref|XP_002793566.1| nitrogen regulatory protein areA [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226277860|gb|EEH33426.1| nitrogen regulatory protein areA [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 992
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 763 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 812
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 24/45 (53%), Positives = 30/45 (66%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
Q G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 756 QSGAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 800
>gi|452847941|gb|EME49873.1| nitrogen response factor-like protein [Dothistroma septosporum
NZE10]
Length = 938
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 693 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 742
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 693 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 730
>gi|2494688|sp|Q92269.1|NRFA_PENUR RecName: Full=Nitrogen regulatory protein nrfA
gi|1654230|gb|AAB17740.1| nitrogen regulatory factor NRFA [Penicillium urticae]
Length = 865
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 665 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 714
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 665 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 702
>gi|407924861|gb|EKG17886.1| Zinc finger GATA-type protein [Macrophomina phaseolina MS6]
Length = 923
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 39/60 (65%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 690 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 739
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/45 (55%), Positives = 31/45 (68%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
Q G C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 683 QNGAPTTCTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 727
>gi|358372181|dbj|GAA88786.1| siderophore transcription factor SreA [Aspergillus kawachii IFO
4308]
Length = 568
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 43/78 (55%), Gaps = 10/78 (12%)
Query: 47 PLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRH 106
P V +A + + C NCGT +T LWRR+ +G P+CNACGLYYKLH
Sbjct: 248 PDVPGKSGASAEGNMLVSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHG--------- 298
Query: 107 LAMRPLAMRKDGIQTRKR 124
RP M+K I+ RKR
Sbjct: 299 -CYRPTTMKKTIIKRRKR 315
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 26/44 (59%), Positives = 30/44 (68%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV RP
Sbjct: 103 TSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 146
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 107 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 146
>gi|225683771|gb|EEH22055.1| nitrogen regulatory protein areA [Paracoccidioides brasiliensis
Pb03]
Length = 995
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 766 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 815
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 24/45 (53%), Positives = 30/45 (66%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
Q G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 759 QSGAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 803
>gi|115386818|ref|XP_001209950.1| nitrogen regulatory protein areA [Aspergillus terreus NIH2624]
gi|114190948|gb|EAU32648.1| nitrogen regulatory protein areA [Aspergillus terreus NIH2624]
Length = 817
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 616 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 665
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 616 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 653
>gi|322701406|gb|EFY93156.1| nitrogen response regulator [Metarhizium acridum CQMa 102]
Length = 961
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 680 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 729
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 23/43 (53%), Positives = 29/43 (67%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 675 GAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 717
>gi|390516511|emb|CCE73649.1| GATA-type transcription factor, iron regulation [Fusarium
fujikuroi]
Length = 555
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 31/62 (50%), Positives = 39/62 (62%), Gaps = 10/62 (16%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 255 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVH----------RPMTMKKSTIKRR 304
Query: 123 KR 124
KR
Sbjct: 305 KR 306
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 36/51 (70%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 257 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIKRRKRI 307
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 29/70 (41%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G C+NCGT T LWRR+ G +CNACGLY K N +RP +++ P + D +T
Sbjct: 99 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLKR---PPNVVPADPPRT 155
Query: 122 RKRKPKKQGG 131
KP G
Sbjct: 156 TAPKPSAVAG 165
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 26/41 (63%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+G+ C NCG TPLWRR G +CNACGLY K +RP
Sbjct: 98 QGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRP 138
>gi|226293134|gb|EEH48554.1| nitrogen regulatory protein areA [Paracoccidioides brasiliensis
Pb18]
Length = 993
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 764 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 813
Score = 55.5 bits (132), Expect = 6e-06, Method: Composition-based stats.
Identities = 24/45 (53%), Positives = 30/45 (66%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
Q G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 757 QSGAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 801
>gi|429854610|gb|ELA29612.1| nitrogen response regulator [Colletotrichum gloeosporioides Nara
gc5]
Length = 912
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 32/75 (42%), Positives = 45/75 (60%), Gaps = 6/75 (8%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRP------LAMRKDG 118
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V RPL+++ ++ ++ G
Sbjct: 665 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNRGSGASLPVGG 724
Query: 119 IQTRKRKPKKQGGGG 133
TR +K G G
Sbjct: 725 TSTRSKKNAANSGVG 739
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 3/46 (6%)
Query: 6 GEGRE---CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
GEG C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 657 GEGNTPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 702
>gi|50552360|ref|XP_503590.1| YALI0E05555p [Yarrowia lipolytica]
gi|49649459|emb|CAG79171.1| YALI0E05555p [Yarrowia lipolytica CLIB122]
Length = 805
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 10/66 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR++ G +CNACGLYY+LH V+ RP+ M+K+ I+ R
Sbjct: 395 IACQNCGTTITPLWRRDDSGSTICNACGLYYRLHGVH----------RPVRMKKNMIKRR 444
Query: 123 KRKPKK 128
KR +K
Sbjct: 445 KRVLEK 450
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/50 (50%), Positives = 36/50 (72%), Gaps = 1/50 (2%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD +G +CNACGLY++++G++RP V+ K + R+
Sbjct: 397 CQNCGTTITPLWRRDDSGSTICNACGLYYRLHGVHRP-VRMKKNMIKRRK 445
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 32/51 (62%)
Query: 51 PSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
P ++ A+ C+NCGT T LWRR +G +CNACGLY K N++RP
Sbjct: 214 PKRKKKASVITAQVCSNCGTTRTPLWRRAPNGATICNACGLYLKARNISRP 264
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 25/48 (52%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLT 56
C NCG TPLWRR G +CNACGLY K ++RP L KP T
Sbjct: 228 CSNCGTTRTPLWRRAPNGATICNACGLYLKARNISRPTHLEKPPATAT 275
>gi|302496671|ref|XP_003010336.1| siderophore transcription factor SreA [Arthroderma benhamiae CBS
112371]
gi|291173879|gb|EFE29696.1| siderophore transcription factor SreA [Arthroderma benhamiae CBS
112371]
Length = 542
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 35/46 (76%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
C NCGT +T LWRR++ G +CNACGLYY+LH +RP+AM+ ++
Sbjct: 250 CQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPVAMKKTVIK 295
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 36/56 (64%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
G + C NCG TPLWRRD G+ +CNACGLY++++G +RP+ + +R+
Sbjct: 245 GMPKACQNCGTTLTPLWRRDDQGNTICNACGLYYRLHGSHRPVAMKKTVIKRRKRV 300
>gi|217337439|gb|ACK43087.1| major nitrogen regulator element [Tuber borchii]
Length = 931
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 42/73 (57%), Gaps = 10/73 (13%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 700 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 749
Query: 125 KPKKQGGGGGGGS 137
GG S
Sbjct: 750 GSGSSLPVGGSAS 762
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 700 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 737
>gi|119479079|ref|XP_001259568.1| siderophore transcription factor SreA [Neosartorya fischeri NRRL
181]
gi|119407722|gb|EAW17671.1| siderophore transcription factor SreA [Neosartorya fischeri NRRL
181]
Length = 546
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/62 (50%), Positives = 38/62 (61%), Gaps = 10/62 (16%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+K I+ R
Sbjct: 251 VSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGC----------YRPTTMKKSVIKRR 300
Query: 123 KR 124
KR
Sbjct: 301 KR 302
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 28/39 (71%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 253 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTT 291
Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 30/44 (68%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV RP
Sbjct: 96 TSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 139
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 139
>gi|384489884|gb|EIE81106.1| hypothetical protein RO3G_05811 [Rhizopus delemar RA 99-880]
Length = 234
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 45/67 (67%), Gaps = 6/67 (8%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ--TR 122
C NC T LWRR++DG +CNACGLYYKLH+V+RPL+++ R + R+ +Q
Sbjct: 19 CYNCDATTTPLWRRDDDGNTICNACGLYYKLHHVHRPLSLK----RNVIHRRKRVQMIQH 74
Query: 123 KRKPKKQ 129
+R P++Q
Sbjct: 75 RRIPEEQ 81
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 31/38 (81%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC A +TPLWRRD G+ +CNACGLY+K++ ++RPL
Sbjct: 19 CYNCDATTTPLWRRDDDGNTICNACGLYYKLHHVHRPL 56
>gi|342881901|gb|EGU82684.1| hypothetical protein FOXB_06796 [Fusarium oxysporum Fo5176]
Length = 555
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 31/62 (50%), Positives = 39/62 (62%), Gaps = 10/62 (16%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 255 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVH----------RPMTMKKSTIKRR 304
Query: 123 KR 124
KR
Sbjct: 305 KR 306
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 36/51 (70%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 257 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPMTMKKSTIKRRKRI 307
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 30/44 (68%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR+ G +CNACGLY K N +RP +++
Sbjct: 99 GQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRPTSLK 142
Score = 52.4 bits (124), Expect = 7e-05, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 26/41 (63%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+G+ C NCG TPLWRR G +CNACGLY K +RP
Sbjct: 98 QGQVCSNCGTTRTPLWRRSPQGATICNACGLYLKARNASRP 138
>gi|452987950|gb|EME87705.1| GLN3p transcription factor [Pseudocercospora fijiensis CIRAD86]
Length = 876
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 32/84 (38%), Positives = 45/84 (53%), Gaps = 10/84 (11%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR----------PLAM 114
CTNC T+ T LWRRN +G P+CNACGL+ KLH V RPL+++ ++ P+
Sbjct: 630 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPLSLKTDVIKKRNRGSGNSVPVGT 689
Query: 115 RKDGIQTRKRKPKKQGGGGGGGSG 138
+ ++ Q G G SG
Sbjct: 690 SRSKKAASRKNSVAQPGSGQASSG 713
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 630 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 667
>gi|189206315|ref|XP_001939492.1| GATA factor SREP [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187975585|gb|EDU42211.1| GATA factor SREP [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 443
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 31/60 (51%), Positives = 39/60 (65%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C NCGT +T LWRR++ G +CNACGLYYKLH + RP+AM+K I+ RKR
Sbjct: 210 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTH----------RPVAMKKQEIKRRKR 259
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 35/51 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD GH +CNACGLY+K++G +RP+ + + +R+
Sbjct: 210 CQNCGTTITPLWRRDDAGHIICNACGLYYKLHGTHRPVAMKKQEIKRRKRV 260
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/44 (54%), Positives = 30/44 (68%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NC T T LWRR+ GE VCNACGLY K N +RP+ ++
Sbjct: 76 GQVCSNCKTTQTPLWRRSPSGETVCNACGLYMKARNQSRPVNLK 119
Score = 52.8 bits (125), Expect = 4e-05, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
G+ C NC TPLWRR +G +CNACGLY K +RP+ KR T+T+ L
Sbjct: 76 GQVCSNCKTTQTPLWRRSPSGETVCNACGLYMKARNQSRPV--NLKRNTSTQIL 127
>gi|156049633|ref|XP_001590783.1| hypothetical protein SS1G_08523 [Sclerotinia sclerotiorum 1980]
gi|154692922|gb|EDN92660.1| hypothetical protein SS1G_08523 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 749
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 34/69 (49%), Positives = 42/69 (60%), Gaps = 11/69 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 273 VACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVH----------RPVTMKKSIIKRR 322
Query: 123 KR-KPKKQG 130
KR P QG
Sbjct: 323 KRVVPAVQG 331
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 36/51 (70%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 275 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKSIIKRRKRV 325
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 29/70 (41%), Positives = 35/70 (50%), Gaps = 4/70 (5%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQT 121
G C+NCGT T LWRR+ G +CNACGLY K N +RP ++ RP G Q
Sbjct: 113 GQICSNCGTTRTPLWRRSPQGATICNACGLYQKARNASRPTNLK----RPPTTAPIGHQG 168
Query: 122 RKRKPKKQGG 131
R G
Sbjct: 169 DSRASPSASG 178
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 26/44 (59%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
Q G+ C NCG TPLWRR G +CNACGLY K +RP
Sbjct: 109 QIQTGQICSNCGTTRTPLWRRSPQGATICNACGLYQKARNASRP 152
>gi|453088528|gb|EMF16568.1| hypothetical protein SEPMUDRAFT_34698 [Mycosphaerella populorum
SO2202]
Length = 957
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 39/60 (65%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 714 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 763
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ GH LCNACGL+ K++G+ RPL
Sbjct: 714 CTNCFTQTTPLWRRNPEGHPLCNACGLFLKLHGVVRPL 751
>gi|346323174|gb|EGX92772.1| GATA factor SREP [Cordyceps militaris CM01]
Length = 533
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 58/111 (52%), Gaps = 11/111 (9%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCGT 70
C NCG TPLWRRDG GH +CNACGLY++++G++RP+ K+ T RR + +N
Sbjct: 242 CQNCGTTITPLWRRDGNGHVICNACGLYYRLHGVHRPVTM--KKATIKRRKRVIPSNQEE 299
Query: 71 R----MTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
+ R+N+ P G ++N L +RH P+ M D
Sbjct: 300 EGEDGDVAMDSRSNETTP---ERGTVNDDGSIN--LGIRHKPEDPVPMDLD 345
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 36/48 (75%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ +G +CNACGLYY+LH V+RP+ M+ ++
Sbjct: 240 IACQNCGTTITPLWRRDGNGHVICNACGLYYRLHGVHRPVTMKKATIK 287
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTA 57
C NCG TPLWRR G +CNACGLY + RP L KP K ++A
Sbjct: 87 CSNCGTTETPLWRRSPQGATICNACGLYLRARNSARPTNLKKPPKVVSA 135
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 38/74 (51%), Gaps = 6/74 (8%)
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRH----LAMRP--LAMRKD 117
C+NCGT T LWRR+ G +CNACGLY + N RP ++ ++ P + K
Sbjct: 86 VCSNCGTTETPLWRRSPQGATICNACGLYLRARNSARPTNLKKPPKVVSAEPPRASTPKA 145
Query: 118 GIQTRKRKPKKQGG 131
G T+ PK G
Sbjct: 146 GGSTQDSTPKVAGA 159
>gi|312081468|ref|XP_003143041.1| GATA zinc finger family protein [Loa loa]
gi|307761798|gb|EFO21032.1| GATA zinc finger family protein [Loa loa]
Length = 366
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 40/72 (55%), Gaps = 10/72 (13%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
C+NC T TT WRR+ G+ VCNACGLYY+LH N RP+ MRKD IQ R R
Sbjct: 251 CSNCLTTKTTAWRRDQTGKLVCNACGLYYRLHRTN----------RPVHMRKDIIQQRFR 300
Query: 125 KPKKQGGGGGGG 136
+ K+
Sbjct: 301 RRVKEEDAASAN 312
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC T WRRD TG +CNACGLY++++ NRP+
Sbjct: 251 CSNCLTTKTTAWRRDQTGKLVCNACGLYYRLHRTNRPV 288
>gi|242220478|ref|XP_002476005.1| hypothetical transcription factor [Postia placenta Mad-698-R]
gi|220724794|gb|EED78814.1| hypothetical transcription factor [Postia placenta Mad-698-R]
Length = 1062
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 931 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 980
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 931 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 968
>gi|426197491|gb|EKV47418.1| hypothetical protein AGABI2DRAFT_221510 [Agaricus bisporus var.
bisporus H97]
Length = 894
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T T LWRR+ +G+P+CNACGL+YKLH V +RPL+++ D I+ R R
Sbjct: 763 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGV----------VRPLSLKTDVIKKRNR 812
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 23/38 (60%), Positives = 29/38 (76%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRRD G LCNACGL++K++G+ RPL
Sbjct: 763 CTNCQTTNTPLWRRDPEGQPLCNACGLFYKLHGVVRPL 800
>gi|367042150|ref|XP_003651455.1| hypothetical protein THITE_37279 [Thielavia terrestris NRRL 8126]
gi|346998717|gb|AEO65119.1| hypothetical protein THITE_37279 [Thielavia terrestris NRRL 8126]
Length = 988
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 34/87 (39%), Positives = 47/87 (54%), Gaps = 23/87 (26%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 722 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 771
Query: 125 -------------KPKKQGGGGGGGSG 138
+ +K G G G+
Sbjct: 772 GSGTGLGVGGTSTRSRKNAGSSGSGTA 798
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 722 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 759
>gi|190348546|gb|EDK41015.2| hypothetical protein PGUG_05113 [Meyerozyma guilliermondii ATCC
6260]
Length = 429
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 44/78 (56%), Gaps = 11/78 (14%)
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM---------RHLAMRP 111
+ + C NCGT +T LWRR++ G +CNACGLYYKLH +RP+ M RHL
Sbjct: 134 MAIACYNCGTTITPLWRRDDAGNTICNACGLYYKLHKSHRPVKMKRTTIKRRKRHLNETT 193
Query: 112 LAMRKDGIQTRKRKPKKQ 129
L+ + R KPKK
Sbjct: 194 LSTASE--NNRIEKPKKS 209
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 1 MDFQFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTAT 58
M G++C NCG TPLWRR G +CNACGLY++ N +RP L +P + +
Sbjct: 1 MSVDALPGQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNSHRPVNLKRPPNTIPVS 60
Query: 59 RRLGLC 64
+ G C
Sbjct: 61 SKQGSC 66
Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 34/50 (68%), Gaps = 2/50 (4%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRR 60
C NCG TPLWRRD G+ +CNACGLY+K++ +RP+ KR T RR
Sbjct: 138 CYNCGTTITPLWRRDDAGNTICNACGLYYKLHKSHRPVKM--KRTTIKRR 185
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 32/44 (72%)
Query: 62 GLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
G C+NCGT T LWRR DG +CNACGLYY+ +N +RP+ ++
Sbjct: 8 GQQCSNCGTTKTPLWRRAPDGTLICNACGLYYRSNNSHRPVNLK 51
>gi|212529674|ref|XP_002144994.1| siderophore transcription factor SreA [Talaromyces marneffei ATCC
18224]
gi|210074392|gb|EEA28479.1| siderophore transcription factor SreA [Talaromyces marneffei ATCC
18224]
Length = 485
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 39/65 (60%), Gaps = 10/65 (15%)
Query: 60 RLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGI 119
L + C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+K I
Sbjct: 245 ELLVACQNCGTTVTPLWRRDENGHPICNACGLYYKLHGC----------YRPTTMKKSII 294
Query: 120 QTRKR 124
+ RKR
Sbjct: 295 KRRKR 299
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 35/56 (62%), Gaps = 6/56 (10%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKPSKRLTATRR 60
C NCG TPLWRRD GH +CNACGLY+K++G RP ++K KR+ R
Sbjct: 250 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTTMKKSIIKRRKRVVPAMR 305
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 31/44 (70%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV+RP
Sbjct: 96 TAFLGHSCSNCGTKSTPLWRRSPTGATICNACGLYLKARNVHRP 139
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 27/40 (67%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K ++RP
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGATICNACGLYLKARNVHRP 139
>gi|425771786|gb|EKV10222.1| GATA transcriptional activator AreA [Penicillium digitatum Pd1]
gi|425777069|gb|EKV15259.1| GATA transcriptional activator AreA [Penicillium digitatum PHI26]
Length = 873
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 673 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 722
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 673 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 710
>gi|115401494|ref|XP_001216335.1| GATA factor SREP [Aspergillus terreus NIH2624]
gi|114190276|gb|EAU31976.1| GATA factor SREP [Aspergillus terreus NIH2624]
Length = 536
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 35/50 (70%)
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
L + C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+ ++
Sbjct: 243 LLVSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPSTMKKTIIK 292
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 43/83 (51%), Gaps = 1/83 (1%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAM-RHLAMRPLAMRK 116
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV+RP R A +
Sbjct: 89 TSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVSRPTKRNRTQASQEPGPSP 148
Query: 117 DGIQTRKRKPKKQGGGGGGGSGG 139
D + + P GG G S G
Sbjct: 149 DQLGPSQSAPIPTTEGGCGSSKG 171
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 247 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRP 283
Score = 55.5 bits (132), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 35/66 (53%), Gaps = 2/66 (3%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTN 67
G C NCG STPLWRR TG +CNACGLY K ++RP + R A++ G
Sbjct: 93 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVSRPTKR--NRTQASQEPGPSPDQ 150
Query: 68 CGTRMT 73
G +
Sbjct: 151 LGPSQS 156
>gi|121713620|ref|XP_001274421.1| siderophore transcription factor SreA [Aspergillus clavatus NRRL 1]
gi|119402574|gb|EAW12995.1| siderophore transcription factor SreA [Aspergillus clavatus NRRL 1]
Length = 561
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 34/48 (70%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+ ++
Sbjct: 259 VACQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTTMKKTIIK 306
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/63 (49%), Positives = 36/63 (57%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV RP L P A +
Sbjct: 106 TSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTKRNRLEPGPEAAAQQ 165
Query: 118 GIQ 120
G Q
Sbjct: 166 GPQ 168
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 35/56 (62%), Gaps = 6/56 (10%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKPSKRLTATRR 60
C NCG TPLWRRD GH +CNACGLY+K++G RP ++K KR+ R
Sbjct: 261 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTTMKKTIIKRRKRVVPALR 316
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 110 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 149
>gi|169775323|ref|XP_001822129.1| GATA factor SREP [Aspergillus oryzae RIB40]
gi|83769992|dbj|BAE60127.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 566
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 34/48 (70%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+ ++
Sbjct: 264 VSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTTMKKTIIK 311
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 41/86 (47%), Gaps = 4/86 (4%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV RP + P +
Sbjct: 102 TSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTKRNRVQTSPETTQPP 161
Query: 118 GIQTR----KRKPKKQGGGGGGGSGG 139
+ P GGG GS G
Sbjct: 162 SNPSHPPHDSTAPSSHEGGGCHGSKG 187
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 106 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 145
>gi|354545181|emb|CCE41908.1| hypothetical protein CPAR2_804580 [Candida parapsilosis]
Length = 692
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/48 (58%), Positives = 35/48 (72%)
Query: 55 LTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
L R+ L CTNC TR T LWR++N+G+ +CNACGL+YKLH RPL
Sbjct: 342 LNNGRKKILQCTNCETRTTPLWRKSNNGDLLCNACGLFYKLHGTLRPL 389
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 21/39 (53%), Positives = 28/39 (71%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
+C NC +TPLWR+ G LCNACGL++K++G RPL
Sbjct: 351 QCTNCETRTTPLWRKSNNGDLLCNACGLFYKLHGTLRPL 389
>gi|3182920|sp|P78688.1|AREA_GIBFU RecName: Full=Nitrogen regulatory protein areA
gi|1834451|emb|CAA71897.1| AREA [Fusarium fujikuroi]
Length = 971
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 694 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 743
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 44/148 (29%), Positives = 63/148 (42%), Gaps = 20/148 (13%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL------VKPSKRLTATR-RLGL 63
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL +K R + T +G
Sbjct: 694 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPLSLKTDVIKKRNRGSGTNVPVGG 753
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVN------------RPLAMRHLAMRP 111
T +TL R N + A K ++ N +P + + + P
Sbjct: 754 SSTRSKKTASTLNSRKNSTLSMSTATANSTKPNSSNPTPRVTTPPATSQPPSSKDVD-SP 812
Query: 112 LAMRKDGIQTRKRKPKKQGGGGGGGSGG 139
++ G T P GG G SG
Sbjct: 813 VSGTTSGANTAGSTPNSHFGGPGPSSGA 840
>gi|391873006|gb|EIT82081.1| GATA transcription factor [Aspergillus oryzae 3.042]
Length = 566
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 34/48 (70%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+ ++
Sbjct: 264 VSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTTMKKTIIK 311
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 41/86 (47%), Gaps = 4/86 (4%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV RP + P +
Sbjct: 102 TSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTKRNRVQTSPETTQPP 161
Query: 118 GIQTR----KRKPKKQGGGGGGGSGG 139
+ P GGG GS G
Sbjct: 162 SNPSHPPHDSTAPSSHEGGGCHGSKG 187
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 106 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 145
>gi|299745564|ref|XP_001831801.2| hypothetical protein CC1G_05900 [Coprinopsis cinerea okayama7#130]
gi|298406644|gb|EAU89984.2| hypothetical protein CC1G_05900 [Coprinopsis cinerea okayama7#130]
Length = 500
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 50/102 (49%), Gaps = 12/102 (11%)
Query: 34 ACGLYHKMNGMNRPLVKPSKRLTATRRL--GLCCTNCGTRMTTLWRRNNDGEPVCNACGL 91
A ++ + + P+ R R L C NCGT T LWRR++ G +CNACGL
Sbjct: 86 ASPAANESDASTSAVAAPNARTKGARPAVGALSCANCGTSTTPLWRRDDVGNNICNACGL 145
Query: 92 YYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKRKPKKQGGGG 133
Y+KLH + RP +M+K I+ RKR P G G
Sbjct: 146 YFKLHGTH----------RPNSMKKTVIKRRKRVPAAPGSAG 177
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKPSKRLTA 57
C NCG +TPLWRRD G+ +CNACGLY K++G +RP ++K KR+ A
Sbjct: 119 CANCGTSTTPLWRRDDVGNNICNACGLYFKLHGTHRPNSMKKTVIKRRKRVPA 171
>gi|242762467|ref|XP_002340383.1| siderophore transcription factor SreA [Talaromyces stipitatus ATCC
10500]
gi|218723579|gb|EED22996.1| siderophore transcription factor SreA [Talaromyces stipitatus ATCC
10500]
Length = 493
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/64 (50%), Positives = 39/64 (60%), Gaps = 10/64 (15%)
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
L + C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+K I+
Sbjct: 252 LLVACQNCGTTVTPLWRRDENGHPICNACGLYYKLHGC----------YRPTTMKKSIIK 301
Query: 121 TRKR 124
RKR
Sbjct: 302 RRKR 305
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 35/56 (62%), Gaps = 6/56 (10%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP------LVKPSKRLTATRR 60
C NCG TPLWRRD GH +CNACGLY+K++G RP ++K KR+ R
Sbjct: 256 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTTMKKSIIKRRKRVVPAMR 311
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 31/44 (70%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV+RP
Sbjct: 99 TAFLGHSCSNCGTKSTPLWRRSPTGATICNACGLYLKARNVHRP 142
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 27/40 (67%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K ++RP
Sbjct: 103 GHSCSNCGTKSTPLWRRSPTGATICNACGLYLKARNVHRP 142
>gi|397530026|gb|AFO53621.1| nitrogen regulatory protein AreA [Beauveria bassiana]
Length = 455
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 40/61 (65%), Gaps = 10/61 (16%)
Query: 64 CCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRK 123
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R
Sbjct: 210 TCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRN 259
Query: 124 R 124
R
Sbjct: 260 R 260
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 29/43 (67%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 206 GNPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 248
>gi|5902804|sp|O13508.2|AREA_PENRO RecName: Full=Nitrogen regulatory protein areA; AltName:
Full=Nitrogen regulator nmc
gi|3702623|emb|CAA04815.1| nitrogen regulator [Penicillium roqueforti]
Length = 860
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 660 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 709
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 660 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 697
>gi|238878432|gb|EEQ42070.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 470
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 26/40 (65%), Positives = 32/40 (80%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPL 102
L CTNC TR T LWR+ N+G+ +CNACGL+YKLH V RP+
Sbjct: 252 LQCTNCQTRTTPLWRKANNGDLLCNACGLFYKLHGVLRPI 291
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 26/71 (36%), Positives = 40/71 (56%)
Query: 10 ECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRLGLCCTNCG 69
+C NC +TPLWR+ G LCNACGL++K++G+ RP+ KR R + T+
Sbjct: 253 QCTNCQTRTTPLWRKANNGDLLCNACGLFYKLHGVLRPINGDKKRKYTKRNDKMISTDNT 312
Query: 70 TRMTTLWRRNN 80
L ++N+
Sbjct: 313 NIFAGLSQQNH 323
>gi|380489847|emb|CCF36425.1| GATA zinc finger [Colletotrichum higginsianum]
Length = 591
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 33/70 (47%), Positives = 41/70 (58%), Gaps = 10/70 (14%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+ RP+ M+K I+ R
Sbjct: 287 IACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVH----------RPVTMKKSIIKRR 336
Query: 123 KRKPKKQGGG 132
KR GG
Sbjct: 337 KRVIPSSFGG 346
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 36/51 (70%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKPSKRLTATRRL 61
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+ + +R+
Sbjct: 289 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVTMKKSIIKRRKRV 339
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/60 (43%), Positives = 37/60 (61%)
Query: 46 RPLVKPSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
RP P+ +++ + G C+NCGT T LWRR+ G +CNACGLY K N +RP ++
Sbjct: 103 RPSPAPTTASSSSGQTGQVCSNCGTTRTPLWRRSPQGTTICNACGLYQKARNASRPTNLK 162
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP--LVKPSKRLTATRR 60
G+ C NCG TPLWRR G +CNACGLY K +RP L KP + A R
Sbjct: 119 GQVCSNCGTTRTPLWRRSPQGTTICNACGLYQKARNASRPTNLKKPPHLVAAAPR 173
>gi|238496045|ref|XP_002379258.1| siderophore transcription factor SreA [Aspergillus flavus NRRL3357]
gi|220694138|gb|EED50482.1| siderophore transcription factor SreA [Aspergillus flavus NRRL3357]
Length = 559
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 34/48 (70%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMR 110
+ C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+ ++
Sbjct: 257 VSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTTMKKTIIK 304
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 41/86 (47%), Gaps = 4/86 (4%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKD 117
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV RP + P +
Sbjct: 95 TSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARPTKRNRVQTSPETTQPP 154
Query: 118 GIQTR----KRKPKKQGGGGGGGSGG 139
+ P GGG GS G
Sbjct: 155 SNPSHPPHDSTAPSSHEGGGCHGSKG 180
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 99 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 138
>gi|303312759|ref|XP_003066391.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
gi|240106053|gb|EER24246.1| GATA family transcription factor [Coccidioides posadasii C735 delta
SOWgp]
gi|320032224|gb|EFW14179.1| GATA factor SREP [Coccidioides posadasii str. Silveira]
Length = 584
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 40/64 (62%), Gaps = 10/64 (15%)
Query: 61 LGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQ 120
+ C NCGT +T LWRR++ G P+CNACGLY++LH RP+AM+K I+
Sbjct: 264 VPTACQNCGTTVTPLWRRDDQGHPICNACGLYFRLHGC----------ARPVAMKKSIIK 313
Query: 121 TRKR 124
RKR
Sbjct: 314 RRKR 317
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 32/52 (61%), Gaps = 3/52 (5%)
Query: 1 MDFQFGEG---RECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
+D EG C NCG TPLWRRD GH +CNACGLY +++G RP+
Sbjct: 255 LDVPNQEGGVPTACQNCGTTVTPLWRRDDQGHPICNACGLYFRLHGCARPVA 306
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 35/65 (53%), Gaps = 3/65 (4%)
Query: 40 KMNGMNRPLVK---PSKRLTATRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLH 96
+ N NRP P R LG C NCGT+ T LWRR +G +CNACGLY K
Sbjct: 81 ESNDENRPNASRQSPKSRGKEQVFLGHSCVNCGTKRTPLWRRAPNGSTICNACGLYLKAR 140
Query: 97 NVNRP 101
N +RP
Sbjct: 141 NADRP 145
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 26/44 (59%)
Query: 4 QFGEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
Q G CVNCG TPLWRR G +CNACGLY K +RP
Sbjct: 102 QVFLGHSCVNCGTKRTPLWRRAPNGSTICNACGLYLKARNADRP 145
>gi|70997557|ref|XP_753523.1| siderophore transcription factor SreA [Aspergillus fumigatus Af293]
gi|66851159|gb|EAL91485.1| siderophore transcription factor SreA [Aspergillus fumigatus Af293]
gi|159126746|gb|EDP51862.1| siderophore transcription factor SreA [Aspergillus fumigatus A1163]
Length = 546
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/62 (50%), Positives = 38/62 (61%), Gaps = 10/62 (16%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ +G P+CNACGLYYKLH RP M+K I+ R
Sbjct: 251 VSCQNCGTTVTPLWRRDENGHPICNACGLYYKLHGC----------YRPTTMKKSIIKRR 300
Query: 123 KR 124
KR
Sbjct: 301 KR 302
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 28/39 (71%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG TPLWRRD GH +CNACGLY+K++G RP
Sbjct: 253 CQNCGTTVTPLWRRDENGHPICNACGLYYKLHGCYRPTT 291
Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 30/44 (68%)
Query: 58 TRRLGLCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRP 101
T LG C+NCGT+ T LWRR+ G +CNACGLY K NV RP
Sbjct: 96 TSFLGHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 139
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 26/40 (65%)
Query: 8 GRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
G C NCG STPLWRR TG +CNACGLY K + RP
Sbjct: 100 GHSCSNCGTKSTPLWRRSPTGAMICNACGLYLKARNVARP 139
>gi|400600701|gb|EJP68369.1| Zinc finger, GATA-type [Beauveria bassiana ARSEF 2860]
Length = 838
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 594 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 643
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 23/43 (53%), Positives = 29/43 (67%)
Query: 6 GEGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
G C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 589 GNPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 631
>gi|358385091|gb|EHK22688.1| hypothetical protein TRIVIDRAFT_113059, partial [Trichoderma virens
Gv29-8]
Length = 950
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 685 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 734
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 685 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 722
>gi|310791188|gb|EFQ26717.1| GATA zinc finger [Glomerella graminicola M1.001]
Length = 956
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 35/86 (40%), Positives = 46/86 (53%), Gaps = 23/86 (26%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 679 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 728
Query: 125 -------------KPKKQGGGGGGGS 137
+ KK G GS
Sbjct: 729 GSGASLPVGGTSTRSKKSAGASASGS 754
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 3/46 (6%)
Query: 6 GEGRE---CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
GEG C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 671 GEGNAPTTCTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 716
>gi|13620173|emb|CAC36427.1| GATA transcription factor [Botryotinia fuckeliana]
Length = 388
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 42/70 (60%), Gaps = 11/70 (15%)
Query: 63 LCCTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTR 122
+ C NCGT +T LWRR+ G +CNACGLYYKLH V+RP + M+K I+ R
Sbjct: 275 VACQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRP----------VTMKKSIIKRR 324
Query: 123 KR-KPKKQGG 131
KR P QG
Sbjct: 325 KRVVPAVQGS 334
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 32/39 (82%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLV 49
C NCG TPLWRRD +GH +CNACGLY+K++G++RP+
Sbjct: 277 CQNCGTTITPLWRRDESGHTICNACGLYYKLHGVHRPVT 315
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 26/39 (66%)
Query: 67 NCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMR 105
NCGT T LWRR+ G +CNACGLY K N +RP ++
Sbjct: 120 NCGTTRTPLWRRSPQGATICNACGLYQKARNASRPTNLK 158
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%)
Query: 7 EGRECVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRP 47
+ + NCG TPLWRR G +CNACGLY K +RP
Sbjct: 114 QSNQICNCGTTRTPLWRRSPQGATICNACGLYQKARNASRP 154
>gi|89078359|gb|ABD60578.1| GATA transcription factor FNR1 [Fusarium oxysporum f. sp.
lycopersici]
Length = 906
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 40/60 (66%), Gaps = 10/60 (16%)
Query: 65 CTNCGTRMTTLWRRNNDGEPVCNACGLYYKLHNVNRPLAMRHLAMRPLAMRKDGIQTRKR 124
CTNC T+ T LWRRN +G+P+CNACGL+ KLH V +RPL+++ D I+ R R
Sbjct: 629 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGV----------VRPLSLKTDVIKKRNR 678
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CVNCGAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPL 48
C NC +TPLWRR+ G LCNACGL+ K++G+ RPL
Sbjct: 629 CTNCFTQTTPLWRRNPEGQPLCNACGLFLKLHGVVRPL 666
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.140 0.475
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,553,589,504
Number of Sequences: 23463169
Number of extensions: 102452503
Number of successful extensions: 793129
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1630
Number of HSP's successfully gapped in prelim test: 1168
Number of HSP's that attempted gapping in prelim test: 784183
Number of HSP's gapped (non-prelim): 6645
length of query: 139
length of database: 8,064,228,071
effective HSP length: 104
effective length of query: 35
effective length of database: 9,919,025,791
effective search space: 347165902685
effective search space used: 347165902685
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 71 (32.0 bits)