BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 033001
         (129 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9FJ10|GAT16_ARATH GATA transcription factor 16 OS=Arabidopsis thaliana GN=GATA16 PE=2
           SV=1
          Length = 139

 Score =  107 bits (268), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 63/140 (45%), Positives = 83/140 (59%), Gaps = 22/140 (15%)

Query: 1   MDVKTKRREAEEMMKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRK 60
           +D +T +  AE+M++     + N+  K+C DC T++TPLWRGGP GP+SLCNACGIR RK
Sbjct: 11  VDSETMKTRAEDMIEQNNT-SVNDKKKTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRK 69

Query: 61  TKKLALLGRDKGRAQKRKRKYSSNNNNNKGATKLGISLKAGLMAVG-----------SDM 109
                   R  G    +K K SS+   N+   K G SLK  LM +G             +
Sbjct: 70  K-------RRGGTEDNKKLKKSSSGGGNR---KFGESLKQSLMDLGIRKRSTVEKQRQKL 119

Query: 110 GEEEQAAILLMSLSYGCLYA 129
           GEEEQAA+LLM+LSYG +YA
Sbjct: 120 GEEEQAAVLLMALSYGSVYA 139


>sp|Q8LG10|GAT15_ARATH GATA transcription factor 15 OS=Arabidopsis thaliana GN=GATA15 PE=2
           SV=2
          Length = 149

 Score = 94.4 bits (233), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 56/119 (47%), Positives = 70/119 (58%), Gaps = 26/119 (21%)

Query: 27  KSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDKGRAQKRKRKYSSNNN 86
           KSC  C T++TPLWRGGPAGP+SLCNACGIR RK ++  +  R +      K+K S N N
Sbjct: 41  KSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSED-----KKKKSHNRN 95

Query: 87  NNKGATKLGISLKAGLMAVGSD---------------MGEEEQAAILLMSLSYG-CLYA 129
                 K G SLK  LM +G +               +GEEEQAA+LLM+LSY   +YA
Sbjct: 96  -----PKFGDSLKQRLMELGREVMMQRSTAENQRRNKLGEEEQAAVLLMALSYASSVYA 149


>sp|Q9LIB5|GAT17_ARATH GATA transcription factor 17 OS=Arabidopsis thaliana GN=GATA17 PE=2
           SV=1
          Length = 190

 Score = 83.2 bits (204), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 56/152 (36%), Positives = 77/152 (50%), Gaps = 46/152 (30%)

Query: 24  EMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDKGRAQKRK----- 78
           +  ++C+DC T RTPLWRGGPAGP+SLCNACGI+ RK ++ AL  R + + + RK     
Sbjct: 39  DTKRTCVDCGTIRTPLWRGGPAGPKSLCNACGIKSRKKRQAALGMRSEEKKKNRKSNCNN 98

Query: 79  ---------RKYSSN----------------NNN-------NKGATK-LGISLKAGLMA- 104
                    +KY  N                NN        NKG +K L +  K  +M  
Sbjct: 99  DLNLDHRNAKKYKINIVDDGKIDIDDDPKICNNKRSSSSSSNKGVSKFLDLGFKVPVMKR 158

Query: 105 -------VGSDMGEEEQAAILLMSLSYGCLYA 129
                  +   +GEEE+AA+LLM+LS   +YA
Sbjct: 159 SAVEKKRLWRKLGEEERAAVLLMALSCSSVYA 190


>sp|Q8LC59|GAT23_ARATH GATA transcription factor 23 OS=Arabidopsis thaliana GN=GATA23
          PE=2 SV=2
          Length = 120

 Score = 69.7 bits (169), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 29/48 (60%), Positives = 37/48 (77%)

Query: 29 CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDKGRAQK 76
          C +C TT+TP+WRGGP GP+SLCNACGIR+RK ++  LLG    R+ K
Sbjct: 28 CSECKTTKTPMWRGGPTGPKSLCNACGIRHRKQRRSELLGIHIIRSHK 75


>sp|Q5HZ36|GAT21_ARATH GATA transcription factor 21 OS=Arabidopsis thaliana GN=GATA21 PE=1
           SV=2
          Length = 398

 Score = 65.5 bits (158), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 26/38 (68%), Positives = 31/38 (81%)

Query: 23  NEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRK 60
           N + + C DC+TT+TPLWR GP GP+SLCNACGIR RK
Sbjct: 226 NGVIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRK 263


>sp|Q6QPM2|GAT19_ARATH GATA transcription factor 19 OS=Arabidopsis thaliana GN=GATA19 PE=2
           SV=2
          Length = 211

 Score = 63.2 bits (152), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 26/49 (53%), Positives = 35/49 (71%)

Query: 23  NEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDK 71
           N + + C +C TT TPLWR GP GP+SLCNACGIR++K ++ A   R+ 
Sbjct: 71  NLLARRCANCDTTSTPLWRNGPRGPKSLCNACGIRFKKEERRASTARNS 119


>sp|Q9ZPX0|GAT20_ARATH GATA transcription factor 20 OS=Arabidopsis thaliana GN=GATA20 PE=2
           SV=2
          Length = 208

 Score = 62.0 bits (149), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 25/46 (54%), Positives = 33/46 (71%)

Query: 20  GTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLA 65
           G  + + + C  C TT TPLWR GP GP+SLCNACGIR++K ++ A
Sbjct: 85  GVAHSLPRRCASCDTTSTPLWRNGPKGPKSLCNACGIRFKKEERRA 130


>sp|Q9SZI6|GAT22_ARATH Putative GATA transcription factor 22 OS=Arabidopsis thaliana
           GN=GATA22 PE=2 SV=1
          Length = 352

 Score = 62.0 bits (149), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 23/29 (79%), Positives = 26/29 (89%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIR 57
           C DC+TT+TPLWR GP GP+SLCNACGIR
Sbjct: 201 CSDCNTTKTPLWRSGPRGPKSLCNACGIR 229


>sp|Q8LC79|GAT18_ARATH GATA transcription factor 18 OS=Arabidopsis thaliana GN=GATA18 PE=2
           SV=2
          Length = 295

 Score = 59.7 bits (143), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 29/35 (82%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           C +C TT TPLWR GP GP+SLCNACGIR++K ++
Sbjct: 154 CANCDTTSTPLWRNGPRGPKSLCNACGIRFKKEER 188


>sp|Q54NM5|GTAL_DICDI GATA zinc finger domain-containing protein 12 OS=Dictyostelium
           discoideum GN=gtaL PE=4 SV=1
          Length = 640

 Score = 58.9 bits (141), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 33/49 (67%)

Query: 11  EEMMKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           E M+++        +++ C++C T+ TP WR GP G ++LCNACGIRYR
Sbjct: 488 ENMIRAQTKKQKKTISRVCVNCKTSDTPEWRRGPQGAKTLCNACGIRYR 536


>sp|Q6DBP8|GAT11_ARATH GATA transcription factor 11 OS=Arabidopsis thaliana GN=GATA11 PE=2
           SV=1
          Length = 303

 Score = 58.5 bits (140), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 20/33 (60%), Positives = 27/33 (81%)

Query: 27  KSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           + C  C TT+TP WR GP+GP++LCNACG+R+R
Sbjct: 220 RKCTHCETTKTPQWREGPSGPKTLCNACGVRFR 252


>sp|Q75JZ0|GTAH_DICDI GATA zinc finger domain-containing protein 8 OS=Dictyostelium
           discoideum GN=gtaH PE=4 SV=1
          Length = 519

 Score = 56.6 bits (135), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 33/60 (55%), Gaps = 11/60 (18%)

Query: 4   KTKRREAEEMMKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           + KRREA  ++           N  C +C TT TP WR GP G +SLCNACG+ Y K  K
Sbjct: 448 REKRREASRLL-----------NNVCRNCKTTETPEWRKGPDGTKSLCNACGLHYAKNVK 496


>sp|P69781|GAT12_ARATH GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2
           SV=1
          Length = 331

 Score = 56.6 bits (135), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 23/46 (50%), Positives = 32/46 (69%), Gaps = 2/46 (4%)

Query: 14  MKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           + SP +G   E  + C+ C T +TP WR GP GP++LCNACG+RY+
Sbjct: 208 VSSPESGGAEE--RRCLHCATDKTPQWRTGPMGPKTLCNACGVRYK 251


>sp|Q55GK0|GTAE_DICDI GATA zinc finger domain-containing protein 5 OS=Dictyostelium
           discoideum GN=gtaE PE=4 SV=1
          Length = 952

 Score = 55.8 bits (133), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 21/36 (58%), Positives = 26/36 (72%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKL 64
           C  C+T+ TP WR GP GP +LCNACG+ Y K +KL
Sbjct: 241 CYQCNTSNTPEWRKGPEGPATLCNACGLAYAKKQKL 276


>sp|O82632|GATA9_ARATH GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2
           SV=1
          Length = 308

 Score = 55.8 bits (133), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 25/31 (80%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           C+ C T +TP WR GP GP++LCNACG+RY+
Sbjct: 199 CLHCATEKTPQWRTGPMGPKTLCNACGVRYK 229


>sp|Q8LAU9|GATA1_ARATH GATA transcription factor 1 OS=Arabidopsis thaliana GN=GATA1 PE=2
           SV=2
          Length = 274

 Score = 55.5 bits (132), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 26/35 (74%)

Query: 25  MNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           M + C  C   +TP WR GPAGP++LCNACG+RY+
Sbjct: 192 MGRKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYK 226


>sp|Q9SV30|GATA8_ARATH GATA transcription factor 8 OS=Arabidopsis thaliana GN=GATA8 PE=2
           SV=1
          Length = 322

 Score = 55.1 bits (131), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 26/33 (78%)

Query: 27  KSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           + C+ C  T+TP WR GP GP++LCNACG+RY+
Sbjct: 229 RKCMHCEVTKTPQWRLGPMGPKTLCNACGVRYK 261


>sp|P40209|GAT2_YEAST Protein GAT2 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
           S288c) GN=GAT2 PE=4 SV=1
          Length = 560

 Score = 55.1 bits (131), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 26/61 (42%), Positives = 32/61 (52%), Gaps = 7/61 (11%)

Query: 10  AEEMMKSPPAGTFNEMNKS-------CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTK 62
           A  +  + PA   +E N +       C  C  T TP WR GP G R+LCNACG+ YRK  
Sbjct: 446 AAAVSTTTPAANSDEKNPNAKKIIEFCFHCGETETPEWRKGPYGTRTLCNACGLFYRKVT 505

Query: 63  K 63
           K
Sbjct: 506 K 506


>sp|Q00858|CGPB_FUSSO Cutinase gene palindrome-binding protein OS=Fusarium solani subsp.
           pisi PE=2 SV=1
          Length = 457

 Score = 54.7 bits (130), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 27/35 (77%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           C DC T  +P WR GP+GP++LCNACG+R+ K +K
Sbjct: 402 CTDCGTLDSPEWRKGPSGPKTLCNACGLRWAKKEK 436


>sp|P52172|SRP_DROME Box A-binding factor OS=Drosophila melanogaster GN=srp PE=1 SV=2
          Length = 1264

 Score = 54.7 bits (130), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 27/53 (50%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 28  SCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDKGRAQKRKRK 80
           SC +CHTT T LWR  PAG   +CNACG+ Y+       L   K   QKRKRK
Sbjct: 802 SCSNCHTTHTSLWRRNPAG-EPVCNACGLYYKLHSVPRPLTMKKDTIQKRKRK 853


>sp|Q9M1U2|GAT14_ARATH GATA transcription factor 14 OS=Arabidopsis thaliana GN=GATA14 PE=2
           SV=1
          Length = 204

 Score = 53.9 bits (128), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 23/38 (60%), Positives = 27/38 (71%)

Query: 22  FNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           F   +KSC  C T +TPLWR GP G  +LCNACG+RYR
Sbjct: 110 FGITDKSCSHCGTRKTPLWREGPRGAGTLCNACGMRYR 147


>sp|Q8VZP4|GAT10_ARATH GATA transcription factor 10 OS=Arabidopsis thaliana GN=GATA10 PE=2
           SV=1
          Length = 308

 Score = 53.9 bits (128), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 24/31 (77%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           C  C T  TP WR GP+GP++LCNACG+R++
Sbjct: 220 CTHCETITTPQWRQGPSGPKTLCNACGVRFK 250


>sp|Q9LT45|GAT29_ARATH GATA transcription factor 29 OS=Arabidopsis thaliana GN=GATA29 PE=2
           SV=1
          Length = 208

 Score = 53.5 bits (127), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 28/34 (82%)

Query: 30  IDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           ++C+   TP+WR GP GP+SLCNACGI++RK ++
Sbjct: 162 MNCNALNTPMWRRGPLGPKSLCNACGIKFRKEEE 195


>sp|Q54KX0|GTAN_DICDI GATA zinc finger domain-containing protein 14 OS=Dictyostelium
           discoideum GN=gtaN PE=4 SV=1
          Length = 953

 Score = 53.1 bits (126), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 22/35 (62%), Positives = 25/35 (71%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           C  C TT+TP WR GPAG +SLCNACG+ Y K  K
Sbjct: 893 CTSCGTTQTPEWRKGPAGGKSLCNACGLHYAKLMK 927


>sp|Q01371|WC1_NEUCR White collar 1 protein OS=Neurospora crassa (strain ATCC 24698 /
           74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=wc-1
           PE=2 SV=2
          Length = 1167

 Score = 52.8 bits (125), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 26/36 (72%)

Query: 25  MNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRK 60
           M + C +CHT  TP WR GP+G R LCN+CG+R+ K
Sbjct: 930 MVRDCANCHTRNTPEWRRGPSGNRDLCNSCGLRWAK 965


>sp|Q9SKN6|GAT13_ARATH Putative GATA transcription factor 13 OS=Arabidopsis thaliana
           GN=GATA13 PE=3 SV=2
          Length = 291

 Score = 52.8 bits (125), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 20/31 (64%), Positives = 23/31 (74%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           C  C TT TP WR GP G ++LCNACGIR+R
Sbjct: 193 CTHCETTTTPQWREGPNGRKTLCNACGIRFR 223


>sp|Q8L4M6|GATA3_ARATH GATA transcription factor 3 OS=Arabidopsis thaliana GN=GATA3 PE=2
           SV=2
          Length = 269

 Score = 52.4 bits (124), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 29/50 (58%)

Query: 10  AEEMMKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           A E ++     T     + C  C T  TP WR GP GP++LCNACG+R++
Sbjct: 163 ATEQLRKKKQETVLVFQRRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFK 212


>sp|P78714|WC2_NEUCR White collar 2 protein OS=Neurospora crassa (strain ATCC 24698 /
           74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=wc-2
           PE=1 SV=1
          Length = 530

 Score = 52.4 bits (124), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 18/30 (60%), Positives = 24/30 (80%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRY 58
           C DC T  +P WR GP+GP++LCNACG+R+
Sbjct: 468 CTDCGTLDSPEWRKGPSGPKTLCNACGLRW 497


>sp|O49743|GATA4_ARATH GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2
           SV=1
          Length = 240

 Score = 52.0 bits (123), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 24/31 (77%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           C  C + +TP WR GP GP++LCNACG+RY+
Sbjct: 160 CTHCASEKTPQWRTGPLGPKTLCNACGVRYK 190


>sp|Q9FH57|GATA5_ARATH GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2
           SV=1
          Length = 339

 Score = 51.6 bits (122), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 6/58 (10%)

Query: 4   KTKRREAEEMMKSPPAGTFNEMN--KSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           K K+R AE +     +G   ++   + C  C   +TP WR GP G ++LCNACG+RY+
Sbjct: 228 KHKKRSAESVF----SGELQQLQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYK 281


>sp|O49741|GATA2_ARATH GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2
           SV=1
          Length = 264

 Score = 51.2 bits (121), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 24/31 (77%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           C  C + +TP WR GP GP++LCNACG+R++
Sbjct: 181 CTHCASEKTPQWRTGPLGPKTLCNACGVRFK 211


>sp|Q5KSV0|GTAK_DICDI GATA zinc finger domain-containing protein 11 OS=Dictyostelium
           discoideum GN=gtaK PE=2 SV=1
          Length = 650

 Score = 50.8 bits (120), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 20/34 (58%), Positives = 24/34 (70%)

Query: 27  KSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRK 60
           K C  C TT +P WR GPAG +SLCNACG+ + K
Sbjct: 520 KQCTSCGTTSSPEWRKGPAGNQSLCNACGLYFAK 553


>sp|Q550D5|GTAA_DICDI Transcription factor stalky OS=Dictyostelium discoideum GN=stkA
           PE=1 SV=1
          Length = 872

 Score = 50.8 bits (120), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 33/52 (63%)

Query: 27  KSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDKGRAQKRK 78
           +SC  C +++TP WR GP+G  SLCNACGI++R   K  +    + +  ++K
Sbjct: 292 RSCEFCGSSQTPTWRRGPSGKGSLCNACGIKWRLKGKDGIFKPSQKQQNRQK 343


>sp|Q55C49|GTAG_DICDI GATA zinc finger domain-containing protein 7 OS=Dictyostelium
           discoideum GN=gtaG PE=4 SV=1
          Length = 1006

 Score = 50.8 bits (120), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/60 (41%), Positives = 32/60 (53%), Gaps = 9/60 (15%)

Query: 4   KTKRREAEEMMKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           KT RR    + KS            C +C T  TP WR GP+GP +LCNACG+ Y K ++
Sbjct: 826 KTHRRRPANIDKS---------TLYCHNCGTKNTPEWRRGPSGPATLCNACGLAYAKKQR 876


>sp|Q54TE3|GTAJ_DICDI GATA zinc finger domain-containing protein 10 OS=Dictyostelium
           discoideum GN=gtaJ PE=4 SV=1
          Length = 714

 Score = 50.4 bits (119), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 29/86 (33%), Positives = 41/86 (47%), Gaps = 9/86 (10%)

Query: 4   KTKRREAEEMMKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           K +RR    M  S         N  C  C  T TP WR GP G  +LCNACG+ Y K++K
Sbjct: 613 KPQRRRRRTMYSS-------RRNLKCHYCEVTETPEWRRGPDGDHTLCNACGLHYAKSQK 665

Query: 64  LALLGRDKGRAQKRKRKYSSNNNNNK 89
              L R+K   ++++ +      N +
Sbjct: 666 --KLAREKELEKQKELEREKERENTR 689


>sp|Q54HA4|GTAO_DICDI GATA zinc finger domain-containing protein 15 (Fragment)
           OS=Dictyostelium discoideum GN=gtaO PE=4 SV=1
          Length = 511

 Score = 50.4 bits (119), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 32/57 (56%)

Query: 7   RREAEEMMKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           RR  +  +K+    + N     C  C T  +P WR GP G +SLCNACG+ Y KTKK
Sbjct: 431 RRNRKCTIKTKTLQSSNSEEIVCQACGTRASPEWRKGPDGFKSLCNACGLYYAKTKK 487


>sp|B0G188|GTAP_DICDI GATA zinc finger domain-containing protein 16 OS=Dictyostelium
           discoideum GN=gtaP PE=4 SV=1
          Length = 695

 Score = 50.1 bits (118), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%), Gaps = 11/64 (17%)

Query: 28  SCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKT----KKLALLGRDKG-------RAQK 76
           SC  C  T TP WR GP G ++LCNACG+ + K+    K+  LL    G       +AQK
Sbjct: 478 SCHTCGVTNTPEWRRGPNGAKTLCNACGLAWAKSVKSEKQKELLANSTGVNITEPKKAQK 537

Query: 77  RKRK 80
           RK++
Sbjct: 538 RKKE 541


>sp|P43574|GAT1_YEAST Transcriptional regulatory protein GAT1 OS=Saccharomyces cerevisiae
           (strain ATCC 204508 / S288c) GN=GAT1 PE=1 SV=1
          Length = 510

 Score = 49.3 bits (116), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 1/72 (1%)

Query: 16  SPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDKGRAQ 75
           +PP+ T +  +  C +C T+ TPLWR  P G   LCNACG+  +       L       +
Sbjct: 297 TPPSNTSSNPDIKCSNCTTSTTPLWRKDPKG-LPLCNACGLFLKLHGVTRPLSLKTDIIK 355

Query: 76  KRKRKYSSNNNN 87
           KR+R  +  NNN
Sbjct: 356 KRQRSSTKINNN 367


>sp|Q75JZ1|GTAC_DICDI GATA zinc finger domain-containing protein 3 OS=Dictyostelium
           discoideum GN=gtaC PE=4 SV=1
          Length = 587

 Score = 49.3 bits (116), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 20/35 (57%), Positives = 23/35 (65%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           CI C T  TP WR GP G ++LCNACG+ Y K  K
Sbjct: 500 CIFCGTMETPEWRKGPGGHKTLCNACGLHYAKNIK 534


>sp|Q9SD38|GATA6_ARATH GATA transcription factor 6 OS=Arabidopsis thaliana GN=GATA6 PE=2
           SV=1
          Length = 312

 Score = 48.5 bits (114), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 25/38 (65%)

Query: 27  KSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKL 64
           + C  C   +TP WR GP G ++LCNACG+RY+  + L
Sbjct: 221 RQCGHCGVQKTPQWRAGPLGAKTLCNACGVRYKSGRLL 258


>sp|Q54TM6|GTAI_DICDI GATA zinc finger domain-containing protein 9 OS=Dictyostelium
           discoideum GN=gtaI PE=4 SV=1
          Length = 536

 Score = 48.5 bits (114), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 22/41 (53%), Positives = 25/41 (60%)

Query: 23  NEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKK 63
           N  +  C  C TT TP WR GP G +SLCNACG+ Y K  K
Sbjct: 473 NHTSLFCRHCGTTDTPEWRRGPDGRKSLCNACGLHYSKLVK 513


>sp|O65515|GATA7_ARATH GATA transcription factor 7 OS=Arabidopsis thaliana GN=GATA7 PE=2
           SV=1
          Length = 238

 Score = 47.4 bits (111), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 27/42 (64%)

Query: 23  NEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKL 64
            ++ + C  C   +TP WR GP G ++LCNACG+R++  + L
Sbjct: 160 QQLRRCCSHCGVQKTPQWRMGPLGAKTLCNACGVRFKSGRLL 201


>sp|Q55EQ0|GTAF_DICDI GATA zinc finger domain-containing protein 6 OS=Dictyostelium
           discoideum GN=gtaF PE=4 SV=1
          Length = 623

 Score = 45.8 bits (107), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 5/50 (10%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDKGRAQKRK 78
           C  C  T+T  WR GP G +SLCNACGIR+       ++ ++K  A K K
Sbjct: 320 CHSCGETQTSQWRRGPDGCKSLCNACGIRFAN-----IVSKEKALAVKEK 364


>sp|Q5PP38|GAT27_ARATH GATA transcription factor 27 OS=Arabidopsis thaliana GN=GATA27
          PE=2 SV=1
          Length = 470

 Score = 44.7 bits (104), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 19/36 (52%), Positives = 20/36 (55%)

Query: 29 CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKL 64
          C  C  T TPLWR GP     LCNACG R+R    L
Sbjct: 7  CYHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSL 42


>sp|Q8W4H1|GAT26_ARATH GATA transcription factor 26 OS=Arabidopsis thaliana GN=GATA26
          PE=2 SV=1
          Length = 510

 Score = 44.3 bits (103), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 19/36 (52%), Positives = 20/36 (55%)

Query: 29 CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKL 64
          C  C  T TPLWR GP     LCNACG R+R    L
Sbjct: 7  CYHCGVTNTPLWRNGPPEKPVLCNACGSRWRTKGTL 42


>sp|Q07928|GAT3_YEAST Protein GAT3 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
           S288c) GN=GAT3 PE=4 SV=1
          Length = 141

 Score = 44.3 bits (103), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 12/64 (18%)

Query: 25  MNKSCIDCHTTRT-PLWRGGPAGPRSLCNACGIRYRKTKKLALLGRD---------KGRA 74
           + + C  C   +T P WR GP G  +LCNACG+ YRK     + G+D         KG +
Sbjct: 68  VTRRCPQCAVIKTSPQWREGPDGEVTLCNACGLFYRKI--FLVFGKDLAKRYFNEIKGVS 125

Query: 75  QKRK 78
            KRK
Sbjct: 126 VKRK 129


>sp|Q1WG82|ZGLP1_MOUSE GATA-type zinc finger protein 1 OS=Mus musculus GN=Zglp1 PE=2 SV=1
          Length = 266

 Score = 44.3 bits (103), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 29  CIDCHTTRTPLWRGGPAGPRSLCNACGIRYRK 60
           C  C T RTPLWR    G   LCNACGIRY+K
Sbjct: 197 CASCRTQRTPLWRDAEDG-TPLCNACGIRYKK 227


>sp|P0C6A0|ZGLP1_HUMAN GATA-type zinc finger protein 1 OS=Homo sapiens GN=ZGLP1 PE=2 SV=1
          Length = 271

 Score = 43.9 bits (102), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 19  AGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRK 60
           AG+     + C  C T RTPLWR    G   LCNACGIRY+K
Sbjct: 196 AGSEALEPRRCASCRTQRTPLWRDAEDG-TPLCNACGIRYKK 236


>sp|P23825|GATA3_CHICK GATA-binding factor 3 OS=Gallus gallus GN=GATA3 PE=2 SV=1
          Length = 444

 Score = 43.9 bits (102), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 37/80 (46%), Gaps = 1/80 (1%)

Query: 15  KSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDKGRA 74
           KS P    +   + C++C  T TPLWR    G   LCNACG+ ++   +   L + K R 
Sbjct: 250 KSRPKARSSTEGRECVNCGATSTPLWRRDGTG-HYLCNACGLYHKMNGQNRPLIKPKRRL 308

Query: 75  QKRKRKYSSNNNNNKGATKL 94
              +R  +S  N     T L
Sbjct: 309 SAARRAGTSCANCQTTTTTL 328



 Score = 37.7 bits (86), Expect = 0.024,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 11/57 (19%)

Query: 3   VKTKRREAEEMMKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           +K KRR    +  +  AGT      SC +C TT T LWR    G   +CNACG+ Y+
Sbjct: 302 IKPKRR----LSAARRAGT------SCANCQTTTTTLWRRNANGD-PVCNACGLYYK 347


>sp|P23772|GATA3_MOUSE Trans-acting T-cell-specific transcription factor GATA-3 OS=Mus
           musculus GN=Gata3 PE=1 SV=1
          Length = 443

 Score = 43.5 bits (101), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 37/80 (46%), Gaps = 1/80 (1%)

Query: 15  KSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYRKTKKLALLGRDKGRA 74
           KS P    +   + C++C  T TPLWR    G   LCNACG+ ++   +   L + K R 
Sbjct: 249 KSRPKARSSTEGRECVNCGATSTPLWRRDGTG-HYLCNACGLYHKMNGQNRPLIKPKRRL 307

Query: 75  QKRKRKYSSNNNNNKGATKL 94
              +R  +S  N     T L
Sbjct: 308 SAARRAGTSCANCQTTTTTL 327



 Score = 37.0 bits (84), Expect = 0.032,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 11/57 (19%)

Query: 3   VKTKRREAEEMMKSPPAGTFNEMNKSCIDCHTTRTPLWRGGPAGPRSLCNACGIRYR 59
           +K KRR    +  +  AGT      SC +C TT T LWR    G   +CNACG+ Y+
Sbjct: 301 IKPKRR----LSAARRAGT------SCANCQTTTTTLWRRNANGD-PVCNACGLYYK 346


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.316    0.130    0.388 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 47,434,386
Number of Sequences: 539616
Number of extensions: 1829899
Number of successful extensions: 6442
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 93
Number of HSP's successfully gapped in prelim test: 49
Number of HSP's that attempted gapping in prelim test: 6197
Number of HSP's gapped (non-prelim): 211
length of query: 129
length of database: 191,569,459
effective HSP length: 95
effective length of query: 34
effective length of database: 140,305,939
effective search space: 4770401926
effective search space used: 4770401926
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 55 (25.8 bits)