BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 030971
(168 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q8LG10|GAT15_ARATH GATA transcription factor 15 OS=Arabidopsis thaliana GN=GATA15 PE=2
SV=2
Length = 149
Score = 103 bits (258), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 76/154 (49%), Gaps = 52/154 (33%)
Query: 9 KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAA 68
K C C T+KTPLWRGGPAGPKSLCNACGIR RKK+R + RS+ KK K+ N
Sbjct: 41 KSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTL--ISNRSEDKKKKSHNRNPK- 97
Query: 69 ASASAKVVTEKADGTGNASCCVNGLGDSFKMRLMALGQEVLLQRSSPSSSSPSTTSKSSP 128
GDS K RLM LG+EV++QRS+ +
Sbjct: 98 ------------------------FGDSLKQRLMELGREVMMQRSTAEN----------- 122
Query: 129 SLFLPSVVAKKQRCQRKRKLKEEEQAAFSLMALS 162
QR+ KL EEEQAA LMALS
Sbjct: 123 --------------QRRNKLGEEEQAAVLLMALS 142
>sp|Q9FJ10|GAT16_ARATH GATA transcription factor 16 OS=Arabidopsis thaliana GN=GATA16 PE=2
SV=1
Length = 139
Score = 88.2 bits (217), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 82/172 (47%), Gaps = 60/172 (34%)
Query: 1 MNQNDSNL----KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDK 56
+ QN++++ K C DC T+KTPLWRGGP GPKSLCNACGIR RKK+R
Sbjct: 24 IEQNNTSVNDKKKTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRR----------- 72
Query: 57 KKDKTTNAAAAAASASAKVVTEKADGTGNASCCVNGLGDSFKMRLMALGQEVLLQRSSPS 116
+ K + + + G GN G+S K LM LG + +RS+
Sbjct: 73 -----------GGTEDNKKLKKSSSGGGNRK-----FGESLKQSLMDLG---IRKRST-- 111
Query: 117 SSSPSTTSKSSPSLFLPSVVAKKQRCQRKRKLKEEEQAAFSLMALSCGFVFA 168
+KQR +KL EEEQAA LMALS G V+A
Sbjct: 112 --------------------VEKQR----QKLGEEEQAAVLLMALSYGSVYA 139
>sp|Q8LC59|GAT23_ARATH GATA transcription factor 23 OS=Arabidopsis thaliana GN=GATA23
PE=2 SV=2
Length = 120
Score = 75.9 bits (185), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 37/45 (82%)
Query: 1 MNQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
M + ++ C++CKTTKTP+WRGGP GPKSLCNACGIR RK++R
Sbjct: 18 MKEEKGTIRCCSECKTTKTPMWRGGPTGPKSLCNACGIRHRKQRR 62
>sp|Q5HZ36|GAT21_ARATH GATA transcription factor 21 OS=Arabidopsis thaliana GN=GATA21 PE=1
SV=2
Length = 398
Score = 69.3 bits (168), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 28/39 (71%), Positives = 32/39 (82%)
Query: 4 NDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRK 42
N+ ++ C+DC TTKTPLWR GP GPKSLCNACGIR RK
Sbjct: 225 NNGVIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRK 263
>sp|Q9SZI6|GAT22_ARATH Putative GATA transcription factor 22 OS=Arabidopsis thaliana
GN=GATA22 PE=2 SV=1
Length = 352
Score = 68.6 bits (166), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 27/36 (75%), Positives = 30/36 (83%)
Query: 4 NDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIR 39
ND ++ C+DC TTKTPLWR GP GPKSLCNACGIR
Sbjct: 194 NDCVIRICSDCNTTKTPLWRSGPRGPKSLCNACGIR 229
>sp|Q6QPM2|GAT19_ARATH GATA transcription factor 19 OS=Arabidopsis thaliana GN=GATA19 PE=2
SV=2
Length = 211
Score = 67.0 bits (162), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 30/59 (50%), Positives = 39/59 (66%), Gaps = 7/59 (11%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAAA 69
C +C TT TPLWR GP GPKSLCNACGIRF+K++R R+ ++ T+ + AA
Sbjct: 77 CANCDTTSTPLWRNGPRGPKSLCNACGIRFKKEER-------RASTARNSTSGGGSTAA 128
>sp|Q6DBP8|GAT11_ARATH GATA transcription factor 11 OS=Arabidopsis thaliana GN=GATA11 PE=2
SV=1
Length = 303
Score = 67.0 bits (162), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 35/45 (77%)
Query: 4 NDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFE 48
+D ++ CT C+TTKTP WR GP+GPK+LCNACG+RFR + V E
Sbjct: 215 SDGIVRKCTHCETTKTPQWREGPSGPKTLCNACGVRFRSGRLVPE 259
>sp|Q9ZPX0|GAT20_ARATH GATA transcription factor 20 OS=Arabidopsis thaliana GN=GATA20 PE=2
SV=2
Length = 208
Score = 66.6 bits (161), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 26/35 (74%), Positives = 29/35 (82%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
C C TT TPLWR GP GPKSLCNACGIRF+K++R
Sbjct: 94 CASCDTTSTPLWRNGPKGPKSLCNACGIRFKKEER 128
>sp|Q8LC79|GAT18_ARATH GATA transcription factor 18 OS=Arabidopsis thaliana GN=GATA18 PE=2
SV=2
Length = 295
Score = 66.6 bits (161), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 26/35 (74%), Positives = 30/35 (85%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
C +C TT TPLWR GP GPKSLCNACGIRF+K++R
Sbjct: 154 CANCDTTSTPLWRNGPRGPKSLCNACGIRFKKEER 188
>sp|Q8VZP4|GAT10_ARATH GATA transcription factor 10 OS=Arabidopsis thaliana GN=GATA10 PE=2
SV=1
Length = 308
Score = 62.4 bits (150), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 35/47 (74%)
Query: 2 NQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFE 48
+++D ++ CT C+T TP WR GP+GPK+LCNACG+RF+ + V E
Sbjct: 211 SKSDGIVRICTHCETITTPQWRQGPSGPKTLCNACGVRFKSGRLVPE 257
>sp|Q9LIB5|GAT17_ARATH GATA transcription factor 17 OS=Arabidopsis thaliana GN=GATA17 PE=2
SV=1
Length = 190
Score = 62.4 bits (150), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 81/175 (46%), Gaps = 41/175 (23%)
Query: 9 KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAA 68
+ C DC T +TPLWRGGPAGPKSLCNAC G+K R ++ AA
Sbjct: 42 RTCVDCGTIRTPLWRGGPAGPKSLCNAC------------GIKSRKKRQ-------AALG 82
Query: 69 ASASAKVVTEKADGTGNASCCVNGLG----DSFKMRLMALGQEVLLQRSSP----SSSSP 120
+ K K++ C N L ++ K ++ + + P + S
Sbjct: 83 MRSEEKKKNRKSN-------CNNDLNLDHRNAKKYKINIVDDGKIDIDDDPKICNNKRSS 135
Query: 121 STTSKSSPSLFLP----SVVAKKQRCQRKR---KLKEEEQAAFSLMALSCGFVFA 168
S++S S FL V K+ ++KR KL EEE+AA LMALSC V+A
Sbjct: 136 SSSSNKGVSKFLDLGFKVPVMKRSAVEKKRLWRKLGEEERAAVLLMALSCSSVYA 190
>sp|Q9SKN6|GAT13_ARATH Putative GATA transcription factor 13 OS=Arabidopsis thaliana
GN=GATA13 PE=3 SV=2
Length = 291
Score = 60.8 bits (146), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 25/38 (65%), Positives = 28/38 (73%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFE 48
CT C+TT TP WR GP G K+LCNACGIRFR + V E
Sbjct: 193 CTHCETTTTPQWREGPNGRKTLCNACGIRFRSGRLVLE 230
>sp|O49741|GATA2_ARATH GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2
SV=1
Length = 264
Score = 59.7 bits (143), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 30/41 (73%)
Query: 8 LKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFE 48
++ CT C + KTP WR GP GPK+LCNACG+RF+ + V E
Sbjct: 178 MRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVPE 218
>sp|Q00858|CGPB_FUSSO Cutinase gene palindrome-binding protein OS=Fusarium solani subsp.
pisi PE=2 SV=1
Length = 457
Score = 58.9 bits (141), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 22/35 (62%), Positives = 29/35 (82%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
CTDC T +P WR GP+GPK+LCNACG+R+ KK++
Sbjct: 402 CTDCGTLDSPEWRKGPSGPKTLCNACGLRWAKKEK 436
>sp|Q9LT45|GAT29_ARATH GATA transcription factor 29 OS=Arabidopsis thaliana GN=GATA29 PE=2
SV=1
Length = 208
Score = 58.2 bits (139), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 30/39 (76%), Gaps = 2/39 (5%)
Query: 8 LKYCTD--CKTTKTPLWRGGPAGPKSLCNACGIRFRKKK 44
+K CT+ C TP+WR GP GPKSLCNACGI+FRK++
Sbjct: 156 MKKCTNMNCNALNTPMWRRGPLGPKSLCNACGIKFRKEE 194
>sp|Q9SV30|GATA8_ARATH GATA transcription factor 8 OS=Arabidopsis thaliana GN=GATA8 PE=2
SV=1
Length = 322
Score = 58.2 bits (139), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 30/40 (75%), Gaps = 1/40 (2%)
Query: 8 LKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVF 47
L+ C C+ TKTP WR GP GPK+LCNACG+R+ K R+F
Sbjct: 228 LRKCMHCEVTKTPQWRLGPMGPKTLCNACGVRY-KSGRLF 266
>sp|O82632|GATA9_ARATH GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2
SV=1
Length = 308
Score = 57.4 bits (137), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 28/40 (70%)
Query: 9 KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFE 48
+ C C T KTP WR GP GPK+LCNACG+R++ + V E
Sbjct: 197 RRCLHCATEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 236
>sp|P69781|GAT12_ARATH GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2
SV=1
Length = 331
Score = 57.4 bits (137), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 27/38 (71%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFE 48
C C T KTP WR GP GPK+LCNACG+R++ + V E
Sbjct: 221 CLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 258
>sp|O49743|GATA4_ARATH GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2
SV=1
Length = 240
Score = 57.0 bits (136), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFE 48
CT C + KTP WR GP GPK+LCNACG+R++ + V E
Sbjct: 160 CTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPE 197
>sp|Q8LAU9|GATA1_ARATH GATA transcription factor 1 OS=Arabidopsis thaliana GN=GATA1 PE=2
SV=2
Length = 274
Score = 57.0 bits (136), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 11/84 (13%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAAAS 70
C C KTP WR GPAGPK+LCNACG+R++ + V E + + T A ++
Sbjct: 196 CQHCGAEKTPQWRAGPAGPKTLCNACGVRYKSGRLVPE------YRPANSPTFTAELHSN 249
Query: 71 ASAKVV-----TEKADGTGNASCC 89
+ K+V + DG G+ C
Sbjct: 250 SHRKIVEMRKQYQSGDGDGDRKDC 273
>sp|Q55C49|GTAG_DICDI GATA zinc finger domain-containing protein 7 OS=Dictyostelium
discoideum GN=gtaG PE=4 SV=1
Length = 1006
Score = 55.8 bits (133), Expect = 1e-07, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 29/41 (70%)
Query: 5 DSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
D + YC +C T TP WR GP+GP +LCNACG+ + KK+R
Sbjct: 836 DKSTLYCHNCGTKNTPEWRRGPSGPATLCNACGLAYAKKQR 876
>sp|Q8L4M6|GATA3_ARATH GATA transcription factor 3 OS=Arabidopsis thaliana GN=GATA3 PE=2
SV=2
Length = 269
Score = 55.8 bits (133), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 20/33 (60%), Positives = 25/33 (75%)
Query: 9 KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFR 41
+ C+ C T TP WR GP GPK+LCNACG+RF+
Sbjct: 180 RRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFK 212
>sp|Q75JZ0|GTAH_DICDI GATA zinc finger domain-containing protein 8 OS=Dictyostelium
discoideum GN=gtaH PE=4 SV=1
Length = 519
Score = 55.8 bits (133), Expect = 1e-07, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKK-KRVFEGLKKRSD--KKKDKTT 62
C +CKTT+TP WR GP G KSLCNACG+ + K KR GL ++ KK D T+
Sbjct: 462 CRNCKTTETPEWRKGPDGTKSLCNACGLHYAKNVKREAAGLHHLNEVGKKVDLTS 516
>sp|P78714|WC2_NEUCR White collar 2 protein OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=wc-2
PE=1 SV=1
Length = 530
Score = 55.5 bits (132), Expect = 2e-07, Method: Composition-based stats.
Identities = 20/30 (66%), Positives = 25/30 (83%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRF 40
CTDC T +P WR GP+GPK+LCNACG+R+
Sbjct: 468 CTDCGTLDSPEWRKGPSGPKTLCNACGLRW 497
>sp|Q54NM5|GTAL_DICDI GATA zinc finger domain-containing protein 12 OS=Dictyostelium
discoideum GN=gtaL PE=4 SV=1
Length = 640
Score = 54.7 bits (130), Expect = 2e-07, Method: Composition-based stats.
Identities = 22/43 (51%), Positives = 31/43 (72%)
Query: 3 QNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
Q + + C +CKT+ TP WR GP G K+LCNACGIR+R +++
Sbjct: 498 QKKTISRVCVNCKTSDTPEWRRGPQGAKTLCNACGIRYRLQQQ 540
>sp|Q54KX0|GTAN_DICDI GATA zinc finger domain-containing protein 14 OS=Dictyostelium
discoideum GN=gtaN PE=4 SV=1
Length = 953
Score = 53.1 bits (126), Expect = 7e-07, Method: Composition-based stats.
Identities = 22/33 (66%), Positives = 26/33 (78%)
Query: 10 YCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRK 42
+CT C TT+TP WR GPAG KSLCNACG+ + K
Sbjct: 892 FCTSCGTTQTPEWRKGPAGGKSLCNACGLHYAK 924
>sp|Q9M1U2|GAT14_ARATH GATA transcription factor 14 OS=Arabidopsis thaliana GN=GATA14 PE=2
SV=1
Length = 204
Score = 53.1 bits (126), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 25/33 (75%)
Query: 9 KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFR 41
K C+ C T KTPLWR GP G +LCNACG+R+R
Sbjct: 115 KSCSHCGTRKTPLWREGPRGAGTLCNACGMRYR 147
>sp|Q5KSV0|GTAK_DICDI GATA zinc finger domain-containing protein 11 OS=Dictyostelium
discoideum GN=gtaK PE=2 SV=1
Length = 650
Score = 52.8 bits (125), Expect = 9e-07, Method: Composition-based stats.
Identities = 24/38 (63%), Positives = 27/38 (71%)
Query: 8 LKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
LK CT C TT +P WR GPAG +SLCNACG+ F K R
Sbjct: 519 LKQCTSCGTTSSPEWRKGPAGNQSLCNACGLYFAKLVR 556
>sp|Q55GK0|GTAE_DICDI GATA zinc finger domain-containing protein 5 OS=Dictyostelium
discoideum GN=gtaE PE=4 SV=1
Length = 952
Score = 52.0 bits (123), Expect = 2e-06, Method: Composition-based stats.
Identities = 22/45 (48%), Positives = 31/45 (68%), Gaps = 1/45 (2%)
Query: 2 NQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRV 46
N + +LK C C T+ TP WR GP GP +LCNACG+ + KK+++
Sbjct: 233 NIDKDSLK-CYQCNTSNTPEWRKGPEGPATLCNACGLAYAKKQKL 276
>sp|Q550D5|GTAA_DICDI Transcription factor stalky OS=Dictyostelium discoideum GN=stkA
PE=1 SV=1
Length = 872
Score = 52.0 bits (123), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 37/52 (71%), Gaps = 2/52 (3%)
Query: 9 KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDK 60
+ C C +++TP WR GP+G SLCNACGI++R K + +G+ K S K++++
Sbjct: 292 RSCEFCGSSQTPTWRRGPSGKGSLCNACGIKWRLKGK--DGIFKPSQKQQNR 341
>sp|P18494|GLN3_YEAST Nitrogen regulatory protein GLN3 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=GLN3 PE=1 SV=2
Length = 730
Score = 51.6 bits (122), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 41/78 (52%), Gaps = 9/78 (11%)
Query: 3 QNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGL-------KKRSD 55
QN L C +CKT KTPLWR P G +LCNACG+ F+K L KKR
Sbjct: 298 QNKKPLIQCFNCKTFKTPLWRRSPEG-NTLCNACGL-FQKLHGTMRPLSLKSDVIKKRIS 355
Query: 56 KKKDKTTNAAAAAASASA 73
KK+ K T+ A + SA
Sbjct: 356 KKRAKQTDPNIAQNTPSA 373
>sp|Q9FH57|GATA5_ARATH GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2
SV=1
Length = 339
Score = 51.2 bits (121), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFR 41
C+ C KTP WR GP G K+LCNACG+R++
Sbjct: 251 CSHCGVQKTPQWRAGPMGAKTLCNACGVRYK 281
>sp|Q54TM6|GTAI_DICDI GATA zinc finger domain-containing protein 9 OS=Dictyostelium
discoideum GN=gtaI PE=4 SV=1
Length = 536
Score = 51.2 bits (121), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 20/33 (60%), Positives = 23/33 (69%)
Query: 10 YCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRK 42
+C C TT TP WR GP G KSLCNACG+ + K
Sbjct: 478 FCRHCGTTDTPEWRRGPDGRKSLCNACGLHYSK 510
>sp|O65515|GATA7_ARATH GATA transcription factor 7 OS=Arabidopsis thaliana GN=GATA7 PE=2
SV=1
Length = 238
Score = 51.2 bits (121), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 23/31 (74%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFR 41
C+ C KTP WR GP G K+LCNACG+RF+
Sbjct: 166 CSHCGVQKTPQWRMGPLGAKTLCNACGVRFK 196
>sp|P40209|GAT2_YEAST Protein GAT2 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GAT2 PE=4 SV=1
Length = 560
Score = 51.2 bits (121), Expect = 3e-06, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 28/40 (70%)
Query: 8 LKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVF 47
+++C C T+TP WR GP G ++LCNACG+ +RK + F
Sbjct: 469 IEFCFHCGETETPEWRKGPYGTRTLCNACGLFYRKVTKKF 508
>sp|P17429|AREA_EMENI Nitrogen regulatory protein areA OS=Emericella nidulans (strain
FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139)
GN=areA PE=1 SV=2
Length = 876
Score = 50.4 bits (119), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 38/65 (58%), Gaps = 2/65 (3%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAAAS 70
CT+C T TPLWR P G + LCNACG+ F K V L ++D K + N+A + A
Sbjct: 673 CTNCFTQTTPLWRRNPEG-QPLCNACGL-FLKLHGVVRPLSLKTDVIKKRNRNSANSLAV 730
Query: 71 ASAKV 75
S++V
Sbjct: 731 GSSRV 735
>sp|Q9SD38|GATA6_ARATH GATA transcription factor 6 OS=Arabidopsis thaliana GN=GATA6 PE=2
SV=1
Length = 312
Score = 50.4 bits (119), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 9 KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFR 41
+ C C KTP WR GP G K+LCNACG+R++
Sbjct: 221 RQCGHCGVQKTPQWRAGPLGAKTLCNACGVRYK 253
>sp|Q55EQ0|GTAF_DICDI GATA zinc finger domain-containing protein 6 OS=Dictyostelium
discoideum GN=gtaF PE=4 SV=1
Length = 623
Score = 48.5 bits (114), Expect = 2e-05, Method: Composition-based stats.
Identities = 21/31 (67%), Positives = 22/31 (70%)
Query: 10 YCTDCKTTKTPLWRGGPAGPKSLCNACGIRF 40
YC C T+T WR GP G KSLCNACGIRF
Sbjct: 319 YCHSCGETQTSQWRRGPDGCKSLCNACGIRF 349
>sp|Q75JZ1|GTAC_DICDI GATA zinc finger domain-containing protein 3 OS=Dictyostelium
discoideum GN=gtaC PE=4 SV=1
Length = 587
Score = 48.1 bits (113), Expect = 3e-05, Method: Composition-based stats.
Identities = 19/33 (57%), Positives = 23/33 (69%)
Query: 10 YCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRK 42
YC C T +TP WR GP G K+LCNACG+ + K
Sbjct: 499 YCIFCGTMETPEWRKGPGGHKTLCNACGLHYAK 531
>sp|Q8W4H1|GAT26_ARATH GATA transcription factor 26 OS=Arabidopsis thaliana GN=GATA26
PE=2 SV=1
Length = 510
Score = 48.1 bits (113), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRV--FEGLKKRSD 55
C C T TPLWR GP LCNACG R+R K + + L R+D
Sbjct: 7 CYHCGVTNTPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTPLHARAD 53
>sp|Q01371|WC1_NEUCR White collar 1 protein OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=wc-1
PE=2 SV=2
Length = 1167
Score = 48.1 bits (113), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 26/36 (72%)
Query: 8 LKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKK 43
++ C +C T TP WR GP+G + LCN+CG+R+ K+
Sbjct: 931 VRDCANCHTRNTPEWRRGPSGNRDLCNSCGLRWAKQ 966
>sp|Q54TE3|GTAJ_DICDI GATA zinc finger domain-containing protein 10 OS=Dictyostelium
discoideum GN=gtaJ PE=4 SV=1
Length = 714
Score = 47.4 bits (111), Expect = 4e-05, Method: Composition-based stats.
Identities = 21/45 (46%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
Query: 1 MNQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
M + NLK C C+ T+TP WR GP G +LCNACG+ + K ++
Sbjct: 622 MYSSRRNLK-CHYCEVTETPEWRRGPDGDHTLCNACGLHYAKSQK 665
>sp|Q5PP38|GAT27_ARATH GATA transcription factor 27 OS=Arabidopsis thaliana GN=GATA27
PE=2 SV=1
Length = 470
Score = 47.4 bits (111), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 20/33 (60%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKK 43
C C T TPLWR GP LCNACG R+R K
Sbjct: 7 CYHCGVTSTPLWRNGPPEKPVLCNACGSRWRTK 39
>sp|P78688|AREA_GIBFU Nitrogen regulatory protein areA OS=Gibberella fujikuroi GN=AREA
PE=3 SV=1
Length = 971
Score = 46.6 bits (109), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 7/78 (8%)
Query: 2 NQNDSNLKY-CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSD--KKK 58
N ND N CT+C T TPLWR P G + LCNACG+ F K V L ++D KK+
Sbjct: 684 NGNDGNAPTTCTNCFTQTTPLWRRNPEG-QPLCNACGL-FLKLHGVVRPLSLKTDVIKKR 741
Query: 59 DK--TTNAAAAAASASAK 74
++ TN +S +K
Sbjct: 742 NRGSGTNVPVGGSSTRSK 759
>sp|Q01168|NUT1_MAGO7 Nitrogen regulatory protein NUT1 OS=Magnaporthe oryzae (strain
70-15 / ATCC MYA-4617 / FGSC 8958) GN=NUT1 PE=4 SV=1
Length = 956
Score = 45.8 bits (107), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 41/75 (54%), Gaps = 4/75 (5%)
Query: 2 NQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSD--KKKD 59
NQ CT+C T TPLWR P G + LCNACG+ F K V L ++D KK++
Sbjct: 654 NQGGDAPTTCTNCATQTTPLWRRNPEG-QPLCNACGL-FLKLHGVVRPLSLKTDVIKKRN 711
Query: 60 KTTNAAAAAASASAK 74
+ + + A++ ++
Sbjct: 712 RGSGSNVPGATSGSR 726
>sp|Q10280|GAF1_SCHPO Transcription factor gaf1 OS=Schizosaccharomyces pombe (strain 972
/ ATCC 24843) GN=gaf1 PE=1 SV=2
Length = 855
Score = 45.8 bits (107), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 33/52 (63%), Gaps = 4/52 (7%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSD--KKKDK 60
CT+C+T TPLWR P G + LCNACG+ F K V L ++D KK+++
Sbjct: 635 CTNCQTRTTPLWRRSPDG-QPLCNACGL-FMKINGVVRPLSLKTDVIKKRNR 684
>sp|O13415|AREA_ASPOR Nitrogen regulatory protein areA OS=Aspergillus oryzae (strain ATCC
42149 / RIB 40) GN=areA PE=4 SV=1
Length = 866
Score = 45.8 bits (107), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSD--KKKDKTTNAAAAA 68
CT+C T TPLWR P G + LCNACG+ F K V L ++D KK+++++ + A
Sbjct: 664 CTNCFTQTTPLWRRNPEG-QPLCNACGL-FLKLHGVVRPLSLKTDVIKKRNRSSANSLAV 721
Query: 69 ASASAKVVTEKADGTGNASCCV 90
++ A T + + AS
Sbjct: 722 GTSRASKKTARKNSVQQASVTT 743
>sp|O13412|AREA_ASPNG Nitrogen regulatory protein areA OS=Aspergillus niger GN=areA PE=4
SV=1
Length = 882
Score = 45.4 bits (106), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 41/67 (61%), Gaps = 5/67 (7%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSD--KKKDKTT-NAAAA 67
CT+C T TPLWR P G + LCNACG+ F K V L ++D KK+++++ N+ A
Sbjct: 676 CTNCFTQTTPLWRRNPEG-QPLCNACGL-FLKLHGVVRPLSLKTDVIKKRNRSSANSLAV 733
Query: 68 AASASAK 74
S ++K
Sbjct: 734 GTSRASK 740
>sp|Q01582|AREA_PENCH Nitrogen regulatory protein areA OS=Penicillium chrysogenum GN=AREA
PE=4 SV=1
Length = 725
Score = 45.1 bits (105), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 34/54 (62%), Gaps = 4/54 (7%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSD--KKKDKTT 62
CT+C T TPLWR P G + LCNACG+ F K V L ++D KK+++++
Sbjct: 525 CTNCFTQTTPLWRRNPEG-QPLCNACGL-FLKLHGVVRPLSLKTDVIKKRNRSS 576
>sp|B0G188|GTAP_DICDI GATA zinc finger domain-containing protein 16 OS=Dictyostelium
discoideum GN=gtaP PE=4 SV=1
Length = 695
Score = 45.1 bits (105), Expect = 2e-04, Method: Composition-based stats.
Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 8/71 (11%)
Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAAAS 70
C C T TP WR GP G K+LCNACG+ + K +S+K+K+ N+ +
Sbjct: 479 CHTCGVTNTPEWRRGPNGAKTLCNACGLAWAK--------SVKSEKQKELLANSTGVNIT 530
Query: 71 ASAKVVTEKAD 81
K K +
Sbjct: 531 EPKKAQKRKKE 541
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.314 0.125 0.361
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 58,444,672
Number of Sequences: 539616
Number of extensions: 2167494
Number of successful extensions: 13028
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 87
Number of HSP's successfully gapped in prelim test: 119
Number of HSP's that attempted gapping in prelim test: 12707
Number of HSP's gapped (non-prelim): 353
length of query: 168
length of database: 191,569,459
effective HSP length: 109
effective length of query: 59
effective length of database: 132,751,315
effective search space: 7832327585
effective search space used: 7832327585
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (26.6 bits)