BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 021193
(316 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9FH57|GATA5_ARATH GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2
SV=1
Length = 339
Score = 187 bits (474), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 189/346 (54%), Gaps = 49/346 (14%)
Query: 5 MEARAFKPSLRRELSCCLKSTQQVFFDDIPCVSNED-FSVDDL-----------LDFSNG 52
ME A K S+R+E++ LK+T V+ + + + ++ FSVDD F++
Sbjct: 1 MEQAALKSSVRKEMA--LKTTSPVYEEFLAVTTAQNGFSVDDFSVDDLLDLSNDDVFADE 58
Query: 53 DFEDGSVDDKDYFSSPDPVDD-DNNSNSGSFSS---EQSLLTNEFVEPVDDFAELEWVSQ 108
+ + + + SS +P DD D S FS SL T+E P DD A LEW+S
Sbjct: 59 ETDLKAQHEMVRVSSEEPNDDGDALRRSSDFSGCDDFGSLPTSELSLPADDLANLEWLSH 118
Query: 109 FVDDSSCSELSLLYPNYVERTRSEPDGKPV---SNKTSTNPTTTTSPCFPLRVPSKARTK 165
FV+DS +E S PN P KP ++ T CF VP+KAR+K
Sbjct: 119 FVEDS-FTEYS--GPNLT----GTPTEKPAWLTGDRKHPVTAVTEETCFKSPVPAKARSK 171
Query: 166 RTRRSGWAWSSGSPLSTESTISSSSSTSCLIFTDSVQNIEWFSGFD--EPVA-------- 215
R R WS GS S+ + S S+S+S S + WFSG + EPV
Sbjct: 172 RNRNGLKVWSLGSSSSSGPSSSGSTSSS-----SSGPSSPWFSGAELLEPVVTSERPPFP 226
Query: 216 KKLKKKPAVQSGGGLFQ-----RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLF 270
KK KK+ A G Q R+CSHC QKTPQWR GP+G KTLCNACGVRYKSGRL
Sbjct: 227 KKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLL 286
Query: 271 PEYRPACSPTFSVDMHSNSHRKVLEMRRKKE-SAGPDVGLSHMVQS 315
PEYRPACSPTFS ++HSN HRKV+EMRRKKE ++ + GL+ +VQS
Sbjct: 287 PEYRPACSPTFSSELHSNHHRKVIEMRRKKEPTSDNETGLNQLVQS 332
>sp|Q9SD38|GATA6_ARATH GATA transcription factor 6 OS=Arabidopsis thaliana GN=GATA6 PE=2
SV=1
Length = 312
Score = 175 bits (444), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 127/295 (43%), Positives = 160/295 (54%), Gaps = 27/295 (9%)
Query: 37 SNEDFSVDDLLDFSNGDFEDGS-VDDKDYF--SSPDPVDDDNNSNSGSFSSEQSLLTNEF 93
+ +DFSVDDLLDFS + +D V+D+ V D+N + + S T+
Sbjct: 24 NGDDFSVDDLLDFSKEEEDDDVLVEDEAELKVQRKRGVSDENTLHRSNDFSTADFHTSGL 83
Query: 94 VEPVDDFAELEWVSQFVDDSSCSELSLLYPNYVERTRSEPDGKPV---SNKTSTNPTTTT 150
P+DD AELEW+S FVDDSS + S P KPV N+
Sbjct: 84 SVPMDDIAELEWLSNFVDDSSFTPYSA------------PTNKPVWLTGNRRHLVQPVKE 131
Query: 151 SPCFPLRVPS-KARTKRTRRSGWAWSSGSPLSTESTISSSSSTSCLIFTDSVQNIEWFSG 209
CF + P+ K R KR R WS GS T+S+ SS++S+S S +
Sbjct: 132 ETCFKSQHPAVKTRPKRARTGVRVWSHGSQSLTDSSSSSTTSSSSSPRPSSPLWLASGQF 191
Query: 210 FDEPVAKKLKKKPAVQSGGGLFQ------RRCSHCQTQKTPQWRTGPLGPKTLCNACGVR 263
DEP+ K KKK ++ G R+C HC QKTPQWR GPLG KTLCNACGVR
Sbjct: 192 LDEPMTKTQKKKKVWKNAGQTQTQTQTQTRQCGHCGVQKTPQWRAGPLGAKTLCNACGVR 251
Query: 264 YKSGRLFPEYRPACSPTFSVDMHSNSHRKVLEMRRKKES--AGPDVGLSHMVQSF 316
YKSGRL PEYRPACSPTFS ++HSN H KV+EMRRKKE+ + GL+ VQ+
Sbjct: 252 YKSGRLLPEYRPACSPTFSSELHSNHHSKVIEMRRKKETSDGAEETGLNQPVQTV 306
>sp|P69781|GAT12_ARATH GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2
SV=1
Length = 331
Score = 174 bits (442), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 123/290 (42%), Positives = 158/290 (54%), Gaps = 43/290 (14%)
Query: 40 DFSVDDLL-DFSNGDFEDGSV----------DDKDYFSSPD-PVDDDNNSNSGSFSSEQS 87
DF+VDDLL DFSN D E+ V D FS+ D P + + SFS +
Sbjct: 13 DFAVDDLLVDFSNDDDEENDVVADSTTTTTITDSSNFSAADLPSFHGDVQDGTSFSGDLC 72
Query: 88 LLTNEFVEPVDDFA-ELEWVSQFVDDSSCSELSLLYPNYVERTRSEPDGKPVSNKTSTNP 146
+ P DD A ELEW+S VD+S E + + +S PD P S+ S
Sbjct: 73 I-------PSDDLADELEWLSNIVDESLSPE-DVHKLELISGFKSRPD--PKSDTGSPEN 122
Query: 147 TTTTSPCFP--LRVPSKARTKRTRRSGWAWSS---------GSPLSTESTISS----SSS 191
++SP F + VP+KAR+KR+R + W+S SP + E+ +SS S
Sbjct: 123 PNSSSPIFTTDVSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTGETILSSQQHLSPP 182
Query: 192 TSCLIFTDSVQNIEWFSGFDEPVAKKLKKKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPL 251
TS + + + G + KK + GG +RRC HC T KTPQWRTGP+
Sbjct: 183 TSPPLLMAPLGKKQAVDG-----GHRRKKDVSSPESGGAEERRCLHCATDKTPQWRTGPM 237
Query: 252 GPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRKVLEMRRKKE 301
GPKTLCNACGVRYKSGRL PEYRPA SPTF + HSNSHRKV+E+RR+KE
Sbjct: 238 GPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKE 287
>sp|Q8L4M6|GATA3_ARATH GATA transcription factor 3 OS=Arabidopsis thaliana GN=GATA3 PE=2
SV=2
Length = 269
Score = 172 bits (437), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 125/310 (40%), Positives = 159/310 (51%), Gaps = 58/310 (18%)
Query: 1 MEFCMEARAFKPSLRRELSCCLKSTQQVFFDDIPCVSN--EDFSVDDLLDFSNGDFEDGS 58
ME EARA K SLR E + LK Q + +D+ S+ EDFSV+ LDFS G
Sbjct: 1 MELWTEARALKASLRGESTISLKHHQVIVSEDLSRTSSLPEDFSVECFLDFSEGQ----K 56
Query: 59 VDDKDYFSSPDPVDDDNNSNSGSFSSEQSLLTNEFVEPVDDFAELEWVSQFVDDSSCSEL 118
++++ S + + + FSS+ + P +D ELEWVS+ VDD S E+
Sbjct: 57 EEEEEVVSVSSSQEQEEQEHDCVFSSQPCIFDQLPSLPDEDVEELEWVSRVVDDCSSPEV 116
Query: 119 SLLYPNYVERTRSEPDGKPVSNKTSTNPTTTTSPCFPLRVPSKARTKRTRRSGWAWSSGS 178
SLL T T P F R+P K RTKR+R S
Sbjct: 117 SLLLTQ----------------------THKTKPSFS-RIPVKPRTKRSRNS-------- 145
Query: 179 PLSTESTISSSSSTSCLIFTDSVQNIEWFSGFDEPVAKKLKKKPAVQSGGGLFQRRCSHC 238
+ S L+ T+ ++L+KK Q +FQRRCSHC
Sbjct: 146 --------LTGSRVWPLVSTN----------HQHAATEQLRKKK--QETVLVFQRRCSHC 185
Query: 239 QTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRKVLEMRR 298
T TPQWRTGP+GPKTLCNACGVR+KSGRL PEYRPA SPTFS ++HSN HRKVLE+R+
Sbjct: 186 GTNNTPQWRTGPVGPKTLCNACGVRFKSGRLCPEYRPADSPTFSNEIHSNLHRKVLELRK 245
Query: 299 KKESAGPDVG 308
KE G + G
Sbjct: 246 SKE-LGEETG 254
>sp|O82632|GATA9_ARATH GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2
SV=1
Length = 308
Score = 169 bits (427), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 109/269 (40%), Positives = 150/269 (55%), Gaps = 29/269 (10%)
Query: 41 FSVDDLLDFSN--GDFEDG--SVDDKDYFSSPDPVDDDNNSNSGSFSSEQSLLTNEFVEP 96
F VDDLLDFSN G+ +DG ++ D S+ D +SNS S ++ + ++ ++ P
Sbjct: 18 FVVDDLLDFSNDDGEVDDGLNTLPDSSTLSTGTLTD---SSNSSSLFTDGTGFSDLYI-P 73
Query: 97 VDDFAELEWVSQFVDDSSCSELSLLYPNYVERTRSEPDGKPVSNKTSTNPTTTTSPC--- 153
DD AELEW+S FV++S E + + G +++ P
Sbjct: 74 NDDIAELEWLSNFVEESFAGEDQDKLHLFSGLKNPQTTGSTLTHLIKPEPELDHQFIDID 133
Query: 154 -FPLRVPSKARTKRTRRSGWAWSSGSPLSTESTISSSSSTSCLIFTDSVQNIEWFSGFDE 212
+ VP+KAR+KR+R + W+S +++ S T+ V+ ++ D
Sbjct: 134 ESNVAVPAKARSKRSRSAASTWAS-----RLLSLADSDETNPKKKQRRVKEQDFAGDMDV 188
Query: 213 PVAKKLKKKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPE 272
+SGGG RRC HC T+KTPQWRTGP+GPKTLCNACGVRYKSGRL PE
Sbjct: 189 DCG---------ESGGG---RRCLHCATEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 236
Query: 273 YRPACSPTFSVDMHSNSHRKVLEMRRKKE 301
YRPA SPTF + HSNSHRKV+E+RR+KE
Sbjct: 237 YRPASSPTFVMARHSNSHRKVMELRRQKE 265
>sp|O65515|GATA7_ARATH GATA transcription factor 7 OS=Arabidopsis thaliana GN=GATA7 PE=2
SV=1
Length = 238
Score = 156 bits (395), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/279 (40%), Positives = 136/279 (48%), Gaps = 75/279 (26%)
Query: 40 DFSVDDLLDFSNGD--FEDGSVDDKDYFSSPDPVDDDNNSNSGSFSSE-------QSLLT 90
DFSVDDLLD SN D E S K+ D+ SFS + + LL+
Sbjct: 10 DFSVDDLLDLSNADTSLESSSSQRKE--------DEQEREKFKSFSDQSTRLSPPEDLLS 61
Query: 91 NEFVEPVDDFAELEWVSQFVDDS-SCSELSLLYPNYVERTRSEPDGKPVSNKTSTNPTTT 149
PV D +LEW+S FV+DS S S +S +P NP +
Sbjct: 62 FPGDAPVGDLEDLEWLSNFVEDSFSESYISSDFP--------------------VNPVAS 101
Query: 150 TS---PCFPLRVPSKARTKRTRRSGWAWSSGSPLSTESTISSSSSTSCLIFTDSVQNIEW 206
C VP K R+KR R +G WS SP ST +
Sbjct: 102 VEVRRQC----VPVKPRSKRRRTNGRIWSMESPSPLLSTAVARR---------------- 141
Query: 207 FSGFDEPVAKKLKKKPAVQSGGGLFQRR-----CSHCQTQKTPQWRTGPLGPKTLCNACG 261
KK ++ S GG+ Q++ CSHC QKTPQWR GPLG KTLCNACG
Sbjct: 142 ---------KKRGRQKVDASYGGVVQQQQLRRCCSHCGVQKTPQWRMGPLGAKTLCNACG 192
Query: 262 VRYKSGRLFPEYRPACSPTFSVDMHSNSHRKVLEMRRKK 300
VR+KSGRL PEYRPACSPTF+ ++HSNSHRKVLE+R K
Sbjct: 193 VRFKSGRLLPEYRPACSPTFTNEIHSNSHRKVLELRLMK 231
>sp|O49741|GATA2_ARATH GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2
SV=1
Length = 264
Score = 147 bits (371), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 93/234 (39%), Positives = 120/234 (51%), Gaps = 30/234 (12%)
Query: 68 PDPVDDDNNSNSGSFSSEQSLLTNEFVEPVDDFAELEWVSQFVDDSSCSELSLLYPNYVE 127
P P + + + S++ ++ P DD A LEW+SQFVDDS + +
Sbjct: 44 PPPQNPSFHHHHLPSSADHHSFLHDICVPSDDAAHLEWLSQFVDDS-----------FAD 92
Query: 128 RTRSEPDGKPVSNKTSTNPTTTTSPCFPLRVPSKARTKRTRRSGWAWSSGSPLSTESTIS 187
+ G S KT T+ P K R+KR+R + SP+ ES
Sbjct: 93 FPANPLGGTMTSVKTETS------------FPGKPRSKRSRAPAPFAGTWSPMPLESEHQ 140
Query: 188 SSSSTSCLIFTDSVQNIEWFSGFDEPVAKKLKKKPAVQSGGGLFQRRCSHCQTQKTPQWR 247
S + E G + GGG+ RRC+HC ++KTPQWR
Sbjct: 141 QLHSAAKF-----KPKKEQSGGGGGGGGRHQSSSSETTEGGGM--RRCTHCASEKTPQWR 193
Query: 248 TGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRKVLEMRRKKE 301
TGPLGPKTLCNACGVR+KSGRL PEYRPA SPTF + HSNSHRKV+E+RR+KE
Sbjct: 194 TGPLGPKTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKE 247
>sp|O49743|GATA4_ARATH GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2
SV=1
Length = 240
Score = 139 bits (351), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 116/217 (53%), Gaps = 57/217 (26%)
Query: 89 LTNEFVEPVDDFAELEWVSQFVDDS----SCSELSLLYPNYVERTRSEPDGKPVSNKTST 144
T++ P DD A LEW+S+FVDDS + L++ + T GKP S ++
Sbjct: 63 FTHDLCVPSDDAAHLEWLSRFVDDSFSDFPANPLTMTVRPEISFT-----GKPRSRRSRA 117
Query: 145 NPTTTTSPCFPLRVPSKARTKRTRRSGWAWSSGSPLSTESTISSSSSTSCLIFTDSVQNI 204
PS A T WA P+S ES + S +
Sbjct: 118 P------------APSVAGT-------WA-----PMS-ESELCHSVA------------- 139
Query: 205 EWFSGFDEPVAKKLKKKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
+P KK+ +V + G RRC+HC ++KTPQWRTGPLGPKTLCNACGVRY
Sbjct: 140 -------KPKPKKVYNAESVTADGA---RRCTHCASEKTPQWRTGPLGPKTLCNACGVRY 189
Query: 265 KSGRLFPEYRPACSPTFSVDMHSNSHRKVLEMRRKKE 301
KSGRL PEYRPA SPTF + HSNSHRKV+E+RR+KE
Sbjct: 190 KSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKE 226
>sp|Q8LAU9|GATA1_ARATH GATA transcription factor 1 OS=Arabidopsis thaliana GN=GATA1 PE=2
SV=2
Length = 274
Score = 129 bits (323), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 55/88 (62%), Positives = 67/88 (76%)
Query: 219 KKKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACS 278
KK V + + R+C HC +KTPQWR GP GPKTLCNACGVRYKSGRL PEYRPA S
Sbjct: 180 KKTMTVAAAALIMGRKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYKSGRLVPEYRPANS 239
Query: 279 PTFSVDMHSNSHRKVLEMRRKKESAGPD 306
PTF+ ++HSNSHRK++EMR++ +S D
Sbjct: 240 PTFTAELHSNSHRKIVEMRKQYQSGDGD 267
>sp|Q9SV30|GATA8_ARATH GATA transcription factor 8 OS=Arabidopsis thaliana GN=GATA8 PE=2
SV=1
Length = 322
Score = 126 bits (317), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 54/72 (75%), Positives = 61/72 (84%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRK 292
R+C HC+ KTPQWR GP+GPKTLCNACGVRYKSGRLFPEYRPA SPTF+ +HSNSH+K
Sbjct: 229 RKCMHCEVTKTPQWRLGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFTPALHSNSHKK 288
Query: 293 VLEMRRKKESAG 304
V EMR K+ S G
Sbjct: 289 VAEMRNKRCSDG 300
>sp|Q6DBP8|GAT11_ARATH GATA transcription factor 11 OS=Arabidopsis thaliana GN=GATA11 PE=2
SV=1
Length = 303
Score = 122 bits (305), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 51/67 (76%), Positives = 59/67 (88%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRK 292
R+C+HC+T KTPQWR GP GPKTLCNACGVR++SGRL PEYRPA SPTF +HSNSHRK
Sbjct: 220 RKCTHCETTKTPQWREGPSGPKTLCNACGVRFRSGRLVPEYRPASSPTFIPAVHSNSHRK 279
Query: 293 VLEMRRK 299
++EMRRK
Sbjct: 280 IIEMRRK 286
>sp|Q8VZP4|GAT10_ARATH GATA transcription factor 10 OS=Arabidopsis thaliana GN=GATA10 PE=2
SV=1
Length = 308
Score = 117 bits (293), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 50/67 (74%), Positives = 57/67 (85%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRK 292
R C+HC+T TPQWR GP GPKTLCNACGVR+KSGRL PEYRPA SPTF +HSNSHRK
Sbjct: 218 RICTHCETITTPQWRQGPSGPKTLCNACGVRFKSGRLVPEYRPASSPTFIPSVHSNSHRK 277
Query: 293 VLEMRRK 299
++EMR+K
Sbjct: 278 IIEMRKK 284
>sp|Q9M1U2|GAT14_ARATH GATA transcription factor 14 OS=Arabidopsis thaliana GN=GATA14 PE=2
SV=1
Length = 204
Score = 112 bits (279), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 47/78 (60%), Positives = 62/78 (79%)
Query: 229 GLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSN 288
G+ + CSHC T+KTP WR GP G TLCNACG+RY++GRL PEYRPA SP F ++HSN
Sbjct: 111 GITDKSCSHCGTRKTPLWREGPRGAGTLCNACGMRYRTGRLLPEYRPASSPDFKPNVHSN 170
Query: 289 SHRKVLEMRRKKESAGPD 306
HRKV+E+RR+++S+ P+
Sbjct: 171 FHRKVMEIRRERKSSPPN 188
>sp|Q9SKN6|GAT13_ARATH Putative GATA transcription factor 13 OS=Arabidopsis thaliana
GN=GATA13 PE=3 SV=2
Length = 291
Score = 101 bits (252), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 42/65 (64%), Positives = 51/65 (78%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRKVL 294
C+HC+T TPQWR GP G KTLCNACG+R++SGRL EYRPA SPTF +HSN H+K++
Sbjct: 193 CTHCETTTTPQWREGPNGRKTLCNACGIRFRSGRLVLEYRPAASPTFIPTVHSNLHKKII 252
Query: 295 EMRRK 299
MR K
Sbjct: 253 YMRMK 257
>sp|Q6QPM2|GAT19_ARATH GATA transcription factor 19 OS=Arabidopsis thaliana GN=GATA19 PE=2
SV=2
Length = 211
Score = 61.2 bits (147), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 28/36 (77%)
Query: 230 LFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYK 265
L RRC++C T TP WR GP GPK+LCNACG+R+K
Sbjct: 72 LLARRCANCDTTSTPLWRNGPRGPKSLCNACGIRFK 107
>sp|Q8LC79|GAT18_ARATH GATA transcription factor 18 OS=Arabidopsis thaliana GN=GATA18 PE=2
SV=2
Length = 295
Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 28/36 (77%)
Query: 230 LFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYK 265
L RRC++C T TP WR GP GPK+LCNACG+R+K
Sbjct: 149 LLARRCANCDTTSTPLWRNGPRGPKSLCNACGIRFK 184
>sp|Q55C49|GTAG_DICDI GATA zinc finger domain-containing protein 7 OS=Dictyostelium
discoideum GN=gtaG PE=4 SV=1
Length = 1006
Score = 59.7 bits (143), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 38/63 (60%), Gaps = 8/63 (12%)
Query: 207 FSGFDE-----PVAKKLKKKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACG 261
F+ +D+ PV K +++PA L+ C +C T+ TP+WR GP GP TLCNACG
Sbjct: 812 FTYYDQSFPHPPVKKTHRRRPANIDKSTLY---CHNCGTKNTPEWRRGPSGPATLCNACG 868
Query: 262 VRY 264
+ Y
Sbjct: 869 LAY 871
>sp|Q9ZPX0|GAT20_ARATH GATA transcription factor 20 OS=Arabidopsis thaliana GN=GATA20 PE=2
SV=2
Length = 208
Score = 58.9 bits (141), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 22/33 (66%), Positives = 26/33 (78%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYK 265
RRC+ C T TP WR GP GPK+LCNACG+R+K
Sbjct: 92 RRCASCDTTSTPLWRNGPKGPKSLCNACGIRFK 124
>sp|Q8LC59|GAT23_ARATH GATA transcription factor 23 OS=Arabidopsis thaliana GN=GATA23 PE=2
SV=2
Length = 120
Score = 58.5 bits (140), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 28/36 (77%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGR 268
R CS C+T KTP WR GP GPK+LCNACG+R++ R
Sbjct: 26 RCCSECKTTKTPMWRGGPTGPKSLCNACGIRHRKQR 61
>sp|Q54NM5|GTAL_DICDI GATA zinc finger domain-containing protein 12 OS=Dictyostelium
discoideum GN=gtaL PE=4 SV=1
Length = 640
Score = 57.8 bits (138), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 26/33 (78%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYK 265
R C +C+T TP+WR GP G KTLCNACG+RY+
Sbjct: 504 RVCVNCKTSDTPEWRRGPQGAKTLCNACGIRYR 536
>sp|Q9SZI6|GAT22_ARATH Putative GATA transcription factor 22 OS=Arabidopsis thaliana
GN=GATA22 PE=2 SV=1
Length = 352
Score = 57.4 bits (137), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 22/31 (70%), Positives = 25/31 (80%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVR 263
R CS C T KTP WR+GP GPK+LCNACG+R
Sbjct: 199 RICSDCNTTKTPLWRSGPRGPKSLCNACGIR 229
>sp|Q5HZ36|GAT21_ARATH GATA transcription factor 21 OS=Arabidopsis thaliana GN=GATA21 PE=1
SV=2
Length = 398
Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 22/33 (66%), Positives = 26/33 (78%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYK 265
R CS C T KTP WR+GP GPK+LCNACG+R +
Sbjct: 230 RVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQR 262
>sp|Q9FJ10|GAT16_ARATH GATA transcription factor 16 OS=Arabidopsis thaliana GN=GATA16 PE=2
SV=1
Length = 139
Score = 56.2 bits (134), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 28/37 (75%)
Query: 232 QRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGR 268
++ C+ C T KTP WR GP+GPK+LCNACG+R + R
Sbjct: 35 KKTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKR 71
>sp|Q8LG10|GAT15_ARATH GATA transcription factor 15 OS=Arabidopsis thaliana GN=GATA15 PE=2
SV=2
Length = 149
Score = 55.5 bits (132), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 27/37 (72%)
Query: 232 QRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGR 268
++ C+ C T KTP WR GP GPK+LCNACG+R + R
Sbjct: 40 KKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKR 76
>sp|Q54KX0|GTAN_DICDI GATA zinc finger domain-containing protein 14 OS=Dictyostelium
discoideum GN=gtaN PE=4 SV=1
Length = 953
Score = 54.7 bits (130), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 33/53 (62%), Gaps = 8/53 (15%)
Query: 212 EPVAKKLKKKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
EP +K+KKK LF C+ C T +TP+WR GP G K+LCNACG+ Y
Sbjct: 878 EPPVEKVKKK-----ADTLF---CTSCGTTQTPEWRKGPAGGKSLCNACGLHY 922
>sp|Q75JZ0|GTAH_DICDI GATA zinc finger domain-containing protein 8 OS=Dictyostelium
discoideum GN=gtaH PE=4 SV=1
Length = 519
Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 34/54 (62%), Gaps = 4/54 (7%)
Query: 211 DEPVAKKLKKKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
D+ K+ K++ A + L C +C+T +TP+WR GP G K+LCNACG+ Y
Sbjct: 442 DKNAEKREKRREASR----LLNNVCRNCKTTETPEWRKGPDGTKSLCNACGLHY 491
>sp|Q55GK0|GTAE_DICDI GATA zinc finger domain-containing protein 5 OS=Dictyostelium
discoideum GN=gtaE PE=4 SV=1
Length = 952
Score = 53.5 bits (127), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 22/31 (70%)
Query: 234 RCSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
+C C T TP+WR GP GP TLCNACG+ Y
Sbjct: 240 KCYQCNTSNTPEWRKGPEGPATLCNACGLAY 270
>sp|Q00858|CGPB_FUSSO Cutinase gene palindrome-binding protein OS=Fusarium solani subsp.
pisi PE=2 SV=1
Length = 457
Score = 52.4 bits (124), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 24/30 (80%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
C+ C T +P+WR GP GPKTLCNACG+R+
Sbjct: 402 CTDCGTLDSPEWRKGPSGPKTLCNACGLRW 431
>sp|Q9LIB5|GAT17_ARATH GATA transcription factor 17 OS=Arabidopsis thaliana GN=GATA17 PE=2
SV=1
Length = 190
Score = 52.0 bits (123), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 26/37 (70%)
Query: 232 QRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGR 268
+R C C T +TP WR GP GPK+LCNACG++ + R
Sbjct: 41 KRTCVDCGTIRTPLWRGGPAGPKSLCNACGIKSRKKR 77
>sp|P78714|WC2_NEUCR White collar 2 protein OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=wc-2
PE=1 SV=1
Length = 530
Score = 51.6 bits (122), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 24/30 (80%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
C+ C T +P+WR GP GPKTLCNACG+R+
Sbjct: 468 CTDCGTLDSPEWRKGPSGPKTLCNACGLRW 497
>sp|Q01371|WC1_NEUCR White collar 1 protein OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=wc-1
PE=2 SV=2
Length = 1167
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 31/41 (75%), Gaps = 2/41 (4%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRY--KSGRLFP 271
R C++C T+ TP+WR GP G + LCN+CG+R+ ++GR+ P
Sbjct: 932 RDCANCHTRNTPEWRRGPSGNRDLCNSCGLRWAKQTGRVSP 972
>sp|P40209|GAT2_YEAST Protein GAT2 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GAT2 PE=4 SV=1
Length = 560
Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 27/49 (55%), Gaps = 9/49 (18%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRY---------KSGRLFPEYR 274
C HC +TP+WR GP G +TLCNACG+ Y KS L YR
Sbjct: 472 CFHCGETETPEWRKGPYGTRTLCNACGLFYRKVTKKFGSKSSNLLLRYR 520
>sp|Q9LT45|GAT29_ARATH GATA transcription factor 29 OS=Arabidopsis thaliana GN=GATA29 PE=2
SV=1
Length = 208
Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 27/36 (75%), Gaps = 2/36 (5%)
Query: 232 QRRCSH--CQTQKTPQWRTGPLGPKTLCNACGVRYK 265
++C++ C TP WR GPLGPK+LCNACG++++
Sbjct: 156 MKKCTNMNCNALNTPMWRRGPLGPKSLCNACGIKFR 191
>sp|Q54TM6|GTAI_DICDI GATA zinc finger domain-containing protein 9 OS=Dictyostelium
discoideum GN=gtaI PE=4 SV=1
Length = 536
Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
C HC T TP+WR GP G K+LCNACG+ Y
Sbjct: 479 CRHCGTTDTPEWRRGPDGRKSLCNACGLHY 508
>sp|Q75JZ1|GTAC_DICDI GATA zinc finger domain-containing protein 3 OS=Dictyostelium
discoideum GN=gtaC PE=4 SV=1
Length = 587
Score = 50.1 bits (118), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
C C T +TP+WR GP G KTLCNACG+ Y
Sbjct: 500 CIFCGTMETPEWRKGPGGHKTLCNACGLHY 529
>sp|Q54TE3|GTAJ_DICDI GATA zinc finger domain-containing protein 10 OS=Dictyostelium
discoideum GN=gtaJ PE=4 SV=1
Length = 714
Score = 49.3 bits (116), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 234 RCSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
+C +C+ +TP+WR GP G TLCNACG+ Y
Sbjct: 630 KCHYCEVTETPEWRRGPDGDHTLCNACGLHY 660
>sp|Q1WG82|ZGLP1_MOUSE GATA-type zinc finger protein 1 OS=Mus musculus GN=Zglp1 PE=2 SV=1
Length = 266
Score = 48.1 bits (113), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 40/71 (56%), Gaps = 6/71 (8%)
Query: 200 SVQNIEWFSGFDEPVAKKLKKKP--AVQSGG---GLFQRRCSHCQTQKTPQWRTGPLGPK 254
S+Q + +S ++KL P A+ S G L RRC+ C+TQ+TP WR G
Sbjct: 157 SLQIVPTYSLPGRSCSQKLPASPSKALASPGSSEALGPRRCASCRTQRTPLWRDAEDG-T 215
Query: 255 TLCNACGVRYK 265
LCNACG+RYK
Sbjct: 216 PLCNACGIRYK 226
>sp|Q07928|GAT3_YEAST Protein GAT3 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GAT3 PE=4 SV=1
Length = 141
Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 224 VQSGGGLFQRRCSHCQTQKT-PQWRTGPLGPKTLCNACGVRYK 265
VQ G+ RRC C KT PQWR GP G TLCNACG+ Y+
Sbjct: 62 VQKRKGV-TRRCPQCAVIKTSPQWREGPDGEVTLCNACGLFYR 103
>sp|B0G188|GTAP_DICDI GATA zinc finger domain-containing protein 16 OS=Dictyostelium
discoideum GN=gtaP PE=4 SV=1
Length = 695
Score = 46.6 bits (109), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 215 AKKLKKKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
AKK ++ ++ GG + C C TP+WR GP G KTLCNACG+ +
Sbjct: 461 AKKPARRRKMRYGG--TELSCHTCGVTNTPEWRRGPNGAKTLCNACGLAW 508
>sp|P0C6A0|ZGLP1_HUMAN GATA-type zinc finger protein 1 OS=Homo sapiens GN=ZGLP1 PE=2 SV=1
Length = 271
Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 29/53 (54%), Gaps = 10/53 (18%)
Query: 222 PAVQSGG---------GLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYK 265
PA GG L RRC+ C+TQ+TP WR G LCNACG+RYK
Sbjct: 184 PAAHPGGTEAHSAGSEALEPRRCASCRTQRTPLWRDAEDG-TPLCNACGIRYK 235
>sp|Q55EQ0|GTAF_DICDI GATA zinc finger domain-containing protein 6 OS=Dictyostelium
discoideum GN=gtaF PE=4 SV=1
Length = 623
Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
C C +T QWR GP G K+LCNACG+R+
Sbjct: 320 CHSCGETQTSQWRRGPDGCKSLCNACGIRF 349
>sp|Q8W4H1|GAT26_ARATH GATA transcription factor 26 OS=Arabidopsis thaliana GN=GATA26 PE=2
SV=1
Length = 510
Score = 45.4 bits (106), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 21/41 (51%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRP 275
C HC TP WR GP LCNACG R+++ Y P
Sbjct: 7 CYHCGVTNTPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTP 47
>sp|Q550D5|GTAA_DICDI Transcription factor stalky OS=Dictyostelium discoideum GN=stkA
PE=1 SV=1
Length = 872
Score = 45.4 bits (106), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYK 265
R C C + +TP WR GP G +LCNACG++++
Sbjct: 292 RSCEFCGSSQTPTWRRGPSGKGSLCNACGIKWR 324
>sp|Q5KSV0|GTAK_DICDI GATA zinc finger domain-containing protein 11 OS=Dictyostelium
discoideum GN=gtaK PE=2 SV=1
Length = 650
Score = 45.1 bits (105), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 24/32 (75%)
Query: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
++C+ C T +P+WR GP G ++LCNACG+ +
Sbjct: 520 KQCTSCGTTSSPEWRKGPAGNQSLCNACGLYF 551
>sp|Q54HA4|GTAO_DICDI GATA zinc finger domain-containing protein 15 (Fragment)
OS=Dictyostelium discoideum GN=gtaO PE=4 SV=1
Length = 511
Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRY 264
C C T+ +P+WR GP G K+LCNACG+ Y
Sbjct: 453 CQACGTRASPEWRKGPDGFKSLCNACGLYY 482
>sp|Q5PP38|GAT27_ARATH GATA transcription factor 27 OS=Arabidopsis thaliana GN=GATA27 PE=2
SV=1
Length = 470
Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 25/57 (43%)
Query: 235 CSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHR 291
C HC TP WR GP LCNACG R+++ Y P + + HR
Sbjct: 7 CYHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLVNYTPLHARAEGDETEIEDHR 63
>sp|Q54WY0|GTAR_DICDI GATA zinc finger domain-containing protein 18 OS=Dictyostelium
discoideum GN=gtaR PE=4 SV=1
Length = 237
Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 31/51 (60%), Gaps = 8/51 (15%)
Query: 215 AKKLKKKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKT--LCNACGVR 263
+KKLK + GG CS C+TQ+TP WR G G KT LCNACG++
Sbjct: 171 SKKLKPMAKPRPGG------CSICKTQETPYWRKGKDGDKTVYLCNACGLQ 215
>sp|Q8H1G0|GAT28_ARATH GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2
SV=1
Length = 302
Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 235 CSHCQT--QKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPT 280
C HC + TP R GP GP+TLCNACG+ + + F + A T
Sbjct: 223 CRHCGIGEKSTPMMRRGPAGPRTLCNACGLMWANKGAFRDLSKASPQT 270
>sp|Q8GXL7|GAT24_ARATH GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2
SV=2
Length = 297
Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%), Gaps = 2/32 (6%)
Query: 235 CSHCQT--QKTPQWRTGPLGPKTLCNACGVRY 264
C HC T + TP R GP GP+TLCNACG+ +
Sbjct: 219 CRHCGTSEKSTPMMRRGPDGPRTLCNACGLMW 250
>sp|P18494|GLN3_YEAST Nitrogen regulatory protein GLN3 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=GLN3 PE=1 SV=2
Length = 730
Score = 42.7 bits (99), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 41/83 (49%), Gaps = 20/83 (24%)
Query: 220 KKPAVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSP 279
KKP +Q C +C+T KTP WR P G TLCNACG+ K L RP
Sbjct: 300 KKPLIQ---------CFNCKTFKTPLWRRSPEG-NTLCNACGLFQK---LHGTMRP---- 342
Query: 280 TFSVDMHSNSHRKVLEMRRKKES 302
+ + S+ +K + +R K++
Sbjct: 343 ---LSLKSDVIKKRISKKRAKQT 362
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.130 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 123,716,558
Number of Sequences: 539616
Number of extensions: 5376866
Number of successful extensions: 18340
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 85
Number of HSP's successfully gapped in prelim test: 180
Number of HSP's that attempted gapping in prelim test: 17477
Number of HSP's gapped (non-prelim): 836
length of query: 316
length of database: 191,569,459
effective HSP length: 117
effective length of query: 199
effective length of database: 128,434,387
effective search space: 25558443013
effective search space used: 25558443013
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (28.1 bits)