BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019919
(334 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P69781|GAT12_ARATH GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2
SV=1
Length = 331
Score = 249 bits (635), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 162/339 (47%), Positives = 201/339 (59%), Gaps = 63/339 (18%)
Query: 39 NFTVEDLL-DFSNDDAIMNDGGFFENVAANSTDSSTVTSNSAVSGGENNFPANFSGCRSS 97
+F V+DLL DFSNDD ND V A+ST ++T+T +S S + P+ +
Sbjct: 13 DFAVDDLLVDFSNDDDEEND------VVADSTTTTTITDSSNFSAA--DLPSFHGDVQDG 64
Query: 98 NSFADSQFCGELCVPYDDLA-ELEWLSNFVEDSFSMDQTLQKSDLEFLSGSKSL-TPESS 155
SF+ G+LC+P DDLA ELEWLSN V++S S + + LE +SG KS P+S
Sbjct: 65 TSFS-----GDLCIPSDDLADELEWLSNIVDESLSPEDVHK---LELISGFKSRPDPKSD 116
Query: 156 SSSTRLEPVSPKASNPVFLPETPLPGKARSKRPRAAPCDWSTRLL--------------- 200
+ S P +P +S+P+F + +P KARSKR RAA C+W++R L
Sbjct: 117 TGS----PENPNSSSPIFTTDVSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTGETI 172
Query: 201 -----HVSP------------KGPSVE---RESPNPLMHTESGDSVRKCLHCASEKTPQW 240
H+SP K +V+ R + G R+CLHCA++KTPQW
Sbjct: 173 LSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAEERRCLHCATDKTPQW 232
Query: 241 RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHRKVMELRRQKEMQRAP 300
RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFV AKHSNSHRKVMELRRQKEM RA
Sbjct: 233 RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKEMSRAH 292
Query: 301 QQQFLGQRS-----IFGVSNGADDYLIHHPSGPDFRHMI 334
+ IF VS+ DDYLIHH GPDFR +I
Sbjct: 293 HEFIHHHHGTDTAMIFDVSSDGDDYLIHHNVGPDFRQLI 331
>sp|O82632|GATA9_ARATH GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2
SV=1
Length = 308
Score = 236 bits (602), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 186/311 (59%), Gaps = 36/311 (11%)
Query: 40 FTVEDLLDFSNDDAIMNDGGFFENVAANSTDSSTVTSNSAVSGGENNFPANFSGCRSSNS 99
F V+DLLDFSNDD ++DG N+ S+ S ++ N SS+
Sbjct: 18 FVVDDLLDFSNDDGEVDDG-------LNTLPDSSTLSTGTLTDSSN----------SSSL 60
Query: 100 FADSQFCGELCVPYDDLAELEWLSNFVEDSFSMDQTLQKSDLEFLSGSKSLTPESSSSST 159
F D +L +P DD+AELEWLSNFVE+SF+ + + L SG K+ S+ +
Sbjct: 61 FTDGTGFSDLYIPNDDIAELEWLSNFVEESFAGED---QDKLHLFSGLKNPQTTGSTLTH 117
Query: 160 RLEPVSPKASNPVFLPET--PLPGKARSKRPRAAPCDWSTRLLHVS--------PKGPSV 209
++P + + E+ +P KARSKR R+A W++RLL ++ K V
Sbjct: 118 LIKPEPELDHQFIDIDESNVAVPAKARSKRSRSAASTWASRLLSLADSDETNPKKKQRRV 177
Query: 210 ERESPNPLMHTESGDSV--RKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEY 267
+ + M + G+S R+CLHCA+EKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEY
Sbjct: 178 KEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEY 237
Query: 268 RPAASPTFVSAKHSNSHRKVMELRRQKEMQ-RAPQQQFLGQRSIFGVSNGADDYLIHHPS 326
RPA+SPTFV A+HSNSHRKVMELRRQKEM+ Q + + + + +D+L+H+ +
Sbjct: 238 RPASSPTFVMARHSNSHRKVMELRRQKEMRDEHLLSQLRCENLLMDIRSNGEDFLMHNNT 297
Query: 327 ---GPDFRHMI 334
PDFRH+I
Sbjct: 298 NHVAPDFRHLI 308
>sp|O49743|GATA4_ARATH GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2
SV=1
Length = 240
Score = 176 bits (446), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 104/205 (50%), Positives = 121/205 (59%), Gaps = 47/205 (22%)
Query: 103 SQFCGELCVPYDDLAELEWLSNFVEDSFSMDQTLQKSDLEFLSGSKSLTPESSSSSTRLE 162
+ F +LCVP DD A LEWLS FV+DSFS +F + ++T
Sbjct: 61 TDFTHDLCVPSDDAAHLEWLSRFVDDSFS----------DFPANPLTMT----------- 99
Query: 163 PVSPKASNPVFLPETPLPGKARSKRPRA-APC---DWS----TRLLHVSPKGPSVERESP 214
PE GK RS+R RA AP W+ + L H SV + P
Sbjct: 100 ----------VRPEISFTGKPRSRRSRAPAPSVAGTWAPMSESELCH------SVAKPKP 143
Query: 215 NPLMHTES--GDSVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAAS 272
+ + ES D R+C HCASEKTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+S
Sbjct: 144 KKVYNAESVTADGARRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASS 203
Query: 273 PTFVSAKHSNSHRKVMELRRQKEMQ 297
PTFV +HSNSHRKVMELRRQKE Q
Sbjct: 204 PTFVLTQHSNSHRKVMELRRQKEQQ 228
>sp|O49741|GATA2_ARATH GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2
SV=1
Length = 264
Score = 174 bits (442), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 122/218 (55%), Gaps = 48/218 (22%)
Query: 105 FCGELCVPYDDLAELEWLSNFVEDSFSMDQTLQKSDLEFLSGSKSLTPESSSSSTRLEPV 164
F ++CVP DD A LEWLS FV+DSF+ +F + T S +
Sbjct: 65 FLHDICVPSDDAAHLEWLSQFVDDSFA----------DFPANPLGGTMTSVKT------- 107
Query: 165 SPKASNPVFLPETPLPGKARSKRPRA----------APCDWSTRLLHVSPKGPSVERESP 214
ET PGK RSKR RA P + + LH + K + +S
Sbjct: 108 -----------ETSFPGKPRSKRSRAPAPFAGTWSPMPLESEHQQLHSAAKFKPKKEQSG 156
Query: 215 NPLM----------HTESGDSVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLV 264
T G +R+C HCASEKTPQWRTGP+GPKTLCNACGVR+KSGRLV
Sbjct: 157 GGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLV 216
Query: 265 PEYRPAASPTFVSAKHSNSHRKVMELRRQKEMQRAPQQ 302
PEYRPA+SPTFV +HSNSHRKVMELRRQKE+ R PQQ
Sbjct: 217 PEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMRQPQQ 254
>sp|Q9FH57|GATA5_ARATH GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2
SV=1
Length = 339
Score = 154 bits (389), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/248 (43%), Positives = 129/248 (52%), Gaps = 58/248 (23%)
Query: 89 ANFSGCRSSNSFADSQFCGELCVPYDDLAELEWLSNFVEDSFSMDQTLQKSDLEF----L 144
++FSGC S S EL +P DDLA LEWLS+FVEDSF+ E+ L
Sbjct: 87 SDFSGCDDFGSLPTS----ELSLPADDLANLEWLSHFVEDSFT----------EYSGPNL 132
Query: 145 SGSKSLTPESSSSSTRLEPVSPKASNPVFLPETPLPGKARSKRPRAAPCDWSTRLLHVSP 204
+G+ + P + + R PV+ F ++P+P KARSKR R WS S
Sbjct: 133 TGTPTEKP-AWLTGDRKHPVTAVTEETCF--KSPVPAKARSKRNRNGLKVWSLGSSSSSG 189
Query: 205 KG------------------------PSVERESPN-PLMHTE-SGDSV-----------R 227
P V E P P H + S +SV R
Sbjct: 190 PSSSGSTSSSSSGPSSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQLQPQR 249
Query: 228 KCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHRKV 287
KC HC +KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF S HSN HRKV
Sbjct: 250 KCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKV 309
Query: 288 MELRRQKE 295
+E+RR+KE
Sbjct: 310 IEMRRKKE 317
>sp|O65515|GATA7_ARATH GATA transcription factor 7 OS=Arabidopsis thaliana GN=GATA7 PE=2
SV=1
Length = 238
Score = 140 bits (352), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/281 (37%), Positives = 131/281 (46%), Gaps = 84/281 (29%)
Query: 39 NFTVEDLLDFSNDDAIMNDGG--FFENVAANSTDSSTVTSNSAVSGGEN--NFPANFSGC 94
+F+V+DLLD SN D + E+ S ++ +S E+ +FP +
Sbjct: 10 DFSVDDLLDLSNADTSLESSSSQRKEDEQEREKFKSFSDQSTRLSPPEDLLSFPGD---- 65
Query: 95 RSSNSFADSQFCGELCVPYDDLAELEWLSNFVEDSFSMDQTLQKSDLEFLSGSKSLTPES 154
P DL +LEWLSNFVEDSFS ++ SD
Sbjct: 66 ----------------APVGDLEDLEWLSNFVEDSFS--ESYISSDF------------- 94
Query: 155 SSSSTRLEPVSPKASNPVFLPETPLPGKARSKRPRAAPCDWSTRLLHVSPKGPSVERESP 214
PV+P AS V + +P K RSKR R WS ESP
Sbjct: 95 --------PVNPVAS--VEVRRQCVPVKPRSKRRRTNGRIWSM--------------ESP 130
Query: 215 NPLMHTESG--------------------DSVRKCL-HCASEKTPQWRTGPMGPKTLCNA 253
+PL+ T +R+C HC +KTPQWR GP+G KTLCNA
Sbjct: 131 SPLLSTAVARRKKRGRQKVDASYGGVVQQQQLRRCCSHCGVQKTPQWRMGPLGAKTLCNA 190
Query: 254 CGVRYKSGRLVPEYRPAASPTFVSAKHSNSHRKVMELRRQK 294
CGVR+KSGRL+PEYRPA SPTF + HSNSHRKV+ELR K
Sbjct: 191 CGVRFKSGRLLPEYRPACSPTFTNEIHSNSHRKVLELRLMK 231
>sp|Q6DBP8|GAT11_ARATH GATA transcription factor 11 OS=Arabidopsis thaliana GN=GATA11 PE=2
SV=1
Length = 303
Score = 128 bits (322), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/130 (51%), Positives = 85/130 (65%), Gaps = 19/130 (14%)
Query: 175 PETPLPGKARSKRPRAAPCDWST-------RLLHVSPKGPSVERESPNPLMHTESGDSVR 227
PE PGK S+ C +S+ R +H++ + S E+ N S VR
Sbjct: 173 PEKSTPGKPESE------CYFSSEQHAKKKRKIHLTTRTVSSTLEASN------SDGIVR 220
Query: 228 KCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHRKV 287
KC HC + KTPQWR GP GPKTLCNACGVR++SGRLVPEYRPA+SPTF+ A HSNSHRK+
Sbjct: 221 KCTHCETTKTPQWREGPSGPKTLCNACGVRFRSGRLVPEYRPASSPTFIPAVHSNSHRKI 280
Query: 288 MELRRQKEMQ 297
+E+RR+ + Q
Sbjct: 281 IEMRRKDDEQ 290
>sp|Q8LAU9|GATA1_ARATH GATA transcription factor 1 OS=Arabidopsis thaliana GN=GATA1 PE=2
SV=2
Length = 274
Score = 125 bits (314), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 53/67 (79%), Positives = 59/67 (88%)
Query: 227 RKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHRK 286
RKC HC +EKTPQWR GP GPKTLCNACGVRYKSGRLVPEYRPA SPTF + HSNSHRK
Sbjct: 194 RKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYKSGRLVPEYRPANSPTFTAELHSNSHRK 253
Query: 287 VMELRRQ 293
++E+R+Q
Sbjct: 254 IVEMRKQ 260
>sp|Q9SV30|GATA8_ARATH GATA transcription factor 8 OS=Arabidopsis thaliana GN=GATA8 PE=2
SV=1
Length = 322
Score = 125 bits (314), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 53/69 (76%), Positives = 59/69 (85%)
Query: 226 VRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHR 285
+RKC+HC KTPQWR GPMGPKTLCNACGVRYKSGRL PEYRPAASPTF A HSNSH+
Sbjct: 228 LRKCMHCEVTKTPQWRLGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFTPALHSNSHK 287
Query: 286 KVMELRRQK 294
KV E+R ++
Sbjct: 288 KVAEMRNKR 296
>sp|Q8L4M6|GATA3_ARATH GATA transcription factor 3 OS=Arabidopsis thaliana GN=GATA3 PE=2
SV=2
Length = 269
Score = 125 bits (313), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 77/190 (40%), Positives = 100/190 (52%), Gaps = 37/190 (19%)
Query: 111 VPYDDLAELEWLSNFVEDSFSMDQTLQKSDLEFLSGSKSLTPESSSSSTRLEPVSPKASN 170
+P +D+ ELEW+S V+D S PE S T+ P S
Sbjct: 93 LPDEDVEELEWVSRVVDDCSS--------------------PEVSLLLTQTHKTKPSFSR 132
Query: 171 PVFLPETPLPGKARSKRPRAAPCDWSTRLLHVSPKGPSVERESPNPLMHTESGDSV---- 226
+P K R+KR R S V P + + + + + ++V
Sbjct: 133 --------IPVKPRTKRSRN-----SLTGSRVWPLVSTNHQHAATEQLRKKKQETVLVFQ 179
Query: 227 RKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHRK 286
R+C HC + TPQWRTGP+GPKTLCNACGVR+KSGRL PEYRPA SPTF + HSN HRK
Sbjct: 180 RRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFKSGRLCPEYRPADSPTFSNEIHSNLHRK 239
Query: 287 VMELRRQKEM 296
V+ELR+ KE+
Sbjct: 240 VLELRKSKEL 249
>sp|Q9SD38|GATA6_ARATH GATA transcription factor 6 OS=Arabidopsis thaliana GN=GATA6 PE=2
SV=1
Length = 312
Score = 122 bits (306), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 53/81 (65%), Positives = 62/81 (76%)
Query: 227 RKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHRK 286
R+C HC +KTPQWR GP+G KTLCNACGVRYKSGRL+PEYRPA SPTF S HSN H K
Sbjct: 221 RQCGHCGVQKTPQWRAGPLGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHSK 280
Query: 287 VMELRRQKEMQRAPQQQFLGQ 307
V+E+RR+KE ++ L Q
Sbjct: 281 VIEMRRKKETSDGAEETGLNQ 301
>sp|Q8VZP4|GAT10_ARATH GATA transcription factor 10 OS=Arabidopsis thaliana GN=GATA10 PE=2
SV=1
Length = 308
Score = 119 bits (298), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 51/79 (64%), Positives = 64/79 (81%)
Query: 217 LMHTESGDSVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFV 276
L ++S VR C HC + TPQWR GP GPKTLCNACGVR+KSGRLVPEYRPA+SPTF+
Sbjct: 208 LESSKSDGIVRICTHCETITTPQWRQGPSGPKTLCNACGVRFKSGRLVPEYRPASSPTFI 267
Query: 277 SAKHSNSHRKVMELRRQKE 295
+ HSNSHRK++E+R++ +
Sbjct: 268 PSVHSNSHRKIIEMRKKDD 286
>sp|Q9M1U2|GAT14_ARATH GATA transcription factor 14 OS=Arabidopsis thaliana GN=GATA14 PE=2
SV=1
Length = 204
Score = 109 bits (272), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 47/84 (55%), Positives = 63/84 (75%)
Query: 212 ESPNPLMHTESGDSVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAA 271
+SP+ L ++ G + + C HC + KTP WR GP G TLCNACG+RY++GRL+PEYRPA+
Sbjct: 100 KSPSDLFDSKFGITDKSCSHCGTRKTPLWREGPRGAGTLCNACGMRYRTGRLLPEYRPAS 159
Query: 272 SPTFVSAKHSNSHRKVMELRRQKE 295
SP F HSN HRKVME+RR+++
Sbjct: 160 SPDFKPNVHSNFHRKVMEIRRERK 183
>sp|Q9SKN6|GAT13_ARATH Putative GATA transcription factor 13 OS=Arabidopsis thaliana
GN=GATA13 PE=3 SV=2
Length = 291
Score = 102 bits (253), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 42/69 (60%), Positives = 52/69 (75%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHRKVM 288
C HC + TPQWR GP G KTLCNACG+R++SGRLV EYRPAASPTF+ HSN H+K++
Sbjct: 193 CTHCETTTTPQWREGPNGRKTLCNACGIRFRSGRLVLEYRPAASPTFIPTVHSNLHKKII 252
Query: 289 ELRRQKEMQ 297
+R + Q
Sbjct: 253 YMRMKDNDQ 261
>sp|Q54NM5|GTAL_DICDI GATA zinc finger domain-containing protein 12 OS=Dictyostelium
discoideum GN=gtaL PE=4 SV=1
Length = 640
Score = 58.9 bits (141), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 20/33 (60%), Positives = 26/33 (78%)
Query: 227 RKCLHCASEKTPQWRTGPMGPKTLCNACGVRYK 259
R C++C + TP+WR GP G KTLCNACG+RY+
Sbjct: 504 RVCVNCKTSDTPEWRRGPQGAKTLCNACGIRYR 536
>sp|Q9FJ10|GAT16_ARATH GATA transcription factor 16 OS=Arabidopsis thaliana GN=GATA16 PE=2
SV=1
Length = 139
Score = 58.2 bits (139), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 37/69 (53%)
Query: 194 DWSTRLLHVSPKGPSVERESPNPLMHTESGDSVRKCLHCASEKTPQWRTGPMGPKTLCNA 253
D S ++L V + E +T D + C C + KTP WR GP+GPK+LCNA
Sbjct: 3 DHSEKVLLVDSETMKTRAEDMIEQNNTSVNDKKKTCADCGTSKTPLWRGGPVGPKSLCNA 62
Query: 254 CGVRYKSGR 262
CG+R + R
Sbjct: 63 CGIRNRKKR 71
>sp|Q8LC59|GAT23_ARATH GATA transcription factor 23 OS=Arabidopsis thaliana GN=GATA23 PE=2
SV=2
Length = 120
Score = 57.4 bits (137), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 31/45 (68%), Gaps = 1/45 (2%)
Query: 218 MHTESGDSVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGR 262
M E G ++R C C + KTP WR GP GPK+LCNACG+R++ R
Sbjct: 18 MKEEKG-TIRCCSECKTTKTPMWRGGPTGPKSLCNACGIRHRKQR 61
>sp|P40209|GAT2_YEAST Protein GAT2 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GAT2 PE=4 SV=1
Length = 560
Score = 56.6 bits (135), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 37/76 (48%), Gaps = 13/76 (17%)
Query: 202 VSPKGPSVERESPNPLMHTESGDSVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRY--- 258
VS P+ + NP + + C HC +TP+WR GP G +TLCNACG+ Y
Sbjct: 449 VSTTTPAANSDEKNP----NAKKIIEFCFHCGETETPEWRKGPYGTRTLCNACGLFYRKV 504
Query: 259 ------KSGRLVPEYR 268
KS L+ YR
Sbjct: 505 TKKFGSKSSNLLLRYR 520
>sp|Q6QPM2|GAT19_ARATH GATA transcription factor 19 OS=Arabidopsis thaliana GN=GATA19 PE=2
SV=2
Length = 211
Score = 56.2 bits (134), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 26/34 (76%)
Query: 226 VRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYK 259
R+C +C + TP WR GP GPK+LCNACG+R+K
Sbjct: 74 ARRCANCDTTSTPLWRNGPRGPKSLCNACGIRFK 107
>sp|Q9ZPX0|GAT20_ARATH GATA transcription factor 20 OS=Arabidopsis thaliana GN=GATA20 PE=2
SV=2
Length = 208
Score = 56.2 bits (134), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 20/33 (60%), Positives = 25/33 (75%)
Query: 227 RKCLHCASEKTPQWRTGPMGPKTLCNACGVRYK 259
R+C C + TP WR GP GPK+LCNACG+R+K
Sbjct: 92 RRCASCDTTSTPLWRNGPKGPKSLCNACGIRFK 124
>sp|Q9SZI6|GAT22_ARATH Putative GATA transcription factor 22 OS=Arabidopsis thaliana
GN=GATA22 PE=2 SV=1
Length = 352
Score = 56.2 bits (134), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 25/32 (78%)
Query: 226 VRKCLHCASEKTPQWRTGPMGPKTLCNACGVR 257
+R C C + KTP WR+GP GPK+LCNACG+R
Sbjct: 198 IRICSDCNTTKTPLWRSGPRGPKSLCNACGIR 229
>sp|Q5HZ36|GAT21_ARATH GATA transcription factor 21 OS=Arabidopsis thaliana GN=GATA21 PE=1
SV=2
Length = 398
Score = 55.8 bits (133), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 26/34 (76%)
Query: 226 VRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYK 259
+R C C + KTP WR+GP GPK+LCNACG+R +
Sbjct: 229 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQR 262
>sp|Q8LG10|GAT15_ARATH GATA transcription factor 15 OS=Arabidopsis thaliana GN=GATA15 PE=2
SV=2
Length = 149
Score = 55.5 bits (132), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 25/88 (28%)
Query: 227 RKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSA----KHSN 282
+ C C + KTP WR GP GPK+LCNACG+R + R T +S K
Sbjct: 41 KSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKR----------RTLISNRSEDKKKK 90
Query: 283 SH-----------RKVMELRRQKEMQRA 299
SH +++MEL R+ MQR+
Sbjct: 91 SHNRNPKFGDSLKQRLMELGREVMMQRS 118
>sp|Q8LC79|GAT18_ARATH GATA transcription factor 18 OS=Arabidopsis thaliana GN=GATA18 PE=2
SV=2
Length = 295
Score = 55.5 bits (132), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 26/34 (76%)
Query: 226 VRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYK 259
R+C +C + TP WR GP GPK+LCNACG+R+K
Sbjct: 151 ARRCANCDTTSTPLWRNGPRGPKSLCNACGIRFK 184
>sp|Q9LIB5|GAT17_ARATH GATA transcription factor 17 OS=Arabidopsis thaliana GN=GATA17 PE=2
SV=1
Length = 190
Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 28/39 (71%)
Query: 224 DSVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGR 262
D+ R C+ C + +TP WR GP GPK+LCNACG++ + R
Sbjct: 39 DTKRTCVDCGTIRTPLWRGGPAGPKSLCNACGIKSRKKR 77
>sp|Q55GK0|GTAE_DICDI GATA zinc finger domain-containing protein 5 OS=Dictyostelium
discoideum GN=gtaE PE=4 SV=1
Length = 952
Score = 54.3 bits (129), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 22/31 (70%)
Query: 228 KCLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
KC C + TP+WR GP GP TLCNACG+ Y
Sbjct: 240 KCYQCNTSNTPEWRKGPEGPATLCNACGLAY 270
>sp|Q55C49|GTAG_DICDI GATA zinc finger domain-containing protein 7 OS=Dictyostelium
discoideum GN=gtaG PE=4 SV=1
Length = 1006
Score = 53.1 bits (126), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 24/34 (70%)
Query: 225 SVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
S C +C ++ TP+WR GP GP TLCNACG+ Y
Sbjct: 838 STLYCHNCGTKNTPEWRRGPSGPATLCNACGLAY 871
>sp|Q01371|WC1_NEUCR White collar 1 protein OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=wc-1
PE=2 SV=2
Length = 1167
Score = 52.8 bits (125), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 9/71 (12%)
Query: 226 VRKCLHCASEKTPQWRTGPMGPKTLCNACGVRY--KSGRLVPEYRPAASPTFVSAKHSNS 283
VR C +C + TP+WR GP G + LCN+CG+R+ ++GR+ P +K SNS
Sbjct: 931 VRDCANCHTRNTPEWRRGPSGNRDLCNSCGLRWAKQTGRVSPRTSSRGGNGDSMSKKSNS 990
Query: 284 -------HRKV 287
HR+V
Sbjct: 991 PSHSSPLHREV 1001
>sp|Q9LT45|GAT29_ARATH GATA transcription factor 29 OS=Arabidopsis thaliana GN=GATA29 PE=2
SV=1
Length = 208
Score = 52.8 bits (125), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 30/39 (76%), Gaps = 2/39 (5%)
Query: 224 DSVRKC--LHCASEKTPQWRTGPMGPKTLCNACGVRYKS 260
+ ++KC ++C + TP WR GP+GPK+LCNACG++++
Sbjct: 154 EGMKKCTNMNCNALNTPMWRRGPLGPKSLCNACGIKFRK 192
>sp|Q00858|CGPB_FUSSO Cutinase gene palindrome-binding protein OS=Fusarium solani subsp.
pisi PE=2 SV=1
Length = 457
Score = 52.8 bits (125), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
C C + +P+WR GP GPKTLCNACG+R+
Sbjct: 402 CTDCGTLDSPEWRKGPSGPKTLCNACGLRW 431
>sp|Q75JZ1|GTAC_DICDI GATA zinc finger domain-containing protein 3 OS=Dictyostelium
discoideum GN=gtaC PE=4 SV=1
Length = 587
Score = 52.0 bits (123), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
C+ C + +TP+WR GP G KTLCNACG+ Y
Sbjct: 500 CIFCGTMETPEWRKGPGGHKTLCNACGLHY 529
>sp|P78714|WC2_NEUCR White collar 2 protein OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=wc-2
PE=1 SV=1
Length = 530
Score = 51.6 bits (122), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
C C + +P+WR GP GPKTLCNACG+R+
Sbjct: 468 CTDCGTLDSPEWRKGPSGPKTLCNACGLRW 497
>sp|Q54TM6|GTAI_DICDI GATA zinc finger domain-containing protein 9 OS=Dictyostelium
discoideum GN=gtaI PE=4 SV=1
Length = 536
Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
C HC + TP+WR GP G K+LCNACG+ Y
Sbjct: 479 CRHCGTTDTPEWRRGPDGRKSLCNACGLHY 508
>sp|Q54KX0|GTAN_DICDI GATA zinc finger domain-containing protein 14 OS=Dictyostelium
discoideum GN=gtaN PE=4 SV=1
Length = 953
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
C C + +TP+WR GP G K+LCNACG+ Y
Sbjct: 893 CTSCGTTQTPEWRKGPAGGKSLCNACGLHY 922
>sp|Q75JZ0|GTAH_DICDI GATA zinc finger domain-containing protein 8 OS=Dictyostelium
discoideum GN=gtaH PE=4 SV=1
Length = 519
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 23/30 (76%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
C +C + +TP+WR GP G K+LCNACG+ Y
Sbjct: 462 CRNCKTTETPEWRKGPDGTKSLCNACGLHY 491
>sp|Q8W4H1|GAT26_ARATH GATA transcription factor 26 OS=Arabidopsis thaliana GN=GATA26 PE=2
SV=1
Length = 510
Score = 50.1 bits (118), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 47/104 (45%), Gaps = 12/104 (11%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHRKVM 288
C HC TP WR GP LCNACG R+++ + Y P + ++ H +
Sbjct: 7 CYHCGVTNTPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTPLHAR--ADGDENDDHHRFQ 64
Query: 289 ELR------RQKEMQ----RAPQQQFLGQRSIFGVSNGADDYLI 322
++ + KE++ +A Q+ + +R +F S G +I
Sbjct: 65 RMKSISLGNKNKEIKMLKRKAIQENIIIKRPVFEFSYGLKAAVI 108
>sp|Q54TE3|GTAJ_DICDI GATA zinc finger domain-containing protein 10 OS=Dictyostelium
discoideum GN=gtaJ PE=4 SV=1
Length = 714
Score = 49.7 bits (117), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 228 KCLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
KC +C +TP+WR GP G TLCNACG+ Y
Sbjct: 630 KCHYCEVTETPEWRRGPDGDHTLCNACGLHY 660
>sp|Q550D5|GTAA_DICDI Transcription factor stalky OS=Dictyostelium discoideum GN=stkA
PE=1 SV=1
Length = 872
Score = 48.9 bits (115), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 227 RKCLHCASEKTPQWRTGPMGPKTLCNACGVRYK 259
R C C S +TP WR GP G +LCNACG++++
Sbjct: 292 RSCEFCGSSQTPTWRRGPSGKGSLCNACGIKWR 324
>sp|Q55EQ0|GTAF_DICDI GATA zinc finger domain-containing protein 6 OS=Dictyostelium
discoideum GN=gtaF PE=4 SV=1
Length = 623
Score = 48.1 bits (113), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 33/63 (52%), Gaps = 1/63 (1%)
Query: 196 STRLLHVSPKGPSVERESPNPLMHTESGDSVRKCLHCASEKTPQWRTGPMGPKTLCNACG 255
ST + +S G S + P DS+ C C +T QWR GP G K+LCNACG
Sbjct: 288 STSIKIISEDGVSNNKRKRRPRAPAPFLDSLY-CHSCGETQTSQWRRGPDGCKSLCNACG 346
Query: 256 VRY 258
+R+
Sbjct: 347 IRF 349
>sp|Q07928|GAT3_YEAST Protein GAT3 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GAT3 PE=4 SV=1
Length = 141
Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 26/39 (66%), Gaps = 1/39 (2%)
Query: 227 RKCLHCASEKT-PQWRTGPMGPKTLCNACGVRYKSGRLV 264
R+C CA KT PQWR GP G TLCNACG+ Y+ LV
Sbjct: 70 RRCPQCAVIKTSPQWREGPDGEVTLCNACGLFYRKIFLV 108
>sp|B0G188|GTAP_DICDI GATA zinc finger domain-containing protein 16 OS=Dictyostelium
discoideum GN=gtaP PE=4 SV=1
Length = 695
Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 22/36 (61%)
Query: 223 GDSVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
G + C C TP+WR GP G KTLCNACG+ +
Sbjct: 473 GGTELSCHTCGVTNTPEWRRGPNGAKTLCNACGLAW 508
>sp|Q5PP38|GAT27_ARATH GATA transcription factor 27 OS=Arabidopsis thaliana GN=GATA27 PE=2
SV=1
Length = 470
Score = 46.6 bits (109), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 39/94 (41%), Gaps = 6/94 (6%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVSAKHSNSHR--- 285
C HC TP WR GP LCNACG R+++ + Y P + HR
Sbjct: 7 CYHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLVNYTPLHARAEGDETEIEDHRTQT 66
Query: 286 ---KVMELRRQKEMQRAPQQQFLGQRSIFGVSNG 316
K M L ++ ++ Q+ F +R+ G
Sbjct: 67 VMIKGMSLNKKIPKRKPYQENFTVKRANLEFHTG 100
>sp|Q5KSV0|GTAK_DICDI GATA zinc finger domain-containing protein 11 OS=Dictyostelium
discoideum GN=gtaK PE=2 SV=1
Length = 650
Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 14/33 (42%), Positives = 24/33 (72%)
Query: 226 VRKCLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
+++C C + +P+WR GP G ++LCNACG+ +
Sbjct: 519 LKQCTSCGTTSSPEWRKGPAGNQSLCNACGLYF 551
>sp|Q54HA4|GTAO_DICDI GATA zinc finger domain-containing protein 15 (Fragment)
OS=Dictyostelium discoideum GN=gtaO PE=4 SV=1
Length = 511
Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 229 CLHCASEKTPQWRTGPMGPKTLCNACGVRY 258
C C + +P+WR GP G K+LCNACG+ Y
Sbjct: 453 CQACGTRASPEWRKGPDGFKSLCNACGLYY 482
>sp|Q8H1G0|GAT28_ARATH GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2
SV=1
Length = 302
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 21/32 (65%), Gaps = 2/32 (6%)
Query: 229 CLHCA--SEKTPQWRTGPMGPKTLCNACGVRY 258
C HC + TP R GP GP+TLCNACG+ +
Sbjct: 223 CRHCGIGEKSTPMMRRGPAGPRTLCNACGLMW 254
>sp|Q8GXL7|GAT24_ARATH GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2
SV=2
Length = 297
Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 23/32 (71%), Gaps = 2/32 (6%)
Query: 229 CLHCA-SEK-TPQWRTGPMGPKTLCNACGVRY 258
C HC SEK TP R GP GP+TLCNACG+ +
Sbjct: 219 CRHCGTSEKSTPMMRRGPDGPRTLCNACGLMW 250
>sp|Q01168|NUT1_MAGO7 Nitrogen regulatory protein NUT1 OS=Magnaporthe oryzae (strain
70-15 / ATCC MYA-4617 / FGSC 8958) GN=NUT1 PE=4 SV=1
Length = 956
Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 221 ESGDSVRKCLHCASEKTPQWRTGPMGPKTLCNACGVRYK 259
+ GD+ C +CA++ TP WR P G + LCNACG+ K
Sbjct: 655 QGGDAPTTCTNCATQTTPLWRRNPEG-QPLCNACGLFLK 692
>sp|Q9LRH6|GAT25_ARATH GATA transcription factor 25 OS=Arabidopsis thaliana GN=GATA25 PE=2
SV=2
Length = 309
Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%), Gaps = 2/30 (6%)
Query: 229 CLHC--ASEKTPQWRTGPMGPKTLCNACGV 256
C HC +S+ TP R GP GP+TLCNACG+
Sbjct: 214 CTHCGISSKCTPMMRRGPSGPRTLCNACGL 243
>sp|P18494|GLN3_YEAST Nitrogen regulatory protein GLN3 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=GLN3 PE=1 SV=2
Length = 730
Score = 42.0 bits (97), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 228 KCLHCASEKTPQWRTGPMGPKTLCNACGVRYK 259
+C +C + KTP WR P G TLCNACG+ K
Sbjct: 305 QCFNCKTFKTPLWRRSPEG-NTLCNACGLFQK 335
>sp|P40569|GAT4_YEAST Protein GAT4 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GAT4 PE=4 SV=1
Length = 121
Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 227 RKCLHCASEKTP-QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFV-SAKHSNSH 284
R C C KT QWR GP G LCNACG+ ++ +L+ + AA+ ++ K + +
Sbjct: 51 RTCGQCGEIKTSLQWREGPNGAACLCNACGLFFR--KLILRFGRAAAKRYMEQIKGTGTK 108
Query: 285 RKV 287
R++
Sbjct: 109 RRI 111
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.314 0.129 0.388
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 134,817,093
Number of Sequences: 539616
Number of extensions: 5826131
Number of successful extensions: 13967
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 61
Number of HSP's successfully gapped in prelim test: 124
Number of HSP's that attempted gapping in prelim test: 13695
Number of HSP's gapped (non-prelim): 307
length of query: 334
length of database: 191,569,459
effective HSP length: 118
effective length of query: 216
effective length of database: 127,894,771
effective search space: 27625270536
effective search space used: 27625270536
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 61 (28.1 bits)