BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>021461
MFAMNPQPLQARPYIDTEEHDVAQTPIPIQNGSKQGDRYDEPEEVEDEAGASSVNRKSND
RGGSSVQSSTSTRTSELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQP
QNIMSGGSASNGSKLSQRIASLVRFREKRKERSFEKKIRYSCRKEVAQRMQRKNGQFTSS
KATFNIASANSNPSNGSAPPESVSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWAN
KGTLRDLTKGARNICFEQHELETSSDIKPATTEAENSYANQDEQGSPHETKPAPLDPQNH
SMRSNEQVWQFF

High Scoring Gene Products

Symbol, full name Information P value
ZML1
AT3G21175
protein from Arabidopsis thaliana 1.5e-39
ZML2
AT1G51600
protein from Arabidopsis thaliana 1.2e-35
GATA1
GATA transcription factor 1
protein from Arabidopsis thaliana 1.1e-06
stkA
GATA zinc finger domain-containing protein 1
gene from Dictyostelium discoideum 8.5e-06
GATA16
AT5G49300
protein from Arabidopsis thaliana 9.8e-06
GATA3
GATA transcription factor 3
protein from Arabidopsis thaliana 0.00024
gtaL
GATA zinc finger domain-containing protein 12
gene from Dictyostelium discoideum 0.00028
AT4G16141 protein from Arabidopsis thaliana 0.00037
gtaE
GATA zinc finger domain-containing protein 5
gene from Dictyostelium discoideum 0.00046
gtaG
GATA zinc finger domain-containing protein 7
gene from Dictyostelium discoideum 0.00049
GATA4
AT3G60530
protein from Arabidopsis thaliana 0.00053
gtaJ
GATA zinc finger domain-containing protein 10
gene from Dictyostelium discoideum 0.00054
GATA23
AT5G26930
protein from Arabidopsis thaliana 0.00057
orf19.1577 gene_product from Candida albicans 0.00078
CaO19.1577
Putative uncharacterized protein
protein from Candida albicans SC5314 0.00078
GATA19
AT4G36620
protein from Arabidopsis thaliana 0.00082
GATA10
AT1G08000
protein from Arabidopsis thaliana 0.00092

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  021461
        (312 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:505006360 - symbol:ZML1 "ZIM-like 1" species:3...   401  1.5e-39   2
TAIR|locus:2017582 - symbol:ZML2 "ZIM-LIKE 2" species:370...   385  1.2e-35   1
TAIR|locus:2076191 - symbol:GATA1 "GATA transcription fac...   134  1.1e-06   1
DICTYBASE|DDB_G0277147 - symbol:stkA "GATA zinc finger do...   134  8.5e-06   1
TAIR|locus:2155919 - symbol:GATA16 "GATA transcription fa...   108  9.8e-06   1
TAIR|locus:2139594 - symbol:GATA3 "GATA transcription fac...   114  0.00024   1
DICTYBASE|DDB_G0285139 - symbol:gtaL "GATA zinc finger do...   119  0.00028   1
TAIR|locus:504955441 - symbol:AT4G16141 species:3702 "Ara...   109  0.00037   1
DICTYBASE|DDB_G0267640 - symbol:gtaE "GATA zinc finger do...   119  0.00046   1
DICTYBASE|DDB_G0270756 - symbol:gtaG "GATA zinc finger do...   119  0.00049   1
TAIR|locus:2103346 - symbol:GATA4 "GATA transcription fac...   110  0.00053   1
DICTYBASE|DDB_G0281829 - symbol:gtaJ "GATA zinc finger do...   117  0.00054   1
TAIR|locus:2148558 - symbol:GATA23 "GATA transcription fa...    92  0.00057   1
CGD|CAL0005605 - symbol:orf19.1577 species:5476 "Candida ...   113  0.00078   1
UNIPROTKB|Q5ALK1 - symbol:CaO19.1577 "Putative uncharacte...   113  0.00078   1
TAIR|locus:2115195 - symbol:GATA19 "GATA transcription fa...   107  0.00082   1
TAIR|locus:2205090 - symbol:GATA10 "GATA transcription fa...   110  0.00092   1


>TAIR|locus:505006360 [details] [associations]
            symbol:ZML1 "ZIM-like 1" species:3702 "Arabidopsis
            thaliana" [GO:0003700 "sequence-specific DNA binding transcription
            factor activity" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] InterPro:IPR000679 InterPro:IPR010402
            InterPro:IPR013088 InterPro:IPR018467 Pfam:PF00320 Pfam:PF09425
            PROSITE:PS00344 PROSITE:PS50114 PROSITE:PS51017 GO:GO:0005634
            EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0046872 EMBL:AB023045
            GO:GO:0043565 GO:GO:0008270 Gene3D:3.30.50.10 GO:GO:0003700
            GO:GO:0006351 EMBL:AB119060 EMBL:AY042817 EMBL:AY064628
            EMBL:AY085109 EMBL:AK118169 IPI:IPI00528960 IPI:IPI00530061
            RefSeq:NP_566676.1 RefSeq:NP_850618.1 UniGene:At.20286
            ProteinModelPortal:Q8GXL7 SMR:Q8GXL7 IntAct:Q8GXL7
            EnsemblPlants:AT3G21175.1 GeneID:821670 KEGG:ath:AT3G21175
            GeneFarm:3917 TAIR:At3g21175 eggNOG:NOG303027 HOGENOM:HOG000238783
            InParanoid:Q8GXL7 OMA:NGRMHIG PhylomeDB:Q8GXL7
            ProtClustDB:CLSN2688624 Genevestigator:Q8GXL7 GermOnline:AT3G21175
            InterPro:IPR010399 Pfam:PF06200 SMART:SM00979 PROSITE:PS51320
            Uniprot:Q8GXL7
        Length = 297

 Score = 401 (146.2 bits), Expect = 1.5e-39, Sum P(2) = 1.5e-39
 Identities = 89/184 (48%), Positives = 114/184 (61%)

Query:    76 ELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQPQNIMSGGSASNGSKL 135
             +LT++++G+VYVF  V+P KVQA+LLLLG  ++P T+P++  +  QN    G +    +L
Sbjct:    79 QLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQRL 138

Query:   136 S--QRIASLVXXXXXXXXXXXXXXIRYSCRKEVAQRMQRKNGQFTSSKATFNIXXXXXXX 193
             S  QR+ASL+              IRY+ RKEVA RMQRK GQFTS+K++ N        
Sbjct:   139 SVPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSS-NDDSGSTGS 197

Query:   194 XXXXXXXXXVSR--------ICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLR 245
                      V          +C+HCG SEK TP MRRGP GPRTLCNACGLMWANKGTLR
Sbjct:   198 DWGSNQSWAVEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLR 257

Query:   246 DLTK 249
             DL+K
Sbjct:   258 DLSK 261

 Score = 37 (18.1 bits), Expect = 1.5e-39, Sum P(2) = 1.5e-39
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query:   292 PAPLDPQNHSMRSNE 306
             P P  PQ+ S+  NE
Sbjct:   263 PPPQTPQHLSLNKNE 277


>TAIR|locus:2017582 [details] [associations]
            symbol:ZML2 "ZIM-LIKE 2" species:3702 "Arabidopsis
            thaliana" [GO:0003700 "sequence-specific DNA binding transcription
            factor activity" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] InterPro:IPR000679 InterPro:IPR010402
            InterPro:IPR013088 Pfam:PF00320 Pfam:PF06203 PROSITE:PS00344
            PROSITE:PS50114 PROSITE:PS51017 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0046872 EMBL:AC025294
            GO:GO:0043565 GO:GO:0008270 Gene3D:3.30.50.10 GO:GO:0003700
            GO:GO:0006351 EMBL:AC024261 eggNOG:NOG303027 HOGENOM:HOG000238783
            ProtClustDB:CLSN2688624 InterPro:IPR010399 Pfam:PF06200
            PROSITE:PS51320 EMBL:AB119061 EMBL:AY045906 EMBL:AY150395
            IPI:IPI00526986 PIR:F96554 RefSeq:NP_564593.1 RefSeq:NP_974002.1
            UniGene:At.26180 UniGene:At.37784 ProteinModelPortal:Q8H1G0
            SMR:Q8H1G0 IntAct:Q8H1G0 PaxDb:Q8H1G0 EnsemblPlants:AT1G51600.1
            EnsemblPlants:AT1G51600.2 GeneID:841585 KEGG:ath:AT1G51600
            GeneFarm:3915 TAIR:At1g51600 InParanoid:Q8H1G0 OMA:NNDEAAS
            PhylomeDB:Q8H1G0 Genevestigator:Q8H1G0 GermOnline:AT1G51600
            Uniprot:Q8H1G0
        Length = 302

 Score = 385 (140.6 bits), Expect = 1.2e-35, P = 1.2e-35
 Identities = 91/208 (43%), Positives = 115/208 (55%)

Query:    76 ELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQPQN--IMSGGSASNGS 133
             +LT++++G+VYVF +V P KVQA+LLLLG  ++P   P    +  QN  + S        
Sbjct:    83 QLTLSFQGQVYVFDSVLPEKVQAVLLLLGGRELPQAAPPGLGSPHQNNRVSSLPGTPQRF 142

Query:   134 KLSQRIASLVXXXXXXXXXXXXXXIRYSCRKEVAQRMQRKNGQFTSSKATFNIXXXXXXX 193
              + QR+ASLV              IRY+ RKEVA RMQR  GQFTS+K+  +        
Sbjct:   143 SIPQRLASLVRFREKRKGRNFDKKIRYTVRKEVALRMQRNKGQFTSAKSNNDEAASAGSS 202

Query:   194 XXXXXXXXXVSR-------ICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRD 246
                       S         C+HCGI EK TP MRRGPAGPRTLCNACGLMWANKG  RD
Sbjct:   203 WGSNQTWAIESSEAQHQEISCRHCGIGEKSTPMMRRGPAGPRTLCNACGLMWANKGAFRD 262

Query:   247 LTKG----ARNICFEQHE---LETSSDI 267
             L+K     A+N+   ++E   LET   I
Sbjct:   263 LSKASPQTAQNLPLNKNEDANLETDHQI 290


>TAIR|locus:2076191 [details] [associations]
            symbol:GATA1 "GATA transcription factor 1" species:3702
            "Arabidopsis thaliana" [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] [GO:0044212 "transcription regulatory region DNA
            binding" evidence=IDA] [GO:0007623 "circadian rhythm" evidence=IEP]
            InterPro:IPR000679 InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344
            PROSITE:PS50114 SMART:SM00401 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0046872 GO:GO:0007623 GO:GO:0043565
            GO:GO:0008270 Gene3D:3.30.50.10 GO:GO:0003700 GO:GO:0006351
            GO:GO:0044212 EMBL:AP001297 HOGENOM:HOG000238267 EMBL:Y13648
            EMBL:AY087597 IPI:IPI00531604 PIR:T52103 RefSeq:NP_189047.1
            UniGene:At.24370 ProteinModelPortal:Q8LAU9 SMR:Q8LAU9 STRING:Q8LAU9
            EnsemblPlants:AT3G24050.1 GeneID:821990 KEGG:ath:AT3G24050
            GeneFarm:3839 TAIR:At3g24050 eggNOG:NOG284625 InParanoid:Q8LAU9
            OMA:MEMESFM PhylomeDB:Q8LAU9 ProtClustDB:CLSN2713932
            Genevestigator:Q8LAU9 GermOnline:AT3G24050 Uniprot:Q8LAU9
        Length = 274

 Score = 134 (52.2 bits), Expect = 1.1e-06, P = 1.1e-06
 Identities = 35/83 (42%), Positives = 46/83 (55%)

Query:   203 VSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEQHELE 262
             + R CQHCG +EK TP  R GPAGP+TLCNACG+ + + G L    + A +  F   EL 
Sbjct:   192 MGRKCQHCG-AEK-TPQWRAGPAGPKTLCNACGVRYKS-GRLVPEYRPANSPTFTA-ELH 247

Query:   263 TSSDIKPATTEAENSYANQDEQG 285
             ++S  K    E    Y + D  G
Sbjct:   248 SNSHRK--IVEMRKQYQSGDGDG 268


>DICTYBASE|DDB_G0277147 [details] [associations]
            symbol:stkA "GATA zinc finger domain-containing
            protein 1" species:44689 "Dictyostelium discoideum" [GO:0045595
            "regulation of cell differentiation" evidence=IMP] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA;IMP]
            [GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0003700
            "sequence-specific DNA binding transcription factor activity"
            evidence=IEA;IMP] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] InterPro:IPR000679
            InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344 PROSITE:PS50114
            SMART:SM00401 dictyBase:DDB_G0277147 GO:GO:0005634 GO:GO:0045595
            GO:GO:0046872 GO:GO:0043565 GO:GO:0008270 Gene3D:3.30.50.10
            GenomeReviews:CM000151_GR GO:GO:0003700 GO:GO:0006351
            EMBL:AAFI02000019 eggNOG:NOG239843 EMBL:U68754 RefSeq:XP_642681.1
            HSSP:P17679 ProteinModelPortal:Q550D5 STRING:Q550D5
            EnsemblProtists:DDB0185187 GeneID:8620870 KEGG:ddi:DDB_G0277147
            OMA:QQTINQH Uniprot:Q550D5
        Length = 872

 Score = 134 (52.2 bits), Expect = 8.5e-06, P = 8.5e-06
 Identities = 35/104 (33%), Positives = 52/104 (50%)

Query:   205 RICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRD-LTKGARNICFEQHELET 263
             R C+ CG S+  TP  RRGP+G  +LCNACG+ W  KG  +D + K +      Q +   
Sbjct:   292 RSCEFCGSSQ--TPTWRRGPSGKGSLCNACGIKWRLKG--KDGIFKPS------QKQQNR 341

Query:   264 SSDIKPATTEAENSYANQDEQGSPHETKPAPLDPQNHSMRSNEQ 307
                I  A  + +    NQ +Q  P + +  P  PQ  + + N+Q
Sbjct:   342 QKPIMSAQKQPKQQQ-NQPQQQQPQQPQQ-PQQPQQQNQQQNQQ 383


>TAIR|locus:2155919 [details] [associations]
            symbol:GATA16 "GATA transcription factor 16" species:3702
            "Arabidopsis thaliana" [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] InterPro:IPR000679 InterPro:IPR013088 Pfam:PF00320
            PROSITE:PS00344 PROSITE:PS50114 SMART:SM00401 GO:GO:0005634
            EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0046872 GO:GO:0043565
            GO:GO:0008270 Gene3D:3.30.50.10 GO:GO:0003700 GO:GO:0006351
            EMBL:AB016872 eggNOG:NOG70483 HOGENOM:HOG000237836 EMBL:BT029338
            IPI:IPI00537074 RefSeq:NP_199741.1 UniGene:At.55451
            ProteinModelPortal:Q9FJ10 SMR:Q9FJ10 PaxDb:Q9FJ10 PRIDE:Q9FJ10
            EnsemblPlants:AT5G49300.1 GeneID:834990 KEGG:ath:AT5G49300
            GeneFarm:3913 TAIR:At5g49300 InParanoid:Q9FJ10 OMA:ACTECHT
            PhylomeDB:Q9FJ10 ProtClustDB:CLSN2916480 Genevestigator:Q9FJ10
            Uniprot:Q9FJ10
        Length = 139

 Score = 108 (43.1 bits), Expect = 9.8e-06, P = 9.8e-06
 Identities = 30/93 (32%), Positives = 46/93 (49%)

Query:   205 RICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANK---GT-----LRDLTKGARNICF 256
             + C  CG S+  TP  R GP GP++LCNACG+    K   GT     L+  + G  N  F
Sbjct:    36 KTCADCGTSK--TPLWRGGPVGPKSLCNACGIRNRKKRRGGTEDNKKLKKSSSGGGNRKF 93

Query:   257 EQHELETSSDI---KPATTEAENSYANQDEQGS 286
              +   ++  D+   K +T E +     ++EQ +
Sbjct:    94 GESLKQSLMDLGIRKRSTVEKQRQKLGEEEQAA 126


>TAIR|locus:2139594 [details] [associations]
            symbol:GATA3 "GATA transcription factor 3" species:3702
            "Arabidopsis thaliana" [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0003700 "sequence-specific DNA binding transcription factor
            activity" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM;IEA;IDA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] [GO:0045893 "positive regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0005730 "nucleolus" evidence=IDA]
            [GO:0007623 "circadian rhythm" evidence=IEP] InterPro:IPR000679
            InterPro:IPR013088 InterPro:IPR016679 Pfam:PF00320
            PIRSF:PIRSF016992 PROSITE:PS00344 PROSITE:PS50114 SMART:SM00401
            GO:GO:0045893 EMBL:CP002687 GenomeReviews:CT486007_GR GO:GO:0005730
            GO:GO:0046872 GO:GO:0007623 GO:GO:0043565 GO:GO:0008270
            Gene3D:3.30.50.10 GO:GO:0003700 GO:GO:0006351 EMBL:AL161586
            EMBL:AL023094 eggNOG:NOG70483 HOGENOM:HOG000238267 EMBL:Y13650
            EMBL:AY099790 EMBL:AY128907 IPI:IPI00520450 PIR:H85408 PIR:T05288
            RefSeq:NP_001031789.1 RefSeq:NP_195194.1 UniGene:At.24640
            UniGene:At.65454 UniGene:At.70827 ProteinModelPortal:Q8L4M6
            SMR:Q8L4M6 EnsemblPlants:AT4G34680.1 EnsemblPlants:AT4G34680.2
            GeneID:829620 KEGG:ath:AT4G34680 GeneFarm:3903 TAIR:At4g34680
            InParanoid:Q8L4M6 OMA:WTEARAL PhylomeDB:Q8L4M6
            ProtClustDB:CLSN2685915 Genevestigator:Q8L4M6 GermOnline:AT4G34680
            Uniprot:Q8L4M6
        Length = 269

 Score = 114 (45.2 bits), Expect = 0.00024, P = 0.00024
 Identities = 32/91 (35%), Positives = 43/91 (47%)

Query:   205 RICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEQHELETS 264
             R C HCG +   TP  R GP GP+TLCNACG+ + + G L    + A +  F     E  
Sbjct:   180 RRCSHCGTNN--TPQWRTGPVGPKTLCNACGVRFKS-GRLCPEYRPADSPTFSN---EIH 233

Query:   265 SDIKPATTEAENSYANQDEQGSPHETKPAPL 295
             S++     E   S    +E G    TK  P+
Sbjct:   234 SNLHRKVLELRKSKELGEETGEA-STKSDPV 263


>DICTYBASE|DDB_G0285139 [details] [associations]
            symbol:gtaL "GATA zinc finger domain-containing
            protein 12" species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0003700 "sequence-specific DNA
            binding transcription factor activity" evidence=IEA] [GO:0046872
            "metal ion binding" evidence=IEA] InterPro:IPR000679
            InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344 PROSITE:PS50114
            SMART:SM00401 dictyBase:DDB_G0285139 GenomeReviews:CM000153_GR
            GO:GO:0046872 GO:GO:0043565 GO:GO:0008270 Gene3D:3.30.50.10
            GO:GO:0003700 eggNOG:COG5641 EMBL:AAFI02000074 HSSP:P17678
            RefSeq:XP_639891.1 ProteinModelPortal:Q54NM5
            EnsemblProtists:DDB0220466 GeneID:8624962 KEGG:ddi:DDB_G0285139
            OMA:LTENMIR Uniprot:Q54NM5
        Length = 640

 Score = 119 (46.9 bits), Expect = 0.00028, P = 0.00028
 Identities = 34/119 (28%), Positives = 58/119 (48%)

Query:   203 VSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMW---------ANKGTLRDLTKGARN 253
             +SR+C +C  S+  TP  RRGP G +TLCNACG+ +         +N  + R+ +     
Sbjct:   502 ISRVCVNCKTSD--TPEWRRGPQGAKTLCNACGIRYRLQQQQVPQSNLNSPRE-SYAVIP 558

Query:   254 ICFEQHELETS----SDIKPATTEAENSYANQDEQGSPHETKPAPLDPQNHSMRSNEQV 308
              C E  + +TS    ++I  +TT    +   Q +Q  P +  P P+      +  ++Q+
Sbjct:   559 TCDENIKQQTSQNSTTNINSSTTTTTATIQQQQQQNIPQQF-PQPIQSPLKMLNIDQQI 616


>TAIR|locus:504955441 [details] [associations]
            symbol:AT4G16141 species:3702 "Arabidopsis thaliana"
            [GO:0003700 "sequence-specific DNA binding transcription factor
            activity" evidence=IEA] [GO:0005634 "nucleus" evidence=ISM]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0043565 "sequence-specific DNA binding" evidence=IEA]
            InterPro:IPR000679 InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344
            PROSITE:PS50114 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0043565 GO:GO:0008270 Gene3D:3.30.50.10 GO:GO:0003700
            HOGENOM:HOG000237836 ProtClustDB:CLSN2684177 HSSP:P17679
            EMBL:AK119021 IPI:IPI00534756 RefSeq:NP_680707.4 UniGene:At.44271
            ProteinModelPortal:Q8GW81 SMR:Q8GW81 EnsemblPlants:AT4G16141.1
            GeneID:827301 KEGG:ath:AT4G16141 TAIR:At4g16141 eggNOG:NOG326708
            InParanoid:Q8GW81 OMA:DVDNGNC PhylomeDB:Q8GW81
            Genevestigator:Q8GW81 Uniprot:Q8GW81
        Length = 197

 Score = 109 (43.4 bits), Expect = 0.00037, P = 0.00037
 Identities = 34/115 (29%), Positives = 50/115 (43%)

Query:   205 RICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANK-----GTLRD----LTKGARNIC 255
             + C  CG S   TP  R GPAGP++LCNACG+    K     G  +D     +K   N+ 
Sbjct:    37 KTCVDCGTSR--TPLWRGGPAGPKSLCNACGIKSRKKRQAALGIRQDDIKIKSKSNNNLG 94

Query:   256 FEQHELETSSDIKPATTEAENSYAN--QDEQGSPHETK-PAPLDPQNHSMRSNEQ 307
              E   ++T    +P   +         +  +G P   K     DP+N S  +N +
Sbjct:    95 LESRNVKTGKG-EPVNVKIAKCEPGIVKIAKGEPGNVKNKIKRDPENSSSSNNNK 148


>DICTYBASE|DDB_G0267640 [details] [associations]
            symbol:gtaE "GATA zinc finger domain-containing
            protein 5" species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0003700 "sequence-specific DNA
            binding transcription factor activity" evidence=IEA] [GO:0046872
            "metal ion binding" evidence=IEA] InterPro:IPR000679
            InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344 PROSITE:PS50114
            SMART:SM00401 dictyBase:DDB_G0267640 GenomeReviews:CM000150_GR
            GO:GO:0046872 GO:GO:0043565 GO:GO:0008270 Gene3D:3.30.50.10
            EMBL:AAFI02000003 GO:GO:0003700 eggNOG:NOG70483 RefSeq:XP_647184.1
            ProteinModelPortal:Q55GK0 EnsemblProtists:DDB0220471 GeneID:8615988
            KEGG:ddi:DDB_G0267640 Uniprot:Q55GK0
        Length = 952

 Score = 119 (46.9 bits), Expect = 0.00046, P = 0.00046
 Identities = 33/84 (39%), Positives = 41/84 (48%)

Query:   207 CQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEQHELETSSD 266
             C  C  S   TP  R+GP GP TLCNACGL +A K   + LTK   NI F Q     +++
Sbjct:   241 CYQCNTSN--TPEWRKGPEGPATLCNACGLAYAKK---QKLTKN--NIKFNQ-STNVNNN 292

Query:   267 IKPATTEAENSYANQDEQGSPHET 290
                  T+      NQ   G P+ T
Sbjct:   293 TNNTITQLNQQINNQ---GLPNLT 313


>DICTYBASE|DDB_G0270756 [details] [associations]
            symbol:gtaG "GATA zinc finger domain-containing
            protein 7" species:44689 "Dictyostelium discoideum" [GO:0030587
            "sorocarp development" evidence=IMP] [GO:0043565 "sequence-specific
            DNA binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0003700 "sequence-specific DNA
            binding transcription factor activity" evidence=IEA] [GO:0046872
            "metal ion binding" evidence=IEA] InterPro:IPR000679
            InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344 PROSITE:PS50114
            SMART:SM00401 dictyBase:DDB_G0270756 EMBL:AAFI02000005
            GenomeReviews:CM000150_GR GO:GO:0046872 GO:GO:0043565 GO:GO:0008270
            Gene3D:3.30.50.10 GO:GO:0003700 GO:GO:0030587 eggNOG:COG5641
            HSSP:P17679 RefSeq:XP_646632.1 ProteinModelPortal:Q55C49
            EnsemblProtists:DDB0220467 GeneID:8617604 KEGG:ddi:DDB_G0270756
            OMA:RPANIDK Uniprot:Q55C49
        Length = 1006

 Score = 119 (46.9 bits), Expect = 0.00049, P = 0.00049
 Identities = 32/104 (30%), Positives = 48/104 (46%)

Query:   207 CQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKG-----TLRDLTKGARNICFEQHEL 261
             C +CG   K TP  RRGP+GP TLCNACGL +A K       L  L   + +  + +  +
Sbjct:   842 CHNCGT--KNTPEWRRGPSGPATLCNACGLAYAKKQREEETNLHKLLLHSNSYSYHRGNM 899

Query:   262 ETSSDIKPATTEAENSYANQDEQGSPHETKPAPLDPQNHSMRSN 305
                S + P+     N+ AN     +P+    +     + S  S+
Sbjct:   900 -LESYVTPSLLPLFNTAANVPYLNTPNNASSSSSSSSSSSSSSS 942


>TAIR|locus:2103346 [details] [associations]
            symbol:GATA4 "GATA transcription factor 4" species:3702
            "Arabidopsis thaliana" [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0003700 "sequence-specific DNA binding transcription factor
            activity" evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=ISM;IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0043565 "sequence-specific DNA binding" evidence=IEA]
            [GO:0045893 "positive regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0009416 "response to light stimulus"
            evidence=IEP] InterPro:IPR000679 InterPro:IPR013088
            InterPro:IPR016679 Pfam:PF00320 PIRSF:PIRSF016992 PROSITE:PS00344
            PROSITE:PS50114 SMART:SM00401 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0045893 GO:GO:0046872 GO:GO:0043565
            GO:GO:0008270 Gene3D:3.30.50.10 GO:GO:0003700 GO:GO:0006351
            GO:GO:0009416 EMBL:AL138646 HOGENOM:HOG000238267
            ProtClustDB:CLSN2683327 EMBL:Y13651 EMBL:AF378881 EMBL:AY039532
            EMBL:AY050476 IPI:IPI00521010 PIR:T47864 RefSeq:NP_191612.1
            UniGene:At.20781 ProteinModelPortal:O49743 SMR:O49743 PRIDE:O49743
            EnsemblPlants:AT3G60530.1 GeneID:825224 KEGG:ath:AT3G60530
            GeneFarm:3882 TAIR:At3g60530 eggNOG:NOG239843 InParanoid:O49743
            OMA:ESELCHS PhylomeDB:O49743 Genevestigator:O49743
            GermOnline:AT3G60530 Uniprot:O49743
        Length = 240

 Score = 110 (43.8 bits), Expect = 0.00053, P = 0.00053
 Identities = 21/35 (60%), Positives = 25/35 (71%)

Query:   204 SRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMW 238
             +R C HC  SEK TP  R GP GP+TLCNACG+ +
Sbjct:   157 ARRCTHCA-SEK-TPQWRTGPLGPKTLCNACGVRY 189


>DICTYBASE|DDB_G0281829 [details] [associations]
            symbol:gtaJ "GATA zinc finger domain-containing
            protein 10" species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0003700 "sequence-specific DNA
            binding transcription factor activity" evidence=IEA] [GO:0046872
            "metal ion binding" evidence=IEA] InterPro:IPR000679
            InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344 PROSITE:PS50114
            SMART:SM00401 dictyBase:DDB_G0281829 GenomeReviews:CM000152_GR
            GO:GO:0046872 GO:GO:0043565 GO:GO:0008270 Gene3D:3.30.50.10
            GO:GO:0003700 EMBL:AAFI02000043 eggNOG:NOG275546 HSSP:P17679
            RefSeq:XP_640446.1 ProteinModelPortal:Q54TE3
            EnsemblProtists:DDB0220473 GeneID:8623259 KEGG:ddi:DDB_G0281829
            OMA:VHAEYQQ Uniprot:Q54TE3
        Length = 714

 Score = 117 (46.2 bits), Expect = 0.00054, P = 0.00054
 Identities = 32/93 (34%), Positives = 48/93 (51%)

Query:   207 CQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEQHELETSSD 266
             C +C ++E  TP  RRGP G  TLCNACGL +A         K  + +  E+ ELE   +
Sbjct:   631 CHYCEVTE--TPEWRRGPDGDHTLCNACGLHYA---------KSQKKLAREK-ELEKQKE 678

Query:   267 IKPATTEAENSYANQDEQGSPHETKPAPLDPQN 299
             ++    E EN+  +  +    ++T  AP + QN
Sbjct:   679 LE-REKERENTRKHSIDFMLMNDTSSAPTNSQN 710


>TAIR|locus:2148558 [details] [associations]
            symbol:GATA23 "GATA transcription factor 23" species:3702
            "Arabidopsis thaliana" [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] [GO:0009416 "response to light stimulus"
            evidence=IEP] [GO:0048527 "lateral root development" evidence=IMP]
            InterPro:IPR000679 InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344
            PROSITE:PS50114 SMART:SM00401 GO:GO:0005634 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0046872 GO:GO:0043565 GO:GO:0008270
            Gene3D:3.30.50.10 GO:GO:0003700 GO:GO:0006351 GO:GO:0048527
            GO:GO:0009416 EMBL:AF007270 HOGENOM:HOG000237836 EMBL:DQ446989
            EMBL:DQ653310 EMBL:BT024789 EMBL:AY086778 IPI:IPI00517378
            PIR:T01770 RefSeq:NP_198045.1 UniGene:At.30837
            ProteinModelPortal:Q8LC59 SMR:Q8LC59 EnsemblPlants:AT5G26930.1
            GeneID:832751 KEGG:ath:AT5G26930 GeneFarm:3912 TAIR:At5g26930
            eggNOG:NOG243746 InParanoid:Q8LC59 OMA:HGGVAVK PhylomeDB:Q8LC59
            ProtClustDB:CLSN2916567 Genevestigator:Q8LC59 Uniprot:Q8LC59
        Length = 120

 Score = 92 (37.4 bits), Expect = 0.00057, P = 0.00057
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query:   205 RICQHCGISEKLTPAMRRGPAGPRTLCNACGL 236
             R C  C  ++  TP  R GP GP++LCNACG+
Sbjct:    26 RCCSECKTTK--TPMWRGGPTGPKSLCNACGI 55


>CGD|CAL0005605 [details] [associations]
            symbol:orf19.1577 species:5476 "Candida albicans" [GO:0005575
            "cellular_component" evidence=ND] InterPro:IPR000679
            InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344 PROSITE:PS50114
            SMART:SM00401 CGD:CAL0005605 GO:GO:0043565 GO:GO:0008270
            Gene3D:3.30.50.10 GO:GO:0003700 EMBL:AACQ01000008 EMBL:AACQ01000007
            eggNOG:COG5641 RefSeq:XP_722478.1 RefSeq:XP_722619.1
            ProteinModelPortal:Q5ALK1 SMR:Q5ALK1 GeneID:3635819 GeneID:3635932
            KEGG:cal:CaO19.1577 KEGG:cal:CaO19.9150 Uniprot:Q5ALK1
        Length = 442

 Score = 113 (44.8 bits), Expect = 0.00078, P = 0.00078
 Identities = 25/46 (54%), Positives = 28/46 (60%)

Query:   207 CQHCGISEKLTPAMRRGPAGPRTLCNACGLMWAN---KGTLRDLTK 249
             CQHC   E  TP  RRGP G RTLCNACGL ++    K  LR+  K
Sbjct:   382 CQHCCSQE--TPEWRRGPEGSRTLCNACGLFYSKLIKKYGLREADK 425


>UNIPROTKB|Q5ALK1 [details] [associations]
            symbol:CaO19.1577 "Putative uncharacterized protein"
            species:237561 "Candida albicans SC5314" [GO:0005575
            "cellular_component" evidence=ND] InterPro:IPR000679
            InterPro:IPR013088 Pfam:PF00320 PROSITE:PS00344 PROSITE:PS50114
            SMART:SM00401 CGD:CAL0005605 GO:GO:0043565 GO:GO:0008270
            Gene3D:3.30.50.10 GO:GO:0003700 EMBL:AACQ01000008 EMBL:AACQ01000007
            eggNOG:COG5641 RefSeq:XP_722478.1 RefSeq:XP_722619.1
            ProteinModelPortal:Q5ALK1 SMR:Q5ALK1 GeneID:3635819 GeneID:3635932
            KEGG:cal:CaO19.1577 KEGG:cal:CaO19.9150 Uniprot:Q5ALK1
        Length = 442

 Score = 113 (44.8 bits), Expect = 0.00078, P = 0.00078
 Identities = 25/46 (54%), Positives = 28/46 (60%)

Query:   207 CQHCGISEKLTPAMRRGPAGPRTLCNACGLMWAN---KGTLRDLTK 249
             CQHC   E  TP  RRGP G RTLCNACGL ++    K  LR+  K
Sbjct:   382 CQHCCSQE--TPEWRRGPEGSRTLCNACGLFYSKLIKKYGLREADK 425


>TAIR|locus:2115195 [details] [associations]
            symbol:GATA19 "GATA transcription factor 19" species:3702
            "Arabidopsis thaliana" [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] InterPro:IPR000679 InterPro:IPR013088 Pfam:PF00320
            PROSITE:PS00344 PROSITE:PS50114 SMART:SM00401 GO:GO:0005634
            EMBL:CP002687 GenomeReviews:CT486007_GR GO:GO:0046872 GO:GO:0043565
            GO:GO:0008270 Gene3D:3.30.50.10 GO:GO:0003700 GO:GO:0006351
            EMBL:AL161589 EMBL:Z99708 HOGENOM:HOG000238145 EMBL:BT029506
            EMBL:AY530746 IPI:IPI00527696 PIR:D85432 RefSeq:NP_195380.1
            UniGene:At.54634 ProteinModelPortal:Q6QPM2 SMR:Q6QPM2
            EnsemblPlants:AT4G36620.1 GeneID:829814 KEGG:ath:AT4G36620
            GeneFarm:3911 TAIR:At4g36620 eggNOG:NOG241947 InParanoid:Q6QPM2
            OMA:ANNEYSY PhylomeDB:Q6QPM2 ProtClustDB:CLSN2915913
            ArrayExpress:Q6QPM2 Genevestigator:Q6QPM2 GermOnline:AT4G36620
            Uniprot:Q6QPM2
        Length = 211

 Score = 107 (42.7 bits), Expect = 0.00082, P = 0.00082
 Identities = 29/103 (28%), Positives = 45/103 (43%)

Query:   203 VSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANK----GTLRDLTKGARNICFEQ 258
             ++R C +C  +   TP  R GP GP++LCNACG+ +  +     T R+ T G  +     
Sbjct:    73 LARRCANCDTTS--TPLWRNGPRGPKSLCNACGIRFKKEERRASTARNSTSGGGSTAAGV 130

Query:   259 HELETSSDIKPATTEAENSYANQDEQGSPHETKPAPL-DPQNH 300
               L+  +          N YA+       H T+  P   P N+
Sbjct:   131 PTLDHQASANYYYNN-NNQYASSSPWHHQHNTQRVPYYSPANN 172


>TAIR|locus:2205090 [details] [associations]
            symbol:GATA10 "GATA transcription factor 10" species:3702
            "Arabidopsis thaliana" [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] InterPro:IPR000679 InterPro:IPR013088 Pfam:PF00320
            PROSITE:PS00344 PROSITE:PS50114 SMART:SM00401 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0046872 GO:GO:0043565
            GO:GO:0008270 Gene3D:3.30.50.10 GO:GO:0003700 GO:GO:0006351
            EMBL:AC026875 EMBL:AY063953 EMBL:AY096723 IPI:IPI00543273
            RefSeq:NP_172278.1 RefSeq:NP_973790.1 UniGene:At.27130
            ProteinModelPortal:Q8VZP4 SMR:Q8VZP4 EnsemblPlants:AT1G08000.1
            EnsemblPlants:AT1G08000.2 GeneID:837315 KEGG:ath:AT1G08000
            GeneFarm:3904 TAIR:At1g08000 eggNOG:NOG70483 HOGENOM:HOG000238267
            InParanoid:Q8VZP4 PhylomeDB:Q8VZP4 ProtClustDB:CLSN2682769
            Genevestigator:Q8VZP4 GermOnline:AT1G08000 Uniprot:Q8VZP4
        Length = 308

 Score = 110 (43.8 bits), Expect = 0.00093, P = 0.00092
 Identities = 22/62 (35%), Positives = 30/62 (48%)

Query:   176 QFTSSKATFNIXXXXXXXXXXXXXXXXVSRICQHCGISEKLT-PAMRRGPAGPRTLCNAC 234
             Q    K   ++                + RIC HC   E +T P  R+GP+GP+TLCNAC
Sbjct:   189 QHAKKKRKIHLITHTESSTLESSKSDGIVRICTHC---ETITTPQWRQGPSGPKTLCNAC 245

Query:   235 GL 236
             G+
Sbjct:   246 GV 247


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.313   0.128   0.378    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      312       270   0.00097  114 3  11 23  0.40    34
                                                     32  0.40    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  17
  No. of states in DFA:  605 (64 KB)
  Total size of DFA:  208 KB (2116 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  22.12u 0.15s 22.27t   Elapsed:  00:00:01
  Total cpu time:  22.12u 0.15s 22.27t   Elapsed:  00:00:01
  Start:  Mon May 20 20:22:07 2013   End:  Mon May 20 20:22:08 2013

Back to top