Citrus Sinensis ID: 024771


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260--
MESLDLQVCCIDDLLDFNINDDECGKPTKRPRNALSSVNRNGCDFDVFEAGDDTDRLFPECAEEELEWLSNFPTVETFVDISSNPNILKQQSPNSVLENSNSSSSTSTNGSTITNGNNNSNSIIMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKAAKPVVSKVIIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSSELHSNSHRKVVEMRRQKQMMGIELGVLGVKPVDKG
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHcccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHcccccccccccccccccccccccHHHHHHHHHHHHHccccccccccccccccc
cccccccHEEHHHHcccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHcccccccccccccccHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEccccccccccccccccccccHHHcccEEEcccccccccccccccccccccccccHHHHHHHHHHccccccccccccccccccc
mesldlqvcciddlldfninddecgkptkrprnalssvnrngcdfdvfeagddtdrlfPECAEEELEwlsnfptvetfvdissnpnilkqqspnsvlensnsssststngstitngnnnsnsIIMNccgnlrvpvrarSKLRTRCRRELLNQEAWWGSVHGSVKAAKPVVSKVIIGrkcqhcgaektpqwragpmgpktlcnacgvrfksgrlvpeyrpansptfsselhsnsHRKVVEMRRQKQMMGielgvlgvkpvdkg
MESLDLQVCCIDDLLdfninddecgkptkrprnalssvnrngCDFDVFEAGDDTDRLFPECAEEELEWLSNFPTVETFVDISSNPNILKQQSPNSVLENSNSSSSTSTNGSTITNGNNNSNSIIMNCCGNLRVPVRARSKLRTRCRRELlnqeawwgsvhgsvkaAKPVVSKVIIGRKCQHCGAEktpqwragpmgPKTLCNACGVRFKSGRLVPEYRPansptfsselhsnshRKVVEMRRQKQmmgielgvlgvkpvdkg
MESLDLQVCCiddlldfninddECGKPTKRPRNALSSVNRNGCDFDVFEAGDDTDRLFPECAEEELEWLSNFPTVETFVDISSNPNILKQQSPNSVLEnsnsssststngstitngnnnsnsiiMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGsvkaakpvvskvIIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSSELHSNSHRKVVEMRRQKQMMGIELGVLGVKPVDKG
****DLQVCCIDDLLDFNINDDE***************NRNGCDFDVFEAGDDTDRLFPECAEEELEWLSNFPTVETFVDI****************************************SIIMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKAAKPVVSKVIIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLV************************************LGV*********
****DLQVCCIDDLLDFNIND***********************************LFPECAEEELEWLSNFP***********************************************************************************************************HCGAEKTPQWRAGPMGPKTLCNACGVR*******************************************************
MESLDLQVCCIDDLLDFNINDDECGKPTKRPRNALSSVNRNGCDFDVFEAGDDTDRLFPECAEEELEWLSNFPTVETFVDISSNPNILKQ*********************TITNGNNNSNSIIMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKAAKPVVSKVIIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSSELHSNSHRKVVEMRRQKQMMGIELGVLGVKPVDKG
***LDLQVCCIDDLLDFNI**********************************TDRLFPECAEEELEWLSNFPTV****************************************************************************************************GRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSSE**********************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MESLDLQVCCIDDLLDFNINDDECGKPTKRPRNALSSVNRNGCDFDVFEAGDDTDRLFPECAEEELEWLSNFPTVETFVDISSNPNILKQQSPNSVLENSNSSSSTSTNGSTITNGNNNSNSIIMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKAAKPVVSKVIIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSSELHSNSHRKVVEMRRQKQMMGIELGVLGVKPVDKG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query262 2.2.26 [Sep-21-2011]
Q8LAU9274 GATA transcription factor yes no 0.870 0.832 0.448 5e-44
O82632308 GATA transcription factor no no 0.847 0.720 0.403 5e-33
Q9FH57339 GATA transcription factor no no 0.263 0.203 0.840 8e-31
Q8L4M6269 GATA transcription factor no no 0.484 0.472 0.510 2e-30
P69781331 GATA transcription factor no no 0.637 0.504 0.422 5e-30
O49741264 GATA transcription factor no no 0.270 0.268 0.788 2e-29
Q9SD38312 GATA transcription factor no no 0.263 0.221 0.797 2e-29
O49743240 GATA transcription factor no no 0.267 0.291 0.771 1e-28
Q6DBP8303 GATA transcription factor no no 0.534 0.462 0.420 4e-28
O65515238 GATA transcription factor no no 0.522 0.575 0.392 2e-27
>sp|Q8LAU9|GATA1_ARATH GATA transcription factor 1 OS=Arabidopsis thaliana GN=GATA1 PE=2 SV=2 Back     alignment and function desciption
 Score =  177 bits (450), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 118/263 (44%), Positives = 160/263 (60%), Gaps = 35/263 (13%)

Query: 11  IDDLLDFNI--NDDECGKPTKRPRNALSSVN--RNGCDFDVFEAGDDTDRLFPECAEEEL 66
           +DDLL+F++   +++  + T+ PRN        R    F +F   D          EE+L
Sbjct: 7   MDDLLNFSVPEEEEDDDEHTQPPRNITRRKTGLRPTDSFGLFNTDD-----LGVVEEEDL 61

Query: 67  EWLSN---FPTVETFVDISSNPNI------------LKQQSPNSVLENSNSSSSTSTNGS 111
           EW+SN   FP +ETFV +  + +             +KQ SP SVLE S+ SS+T+T+ S
Sbjct: 62  EWISNKNAFPVIETFVGVLPSEHFPITSLLEREATEVKQLSPVSVLETSSHSSTTTTSNS 121

Query: 112 TITNGNNNSNSI------IMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKA 165
           +  +  + + +       IM+CC   + P +ARSK R         +  W G+  G ++ 
Sbjct: 122 SGGSNGSTAVATTTTTPTIMSCCVGFKAPAKARSKRRRT--GRRDLRVLWTGNEQGGIQK 179

Query: 166 AKPV---VSKVIIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANS 222
            K +    + +I+GRKCQHCGAEKTPQWRAGP GPKTLCNACGVR+KSGRLVPEYRPANS
Sbjct: 180 KKTMTVAAAALIMGRKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYKSGRLVPEYRPANS 239

Query: 223 PTFSSELHSNSHRKVVEMRRQKQ 245
           PTF++ELHSNSHRK+VEMR+Q Q
Sbjct: 240 PTFTAELHSNSHRKIVEMRKQYQ 262




Transcriptional activator that specifically binds 5'-GATA-3' or 5'-GAT-3' motifs within gene promoters. May be involved in the regulation of some light-responsive genes.
Arabidopsis thaliana (taxid: 3702)
>sp|O82632|GATA9_ARATH GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2 SV=1 Back     alignment and function description
>sp|Q9FH57|GATA5_ARATH GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2 SV=1 Back     alignment and function description
>sp|Q8L4M6|GATA3_ARATH GATA transcription factor 3 OS=Arabidopsis thaliana GN=GATA3 PE=2 SV=2 Back     alignment and function description
>sp|P69781|GAT12_ARATH GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2 SV=1 Back     alignment and function description
>sp|O49741|GATA2_ARATH GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2 SV=1 Back     alignment and function description
>sp|Q9SD38|GATA6_ARATH GATA transcription factor 6 OS=Arabidopsis thaliana GN=GATA6 PE=2 SV=1 Back     alignment and function description
>sp|O49743|GATA4_ARATH GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2 SV=1 Back     alignment and function description
>sp|Q6DBP8|GAT11_ARATH GATA transcription factor 11 OS=Arabidopsis thaliana GN=GATA11 PE=2 SV=1 Back     alignment and function description
>sp|O65515|GATA7_ARATH GATA transcription factor 7 OS=Arabidopsis thaliana GN=GATA7 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query262
224057660258 predicted protein [Populus trichocarpa] 0.927 0.941 0.563 5e-62
255554246205 GATA transcription factor, putative [Ric 0.740 0.946 0.642 3e-56
449465254287 PREDICTED: GATA transcription factor 1-l 0.694 0.634 0.544 7e-50
225433393251 PREDICTED: GATA transcription factor 1-l 0.870 0.908 0.513 3e-49
37572445289 AG-motif binding protein-2 [Nicotiana ta 0.694 0.629 0.535 1e-45
15229571274 GATA transcription factor 1 [Arabidopsis 0.870 0.832 0.448 3e-42
21593190268 GATA transcription factor 1 (AtGATA-1) [ 0.870 0.850 0.444 7e-42
297835478270 hypothetical protein ARALYDRAFT_479930 [ 0.870 0.844 0.467 4e-41
357481109331 GATA transcription factor [Medicago trun 0.835 0.661 0.447 3e-40
110743205134 GATA transcription factor 1 [Arabidopsis 0.454 0.888 0.620 3e-35
>gi|224057660|ref|XP_002299291.1| predicted protein [Populus trichocarpa] gi|222846549|gb|EEE84096.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  243 bits (620), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 156/277 (56%), Positives = 185/277 (66%), Gaps = 34/277 (12%)

Query: 1   MESLDLQVC-CIDDLLDF------NINDDECGKPTKRPRNALSSVNRNG---CDFDVFEA 50
           MESLD   C  +DDLLDF        + +E  + +K+ R AL S+N N      F+V E 
Sbjct: 1   MESLDTAACFMVDDLLDFCSDIGEEEDGEEHQRNSKKSRRALPSLNPNALHPASFNVLEH 60

Query: 51  GDDTDRLFPECAEEELEWLSN---FPTVET-FVDISSNP-NILKQQSPNSVLENSNSSSS 105
                 L PE AEEELEWLSN   FPTVET F  +S  P +I K  SP SVLENS +SS+
Sbjct: 61  S-----LLPEFAEEELEWLSNKDAFPTVETCFGSLSGEPGSIPKHHSPVSVLENSTTSST 115

Query: 106 TSTNGSTITNGNNNSNSIIMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKA 165
           +++  S+       +++IIM+ C  LRVPV+ARSK   R  RE+  QE WW   +   + 
Sbjct: 116 SNSGNSS-------NSNIIMSYC-RLRVPVKARSKRHHRHPREIQEQECWWSQENFITR- 166

Query: 166 AKPVVSKVIIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTF 225
            KP VS   +GRKCQHCG EKTPQWRAGP GPKTLCNACGVR+KSGRLVPEYRPANSPTF
Sbjct: 167 -KPAVSVAKLGRKCQHCGVEKTPQWRAGPDGPKTLCNACGVRYKSGRLVPEYRPANSPTF 225

Query: 226 SSELHSNSHRKVVEMRRQKQMMGIELGVLGVKPVDKG 262
           SS+LHSNSHRKVVEMRRQKQM G+    L  KP+DKG
Sbjct: 226 SSKLHSNSHRKVVEMRRQKQMTGL----LVAKPMDKG 258




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255554246|ref|XP_002518163.1| GATA transcription factor, putative [Ricinus communis] gi|223542759|gb|EEF44296.1| GATA transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|449465254|ref|XP_004150343.1| PREDICTED: GATA transcription factor 1-like [Cucumis sativus] gi|449514819|ref|XP_004164489.1| PREDICTED: GATA transcription factor 1-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|225433393|ref|XP_002285624.1| PREDICTED: GATA transcription factor 1-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|37572445|dbj|BAC98492.1| AG-motif binding protein-2 [Nicotiana tabacum] Back     alignment and taxonomy information
>gi|15229571|ref|NP_189047.1| GATA transcription factor 1 [Arabidopsis thaliana] gi|62900367|sp|Q8LAU9.2|GATA1_ARATH RecName: Full=GATA transcription factor 1; Short=AtGATA-1 gi|2959730|emb|CAA73999.1| homologous to GATA-binding transcription factors [Arabidopsis thaliana] gi|9294674|dbj|BAB03023.1| protein homologous to GATA-binding transcription factors [Arabidopsis thaliana] gi|87116628|gb|ABD19678.1| At3g24050 [Arabidopsis thaliana] gi|332643327|gb|AEE76848.1| GATA transcription factor 1 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|21593190|gb|AAM65139.1| GATA transcription factor 1 (AtGATA-1) [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297835478|ref|XP_002885621.1| hypothetical protein ARALYDRAFT_479930 [Arabidopsis lyrata subsp. lyrata] gi|297331461|gb|EFH61880.1| hypothetical protein ARALYDRAFT_479930 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|357481109|ref|XP_003610840.1| GATA transcription factor [Medicago truncatula] gi|355512175|gb|AES93798.1| GATA transcription factor [Medicago truncatula] Back     alignment and taxonomy information
>gi|110743205|dbj|BAE99493.1| GATA transcription factor 1 [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query262
TAIR|locus:2076191274 GATA1 "GATA transcription fact 0.595 0.569 0.520 6e-46
TAIR|locus:2123738308 GATA9 "GATA transcription fact 0.270 0.230 0.802 2.8e-36
TAIR|locus:2155056339 GATA5 "GATA transcription fact 0.263 0.203 0.840 7.1e-34
TAIR|locus:2055589264 GATA2 "GATA transcription fact 0.270 0.268 0.788 3.8e-33
TAIR|locus:2139594269 GATA3 "GATA transcription fact 0.484 0.472 0.496 7.9e-33
TAIR|locus:2080828312 GATA6 "GATA transcription fact 0.263 0.221 0.797 1.3e-32
TAIR|locus:2103346240 GATA4 "GATA transcription fact 0.263 0.287 0.782 4.3e-32
TAIR|locus:2145259331 GATA12 "GATA transcription fac 0.267 0.211 0.785 3.8e-31
TAIR|locus:2122214238 GATA7 "GATA transcription fact 0.259 0.285 0.764 2e-30
TAIR|locus:2082637322 BME3 "BLUE MICROPYLAR END 3" [ 0.259 0.211 0.764 2.9e-29
TAIR|locus:2076191 GATA1 "GATA transcription factor 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 429 (156.1 bits), Expect = 6.0e-46, Sum P(2) = 6.0e-46
 Identities = 87/167 (52%), Positives = 100/167 (59%)

Query:    88 LKQQSPNSVLEXXXXXXXXXXXXXX------XXXXXXXXXXXXMNCCGNLRVPVRARSKL 141
             +KQ SP SVLE                                M+CC   + P +ARSK 
Sbjct:    98 VKQLSPVSVLETSSHSSTTTTSNSSGGSNGSTAVATTTTTPTIMSCCVGFKAPAKARSKR 157

Query:   142 RTRCRRELLNQEAWWGSVHGXXX---XXXXXXXXXIIGRKCQHCGAEKTPQWRAGPMGPK 198
             R   RR+L  +  W G+  G               I+GRKCQHCGAEKTPQWRAGP GPK
Sbjct:   158 RRTGRRDL--RVLWTGNEQGGIQKKKTMTVAAAALIMGRKCQHCGAEKTPQWRAGPAGPK 215

Query:   199 TLCNACGVRFKSGRLVPEYRPANSPTFSSELHSNSHRKVVEMRRQKQ 245
             TLCNACGVR+KSGRLVPEYRPANSPTF++ELHSNSHRK+VEMR+Q Q
Sbjct:   216 TLCNACGVRYKSGRLVPEYRPANSPTFTAELHSNSHRKIVEMRKQYQ 262


GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0008270 "zinc ion binding" evidence=IEA
GO:0043565 "sequence-specific DNA binding" evidence=IEA
GO:0044212 "transcription regulatory region DNA binding" evidence=IDA
GO:0007623 "circadian rhythm" evidence=IEP
TAIR|locus:2123738 GATA9 "GATA transcription factor 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2155056 GATA5 "GATA transcription factor 5" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2055589 GATA2 "GATA transcription factor 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2139594 GATA3 "GATA transcription factor 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2080828 GATA6 "GATA transcription factor 6" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2103346 GATA4 "GATA transcription factor 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2145259 GATA12 "GATA transcription factor 12" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2122214 GATA7 "GATA transcription factor 7" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2082637 BME3 "BLUE MICROPYLAR END 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q8LAU9GATA1_ARATHNo assigned EC number0.44860.87020.8321yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query262
smart0040152 smart00401, ZnF_GATA, zinc finger binding to DNA c 6e-13
pfam0032036 pfam00320, GATA, GATA zinc finger 9e-13
cd0020254 cd00202, ZnF_GATA, Zinc finger DNA binding domain; 2e-11
>gnl|CDD|214648 smart00401, ZnF_GATA, zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
 Score = 61.7 bits (150), Expect = 6e-13
 Identities = 23/53 (43%), Positives = 29/53 (54%), Gaps = 2/53 (3%)

Query: 175 IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSS 227
            GR C +CG  +TP WR GP G KTLCNACG+ +K    +   RP +      
Sbjct: 2   SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLK--RPLSLKKDGI 52


Length = 52

>gnl|CDD|109380 pfam00320, GATA, GATA zinc finger Back     alignment and domain information
>gnl|CDD|238123 cd00202, ZnF_GATA, Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 262
cd0020254 ZnF_GATA Zinc finger DNA binding domain; binds spe 99.54
smart0040152 ZnF_GATA zinc finger binding to DNA consensus sequ 99.52
PF0032036 GATA: GATA zinc finger; InterPro: IPR000679 Zinc f 99.47
KOG1601340 consensus GATA-4/5/6 transcription factors [Transc 99.15
COG5641 498 GAT1 GATA Zn-finger-containing transcription facto 98.08
>cd00202 ZnF_GATA Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information
Probab=99.54  E-value=3.8e-15  Score=107.51  Aligned_cols=44  Identities=45%  Similarity=0.998  Sum_probs=38.5

Q ss_pred             ccccCCCCCCCCcccCCCCCcccchhhhhhhhhCCCCCCCCCCCCCC
Q 024771          178 KCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPT  224 (262)
Q Consensus       178 ~C~nCgtt~TP~WRrGP~G~~tLCNACGL~ykk~rllp~yrpa~sPt  224 (262)
                      .|+||++++||+||+||.|..+|||||||||++++   .+||...+.
T Consensus         1 ~C~~C~~~~Tp~WR~g~~~~~~LCNaCgl~~~k~~---~~rp~~~~~   44 (54)
T cd00202           1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHG---VMRPLSKRK   44 (54)
T ss_pred             CCCCCCCCCCcccccCCCCcchHHHHHHHHHHhcC---CCCCcccCc
Confidence            59999999999999999888999999999999876   567766544



>smart00401 ZnF_GATA zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>PF00320 GATA: GATA zinc finger; InterPro: IPR000679 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query262
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 6e-12
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 4e-10
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-09
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 6e-08
3dfx_A63 Trans-acting T-cell-specific transcription factor 1e-06
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 3e-06
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Length = 71 Back     alignment and structure
 Score = 59.1 bits (143), Expect = 6e-12
 Identities = 11/45 (24%), Positives = 18/45 (40%), Gaps = 3/45 (6%)

Query: 176 GRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPA 220
             +C +C   +T +WR         CNAC +  +      + RP 
Sbjct: 8   SFQCSNCSVTETIRWRNIRSKEGIQCNACFIYQR---KYNKTRPV 49


>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Length = 46 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Length = 43 Back     alignment and structure
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Length = 63 Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Length = 66 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query262
3dfx_A63 Trans-acting T-cell-specific transcription factor 99.65
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 99.61
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 99.6
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 99.56
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 99.53
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.51
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.22
2zjr_Z60 50S ribosomal protein L32; ribosome, large ribosom 80.43
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Back     alignment and structure
Probab=99.65  E-value=1.3e-17  Score=123.29  Aligned_cols=43  Identities=35%  Similarity=0.814  Sum_probs=37.6

Q ss_pred             CCCCcccccCCCCCCCCcccCCCCCcccchhhhhhhhhCCCCCCCCC
Q 024771          173 VIIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRP  219 (262)
Q Consensus       173 ~~~~r~C~nCgtt~TP~WRrGP~G~~tLCNACGL~ykk~rllp~yrp  219 (262)
                      ......|+||++++||+||+||+|+ +|||||||+|++++.   +||
T Consensus         4 ~~~~~~C~~C~tt~Tp~WR~gp~G~-~LCNACGl~~~~~~~---~RP   46 (63)
T 3dfx_A            4 RRAGTSCANCQTTTTTLWRRNANGD-PVCNACGLYYKLHNI---NRP   46 (63)
T ss_dssp             CCTTCCCTTTCCSCCSSCCCCTTSC-CCCHHHHHHHHHHSS---CCC
T ss_pred             CCCCCcCCCcCCCCCCccCCCCCCC-chhhHHHHHHHHcCC---CCC
Confidence            3456789999999999999999996 999999999998775   455



>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Back     alignment and structure
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>2zjr_Z 50S ribosomal protein L32; ribosome, large ribosomal subunit, ribonucleoprotein, RNA-binding, rRNA-binding, tRNA-binding, methylation; 2.91A {Deinococcus radiodurans} SCOP: g.41.8.5 PDB: 1j5a_M* 1jzy_M* 1jzz_M* 1k01_M* 1nkw_Z 1ond_Z* 1sm1_Z* 1yl3_5 2b66_5 2b9n_5 2b9p_5 2zjp_Y* 2zjq_Z 1jzx_M 3cf5_Y* 3dll_Y* 3pio_Z* 3pip_Z* 1nwy_Z* 1nwx_Z* ... Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 262
d1y0ja139 g.39.1.1 (A:200-238) Erythroid transcription facto 8e-12
d2vuti142 g.39.1.1 (I:671-712) Erythroid transcription facto 4e-09
d3gata_66 g.39.1.1 (A:) Erythroid transcription factor GATA- 5e-09
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Length = 39 Back     information, alignment and structure

class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Mouse (Mus musculus) [TaxId: 10090]
 Score = 56.3 bits (136), Expect = 8e-12
 Identities = 17/34 (50%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 176 GRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFK 209
            R+C +CGA  TP WR    G   LCNACG+  K
Sbjct: 2   ARECVNCGATATPLWRRDRTG-HYLCNACGLYHK 34


>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Length = 42 Back     information, alignment and structure
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Length = 66 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query262
d1y0ja139 Erythroid transcription factor GATA-1 {Mouse (Mus 99.68
d2vuti142 Erythroid transcription factor GATA-1 {Emericella 99.63
d3gata_66 Erythroid transcription factor GATA-1 {Chicken (Ga 99.62
d2zjrz158 Ribosomal protein L32p {Deinococcus radiodurans [T 83.32
d2j015159 Ribosomal protein L32p {Thermus thermophilus [TaxI 81.39
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=99.68  E-value=4.8e-18  Score=113.90  Aligned_cols=37  Identities=46%  Similarity=0.959  Sum_probs=34.0

Q ss_pred             CCcccccCCCCCCCCcccCCCCCcccchhhhhhhhhCC
Q 024771          175 IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGR  212 (262)
Q Consensus       175 ~~r~C~nCgtt~TP~WRrGP~G~~tLCNACGL~ykk~r  212 (262)
                      +.++|+||++++||+||+||+| ++|||||||||+.++
T Consensus         1 e~r~C~~Cgtt~Tp~WR~gp~G-~~LCNACGl~~r~~G   37 (39)
T d1y0ja1           1 EARECVNCGATATPLWRRDRTG-HYLCNACGLYHKMNG   37 (39)
T ss_dssp             CCCCCSSSCCCCCSCCEECTTS-CEECSSHHHHHHHSC
T ss_pred             CcCCCCCCCCCCCcccccCCCC-CEeeHHhHHHHHHhC
Confidence            3578999999999999999999 799999999999765



>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Back     information, alignment and structure
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Back     information, alignment and structure
>d2zjrz1 g.41.8.5 (Z:2-59) Ribosomal protein L32p {Deinococcus radiodurans [TaxId: 1299]} Back     information, alignment and structure
>d2j0151 g.41.8.5 (5:2-60) Ribosomal protein L32p {Thermus thermophilus [TaxId: 274]} Back     information, alignment and structure