Citrus Sinensis ID: 017834


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-----
MFAMNPQPLQARPYIDTEEHDVAQTPIPIQNGSKQGDRYDEPEEVEDEAGASSVNRKSNDRGGSSVQSSTSTRTSELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQPQNIMSGGSASNGSKLSQRIASLVRFREKRKERSFEKKIRYSCRKEVAQRMQRKNGQFTSSKATFNIASANSNPSNGSAPPESVSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEQHELETSSDIKPATTEAENSYANQDEQGSPHETKPAPLDPQNHSMRSNEQYLLESDDGFACPLPIQEDNSLMNLDDEDLQEAMDELANASGSDFEIPSNFDEQVFICV
cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEccEEEEccccccHHHHHHHHHccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHccccccccccccccccccEEcc
cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEccEEEEEccccHHHHHHHHHHHcccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHcccccEEEHHHHHHHHHHcccccccEEccccccccccccccccccccccccccHHEccccccccccccccccccccccHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHccccccccEEEccccccccHHHHccHHHHcccccccEEccccccccEEEEc
mfamnpqplqarpyidteehdvaqtpipiqngskqgdrydepeevedeagassvnrksndrggssvqsststrtseltvayegevyvfpavtPHKVQALLLLLgecdipstvpssafaqpqnimsggsasngskLSQRIASLVRFREKRKERSFEKKIRYSCRKEVAQRMQrkngqftsskATFNIasansnpsngsappesVSRICqhcgisekltpamrrgpagprtlcnacglmwankgtlrdltkgarnicfeqheletssdikpatteaensyanqdeqgsphetkpapldpqnhsmrsneqyllesddgfacplpiqednslmnlddeDLQEAMDELAnasgsdfeipsnfdeqvficv
mfamnpqplQARPYIDTEEHDVAqtpipiqngskqgdrYDEPEEVEdeagassvnrksndrggssvqsststrtseLTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQPQNImsggsasngsklSQRIASLVrfrekrkersfekkiryscrKEVAqrmqrkngqftssKATFNIASANSNPSNGSAPPESVSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEqheletssdikpATTEAENSYanqdeqgspheTKPAPLDPQNHSMRSNEQYLLESDDGFACPLPIQEDNSLMNLDDEDLQEAMDELANASgsdfeipsnfDEQVFICV
MFAMNPQPLQARPYIDTEEHDVAQTPIPIQNGSKQGDRYDEPEEVEDEAGASSVNRKSNDRGGssvqsststrtsELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQPQNIMSGGSASNGSKLSQRIASLVrfrekrkersfekkIRYSCRKEVAQRMQRKNGQFTSSKATFNIasansnpsngsappesVSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEQHELETSSDIKPATTEAENSYANQDEQGSPHETKPAPLDPQNHSMRSNEQYLLESDDGFACPLPIQEDNSLMNLDDEDLQEAMDELANASGSDFEIPSNFDEQVFICV
****************************************************************************LTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPS***********************************************************************************************ICQHCGIS************GPRTLCNACGLMWANKGTLRDLTKGARNICFE************************************************************************************************************
*******************************************************************************AYEGEVYVFPAVTPHKVQA*******************************************************************************TSSKATFNIASANSNPSNGSAPPESVSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWA****************************************************************************************************************EIPSNFDEQVFICV
MFAMNPQPLQARPYIDTEEHDVAQTPIPIQNG*******************************************ELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQPQNIMS*********LSQRIASLVRFREKRKERSFEKKIRYSCRKEVAQRMQRKNGQFTSSKATFNIASAN*************SRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEQHELETSSDIKPATT*******************PAPLDPQNHSMRSNEQYLLESDDGFACPLPIQEDNSLMNLDDEDLQEAMDELANASGSDFEIPSNFDEQVFICV
************************************************************************RTSELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIP***************************QRIASLVRFREKRKERSFEKKIRYSCRKEVAQRMQ***********************************CQ*C*ISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDL*********************************************************NEQYLLESDDGFACPLPIQEDNSLMNLDDEDLQEAMDELANASGSDFEIPSNFDEQVFICV
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MFAMNPQPLQARPYIDTEEHDVAQTPIPIQNGSKQGDRYDEPEEVEDEAGASSVNRKSNDRGGSSVQSSTSTRTSELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQPQNIMSGGSASNGSKLSQRIASLVRFREKRKERSFEKKIRYSCRKEVAQRMQRKNGQFTSSKATFNIASANSNPSNGSAPPESVSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEQHELETSSDIKPATTEAENSYANQDEQGSPHETKPAPLDPQNHSMRSNEQYLLESDDGFACPLPIQEDNSLMNLDDEDLQEAMDELANASGSDFEIPSNFDEQVFICV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query365 2.2.26 [Sep-21-2011]
Q8GXL7297 GATA transcription factor yes no 0.473 0.582 0.554 2e-50
Q8H1G0302 GATA transcription factor no no 0.531 0.642 0.504 3e-48
Q9LRH6309 GATA transcription factor no no 0.471 0.556 0.534 1e-41
Q93WK5727 Two-component response re no no 0.136 0.068 0.52 1e-07
A2YQ93742 Two-component response re N/A no 0.115 0.056 0.5 2e-06
Q0D3B6742 Two-component response re no no 0.115 0.056 0.5 2e-06
Q550D5 872 Transcription factor stal yes no 0.112 0.047 0.534 2e-06
Q55C491006 GATA zinc finger domain-c no no 0.090 0.032 0.628 3e-06
Q9LKL2618 Two-component response re no no 0.147 0.087 0.444 3e-06
Q9LVG4495 Two-component response re no no 0.134 0.098 0.489 4e-06
>sp|Q8GXL7|GAT24_ARATH GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2 SV=2 Back     alignment and function desciption
 Score =  199 bits (506), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 102/184 (55%), Positives = 132/184 (71%), Gaps = 11/184 (5%)

Query: 76  ELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQPQNIMSGGSASNGSKL 135
           +LT++++G+VYVF  V+P KVQA+LLLLG  ++P T+P++  +  QN    G +    +L
Sbjct: 79  QLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQRL 138

Query: 136 S--QRIASLVRFREKRKERSFEKKIRYSCRKEVAQRMQRKNGQFTSSKATFNIASANSNP 193
           S  QR+ASL+RFREKRK R+F+K IRY+ RKEVA RMQRK GQFTS+K++ N  S ++  
Sbjct: 139 SVPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSS-NDDSGSTGS 197

Query: 194 SNGSAPPESVSR--------ICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLR 245
             GS    +V          +C+HCG SEK TP MRRGP GPRTLCNACGLMWANKGTLR
Sbjct: 198 DWGSNQSWAVEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLR 257

Query: 246 DLTK 249
           DL+K
Sbjct: 258 DLSK 261




Transcriptional activator that specifically binds 5'-GATA-3' or 5'-GAT-3' motifs within gene promoters.
Arabidopsis thaliana (taxid: 3702)
>sp|Q8H1G0|GAT28_ARATH GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2 SV=1 Back     alignment and function description
>sp|Q9LRH6|GAT25_ARATH GATA transcription factor 25 OS=Arabidopsis thaliana GN=GATA25 PE=2 SV=2 Back     alignment and function description
>sp|Q93WK5|APRR7_ARATH Two-component response regulator-like APRR7 OS=Arabidopsis thaliana GN=APRR7 PE=2 SV=1 Back     alignment and function description
>sp|A2YQ93|PRR37_ORYSI Two-component response regulator-like PRR37 OS=Oryza sativa subsp. indica GN=PRR37 PE=2 SV=2 Back     alignment and function description
>sp|Q0D3B6|PRR37_ORYSJ Two-component response regulator-like PRR37 OS=Oryza sativa subsp. japonica GN=PRR37 PE=2 SV=1 Back     alignment and function description
>sp|Q550D5|GTAA_DICDI Transcription factor stalky OS=Dictyostelium discoideum GN=stkA PE=1 SV=1 Back     alignment and function description
>sp|Q55C49|GTAG_DICDI GATA zinc finger domain-containing protein 7 OS=Dictyostelium discoideum GN=gtaG PE=4 SV=1 Back     alignment and function description
>sp|Q9LKL2|APRR1_ARATH Two-component response regulator-like APRR1 OS=Arabidopsis thaliana GN=APRR1 PE=1 SV=1 Back     alignment and function description
>sp|Q9LVG4|APRR3_ARATH Two-component response regulator-like APRR3 OS=Arabidopsis thaliana GN=APRR3 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query365
359494710371 PREDICTED: GATA transcription factor 28- 0.972 0.956 0.567 1e-102
224141135368 predicted protein [Populus trichocarpa] 0.797 0.790 0.614 7e-95
356519473 1174 PREDICTED: dynamin-related protein 3A-li 0.8 0.248 0.591 4e-94
356528009358 PREDICTED: GATA transcription factor 24- 0.810 0.826 0.577 1e-93
357476233334 GATA transcription factor [Medicago trun 0.775 0.847 0.566 6e-82
359492959368 PREDICTED: GATA transcription factor 24- 0.923 0.915 0.491 2e-81
255563366313 hypothetical protein RCOM_0886650 [Ricin 0.638 0.744 0.661 3e-81
255572874327 GATA transcription factor, putative [Ric 0.882 0.984 0.515 4e-80
302142082324 unnamed protein product [Vitis vinifera] 0.830 0.935 0.521 5e-77
356508042350 PREDICTED: GATA transcription factor 24- 0.852 0.888 0.517 4e-73
>gi|359494710|ref|XP_002268872.2| PREDICTED: GATA transcription factor 28-like [Vitis vinifera] Back     alignment and taxonomy information
 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 206/363 (56%), Positives = 249/363 (68%), Gaps = 8/363 (2%)

Query: 1   MFAMNPQPLQARPYIDTEEHDVAQTPIPIQ-NGSKQGDRYDEPEEVEDEAGASSVNRKSN 59
           M  +NP+PLQA P+   EEHD     +PI+ NG++ G   ++      EA +     + +
Sbjct: 1   METVNPRPLQALPF---EEHDDDSMQVPIEINGNEGGFEVEDVTGGGGEAVSGGEGGRMS 57

Query: 60  DRGGSSVQSSTSTRTSELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQ 119
                   S  + RTSELT+++EGEVYVF AVTP KVQA+LLLLG  + PS+V SS F  
Sbjct: 58  SVNADEKSSVVAQRTSELTISFEGEVYVFHAVTPDKVQAVLLLLGGHETPSSVSSSEFLL 117

Query: 120 PQNIMSGGSASNGSKLSQRIASLVRFREKRKERSFEKKIRYSCRKEVAQRMQRKNGQFTS 179
            QN+     AS  S L +RIASL+RFREKRKER FEKKIRY+CRKEVAQRM RKNGQF S
Sbjct: 118 QQNMKGLVDASKCSNLPRRIASLIRFREKRKERCFEKKIRYTCRKEVAQRMHRKNGQFAS 177

Query: 180 SKATFNIASANSNPSNGSAPPESVSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWA 239
            K +F +A+ N +PS+G+  PE V R CQHCGISEK TPAMRRGPAGPR+LCNACGLMWA
Sbjct: 178 VKESFKMATGNWDPSSGTPCPEYVFRRCQHCGISEKSTPAMRRGPAGPRSLCNACGLMWA 237

Query: 240 NKGTLRDLTKGARNICFEQHELETSSDIKPATTEAENSYANQDEQGSPHETKPAPLDPQN 299
           NKGTLRDL+KG+R I F Q ELETS DIKP T E E +Y N DE GS  E KP PL+  N
Sbjct: 238 NKGTLRDLSKGSRMIPFGQDELETSDDIKPVTMEREKAYGNHDELGSSEEMKPVPLESGN 297

Query: 300 HSM-RSNEQYLLESDDGFACPLPIQEDNSLMNLDDEDLQEAMDELANASGSDFEIPSNFD 358
            +  + NEQ LLE+       LP+  DNS +N D+   QE  + LAN SG+DFEIP+NFD
Sbjct: 298 PTTGQQNEQDLLETAVALVDHLPVPVDNSSINPDE---QENTEVLANVSGTDFEIPTNFD 354

Query: 359 EQV 361
           EQV
Sbjct: 355 EQV 357




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224141135|ref|XP_002323930.1| predicted protein [Populus trichocarpa] gi|222866932|gb|EEF04063.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356519473|ref|XP_003528397.1| PREDICTED: dynamin-related protein 3A-like [Glycine max] Back     alignment and taxonomy information
>gi|356528009|ref|XP_003532598.1| PREDICTED: GATA transcription factor 24-like [Glycine max] Back     alignment and taxonomy information
>gi|357476233|ref|XP_003608402.1| GATA transcription factor [Medicago truncatula] gi|355509457|gb|AES90599.1| GATA transcription factor [Medicago truncatula] Back     alignment and taxonomy information
>gi|359492959|ref|XP_002283738.2| PREDICTED: GATA transcription factor 24-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|255563366|ref|XP_002522686.1| hypothetical protein RCOM_0886650 [Ricinus communis] gi|223538162|gb|EEF39773.1| hypothetical protein RCOM_0886650 [Ricinus communis] Back     alignment and taxonomy information
>gi|255572874|ref|XP_002527369.1| GATA transcription factor, putative [Ricinus communis] gi|223533288|gb|EEF35041.1| GATA transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|302142082|emb|CBI19285.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356508042|ref|XP_003522771.1| PREDICTED: GATA transcription factor 24-like isoform 1 [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query365
TAIR|locus:505006360297 ZML1 "ZIM-like 1" [Arabidopsis 0.473 0.582 0.483 1.1e-40
TAIR|locus:2017582302 ZML2 "ZIM-LIKE 2" [Arabidopsis 0.526 0.635 0.437 1.2e-35
DICTYBASE|DDB_G0277147 872 stkA "GATA zinc finger domain- 0.331 0.138 0.290 1.4e-06
TAIR|locus:2076191274 GATA1 "GATA transcription fact 0.210 0.281 0.421 1.8e-06
TAIR|locus:2155919139 GATA16 "GATA transcription fac 0.219 0.575 0.322 1.4e-05
DICTYBASE|DDB_G0285139640 gtaL "GATA zinc finger domain- 0.336 0.192 0.283 5.4e-05
TAIR|locus:2139594269 GATA3 "GATA transcription fact 0.230 0.312 0.351 0.00035
TAIR|locus:504955441197 AT4G16141 [Arabidopsis thalian 0.273 0.507 0.295 0.00053
DICTYBASE|DDB_G02707561006 gtaG "GATA zinc finger domain- 0.263 0.095 0.307 0.00074
TAIR|locus:2103346240 GATA4 "GATA transcription fact 0.090 0.137 0.6 0.00076
TAIR|locus:505006360 ZML1 "ZIM-like 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 401 (146.2 bits), Expect = 1.1e-40, Sum P(2) = 1.1e-40
 Identities = 89/184 (48%), Positives = 114/184 (61%)

Query:    76 ELTVAYEGEVYVFPAVTPHKVQALLLLLGECDIPSTVPSSAFAQPQNIMSGGSASNGSKL 135
             +LT++++G+VYVF  V+P KVQA+LLLLG  ++P T+P++  +  QN    G +    +L
Sbjct:    79 QLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQRL 138

Query:   136 S--QRIASLVXXXXXXXXXXXXXXIRYSCRKEVAQRMQRKNGQFTSSKATFNIXXXXXXX 193
             S  QR+ASL+              IRY+ RKEVA RMQRK GQFTS+K++ N        
Sbjct:   139 SVPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSS-NDDSGSTGS 197

Query:   194 XXXXXXXXXVSR--------ICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLR 245
                      V          +C+HCG SEK TP MRRGP GPRTLCNACGLMWANKGTLR
Sbjct:   198 DWGSNQSWAVEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLR 257

Query:   246 DLTK 249
             DL+K
Sbjct:   258 DLSK 261


GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0008270 "zinc ion binding" evidence=IEA
GO:0043565 "sequence-specific DNA binding" evidence=IEA
TAIR|locus:2017582 ZML2 "ZIM-LIKE 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0277147 stkA "GATA zinc finger domain-containing protein 1" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
TAIR|locus:2076191 GATA1 "GATA transcription factor 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2155919 GATA16 "GATA transcription factor 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0285139 gtaL "GATA zinc finger domain-containing protein 12" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
TAIR|locus:2139594 GATA3 "GATA transcription factor 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504955441 AT4G16141 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0270756 gtaG "GATA zinc finger domain-containing protein 7" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
TAIR|locus:2103346 GATA4 "GATA transcription factor 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query365
pfam0620345 pfam06203, CCT, CCT motif 5e-17
cd0020254 cd00202, ZnF_GATA, Zinc finger DNA binding domain; 9e-14
pfam0032036 pfam00320, GATA, GATA zinc finger 7e-12
smart0040152 smart00401, ZnF_GATA, zinc finger binding to DNA c 2e-11
pfam0620036 pfam06200, tify, tify domain 4e-05
smart0097936 smart00979, TIFY, This short possible domain is fo 3e-04
>gnl|CDD|203407 pfam06203, CCT, CCT motif Back     alignment and domain information
 Score = 73.8 bits (182), Expect = 5e-17
 Identities = 22/44 (50%), Positives = 30/44 (68%)

Query: 138 RIASLVRFREKRKERSFEKKIRYSCRKEVAQRMQRKNGQFTSSK 181
           R A+L+R++EKRK R F+KKIRY+ RK VA+   R  G+F    
Sbjct: 1   REAALLRYKEKRKTRKFDKKIRYASRKAVAESRPRVKGRFVKQS 44


This short motif is found in a number of plant proteins. It is rich in basic amino acids and has been called a CCT motif after Co, Col and Toc1. The CCT motif is about 45 amino acids long and contains a putative nuclear localisation signal within the second half of the CCT motif. Toc1 mutants have been identified in this region. Length = 45

>gnl|CDD|238123 cd00202, ZnF_GATA, Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information
>gnl|CDD|109380 pfam00320, GATA, GATA zinc finger Back     alignment and domain information
>gnl|CDD|214648 smart00401, ZnF_GATA, zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>gnl|CDD|203405 pfam06200, tify, tify domain Back     alignment and domain information
>gnl|CDD|198047 smart00979, TIFY, This short possible domain is found in a variety of plant transcription factors that contain GATA domains as well as other motifs Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 365
PF0620036 tify: tify domain; InterPro: IPR010399 The tify do 99.59
cd0020254 ZnF_GATA Zinc finger DNA binding domain; binds spe 99.57
smart0040152 ZnF_GATA zinc finger binding to DNA consensus sequ 99.5
PF0032036 GATA: GATA zinc finger; InterPro: IPR000679 Zinc f 99.49
PF0620345 CCT: CCT motif; InterPro: IPR010402 The CCT (CONST 99.14
KOG1601340 consensus GATA-4/5/6 transcription factors [Transc 98.62
COG5641 498 GAT1 GATA Zn-finger-containing transcription facto 98.27
PF0942527 CCT_2: Divergent CCT motif; InterPro: IPR018467 Th 97.56
COG5641498 GAT1 GATA Zn-finger-containing transcription facto 90.21
>PF06200 tify: tify domain; InterPro: IPR010399 The tify domain is a 36-amino acid domain only found among Embryophyta (land plants) Back     alignment and domain information
Probab=99.59  E-value=1.5e-15  Score=105.81  Aligned_cols=35  Identities=43%  Similarity=0.698  Sum_probs=32.7

Q ss_pred             CCCCCccceEEEccEEEEeCCCChHHHHHHHHHhc
Q 017834           70 TSTRTSELTVAYEGEVYVFPAVTPHKVQALLLLLG  104 (365)
Q Consensus        70 ~~~~t~QLTIfY~G~V~VFD~VppeKaqaImllag  104 (365)
                      +.+.++||||||+|+|+|||+||+|||++||+||+
T Consensus         1 ~~~~~~qLTIfY~G~V~Vfd~v~~~Ka~~im~lA~   35 (36)
T PF06200_consen    1 PSPETAQLTIFYGGQVCVFDDVPPDKAQEIMLLAS   35 (36)
T ss_pred             CCCCCCcEEEEECCEEEEeCCCCHHHHHHHHHHhc
Confidence            35678899999999999999999999999999997



It has been named after the most conserved amino acid pattern (TIF[F/Y]XG) it contains, but was previously known as the Zim domain. As the use of uppercase characters (TIFY) might imply that the domain is fully conserved across proteins, a lowercase lettering has been chosen in an attempt to highlight the reality of its natural variability. Based on the domain architecture, tify domain containing proteins can be classified into two groups. Group I is formed by proteins possessing a CCT (CONSTANS, CO-like, and TOC1) domain and a GATA-type zinc finger in addition to the tify domain. Group II contains proteins characterised by the tify domain but lacking a GATA-type zinc finger. Tify domain containing proteins might be involved in developmental processes and some of them have features that are characteristic for transcription factors: a nuclear localisation and the presence of a putative DNA-binding domain []. Some proteins known to contain a tify domain include: Arabidopsis thaliana Zinc-finger protein expressed in Inflorescence Meristem (ZIM), a putative transcription factor involved in inflorescence and flower development [, ]. A. thaliana ZIM-like proteins (ZML) []. A. thaliana PEAPOD1 and PEAPOD2 (PPD1 and PPD2) [].

>cd00202 ZnF_GATA Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information
>smart00401 ZnF_GATA zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>PF00320 GATA: GATA zinc finger; InterPro: IPR000679 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>PF06203 CCT: CCT motif; InterPro: IPR010402 The CCT (CONSTANS, CO-like, and TOC1) domain is a highly conserved basic module of ~43 amino acids, which is found near the C terminus of plant proteins often involved in light signal transduction Back     alignment and domain information
>KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information
>PF09425 CCT_2: Divergent CCT motif; InterPro: IPR018467 The short CCT (CO, COL, TOC1) motif is found in a number of plant proteins, including Constans (CO), Constans-like (COL) and TOC1 Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query365
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 4e-11
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 2e-10
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 2e-07
3dfx_A63 Trans-acting T-cell-specific transcription factor 3e-07
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 3e-07
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 3e-07
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 6e-07
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Length = 71 Back     alignment and structure
 Score = 57.2 bits (138), Expect = 4e-11
 Identities = 12/48 (25%), Positives = 18/48 (37%), Gaps = 2/48 (4%)

Query: 201 ESVSRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLT 248
              S  C +C ++E  T   R   +     CNAC +        R +T
Sbjct: 5   NKKSFQCSNCSVTE--TIRWRNIRSKEGIQCNACFIYQRKYNKTRPVT 50


>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Length = 46 Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Length = 66 Back     alignment and structure
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Length = 63 Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Length = 43 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query365
3dfx_A63 Trans-acting T-cell-specific transcription factor 99.69
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 99.69
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 99.68
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 99.65
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 99.54
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.49
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.4
3ogk_Q22 JAZ1 incomplete degron peptide; leucine rich repea 97.23
3ogl_Q21 JAZ1 incomplete degron peptide; leucine-rich repea 96.76
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Back     alignment and structure
Probab=99.69  E-value=1e-17  Score=128.66  Aligned_cols=54  Identities=30%  Similarity=0.540  Sum_probs=46.7

Q ss_pred             ccccccccccccCCCccccCCCCCchhchHhhhhHHhcCCCCCCCcCCCcccccccc
Q 017834          204 SRICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLTKGARNICFEQHE  260 (365)
Q Consensus       204 ~~~C~~Cg~t~~~TP~WRrGP~G~~tLCNACGl~~~~~~~~r~~~k~~~~i~~~q~~  260 (365)
                      ...|+||+++  .||+||+||+|+ +|||||||+|++++++|+++.....|....+.
T Consensus         7 ~~~C~~C~tt--~Tp~WR~gp~G~-~LCNACGl~~~~~~~~RP~~~~~~~i~~R~Rk   60 (63)
T 3dfx_A            7 GTSCANCQTT--TTTLWRRNANGD-PVCNACGLYYKLHNINRPLTMKKEGIQTRNRK   60 (63)
T ss_dssp             TCCCTTTCCS--CCSSCCCCTTSC-CCCHHHHHHHHHHSSCCCGGGCCSSCCCCC--
T ss_pred             CCcCCCcCCC--CCCccCCCCCCC-chhhHHHHHHHHcCCCCCcCcCCCccccccCC
Confidence            6799999998  699999999996 99999999999999999998877666655443



>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Back     alignment and structure
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 365
d1y0ja139 g.39.1.1 (A:200-238) Erythroid transcription facto 8e-11
d2vuti142 g.39.1.1 (I:671-712) Erythroid transcription facto 3e-10
d3gata_66 g.39.1.1 (A:) Erythroid transcription factor GATA- 9e-09
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Length = 39 Back     information, alignment and structure

class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Mouse (Mus musculus) [TaxId: 10090]
 Score = 54.3 bits (131), Expect = 8e-11
 Identities = 17/39 (43%), Positives = 19/39 (48%), Gaps = 3/39 (7%)

Query: 205 RICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGT 243
           R C +CG +   TP  RR   G   LCNACGL     G 
Sbjct: 3   RECVNCGATA--TPLWRRDRTG-HYLCNACGLYHKMNGQ 38


>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Length = 42 Back     information, alignment and structure
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Length = 66 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query365
d2vuti142 Erythroid transcription factor GATA-1 {Emericella 99.72
d3gata_66 Erythroid transcription factor GATA-1 {Chicken (Ga 99.67
d1y0ja139 Erythroid transcription factor GATA-1 {Mouse (Mus 99.67
>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Back     information, alignment and structure
class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Emericella nidulans [TaxId: 162425]
Probab=99.72  E-value=5.8e-19  Score=123.61  Aligned_cols=40  Identities=45%  Similarity=0.916  Sum_probs=37.9

Q ss_pred             ccccccccccCCCccccCCCCCchhchHhhhhHHhcCCCCCCC
Q 017834          206 ICQHCGISEKLTPAMRRGPAGPRTLCNACGLMWANKGTLRDLT  248 (365)
Q Consensus       206 ~C~~Cg~t~~~TP~WRrGP~G~~tLCNACGl~~~~~~~~r~~~  248 (365)
                      .|+||+++  +||+||+||+| ++|||||||||++++++||++
T Consensus         2 ~C~nC~tt--~Tp~WRr~~~G-~~lCNACGl~~k~~g~~RP~s   41 (42)
T d2vuti1           2 TCTNCFTQ--TTPLWRRNPEG-QPLCNACGLFLKLHGVVRPLS   41 (42)
T ss_dssp             CCSSSCCC--CCSCCEECTTS-CEECHHHHHHHHHHSSCCCCC
T ss_pred             cCCCCCCC--CCccceeCCCC-CCchhhhhHHHHHcCCCCCCC
Confidence            69999998  79999999999 799999999999999999976



>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Back     information, alignment and structure
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure