Citrus Sinensis ID: 021178


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310------
MAPMYLNPAQDHSDPFRLAEAQKRDHQRLQVLHSSSHNRPAASVSRPIFLNSSTQDQGMIKLEGSQQHDQKRIAGGGSSDLQSSMSQPKTMTNKLAIRRREVGEGSTSDNSSYTSSSSGESMSSKMRLANKIINSSSVSTGTHDESVKVAEKLLHEHDNIEVHYFTTNSSNSNNTVRICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARKAMQAAAESGTTTAKDNSSFSKIKLQNNMEKKPRTSHVAQYKKVQCNTPDPDPPHHEYRSQRKLCFKDFALSLSSNSALKQVFPRDVEEAAILLMELSCGFSHT
cccccccccccccccccccHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHcccccccccccccccHHHHHHcccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHccccccc
cccccccccccccccccHHHHcHHHccEEEEccccccccccccccccEEEEccccccccccccccccccccccccccccccccccccccccccccHHHcccccccccccccccccccccccccccccHHHHccccccccccccccccccccccccccccccccccccccccccccEEEEccccccccccccccccccccHHHccccEEcccccccccccccccccccccccHHHHHHHHHHHcccccccHHHHHHcccccccccccccccccccEEcccccEEcccccHHHHHcccccHHHHHHHHHHHHcccccc
mapmylnpaqdhsdpfRLAEAQKRDHQRLQVLhssshnrpaasvsrpiflnsstqdqgmiklegsqqhdqkriagggssdlqssmsqpktmtNKLAIRRrevgegstsdnssytssssgesmSSKMRLANKIinsssvstgthdESVKVAEKLLHehdnievhyfttnssnsnntvricsdcnttttplwrsgprgpkslcnacGIRQRKARKAMQAAAEsgtttakdnssfSKIKlqnnmekkprtshvaqykkvqcntpdpdpphheyrsqrkLCFKDFAlslssnsalkqvfpRDVEEAAILLMELSCGFSHT
mapmylnpaqdhsDPFRLAEAQKRDHQRLQVLhssshnrpaasvsrPIFLNSSTQDQGMIKLEGSQQHDQKRIAGGgssdlqssmsqpkTMTNKLAIrrrevgegstsdnssytssssgesmSSKMRLANKIInsssvstgthDESVKVAEKLLHEHDNIEVHyfttnssnsnntVRICSDCNTtttplwrsgprgpkslCNACGIRQRKARKAMQAAAesgtttakdnssfskiklqnnmekkprtshVAQYKKVQCNTPDPDPPHHEYRSQRKLCFKDFALSLSSNSALKQVFPRDVEEAAILLMELSCGFSHT
MAPMYLNPAQDHSDPFRLAEAQKRDHQRLQVLHSSSHNRPAASVSRPIFLNSSTQDQGMIKLEGSQQHDQKRIAGGGSSDLQSSMSQPKTMTNKLAIRRREVGEGstsdnssytssssgesmssKMRLANKIINSSSVSTGTHDESVKVAEKLLHEHDNIEVHYFttnssnsnntVRICSDCNTTTTPLWRSGPRGPKSLCNACGIrqrkarkamqaaaESGTTTAKDNSSFSKIKLQNNMEKKPRTSHVAQYKKVQCNTPDPDPPHHEYRSQRKLCFKDFALSLSSNSALKQVFPRDVEEAAILLMELSCGFSHT
******************************************************************************************************************************************************EKLLHEHDNIEVHYFTTNSSNSNNTVRICSDCNTTTTPLWR********LCNACG*********************************************************************KLCFKDFALSLSSNSALKQVFPRDVEEAAILLMELSCG****
************************************************************************************************************************************************************************************DCNTTTTPLWRSGPRGPKSLCNACGIRQ*********************************************************************************************AAILLMELSCGFSHT
********AQDHSDPFRLAEAQKRDHQRLQVL**********SVSRPIFLNSSTQDQGMIKLEG***************************TNKLAIRRR*************************MRLANKIINSS*********SVKVAEKLLHEHDNIEVHYFTTNSSNSNNTVRICSDCNTTTTPLWRSGPRGPKSLCNACGI************************SFSKIKLQNN************YKKVQCNTPDPDPPHHEYRSQRKLCFKDFALSLSSNSALKQVFPRDVEEAAILLMELSCGFSHT
******NP*QDHSDPFRLAEAQKRDHQRLQVLHSS*******SVSRPI*L*SS************************************************************************************************************************NTVRICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARKAMQAAAESG**************************************************QRKLCFKDFALSLSSNSALKQVFPRDVEEAAILLMELSCGFSHT
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAPMYLNPAQDHSDPFRLAEAQKRDHQRLQVLHSSSHNRPAASVSRPIFLNSSTQDQGMIKLEGSQQHDQKRIAGGGSSDLQSSMSQPKTMTNKLAIRRREVGEGSTSDNSSYTSSSSGESMSSKMRLANKIINSSSVSTGTHDESVKVAEKLLHEHDNIEVHYFTTNSSNSNNTVRICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARKAMQAAAESGTTTAKDNSSFSKIKLQNNMEKKPRTSHVAQYKKVQCNTPDPDPPHHEYRSQRKLCFKDFALSLSSNSALKQVFPRDVEEAAILLMELSCGFSHT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query316 2.2.26 [Sep-21-2011]
Q9SZI6352 Putative GATA transcripti yes no 0.784 0.704 0.353 4e-22
Q5HZ36398 GATA transcription factor no no 0.433 0.344 0.390 2e-17
Q8LC59120 GATA transcription factor no no 0.161 0.425 0.568 7e-13
Q9FJ10139 GATA transcription factor no no 0.148 0.338 0.595 3e-12
Q8LG10149 GATA transcription factor no no 0.183 0.389 0.517 1e-11
Q6QPM2211 GATA transcription factor no no 0.170 0.255 0.551 1e-11
Q8VZP4308 GATA transcription factor no no 0.148 0.152 0.595 4e-11
Q8LC79295 GATA transcription factor no no 0.136 0.145 0.651 5e-11
Q6DBP8303 GATA transcription factor no no 0.148 0.155 0.595 5e-11
Q9LIB5190 GATA transcription factor no no 0.129 0.215 0.682 7e-11
>sp|Q9SZI6|GAT22_ARATH Putative GATA transcription factor 22 OS=Arabidopsis thaliana GN=GATA22 PE=2 SV=1 Back     alignment and function desciption
 Score =  105 bits (262), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 109/308 (35%), Positives = 145/308 (47%), Gaps = 60/308 (19%)

Query: 49  FLNSSTQDQGMIKLEGSQQHD-----------QKRIAGGGSSDLQSSMSQPKTMTNKLAI 97
           FL +S QDQ  +    +  HD            K     G S     M   K    KL I
Sbjct: 63  FLINSRQDQVYVGYNNNTFHDVLDTHISQPLETKNFVSDGGSSSSDQMVPKKETRLKLTI 122

Query: 98  RRREVGEGST----SDNSSYTSSSSGESMSSKMRLANK---IINSSSVST--GTHDESVK 148
           ++++  +  T    S     T ++S + +SSK+RL  K   II +S  S     +D+S  
Sbjct: 123 KKKDNHQDQTDLPQSPIKDMTGTNSLKWISSKVRLMKKKKAIITTSDSSKQHTNNDQSSN 182

Query: 149 VAEKLLHEHDNIEVHYFTTNSSNSNNTVRICSDCNTTTTPLWRSGPRGPKSLCNACGIRQ 208
           ++        N E      N  N++  +RICSDCNTT TPLWRSGPRGPKSLCNACGIRQ
Sbjct: 183 LS--------NSERQ----NGYNNDCVIRICSDCNTTKTPLWRSGPRGPKSLCNACGIRQ 230

Query: 209 RKARKAMQAAAE----SGTTTAKDNSS--------------FSKIKLQNNMEKKPRT--- 247
           RKAR+A  A A     SG +                      S + L+ N  K+  T   
Sbjct: 231 RKARRAAMATATATAVSGVSPPVMKKKMQNKNKISNGVYKILSPLPLKVNTCKRMITLEE 290

Query: 248 SHVAQYKKVQCNTPDPDPPHHEYRSQRKLCFKDFALSLSSNSALKQVFPRDVEEAAILLM 307
           + +A+  + Q N+           S   + F D AL LS +SA +QVFP+D +EAAILLM
Sbjct: 291 TALAEDLETQSNST-------MLSSSDNIYFDDLALLLSKSSAYQQVFPQDEKEAAILLM 343

Query: 308 ELSCGFSH 315
            LS G  H
Sbjct: 344 ALSHGMVH 351




Transcriptional regulator that specifically binds 5'-GATA-3' or 5'-GAT-3' motifs within gene promoters.
Arabidopsis thaliana (taxid: 3702)
>sp|Q5HZ36|GAT21_ARATH GATA transcription factor 21 OS=Arabidopsis thaliana GN=GATA21 PE=1 SV=2 Back     alignment and function description
>sp|Q8LC59|GAT23_ARATH GATA transcription factor 23 OS=Arabidopsis thaliana GN=GATA23 PE=2 SV=2 Back     alignment and function description
>sp|Q9FJ10|GAT16_ARATH GATA transcription factor 16 OS=Arabidopsis thaliana GN=GATA16 PE=2 SV=1 Back     alignment and function description
>sp|Q8LG10|GAT15_ARATH GATA transcription factor 15 OS=Arabidopsis thaliana GN=GATA15 PE=2 SV=2 Back     alignment and function description
>sp|Q6QPM2|GAT19_ARATH GATA transcription factor 19 OS=Arabidopsis thaliana GN=GATA19 PE=2 SV=2 Back     alignment and function description
>sp|Q8VZP4|GAT10_ARATH GATA transcription factor 10 OS=Arabidopsis thaliana GN=GATA10 PE=2 SV=1 Back     alignment and function description
>sp|Q8LC79|GAT18_ARATH GATA transcription factor 18 OS=Arabidopsis thaliana GN=GATA18 PE=2 SV=2 Back     alignment and function description
>sp|Q6DBP8|GAT11_ARATH GATA transcription factor 11 OS=Arabidopsis thaliana GN=GATA11 PE=2 SV=1 Back     alignment and function description
>sp|Q9LIB5|GAT17_ARATH GATA transcription factor 17 OS=Arabidopsis thaliana GN=GATA17 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query316
118487597303 unknown [Populus trichocarpa] 0.930 0.970 0.449 1e-47
225429550306 PREDICTED: putative GATA transcription f 0.920 0.950 0.410 2e-42
224088836234 predicted protein [Populus trichocarpa] 0.724 0.978 0.475 6e-41
118489347303 unknown [Populus trichocarpa x Populus d 0.933 0.973 0.447 9e-41
118488977306 unknown [Populus trichocarpa x Populus d 0.939 0.970 0.443 1e-39
255550794186 conserved hypothetical protein [Ricinus 0.582 0.989 0.538 1e-35
356554076306 PREDICTED: putative GATA transcription f 0.664 0.686 0.449 2e-34
255546095312 hypothetical protein RCOM_1046780 [Ricin 0.816 0.826 0.388 3e-32
147805325211 hypothetical protein VITISV_032017 [Viti 0.632 0.947 0.432 2e-29
356556282315 PREDICTED: putative GATA transcription f 0.607 0.609 0.409 1e-28
>gi|118487597|gb|ABK95624.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 146/325 (44%), Positives = 183/325 (56%), Gaps = 31/325 (9%)

Query: 1   MAPMYLNPAQDHSDPFRLAEAQKRDHQRLQVLHSSSHNRPAASVSRPI-FLNSSTQD-QG 58
           M P YLNPA   S PF       R+ Q LQ+  S   ++ A S+S P  F N+S  D Q 
Sbjct: 1   MTPAYLNPASS-SFPF----VDLREEQNLQLFLSP--HQAATSLSGPTNFFNTSAHDHQR 53

Query: 59  MIKLEGSQQHDQKR-----IAGGGSSDLQSSMSQPKTMTNKLAIRRREVGEGSTSDNSSY 113
             K   S+QHD +      I+ GGSS    S  QP+   +        +      D +  
Sbjct: 54  ETKPGESRQHDNQEVDMYNISHGGSS----SSFQPEVNDHNYNSNFHNLSSSKMEDGAEE 109

Query: 114 TSSSSGESMSSKMRLANKIINSSSVSTGTHDESVKVAEKLLHEHDNIEVHYFTTNSSNSN 173
           +  SS + M SKMRL  K+ NS+   T            +L  H+    +    +SSNSN
Sbjct: 110 SGESSVKWMPSKMRLMQKMTNSNCSETDHMPMKF-----MLKFHNQQYQNNEINSSSNSN 164

Query: 174 NTVRICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRK--ARKAMQAAAESGTTTAKDNSS 231
           + +R+CSDCNTT+TPLWRSGPRGPKSLCNACGIRQRK     A  AAA +GT  A + SS
Sbjct: 165 SNIRVCSDCNTTSTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANGTVIAIEASS 224

Query: 232 FSKIKLQNNMEKKPRTSHVAQYKKVQCNTPDPDPPHHEYRSQRKLCFKDFALSLSSNSAL 291
            ++    NN  KK RT+HV+Q KK+        PP    +SQ+KLCFK+ ALSLS N AL
Sbjct: 225 STRSTKVNNKVKKSRTNHVSQNKKLS------KPPESSLQSQKKLCFKNLALSLSKNPAL 278

Query: 292 KQVFPRDVEEAAILLMELSCGFSHT 316
           +QV P DVEEAAILLMELSCGF H+
Sbjct: 279 QQVLPHDVEEAAILLMELSCGFIHS 303




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225429550|ref|XP_002279283.1| PREDICTED: putative GATA transcription factor 22 [Vitis vinifera] gi|296081660|emb|CBI20665.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224088836|ref|XP_002308561.1| predicted protein [Populus trichocarpa] gi|222854537|gb|EEE92084.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|118489347|gb|ABK96478.1| unknown [Populus trichocarpa x Populus deltoides] Back     alignment and taxonomy information
>gi|118488977|gb|ABK96296.1| unknown [Populus trichocarpa x Populus deltoides] Back     alignment and taxonomy information
>gi|255550794|ref|XP_002516445.1| conserved hypothetical protein [Ricinus communis] gi|223544265|gb|EEF45786.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356554076|ref|XP_003545375.1| PREDICTED: putative GATA transcription factor 22-like [Glycine max] Back     alignment and taxonomy information
>gi|255546095|ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus communis] gi|223546563|gb|EEF48061.1| hypothetical protein RCOM_1046780 [Ricinus communis] Back     alignment and taxonomy information
>gi|147805325|emb|CAN63090.1| hypothetical protein VITISV_032017 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356556282|ref|XP_003546455.1| PREDICTED: putative GATA transcription factor 22-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query316
TAIR|locus:2120845352 CGA1 "cytokinin-responsive gat 0.443 0.397 0.415 7.1e-25
TAIR|locus:2170277398 GNC "GATA, nitrate-inducible, 0.098 0.077 0.903 2.1e-22
TAIR|locus:2148558120 GATA23 "GATA transcription fac 0.098 0.258 0.741 9.5e-14
TAIR|locus:2155919139 GATA16 "GATA transcription fac 0.094 0.215 0.733 1.2e-11
TAIR|locus:2083388149 GATA15 "GATA transcription fac 0.177 0.375 0.482 1.8e-11
TAIR|locus:2093678190 GATA17 "GATA transcription fac 0.417 0.694 0.326 8.9e-11
TAIR|locus:504955441197 AT4G16141 [Arabidopsis thalian 0.094 0.152 0.733 1.8e-10
TAIR|locus:2115195211 GATA19 "GATA transcription fac 0.094 0.142 0.8 2.3e-09
TAIR|locus:2077932295 MNP "MONOPOLE" [Arabidopsis th 0.294 0.315 0.356 2.6e-09
TAIR|locus:2062095208 GATA20 "GATA transcription fac 0.094 0.144 0.766 8.1e-09
TAIR|locus:2120845 CGA1 "cytokinin-responsive gata factor 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 243 (90.6 bits), Expect = 7.1e-25, Sum P(2) = 7.1e-25
 Identities = 64/154 (41%), Positives = 78/154 (50%)

Query:   176 VRICSDCNTTTTPLWRSGPRGPKSLCNACGI----XXXXXXXXXXXXXESGTTTAKDNSS 231
             +RICSDCNTT TPLWRSGPRGPKSLCNACGI                  SG +       
Sbjct:   198 IRICSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAMATATATAVSGVSPPVMKKK 257

Query:   232 F-SKIKLQNNMEK--KPRTSHVAQYKKV-----QCNTPDPDPPHHE--YRSQRKLCFKDF 281
               +K K+ N + K   P    V   K++          D +   +     S   + F D 
Sbjct:   258 MQNKNKISNGVYKILSPLPLKVNTCKRMITLEETALAEDLETQSNSTMLSSSDNIYFDDL 317

Query:   282 ALSLSSNSALKQVFPRDVEEAAILLMELSCGFSH 315
             AL LS +SA +QVFP+D +EAAILLM LS G  H
Sbjct:   318 ALLLSKSSAYQQVFPQDEKEAAILLMALSHGMVH 351


GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0008270 "zinc ion binding" evidence=IEA
GO:0043565 "sequence-specific DNA binding" evidence=IEA
GO:0009735 "response to cytokinin stimulus" evidence=IEP
GO:0007623 "circadian rhythm" evidence=IEP
GO:0009416 "response to light stimulus" evidence=IEP
GO:0009740 "gibberellic acid mediated signaling pathway" evidence=IEP
GO:0009910 "negative regulation of flower development" evidence=IMP
GO:0010187 "negative regulation of seed germination" evidence=IEP
GO:0010380 "regulation of chlorophyll biosynthetic process" evidence=IMP
GO:0010468 "regulation of gene expression" evidence=IDA
GO:0044212 "transcription regulatory region DNA binding" evidence=IDA
TAIR|locus:2170277 GNC "GATA, nitrate-inducible, carbon metabolism-involved" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2148558 GATA23 "GATA transcription factor 23" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2155919 GATA16 "GATA transcription factor 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2083388 GATA15 "GATA transcription factor 15" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2093678 GATA17 "GATA transcription factor 17" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504955441 AT4G16141 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2115195 GATA19 "GATA transcription factor 19" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2077932 MNP "MONOPOLE" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062095 GATA20 "GATA transcription factor 20" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
grail3.0013019801
SubName- Full=Putative uncharacterized protein; (234 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query316
smart0040152 smart00401, ZnF_GATA, zinc finger binding to DNA c 8e-17
pfam0032036 pfam00320, GATA, GATA zinc finger 5e-16
cd0020254 cd00202, ZnF_GATA, Zinc finger DNA binding domain; 5e-13
>gnl|CDD|214648 smart00401, ZnF_GATA, zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
 Score = 72.8 bits (179), Expect = 8e-17
 Identities = 22/45 (48%), Positives = 28/45 (62%)

Query: 174 NTVRICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARKAMQAA 218
            + R CS+C TT TPLWR GP G K+LCNACG+  +K     +  
Sbjct: 1   GSGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLKRPL 45


Length = 52

>gnl|CDD|109380 pfam00320, GATA, GATA zinc finger Back     alignment and domain information
>gnl|CDD|238123 cd00202, ZnF_GATA, Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 316
cd0020254 ZnF_GATA Zinc finger DNA binding domain; binds spe 99.41
smart0040152 ZnF_GATA zinc finger binding to DNA consensus sequ 99.35
PF0032036 GATA: GATA zinc finger; InterPro: IPR000679 Zinc f 99.35
KOG1601340 consensus GATA-4/5/6 transcription factors [Transc 98.54
COG5641 498 GAT1 GATA Zn-finger-containing transcription facto 97.85
KOG3554 693 consensus Histone deacetylase complex, MTA1 compon 83.05
>cd00202 ZnF_GATA Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information
Probab=99.41  E-value=1.4e-13  Score=101.82  Aligned_cols=35  Identities=60%  Similarity=1.237  Sum_probs=32.8

Q ss_pred             ccccCCCCCCCccccCCCCCCccchhHhHHHHhhh
Q 021178          178 ICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKAR  212 (316)
Q Consensus       178 ~CsnC~Tt~TPlWRrGP~G~~~LCNACGL~~rk~~  212 (316)
                      .|++|++++||+||+||.|...|||||||||++..
T Consensus         1 ~C~~C~~~~Tp~WR~g~~~~~~LCNaCgl~~~k~~   35 (54)
T cd00202           1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHG   35 (54)
T ss_pred             CCCCCCCCCCcccccCCCCcchHHHHHHHHHHhcC
Confidence            59999999999999999888999999999998875



>smart00401 ZnF_GATA zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>PF00320 GATA: GATA zinc finger; InterPro: IPR000679 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information
>KOG3554 consensus Histone deacetylase complex, MTA1 component [Chromatin structure and dynamics] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query316
2vus_I43 Crystal Structure Of Unliganded Nmra-Area Zinc Fing 3e-04
>pdb|2VUS|I Chain I, Crystal Structure Of Unliganded Nmra-Area Zinc Finger Complex Length = 43 Back     alignment and structure

Iteration: 1

Score = 42.7 bits (99), Expect = 3e-04, Method: Composition-based stats. Identities = 17/28 (60%), Positives = 21/28 (75%), Gaps = 1/28 (3%) Query: 179 CSDCNTTTTPLWRSGPRGPKSLCNACGI 206 C++C T TTPLWR P G + LCNACG+ Sbjct: 4 CTNCFTQTTPLWRRNPEG-QPLCNACGL 30

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query316
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 2e-13
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 1e-12
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 5e-11
3dfx_A63 Trans-acting T-cell-specific transcription factor 4e-09
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 7e-09
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 6e-04
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 8e-04
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Length = 71 Back     alignment and structure
 Score = 63.7 bits (155), Expect = 2e-13
 Identities = 17/41 (41%), Positives = 21/41 (51%)

Query: 170 SNSNNTVRICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRK 210
           S+ N     CS+C+ T T  WR+        CNAC I QRK
Sbjct: 2   SHMNKKSFQCSNCSVTETIRWRNIRSKEGIQCNACFIYQRK 42


>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Length = 46 Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Length = 43 Back     alignment and structure
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Length = 63 Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Length = 66 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query316
3dfx_A63 Trans-acting T-cell-specific transcription factor 99.7
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 99.62
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 99.52
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 99.51
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 99.49
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.47
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.15
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Back     alignment and structure
Probab=99.70  E-value=9.8e-18  Score=126.93  Aligned_cols=60  Identities=30%  Similarity=0.582  Sum_probs=48.9

Q ss_pred             CCCCCCcccccCCCCCCCccccCCCCCCccchhHhHHHHhhhhhhHHHhhcCcccccCCCccccccc
Q 021178          171 NSNNTVRICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARKAMQAAAESGTTTAKDNSSFSKIKL  237 (316)
Q Consensus       171 ~~~~~~~~CsnC~Tt~TPlWRrGP~G~~~LCNACGL~~rk~~k~~~aa~~ng~~~kke~~qstR~K~  237 (316)
                      ++......|++|++++||+||+||+|+ +|||||||||++++.      .++..+.+++++++++|+
T Consensus         2 ~~~~~~~~C~~C~tt~Tp~WR~gp~G~-~LCNACGl~~~~~~~------~RP~~~~~~~i~~R~Rk~   61 (63)
T 3dfx_A            2 AARRAGTSCANCQTTTTTLWRRNANGD-PVCNACGLYYKLHNI------NRPLTMKKEGIQTRNRKM   61 (63)
T ss_dssp             CCCCTTCCCTTTCCSCCSSCCCCTTSC-CCCHHHHHHHHHHSS------CCCGGGCCSSCCCCC---
T ss_pred             CCCCCCCcCCCcCCCCCCccCCCCCCC-chhhHHHHHHHHcCC------CCCcCcCCCccccccCCC
Confidence            346678899999999999999999996 999999999988754      367788888887766554



>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Back     alignment and structure
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Back     alignment and structure
>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 316
d1y0ja139 g.39.1.1 (A:200-238) Erythroid transcription facto 1e-12
d2vuti142 g.39.1.1 (I:671-712) Erythroid transcription facto 4e-10
d3gata_66 g.39.1.1 (A:) Erythroid transcription factor GATA- 5e-08
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Length = 39 Back     information, alignment and structure

class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Mouse (Mus musculus) [TaxId: 10090]
 Score = 59.3 bits (144), Expect = 1e-12
 Identities = 16/34 (47%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 177 RICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRK 210
           R C +C  T TPLWR    G   LCNACG+  + 
Sbjct: 3   RECVNCGATATPLWRRDRTG-HYLCNACGLYHKM 35


>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Length = 42 Back     information, alignment and structure
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Length = 66 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query316
d3gata_66 Erythroid transcription factor GATA-1 {Chicken (Ga 99.69
d1y0ja139 Erythroid transcription factor GATA-1 {Mouse (Mus 99.58
d2vuti142 Erythroid transcription factor GATA-1 {Emericella 99.58
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Back     information, alignment and structure
class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Chicken (Gallus gallus) [TaxId: 9031]
Probab=99.69  E-value=5.5e-18  Score=127.62  Aligned_cols=60  Identities=33%  Similarity=0.664  Sum_probs=48.5

Q ss_pred             CCCcccccCCCCCCCccccCCCCCCccchhHhHHHHhhhhhhHHHhhcCcccccCCCccccccccCC
Q 021178          174 NTVRICSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARKAMQAAAESGTTTAKDNSSFSKIKLQNN  240 (316)
Q Consensus       174 ~~~~~CsnC~Tt~TPlWRrGP~G~~~LCNACGL~~rk~~k~~~aa~~ng~~~kke~~qstR~K~~~~  240 (316)
                      ..+.+|+||+|++||+||+||.| .+|||||||||+++..      .++..+.++.++.+++++...
T Consensus         2 r~g~~C~nCgt~~Tp~WRr~~~G-~~lCNACGl~~~~~~~------~RP~~~~~~~i~~r~r~~~~~   61 (66)
T d3gata_           2 RAGTVCSNCQTSTTTLWRRSPMG-DPVCNACGLYYKLHQV------NRPLTMRKDGIQTRNRKVSSK   61 (66)
T ss_dssp             CTTCCCTTTCCCCCSSEEECTTS-CEEEHHHHHHHHHHCS------CCCGGGCCSSCCCCSCCCCCC
T ss_pred             CCCCCCCCCCCCCCcccccCCCC-CcccchhHHHHHHhCC------cCCccccccccccccCCCCCc
Confidence            35678999999999999999999 7999999999988643      366777788887766555433



>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Back     information, alignment and structure