Citrus Sinensis ID: 027756
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 219 | ||||||
| 356526306 | 300 | PREDICTED: GATA transcription factor 28- | 0.885 | 0.646 | 0.716 | 4e-84 | |
| 225443335 | 299 | PREDICTED: GATA transcription factor 28- | 0.881 | 0.645 | 0.742 | 2e-82 | |
| 297735780 | 244 | unnamed protein product [Vitis vinifera] | 0.881 | 0.790 | 0.742 | 1e-81 | |
| 449491798 | 303 | PREDICTED: GATA transcription factor 24- | 0.881 | 0.636 | 0.692 | 3e-81 | |
| 224113267 | 294 | predicted protein [Populus trichocarpa] | 0.803 | 0.598 | 0.776 | 3e-81 | |
| 449447986 | 299 | PREDICTED: GATA transcription factor 24- | 0.881 | 0.645 | 0.692 | 3e-81 | |
| 18403600 | 302 | GATA transcription factor 28 [Arabidopsi | 0.917 | 0.665 | 0.689 | 2e-79 | |
| 15028099 | 302 | putative flowering protein CONSTANS [Ara | 0.917 | 0.665 | 0.689 | 6e-79 | |
| 30686115 | 295 | GATA transcription factor 24 [Arabidopsi | 0.917 | 0.681 | 0.694 | 1e-78 | |
| 18402914 | 297 | GATA transcription factor 24 [Arabidopsi | 0.917 | 0.676 | 0.688 | 2e-78 |
| >gi|356526306|ref|XP_003531759.1| PREDICTED: GATA transcription factor 28-like [Glycine max] | Back alignment and taxonomy information |
|---|
Score = 316 bits (809), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 152/212 (71%), Positives = 174/212 (82%), Gaps = 18/212 (8%)
Query: 14 GDQLTLSFQGQVYVFDSVSPEKVQAVLLLLGGREVPSTTPAIPIANNQNNR--------- 64
GDQLTLSFQGQVYVFDSVSPEKVQAVLLLLGGRE+P T PA+P++ N NNR
Sbjct: 86 GDQLTLSFQGQVYVFDSVSPEKVQAVLLLLGGREIPPTMPAMPVSPNHNNRGYTGTPQKF 145
Query: 65 ----RLASLIRFREKRKERNFEKKIRYTVRKEVALRMQRNKGQFTSAKSNNEDSASAISS 120
RLASLIRFREKRKERN++KKIRYTVRKEVALRMQRNKGQFTS+KSNN++SAS ++
Sbjct: 146 SVPQRLASLIRFREKRKERNYDKKIRYTVRKEVALRMQRNKGQFTSSKSNNDESASNATN 205
Query: 121 WGSNQSWAGDVNGSQNQDIVCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKGTLRD 180
WG +++W D +GSQ QDIVCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKG LRD
Sbjct: 206 WGMDENWTADNSGSQQQDIVCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKGILRD 265
Query: 181 LSKAAPQAGQTSSLNKNEENGTLKAEQVIRAV 212
LS+AAP +G + EN +L+A Q++ V
Sbjct: 266 LSRAAPLSGTIKN-----ENKSLEANQIVHRV 292
|
Source: Glycine max Species: Glycine max Genus: Glycine Family: Fabaceae Order: Fabales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|225443335|ref|XP_002263707.1| PREDICTED: GATA transcription factor 28-like [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|297735780|emb|CBI18467.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|449491798|ref|XP_004159006.1| PREDICTED: GATA transcription factor 24-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|224113267|ref|XP_002316441.1| predicted protein [Populus trichocarpa] gi|222865481|gb|EEF02612.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|449447986|ref|XP_004141747.1| PREDICTED: GATA transcription factor 24-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|18403600|ref|NP_564593.1| GATA transcription factor 28 [Arabidopsis thaliana] gi|42571823|ref|NP_974002.1| GATA transcription factor 28 [Arabidopsis thaliana] gi|71660840|sp|Q8H1G0.1|GAT28_ARATH RecName: Full=GATA transcription factor 28; AltName: Full=Protein TIFY 2A; AltName: Full=ZIM-like 2 protein gi|23297318|gb|AAN12940.1| putative flowering protein CONSTANS [Arabidopsis thaliana] gi|38603660|dbj|BAD02931.1| GATA-type zinc finger protein [Arabidopsis thaliana] gi|332194567|gb|AEE32688.1| GATA transcription factor 28 [Arabidopsis thaliana] gi|332194568|gb|AEE32689.1| GATA transcription factor 28 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|15028099|gb|AAK76580.1| putative flowering protein CONSTANS [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|30686115|ref|NP_850618.1| GATA transcription factor 24 [Arabidopsis thaliana] gi|14596059|gb|AAK68757.1| Unknown protein [Arabidopsis thaliana] gi|17978695|gb|AAL47341.1| unknown protein [Arabidopsis thaliana] gi|332642950|gb|AEE76471.1| GATA transcription factor 24 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|18402914|ref|NP_566676.1| GATA transcription factor 24 [Arabidopsis thaliana] gi|71660846|sp|Q8GXL7.2|GAT24_ARATH RecName: Full=GATA transcription factor 24; AltName: Full=Protein TIFY 2B; AltName: Full=ZIM-like 1 protein gi|9280218|dbj|BAB01708.1| unnamed protein product [Arabidopsis thaliana] gi|38603658|dbj|BAD02930.1| GATA-type zinc finger protein [Arabidopsis thaliana] gi|332642949|gb|AEE76470.1| GATA transcription factor 24 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 219 | ||||||
| TAIR|locus:505006360 | 297 | ZML1 "ZIM-like 1" [Arabidopsis | 0.931 | 0.686 | 0.619 | 6.9e-63 | |
| TAIR|locus:2017582 | 302 | ZML2 "ZIM-LIKE 2" [Arabidopsis | 0.917 | 0.665 | 0.625 | 2.3e-62 | |
| CGD|CAL0005605 | 442 | orf19.1577 [Candida albicans ( | 0.356 | 0.176 | 0.418 | 1.4e-07 | |
| UNIPROTKB|Q5ALK1 | 442 | CaO19.1577 "Putative uncharact | 0.356 | 0.176 | 0.418 | 1.4e-07 | |
| CGD|CAL0004848 | 446 | BRG1 [Candida albicans (taxid: | 0.374 | 0.183 | 0.408 | 7.4e-07 | |
| UNIPROTKB|Q59LY1 | 446 | GAT2 "Putative uncharacterized | 0.374 | 0.183 | 0.408 | 7.4e-07 | |
| DICTYBASE|DDB_G0267640 | 952 | gtaE "GATA zinc finger domain- | 0.337 | 0.077 | 0.395 | 1.6e-06 | |
| TAIR|locus:2083388 | 149 | GATA15 "GATA transcription fac | 0.296 | 0.436 | 0.385 | 3.4e-06 | |
| DICTYBASE|DDB_G0295707 | 695 | gtaP "GATA zinc finger domain- | 0.301 | 0.094 | 0.383 | 1.9e-05 | |
| DICTYBASE|DDB_G0270756 | 1006 | gtaG "GATA zinc finger domain- | 0.159 | 0.034 | 0.594 | 1.9e-05 |
| TAIR|locus:505006360 ZML1 "ZIM-like 1" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 642 (231.1 bits), Expect = 6.9e-63, P = 6.9e-63
Identities = 137/221 (61%), Positives = 155/221 (70%)
Query: 14 GDQLTLSFQGQVYVFDSVSPEKVQAVLLLLGGREVPSTTPAIPIANNQNNR--------- 64
GDQLTLSFQGQVYVFD VSPEKVQAVLLLLGGREVP T P + +QNNR
Sbjct: 77 GDQLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQ 136
Query: 65 ------RLASLXXXXXXXXXXXXXXXXXYTVRKEVALRMQRNKGQFTSAKSNNEDSASAI 118
RLASL YTVRKEVALRMQR KGQFTSAKS+N+DS S
Sbjct: 137 RLSVPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTG 196
Query: 119 SSWGSNQSWAGDVNGSQNQDIVCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKGTL 178
S WGSNQSWA + +Q +++CRHCG SEKSTPMMRRGP+GPRTLCNACGLMWANKGTL
Sbjct: 197 SDWGSNQSWAVEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTL 256
Query: 179 RDLSKAAP-QAGQTSSLNKNEENGTLKAEQVIRAVGNINGS 218
RDLSK P Q Q SLNKNE+ L+A+Q++ G+I+ +
Sbjct: 257 RDLSKVPPPQTPQHLSLNKNED-ANLEADQMMEVTGDISNT 296
|
|
| TAIR|locus:2017582 ZML2 "ZIM-LIKE 2" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| CGD|CAL0005605 orf19.1577 [Candida albicans (taxid:5476)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q5ALK1 CaO19.1577 "Putative uncharacterized protein" [Candida albicans SC5314 (taxid:237561)] | Back alignment and assigned GO terms |
|---|
| CGD|CAL0004848 BRG1 [Candida albicans (taxid:5476)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q59LY1 GAT2 "Putative uncharacterized protein" [Candida albicans SC5314 (taxid:237561)] | Back alignment and assigned GO terms |
|---|
| DICTYBASE|DDB_G0267640 gtaE "GATA zinc finger domain-containing protein 5" [Dictyostelium discoideum (taxid:44689)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2083388 GATA15 "GATA transcription factor 15" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| DICTYBASE|DDB_G0295707 gtaP "GATA zinc finger domain-containing protein 16" [Dictyostelium discoideum (taxid:44689)] | Back alignment and assigned GO terms |
|---|
| DICTYBASE|DDB_G0270756 gtaG "GATA zinc finger domain-containing protein 7" [Dictyostelium discoideum (taxid:44689)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| estExt_fgenesh4_pm.C_LG_X0958 | hypothetical protein (294 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 219 | |||
| pfam06203 | 45 | pfam06203, CCT, CCT motif | 1e-16 | |
| cd00202 | 54 | cd00202, ZnF_GATA, Zinc finger DNA binding domain; | 7e-13 | |
| smart00401 | 52 | smart00401, ZnF_GATA, zinc finger binding to DNA c | 8e-13 | |
| pfam00320 | 36 | pfam00320, GATA, GATA zinc finger | 1e-12 | |
| pfam06200 | 36 | pfam06200, tify, tify domain | 1e-10 | |
| smart00979 | 36 | smart00979, TIFY, This short possible domain is fo | 5e-10 | |
| pfam09425 | 27 | pfam09425, CCT_2, Divergent CCT motif | 3e-04 |
| >gnl|CDD|203407 pfam06203, CCT, CCT motif | Back alignment and domain information |
|---|
Score = 70.7 bits (174), Expect = 1e-16
Identities = 23/45 (51%), Positives = 29/45 (64%)
Query: 65 RLASLIRFREKRKERNFEKKIRYTVRKEVALRMQRNKGQFTSAKS 109
R A+L+R++EKRK R F+KKIRY RK VA R KG+F
Sbjct: 1 REAALLRYKEKRKTRKFDKKIRYASRKAVAESRPRVKGRFVKQSE 45
|
This short motif is found in a number of plant proteins. It is rich in basic amino acids and has been called a CCT motif after Co, Col and Toc1. The CCT motif is about 45 amino acids long and contains a putative nuclear localisation signal within the second half of the CCT motif. Toc1 mutants have been identified in this region. Length = 45 |
| >gnl|CDD|238123 cd00202, ZnF_GATA, Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C | Back alignment and domain information |
|---|
| >gnl|CDD|214648 smart00401, ZnF_GATA, zinc finger binding to DNA consensus sequence [AT]GATA[AG] | Back alignment and domain information |
|---|
| >gnl|CDD|109380 pfam00320, GATA, GATA zinc finger | Back alignment and domain information |
|---|
| >gnl|CDD|203405 pfam06200, tify, tify domain | Back alignment and domain information |
|---|
| >gnl|CDD|198047 smart00979, TIFY, This short possible domain is found in a variety of plant transcription factors that contain GATA domains as well as other motifs | Back alignment and domain information |
|---|
| >gnl|CDD|117965 pfam09425, CCT_2, Divergent CCT motif | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 219 | |||
| PF06200 | 36 | tify: tify domain; InterPro: IPR010399 The tify do | 99.65 | |
| cd00202 | 54 | ZnF_GATA Zinc finger DNA binding domain; binds spe | 99.63 | |
| smart00401 | 52 | ZnF_GATA zinc finger binding to DNA consensus sequ | 99.58 | |
| PF00320 | 36 | GATA: GATA zinc finger; InterPro: IPR000679 Zinc f | 99.54 | |
| PF06203 | 45 | CCT: CCT motif; InterPro: IPR010402 The CCT (CONST | 99.5 | |
| KOG1601 | 340 | consensus GATA-4/5/6 transcription factors [Transc | 98.71 | |
| PF09425 | 27 | CCT_2: Divergent CCT motif; InterPro: IPR018467 Th | 98.58 | |
| COG5641 | 498 | GAT1 GATA Zn-finger-containing transcription facto | 98.4 | |
| KOG3554 | 693 | consensus Histone deacetylase complex, MTA1 compon | 92.62 | |
| COG5641 | 498 | GAT1 GATA Zn-finger-containing transcription facto | 91.61 | |
| PF13619 | 60 | KTSC: KTSC domain | 82.83 | |
| PF14803 | 34 | Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C. | 81.01 |
| >PF06200 tify: tify domain; InterPro: IPR010399 The tify domain is a 36-amino acid domain only found among Embryophyta (land plants) | Back alignment and domain information |
|---|
Probab=99.65 E-value=1.6e-16 Score=102.06 Aligned_cols=35 Identities=43% Similarity=0.653 Sum_probs=32.5
Q ss_pred cCCCCceEEEeccEEEEecCCCHHHHHHHHHHhcC
Q 027756 11 ERIGDQLTLSFQGQVYVFDSVSPEKVQAVLLLLGG 45 (219)
Q Consensus 11 ~~~~~qLTifY~G~v~Vfd~v~~~ka~~im~la~~ 45 (219)
.++++||||||+|+|+|||+||+|||++||++|+.
T Consensus 2 ~~~~~qLTIfY~G~V~Vfd~v~~~Ka~~im~lA~r 36 (36)
T PF06200_consen 2 SPETAQLTIFYGGQVCVFDDVPPDKAQEIMLLASR 36 (36)
T ss_pred CCCCCcEEEEECCEEEEeCCCCHHHHHHHHHHhcC
Confidence 46789999999999999999999999999999973
|
It has been named after the most conserved amino acid pattern (TIF[F/Y]XG) it contains, but was previously known as the Zim domain. As the use of uppercase characters (TIFY) might imply that the domain is fully conserved across proteins, a lowercase lettering has been chosen in an attempt to highlight the reality of its natural variability. Based on the domain architecture, tify domain containing proteins can be classified into two groups. Group I is formed by proteins possessing a CCT (CONSTANS, CO-like, and TOC1) domain and a GATA-type zinc finger in addition to the tify domain. Group II contains proteins characterised by the tify domain but lacking a GATA-type zinc finger. Tify domain containing proteins might be involved in developmental processes and some of them have features that are characteristic for transcription factors: a nuclear localisation and the presence of a putative DNA-binding domain []. Some proteins known to contain a tify domain include: Arabidopsis thaliana Zinc-finger protein expressed in Inflorescence Meristem (ZIM), a putative transcription factor involved in inflorescence and flower development [, ]. A. thaliana ZIM-like proteins (ZML) []. A. thaliana PEAPOD1 and PEAPOD2 (PPD1 and PPD2) []. |
| >cd00202 ZnF_GATA Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C | Back alignment and domain information |
|---|
| >smart00401 ZnF_GATA zinc finger binding to DNA consensus sequence [AT]GATA[AG] | Back alignment and domain information |
|---|
| >PF00320 GATA: GATA zinc finger; InterPro: IPR000679 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule | Back alignment and domain information |
|---|
| >PF06203 CCT: CCT motif; InterPro: IPR010402 The CCT (CONSTANS, CO-like, and TOC1) domain is a highly conserved basic module of ~43 amino acids, which is found near the C terminus of plant proteins often involved in light signal transduction | Back alignment and domain information |
|---|
| >KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] | Back alignment and domain information |
|---|
| >PF09425 CCT_2: Divergent CCT motif; InterPro: IPR018467 The short CCT (CO, COL, TOC1) motif is found in a number of plant proteins, including Constans (CO), Constans-like (COL) and TOC1 | Back alignment and domain information |
|---|
| >COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] | Back alignment and domain information |
|---|
| >KOG3554 consensus Histone deacetylase complex, MTA1 component [Chromatin structure and dynamics] | Back alignment and domain information |
|---|
| >COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] | Back alignment and domain information |
|---|
| >PF13619 KTSC: KTSC domain | Back alignment and domain information |
|---|
| >PF14803 Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() | |
| Query | 219 | ||||
| 4gat_A | 66 | Solution Nmr Structure Of The Wild Type Dna Binding | 5e-05 | ||
| 7gat_A | 66 | Solution Nmr Structure Of The L22v Mutant Dna Bindi | 8e-05 |
| >pdb|4GAT|A Chain A, Solution Nmr Structure Of The Wild Type Dna Binding Domain Of Area Complexed To A 13bp Dna Containing A Cgata Site, Regularized Mean Structure Length = 66 | Back alignment and structure |
|
| >pdb|7GAT|A Chain A, Solution Nmr Structure Of The L22v Mutant Dna Binding Domain Of Area Complexed To A 13 Bp Dna Containing A Tgata Site, 34 Structures Length = 66 | Back alignment and structure |
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 219 | |||
| 2kae_A | 71 | GATA-type transcription factor; zinc finger, GATA- | 3e-11 | |
| 1gnf_A | 46 | Transcription factor GATA-1; zinc finger, transcri | 3e-09 | |
| 4gat_A | 66 | Nitrogen regulatory protein AREA; DNA binding prot | 7e-09 | |
| 2vut_I | 43 | AREA, nitrogen regulatory protein AREA; transcript | 2e-08 | |
| 3dfx_A | 63 | Trans-acting T-cell-specific transcription factor | 2e-07 |
| >2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Length = 71 | Back alignment and structure |
|---|
Score = 56.4 bits (136), Expect = 3e-11
Identities = 16/66 (24%), Positives = 23/66 (34%), Gaps = 4/66 (6%)
Query: 137 QDIVCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKGTLRDLSKAAPQAGQTSSLNK 196
+ C +C ++E T R CNAC + R ++ A Q L
Sbjct: 7 KSFQCSNCSVTE--TIRWRNIRSKEGIQCNACFIYQRKYNKTRPVT--AVNKYQKRKLKV 62
Query: 197 NEENGT 202
E NG
Sbjct: 63 QETNGV 68
|
| >1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Length = 46 | Back alignment and structure |
|---|
| >4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Length = 66 | Back alignment and structure |
|---|
| >2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Length = 43 | Back alignment and structure |
|---|
| >3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Length = 63 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 219 | |||
| 1gnf_A | 46 | Transcription factor GATA-1; zinc finger, transcri | 99.75 | |
| 2vut_I | 43 | AREA, nitrogen regulatory protein AREA; transcript | 99.75 | |
| 3dfx_A | 63 | Trans-acting T-cell-specific transcription factor | 99.71 | |
| 4gat_A | 66 | Nitrogen regulatory protein AREA; DNA binding prot | 99.71 | |
| 2kae_A | 71 | GATA-type transcription factor; zinc finger, GATA- | 99.66 | |
| 4hc9_A | 115 | Trans-acting T-cell-specific transcription factor; | 99.53 | |
| 4hc9_A | 115 | Trans-acting T-cell-specific transcription factor; | 99.39 | |
| 3ogl_Q | 21 | JAZ1 incomplete degron peptide; leucine-rich repea | 98.62 | |
| 3ogk_Q | 22 | JAZ1 incomplete degron peptide; leucine rich repea | 98.1 |
| >1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A | Back alignment and structure |
|---|
Probab=99.75 E-value=6.4e-19 Score=118.01 Aligned_cols=44 Identities=41% Similarity=0.786 Sum_probs=40.5
Q ss_pred ccccccccccccCCCCccccCCCCCcccchhhhHHHHhcCCCCCCcc
Q 027756 137 QDIVCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKGTLRDLSK 183 (219)
Q Consensus 137 ~~~~C~~C~~~~~~Tp~wR~Gp~G~~~LCNaCGl~~~~~~~~r~~~~ 183 (219)
....|+||++ ++||+||+||+|+ +|||||||||++++++||+.+
T Consensus 3 ~~~~C~~C~t--t~Tp~WR~gp~G~-~LCNaCGl~~k~~~~~RP~~~ 46 (46)
T 1gnf_A 3 EARECVNCGA--TATPLWRRDRTGH-YLCNACGLYHKMNGQNRPLIR 46 (46)
T ss_dssp CSCCCTTTCC--CCCSSCBCCTTCC-CBCSHHHHHHHHTCSCCCCCC
T ss_pred CCCCCCCcCC--CCCCcCccCCCCC-ccchHHHHHHHHcCCCCCCCC
Confidence 3567999999 7899999999997 999999999999999999874
|
| >2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* | Back alignment and structure |
|---|
| >3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* | Back alignment and structure |
|---|
| >4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* | Back alignment and structure |
|---|
| >2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} | Back alignment and structure |
|---|
| >4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A | Back alignment and structure |
|---|
| >4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 219 | ||||
| d2vuti1 | 42 | g.39.1.1 (I:671-712) Erythroid transcription facto | 3e-11 | |
| d1y0ja1 | 39 | g.39.1.1 (A:200-238) Erythroid transcription facto | 1e-10 | |
| d3gata_ | 66 | g.39.1.1 (A:) Erythroid transcription factor GATA- | 4e-08 |
| >d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Length = 42 | Back information, alignment and structure |
|---|
class: Small proteins fold: Glucocorticoid receptor-like (DNA-binding domain) superfamily: Glucocorticoid receptor-like (DNA-binding domain) family: Erythroid transcription factor GATA-1 domain: Erythroid transcription factor GATA-1 species: Emericella nidulans [TaxId: 162425]
Score = 54.4 bits (131), Expect = 3e-11
Identities = 20/43 (46%), Positives = 24/43 (55%), Gaps = 3/43 (6%)
Query: 140 VCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKGTLRDLS 182
C +C TP+ RR PEG + LCNACGL G +R LS
Sbjct: 2 TCTNCFTQT--TPLWRRNPEG-QPLCNACGLFLKLHGVVRPLS 41
|
| >d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Length = 39 | Back information, alignment and structure |
|---|
| >d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Length = 66 | Back information, alignment and structure |
|---|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 219 | |||
| d2vuti1 | 42 | Erythroid transcription factor GATA-1 {Emericella | 99.79 | |
| d1y0ja1 | 39 | Erythroid transcription factor GATA-1 {Mouse (Mus | 99.73 | |
| d3gata_ | 66 | Erythroid transcription factor GATA-1 {Chicken (Ga | 99.71 |
| >d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} | Back information, alignment and structure |
|---|
class: Small proteins fold: Glucocorticoid receptor-like (DNA-binding domain) superfamily: Glucocorticoid receptor-like (DNA-binding domain) family: Erythroid transcription factor GATA-1 domain: Erythroid transcription factor GATA-1 species: Emericella nidulans [TaxId: 162425]
Probab=99.79 E-value=1.9e-20 Score=121.18 Aligned_cols=41 Identities=49% Similarity=0.980 Sum_probs=38.5
Q ss_pred cccccccccCCCCccccCCCCCcccchhhhHHHHhcCCCCCCcc
Q 027756 140 VCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKGTLRDLSK 183 (219)
Q Consensus 140 ~C~~C~~~~~~Tp~wR~Gp~G~~~LCNaCGl~~~~~~~~r~~~~ 183 (219)
.|+||++ ++||+||+||+| ++|||||||||++|+++||+++
T Consensus 2 ~C~nC~t--t~Tp~WRr~~~G-~~lCNACGl~~k~~g~~RP~sl 42 (42)
T d2vuti1 2 TCTNCFT--QTTPLWRRNPEG-QPLCNACGLFLKLHGVVRPLSL 42 (42)
T ss_dssp CCSSSCC--CCCSCCEECTTS-CEECHHHHHHHHHHSSCCCCCC
T ss_pred cCCCCCC--CCCccceeCCCC-CCchhhhhHHHHHcCCCCCCCC
Confidence 5999999 789999999999 6999999999999999999864
|
| >d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|
| >d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} | Back information, alignment and structure |
|---|