Citrus Sinensis ID: 047722


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------14
MGSQLPNSPLNYNNVSINPDNNLQHAAAASALASMKGYTLINVPASRNQQLGTTASSSAAASAPRGAAGIPRPHGYPYKRCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKYRKEEEKRKAKETETSNIDQNSNNQTT
cccccccccccccccccccccccccHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHcccccccccccccccc
cccccccccccccccccccccccccccccccccccccEEEEccccccccccccccccccccccccccccccccccccccEccccccccccccccccccccccHHHHHcccEEccHHcccccccccccccccccccccc
mgsqlpnsplnynnvsinpdnnLQHAAAASALASMKgytlinvpasrnqqlgttasssaaasaprgaagiprphgypykrctnyncntndtpmwrrgplgpktlcnacgIKYRKEEEKRKAKetetsnidqnsnnqtt
mgsqlpnsplnyNNVSINPDNNLQHAAAASALASMKGYTLINVPASRNQQLGTTASssaaasaprgaagiprphgypykRCTNYNCNTndtpmwrrgplgpKTLCNACGIKYRKEeekrkaketetsnidqnsnnqtt
MGSQLPNSPLNYNNVSINPDNNLQHaaaasalasMKGYTLINVPASRNQQLGTTasssaaasaPRGAAGIPRPHGYPYKRCTNYNCNTNDTPMWRRGPLGPKTLCNACGIkyrkeeekrkaketetSNIDQNSNNQTT
***********************************KGYTLIN*******************************HGYPYKRCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKY**************************
***********************************************************************************YNCNTNDTPMWRRGPLGPKTLCNACGIKY**************************
MGSQLPNSPLNYNNVSINPDNNLQHAAAASALASMKGYTLINVPASRNQ******************AGIPRPHGYPYKRCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKYRK************************
*************N**INPDN*LQHAAAASALASMKGYT**NVP*********************************YKRCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKYRK************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGSQLPNSPLNYNNVSINPDNNLQHAAAASALASMKGYTLINVPASRNQQLGTTASSSAAASAPRGAAGIPRPHGYPYKRCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKYRKEEEKRKAKETETSNIDQNSNNQTT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query138 2.2.26 [Sep-21-2011]
Q9LT45208 GATA transcription factor yes no 0.673 0.447 0.505 1e-16
Q6QPM2211 GATA transcription factor no no 0.340 0.222 0.530 4e-10
Q8LC79295 GATA transcription factor no no 0.297 0.138 0.581 7e-10
Q9ZPX0208 GATA transcription factor no no 0.333 0.221 0.530 8e-10
Q54NM5640 GATA zinc finger domain-c yes no 0.246 0.053 0.676 2e-09
Q00858457 Cutinase gene palindrome- N/A no 0.268 0.080 0.615 4e-09
P69781331 GATA transcription factor no no 0.297 0.123 0.534 1e-08
Q9SV30322 GATA transcription factor no no 0.260 0.111 0.552 2e-08
Q55GK0 952 GATA zinc finger domain-c no no 0.268 0.038 0.594 2e-08
Q8L4M6269 GATA transcription factor no no 0.239 0.122 0.6 3e-08
>sp|Q9LT45|GAT29_ARATH GATA transcription factor 29 OS=Arabidopsis thaliana GN=GATA29 PE=2 SV=1 Back     alignment and function desciption
 Score = 85.1 bits (209), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 50/99 (50%), Positives = 57/99 (57%), Gaps = 6/99 (6%)

Query: 38  YTLINVPASR---NQQLGTTASSSAAASAPRGAAGIPRPHGYP--YKRCTNYNCNTNDTP 92
           Y LI+VPA R   N     T S    A+  R         G     K+CTN NCN  +TP
Sbjct: 111 YVLIDVPARRARRNNSTVMTNSWKENATPKRIRGCGGFCGGRIEGMKKCTNMNCNALNTP 170

Query: 93  MWRRGPLGPKTLCNACGIKYRKEEEKRKAKETETSNIDQ 131
           MWRRGPLGPK+LCNACGIK+RKEEE RKAK      +D 
Sbjct: 171 MWRRGPLGPKSLCNACGIKFRKEEE-RKAKRNVVIVLDD 208




Transcriptional regulator that specifically binds 5'-GATA-3' or 5'-GAT-3' motifs within gene promoters.
Arabidopsis thaliana (taxid: 3702)
>sp|Q6QPM2|GAT19_ARATH GATA transcription factor 19 OS=Arabidopsis thaliana GN=GATA19 PE=2 SV=2 Back     alignment and function description
>sp|Q8LC79|GAT18_ARATH GATA transcription factor 18 OS=Arabidopsis thaliana GN=GATA18 PE=2 SV=2 Back     alignment and function description
>sp|Q9ZPX0|GAT20_ARATH GATA transcription factor 20 OS=Arabidopsis thaliana GN=GATA20 PE=2 SV=2 Back     alignment and function description
>sp|Q54NM5|GTAL_DICDI GATA zinc finger domain-containing protein 12 OS=Dictyostelium discoideum GN=gtaL PE=4 SV=1 Back     alignment and function description
>sp|Q00858|CGPB_FUSSO Cutinase gene palindrome-binding protein OS=Fusarium solani subsp. pisi PE=2 SV=1 Back     alignment and function description
>sp|P69781|GAT12_ARATH GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2 SV=1 Back     alignment and function description
>sp|Q9SV30|GATA8_ARATH GATA transcription factor 8 OS=Arabidopsis thaliana GN=GATA8 PE=2 SV=1 Back     alignment and function description
>sp|Q55GK0|GTAE_DICDI GATA zinc finger domain-containing protein 5 OS=Dictyostelium discoideum GN=gtaE PE=4 SV=1 Back     alignment and function description
>sp|Q8L4M6|GATA3_ARATH GATA transcription factor 3 OS=Arabidopsis thaliana GN=GATA3 PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query138
255574434161 conserved hypothetical protein [Ricinus 0.695 0.596 0.536 4e-20
109676364 1238 unknown [Populus trichocarpa] 0.333 0.037 0.782 6e-17
297830770213 predicted protein [Arabidopsis lyrata su 0.666 0.431 0.520 2e-16
449448980 240 PREDICTED: GATA transcription factor 29- 0.644 0.370 0.516 6e-16
15232346208 GATA transcription factor 29 [Arabidopsi 0.673 0.447 0.505 9e-15
224073208226 predicted protein [Populus trichocarpa] 0.615 0.376 0.451 3e-12
413948588 243 hypothetical protein ZEAMMB73_192746 [Ze 0.362 0.205 0.547 5e-09
297798242213 zinc finger family protein [Arabidopsis 0.340 0.220 0.551 1e-08
449457634226 PREDICTED: GATA transcription factor 18- 0.376 0.230 0.5 2e-08
452844631 534 hypothetical protein DOTSEDRAFT_70543 [D 0.318 0.082 0.545 3e-08
>gi|255574434|ref|XP_002528130.1| conserved hypothetical protein [Ricinus communis] gi|223532469|gb|EEF34260.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 52/97 (53%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 32  LASMKGYTLINVPASRNQQL-GTTASSSAAASAPRGAAGIPRPHGYPYKRCTNYNCNTND 90
           + +M  YTL++    R   +     SSS               +  P KRCTNYNCNTND
Sbjct: 61  IHTMNDYTLLDSTPRRVAHMEDVGGSSSINFRRRDSRRQRAGSYNDPTKRCTNYNCNTND 120

Query: 91  TPMWRRGPLGPKTLCNACGIKYRKEEEKRKAKETETS 127
           TPMWR+GPLGPKTLCNACGIKYRKE EKR+AKE  +S
Sbjct: 121 TPMWRKGPLGPKTLCNACGIKYRKEVEKRRAKEAASS 157




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|109676364|gb|ABG37664.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297830770|ref|XP_002883267.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297329107|gb|EFH59526.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|449448980|ref|XP_004142243.1| PREDICTED: GATA transcription factor 29-like [Cucumis sativus] gi|449503487|ref|XP_004162027.1| PREDICTED: GATA transcription factor 29-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|15232346|ref|NP_188711.1| GATA transcription factor 29 [Arabidopsis thaliana] gi|71660799|sp|Q9LT45.1|GAT29_ARATH RecName: Full=GATA transcription factor 29 gi|9294402|dbj|BAB02483.1| unnamed protein product [Arabidopsis thaliana] gi|225898665|dbj|BAH30463.1| hypothetical protein [Arabidopsis thaliana] gi|332642898|gb|AEE76419.1| GATA transcription factor 29 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|224073208|ref|XP_002304024.1| predicted protein [Populus trichocarpa] gi|222841456|gb|EEE79003.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|413948588|gb|AFW81237.1| hypothetical protein ZEAMMB73_192746 [Zea mays] Back     alignment and taxonomy information
>gi|297798242|ref|XP_002867005.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297312841|gb|EFH43264.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|449457634|ref|XP_004146553.1| PREDICTED: GATA transcription factor 18-like [Cucumis sativus] gi|449529098|ref|XP_004171538.1| PREDICTED: GATA transcription factor 18-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|452844631|gb|EME46565.1| hypothetical protein DOTSEDRAFT_70543 [Dothistroma septosporum NZE10] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query138
TAIR|locus:2091886208 GATA29 "GATA transcription fac 0.528 0.350 0.487 1e-13
DICTYBASE|DDB_G0285139640 gtaL "GATA zinc finger domain- 0.188 0.040 0.807 3.3e-09
ASPGD|ASPL0000039361461 nsdD [Emericella nidulans (tax 0.231 0.069 0.588 4.1e-09
DICTYBASE|DDB_G0281661536 gtaI "GATA zinc finger domain- 0.188 0.048 0.692 1.2e-08
TAIR|locus:2115195211 GATA19 "GATA transcription fac 0.253 0.165 0.594 1.3e-08
TAIR|locus:2139594269 GATA3 "GATA transcription fact 0.217 0.111 0.656 1.6e-08
TAIR|locus:2062095208 GATA20 "GATA transcription fac 0.246 0.163 0.594 2.1e-08
DICTYBASE|DDB_G0270756 1006 gtaG "GATA zinc finger domain- 0.195 0.026 0.703 5.4e-08
DICTYBASE|DDB_G0277589587 gtaC "GATA zinc finger domain- 0.384 0.090 0.415 6.2e-08
TAIR|locus:2103346240 GATA4 "GATA transcription fact 0.217 0.125 0.656 6.8e-08
TAIR|locus:2091886 GATA29 "GATA transcription factor 29" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 178 (67.7 bits), Expect = 1.0e-13, P = 1.0e-13
 Identities = 38/78 (48%), Positives = 45/78 (57%)

Query:    38 YTLINVPASRNQQLGTTXXXXX--XXXXP---RGAAGIPRPHGYPYKRCTNYNCNTNDTP 92
             Y LI+VPA R ++  +T           P   RG  G         K+CTN NCN  +TP
Sbjct:   111 YVLIDVPARRARRNNSTVMTNSWKENATPKRIRGCGGFCGGRIEGMKKCTNMNCNALNTP 170

Query:    93 MWRRGPLGPKTLCNACGI 110
             MWRRGPLGPK+LCNACGI
Sbjct:   171 MWRRGPLGPKSLCNACGI 188




GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0008270 "zinc ion binding" evidence=IEA
GO:0043565 "sequence-specific DNA binding" evidence=IEA
DICTYBASE|DDB_G0285139 gtaL "GATA zinc finger domain-containing protein 12" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
ASPGD|ASPL0000039361 nsdD [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0281661 gtaI "GATA zinc finger domain-containing protein 9" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
TAIR|locus:2115195 GATA19 "GATA transcription factor 19" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2139594 GATA3 "GATA transcription factor 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062095 GATA20 "GATA transcription factor 20" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0270756 gtaG "GATA zinc finger domain-containing protein 7" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0277589 gtaC "GATA zinc finger domain-containing protein 3" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
TAIR|locus:2103346 GATA4 "GATA transcription factor 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
Al_scaffold_0003_2224
annotation not avaliable (213 aa)
(Arabidopsis lyrata)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query138
pfam0032036 pfam00320, GATA, GATA zinc finger 3e-13
smart0040152 smart00401, ZnF_GATA, zinc finger binding to DNA c 1e-12
cd0020254 cd00202, ZnF_GATA, Zinc finger DNA binding domain; 3e-10
>gnl|CDD|109380 pfam00320, GATA, GATA zinc finger Back     alignment and domain information
 Score = 59.2 bits (144), Expect = 3e-13
 Identities = 21/30 (70%), Positives = 24/30 (80%)

Query: 85  NCNTNDTPMWRRGPLGPKTLCNACGIKYRK 114
           NC T  TP+WRRGP G +TLCNACG+ YRK
Sbjct: 3   NCGTTKTPLWRRGPDGNRTLCNACGLYYRK 32


This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the domain. Length = 36

>gnl|CDD|214648 smart00401, ZnF_GATA, zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>gnl|CDD|238123 cd00202, ZnF_GATA, Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 138
cd0020254 ZnF_GATA Zinc finger DNA binding domain; binds spe 99.66
smart0040152 ZnF_GATA zinc finger binding to DNA consensus sequ 99.6
PF0032036 GATA: GATA zinc finger; InterPro: IPR000679 Zinc f 99.57
KOG1601340 consensus GATA-4/5/6 transcription factors [Transc 98.84
COG5641 498 GAT1 GATA Zn-finger-containing transcription facto 98.41
COG5641 498 GAT1 GATA Zn-finger-containing transcription facto 85.29
>cd00202 ZnF_GATA Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information
Probab=99.66  E-value=5.7e-17  Score=105.76  Aligned_cols=51  Identities=43%  Similarity=0.959  Sum_probs=43.5

Q ss_pred             cccCCCCCCCCCCccccCCCCCcchhhHHHHHHHhhhcccchhhhhccccccc
Q 047722           80 RCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKYRKEEEKRKAKETETSNIDQN  132 (138)
Q Consensus        80 ~C~~~~C~t~~TP~WRrGp~G~~~LCNACGL~~~k~~~~r~~~~~~~~~~~~~  132 (138)
                      .|+  +|++++||+||+||.+...|||||||||++++..|+........+..+
T Consensus         1 ~C~--~C~~~~Tp~WR~g~~~~~~LCNaCgl~~~k~~~~rp~~~~~~~~~~~~   51 (54)
T cd00202           1 ACS--NCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKRKKDQIKRR   51 (54)
T ss_pred             CCC--CCCCCCCcccccCCCCcchHHHHHHHHHHhcCCCCCcccCcccccccc
Confidence            488  999999999999998889999999999999999888877664444433



>smart00401 ZnF_GATA zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>PF00320 GATA: GATA zinc finger; InterPro: IPR000679 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query138
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 2e-12
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 4e-10
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 4e-09
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 8e-07
3dfx_A63 Trans-acting T-cell-specific transcription factor 1e-06
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Length = 71 Back     alignment and structure
 Score = 57.6 bits (139), Expect = 2e-12
 Identities = 15/60 (25%), Positives = 22/60 (36%), Gaps = 2/60 (3%)

Query: 79  KRCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKYRKEEEKRKAKETETSNIDQNSNNQTT 138
            +C+N  C+  +T  WR         CNAC I  RK  + R           +    +T 
Sbjct: 9   FQCSN--CSVTETIRWRNIRSKEGIQCNACFIYQRKYNKTRPVTAVNKYQKRKLKVQETN 66


>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Length = 46 Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Length = 43 Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Length = 66 Back     alignment and structure
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Length = 63 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query138
3dfx_A63 Trans-acting T-cell-specific transcription factor 99.78
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 99.74
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 99.74
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 99.74
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 99.69
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.65
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.46
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Back     alignment and structure
Probab=99.78  E-value=1.9e-19  Score=120.20  Aligned_cols=56  Identities=30%  Similarity=0.686  Sum_probs=47.3

Q ss_pred             CCccccCCCCCCCCCCccccCCCCCcchhhHHHHHHHhhhcccchhhhhcccccccccCC
Q 047722           77 PYKRCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKYRKEEEKRKAKETETSNIDQNSNNQ  136 (138)
Q Consensus        77 ~~~~C~~~~C~t~~TP~WRrGp~G~~~LCNACGL~~~k~~~~r~~~~~~~~~~~~~~~~~  136 (138)
                      ....|+  +|++++||+||+||+|+ +|||||||+|+++++.|++.. ....|+.|+|+-
T Consensus         6 ~~~~C~--~C~tt~Tp~WR~gp~G~-~LCNACGl~~~~~~~~RP~~~-~~~~i~~R~Rk~   61 (63)
T 3dfx_A            6 AGTSCA--NCQTTTTTLWRRNANGD-PVCNACGLYYKLHNINRPLTM-KKEGIQTRNRKM   61 (63)
T ss_dssp             TTCCCT--TTCCSCCSSCCCCTTSC-CCCHHHHHHHHHHSSCCCGGG-CCSSCCCCC---
T ss_pred             CCCcCC--CcCCCCCCccCCCCCCC-chhhHHHHHHHHcCCCCCcCc-CCCccccccCCC
Confidence            446799  99999999999999996 999999999999999888775 567799998864



>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Back     alignment and structure
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 138
d1y0ja139 g.39.1.1 (A:200-238) Erythroid transcription facto 1e-10
d2vuti142 g.39.1.1 (I:671-712) Erythroid transcription facto 5e-09
d3gata_66 g.39.1.1 (A:) Erythroid transcription factor GATA- 8e-07
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Length = 39 Back     information, alignment and structure

class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Mouse (Mus musculus) [TaxId: 10090]
 Score = 50.9 bits (122), Expect = 1e-10
 Identities = 15/36 (41%), Positives = 20/36 (55%), Gaps = 3/36 (8%)

Query: 79  KRCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKYRK 114
           + C N  C    TP+WRR   G   LCNACG+ ++ 
Sbjct: 3   RECVN--CGATATPLWRRDRTG-HYLCNACGLYHKM 35


>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Length = 42 Back     information, alignment and structure
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Length = 66 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query138
d2vuti142 Erythroid transcription factor GATA-1 {Emericella 99.78
d3gata_66 Erythroid transcription factor GATA-1 {Chicken (Ga 99.75
d1y0ja139 Erythroid transcription factor GATA-1 {Mouse (Mus 99.75
d1neea237 Zinc-binding domain of translation initiation fact 80.68
>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Back     information, alignment and structure
class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Emericella nidulans [TaxId: 162425]
Probab=99.78  E-value=2.5e-20  Score=113.97  Aligned_cols=40  Identities=48%  Similarity=0.983  Sum_probs=37.5

Q ss_pred             cccCCCCCCCCCCccccCCCCCcchhhHHHHHHHhhhcccchh
Q 047722           80 RCTNYNCNTNDTPMWRRGPLGPKTLCNACGIKYRKEEEKRKAK  122 (138)
Q Consensus        80 ~C~~~~C~t~~TP~WRrGp~G~~~LCNACGL~~~k~~~~r~~~  122 (138)
                      .|+  ||++++||+||+||+| ++|||||||||+++++.||+.
T Consensus         2 ~C~--nC~tt~Tp~WRr~~~G-~~lCNACGl~~k~~g~~RP~s   41 (42)
T d2vuti1           2 TCT--NCFTQTTPLWRRNPEG-QPLCNACGLFLKLHGVVRPLS   41 (42)
T ss_dssp             CCS--SSCCCCCSCCEECTTS-CEECHHHHHHHHHHSSCCCCC
T ss_pred             cCC--CCCCCCCccceeCCCC-CCchhhhhHHHHHcCCCCCCC
Confidence            588  9999999999999999 899999999999999998864



>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Back     information, alignment and structure
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1neea2 g.59.1.1 (A:99-135) Zinc-binding domain of translation initiation factor 2 beta {Archaeon Methanobacterium thermoautotrophicum [TaxId: 145262]} Back     information, alignment and structure