Citrus Sinensis ID: 030971


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------17
MNQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAAASASAKVVTEKADGTGNASCCVNGLGDSFKMRLMALGQEVLLQRSSPSSSSPSTTSKSSPSLFLPSVVAKKQRCQRKRKLKEEEQAAFSLMALSCGFVFA
cccccccccccccccccccccccccccccccccHHHHHHHHHccccccccccccccccccccHHHHHHHHccccccccccccccccccccccccccHHHHHHHccHHHHccccccccccccccccccccccccHHHHHHHHHHHcccccHHHHHHHHHHHHccccccc
cccccccEEEEccccccccccccccccccccHHHccccEEcccccccccccccccccccccccccHHHHHHHHHHccccccccccccccccccccHHHHHHHcccccEEEccccccccccccccccccccccccHHHHHHHHHHcccccHHHHHHHHHHHHHcccEcc
mnqndsnlkyctdckttktplwrggpagpkslcnacgirfrkkKRVFEGLkkrsdkkkdkTTNAAAAAASASAKVVtekadgtgnasccvnglgDSFKMRLMALGQEVLlqrsspsssspsttsksspslflpsvvAKKQRCQRKRKLKEEEQAAFSLMALSCGFVFA
mnqndsnlkycTDCKttktplwrggpagpkslcnacgirfrkkkrvfeglkkrsdkkkdkttnAAAAaasasakvvtekadgtgNASCCVNGLGDSFKMRLMALGQEVLLQRsspsssspsttsksspslflpsvvakKQRCQRKRKLKEEEQAAFslmalscgfvfa
MNQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLkkrsdkkkdkttnaaaaaasasakvvtekaDGTGNASCCVNGLGDSFKMRLMALGQEVLLQRsspsssspsttsksspslflpsVVakkqrcqrkrklkeeeqaaFSLMALSCGFVFA
********KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVF*************************************NASCCVNGLGDSFKMRLMALGQEVL**********************************************FSLMALSCGFV**
***********TDCKTTKTPLWRGGPAGPKSLCNACGIRF******************************************************************************************************************AFSLMALSCGFVFA
MNQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLK*******************************TGNASCCVNGLGDSFKMRLMALGQEVL*******************SLFLPSVV***************EQAAFSLMALSCGFVFA
*****SNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRS****************************************D**KMRLMALGQEVLLQ******************LFLPSVVAKKQRCQ*KRKLKEEEQAAFSLMALSCGFVFA
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MNQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAAASASAKVVTEKADGTGNASCCVNGLGDSFKMRLMALGQEVLLQRSSPSSSSPSTTSKSSPSLFLPSVVAKKQRCQRKRKLKEEEQAAFSLMALSCGFVFA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query168 2.2.26 [Sep-21-2011]
Q8LG10149 GATA transcription factor yes no 0.607 0.684 0.422 4e-22
Q9FJ10139 GATA transcription factor no no 0.666 0.805 0.372 3e-17
Q8LC59120 GATA transcription factor no no 0.267 0.375 0.644 1e-13
Q5HZ36398 GATA transcription factor no no 0.232 0.097 0.717 1e-11
Q9SZI6352 Putative GATA transcripti no no 0.214 0.102 0.75 2e-11
Q6QPM2211 GATA transcription factor no no 0.309 0.246 0.508 6e-11
Q6DBP8303 GATA transcription factor no no 0.267 0.148 0.6 6e-11
Q9ZPX0208 GATA transcription factor no no 0.208 0.168 0.742 6e-11
Q8LC79295 GATA transcription factor no no 0.208 0.118 0.742 8e-11
Q8VZP4308 GATA transcription factor no no 0.279 0.152 0.510 1e-09
>sp|Q8LG10|GAT15_ARATH GATA transcription factor 15 OS=Arabidopsis thaliana GN=GATA15 PE=2 SV=2 Back     alignment and function desciption
 Score =  103 bits (258), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 65/154 (42%), Positives = 76/154 (49%), Gaps = 52/154 (33%)

Query: 9   KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAA 68
           K C  C T+KTPLWRGGPAGPKSLCNACGIR RKK+R    +  RS+ KK K+ N     
Sbjct: 41  KSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTL--ISNRSEDKKKKSHNRNPK- 97

Query: 69  ASASAKVVTEKADGTGNASCCVNGLGDSFKMRLMALGQEVLLQRSSPSSSSPSTTSKSSP 128
                                    GDS K RLM LG+EV++QRS+  +           
Sbjct: 98  ------------------------FGDSLKQRLMELGREVMMQRSTAEN----------- 122

Query: 129 SLFLPSVVAKKQRCQRKRKLKEEEQAAFSLMALS 162
                         QR+ KL EEEQAA  LMALS
Sbjct: 123 --------------QRRNKLGEEEQAAVLLMALS 142




Transcriptional regulator that specifically binds 5'-GATA-3' or 5'-GAT-3' motifs within gene promoters.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9FJ10|GAT16_ARATH GATA transcription factor 16 OS=Arabidopsis thaliana GN=GATA16 PE=2 SV=1 Back     alignment and function description
>sp|Q8LC59|GAT23_ARATH GATA transcription factor 23 OS=Arabidopsis thaliana GN=GATA23 PE=2 SV=2 Back     alignment and function description
>sp|Q5HZ36|GAT21_ARATH GATA transcription factor 21 OS=Arabidopsis thaliana GN=GATA21 PE=1 SV=2 Back     alignment and function description
>sp|Q9SZI6|GAT22_ARATH Putative GATA transcription factor 22 OS=Arabidopsis thaliana GN=GATA22 PE=2 SV=1 Back     alignment and function description
>sp|Q6QPM2|GAT19_ARATH GATA transcription factor 19 OS=Arabidopsis thaliana GN=GATA19 PE=2 SV=2 Back     alignment and function description
>sp|Q6DBP8|GAT11_ARATH GATA transcription factor 11 OS=Arabidopsis thaliana GN=GATA11 PE=2 SV=1 Back     alignment and function description
>sp|Q9ZPX0|GAT20_ARATH GATA transcription factor 20 OS=Arabidopsis thaliana GN=GATA20 PE=2 SV=2 Back     alignment and function description
>sp|Q8LC79|GAT18_ARATH GATA transcription factor 18 OS=Arabidopsis thaliana GN=GATA18 PE=2 SV=2 Back     alignment and function description
>sp|Q8VZP4|GAT10_ARATH GATA transcription factor 10 OS=Arabidopsis thaliana GN=GATA10 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query168
224123912161 predicted protein [Populus trichocarpa] 0.821 0.857 0.543 5e-33
225431869153 PREDICTED: GATA transcription factor 16 0.708 0.777 0.478 2e-25
147814791125 hypothetical protein VITISV_042395 [Viti 0.636 0.856 0.443 6e-24
225450647124 PREDICTED: GATA transcription factor 16- 0.636 0.862 0.443 7e-24
449432898148 PREDICTED: GATA transcription factor 16- 0.702 0.797 0.436 1e-23
449432896151 PREDICTED: GATA transcription factor 16- 0.702 0.781 0.436 1e-23
255556286149 GATA transcription factor, putative [Ric 0.714 0.805 0.466 2e-23
224110254125 predicted protein [Populus trichocarpa] 0.666 0.896 0.462 7e-23
357511771129 GATA transcription factor [Medicago trun 0.642 0.837 0.418 7e-22
255542842151 conserved hypothetical protein [Ricinus 0.803 0.894 0.451 1e-21
>gi|224123912|ref|XP_002330240.1| predicted protein [Populus trichocarpa] gi|222871696|gb|EEF08827.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  145 bits (366), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 87/160 (54%), Positives = 104/160 (65%), Gaps = 22/160 (13%)

Query: 9   KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRSDKKKDKTTNAAAAA 68
           K CTDCKTTKTPLWRGGPAGPKSLCNACGIR+RKK+ V   L+K  +KK++KTT +    
Sbjct: 24  KACTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKKRSVMR-LEKGPEKKREKTTTSNTTT 82

Query: 69  ASASAKVVTEKADGTGNASCCVNGLGDSFKMRLMALGQEVLLQRSSPSSSSPSTTSKSSP 128
           A+  + + T     T         + +S +M LM LG+E++LQR                
Sbjct: 83  ATDISTITTATTTNTAQVVSGNGLISESLRMSLMVLGEEMMLQR---------------- 126

Query: 129 SLFLPSVVAKKQRCQRKRKLKEEEQAAFSLMALSCGFVFA 168
               PSVV KKQRCQRKRKL+EEEQAAFSLMALSCG VFA
Sbjct: 127 ----PSVV-KKQRCQRKRKLREEEQAAFSLMALSCGSVFA 161




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225431869|ref|XP_002275498.1| PREDICTED: GATA transcription factor 16 [Vitis vinifera] gi|296083288|emb|CBI22924.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225450647|ref|XP_002278369.1| PREDICTED: GATA transcription factor 16-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|449432898|ref|XP_004134235.1| PREDICTED: GATA transcription factor 16-like isoform 2 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449432896|ref|XP_004134234.1| PREDICTED: GATA transcription factor 16-like isoform 1 [Cucumis sativus] Back     alignment and taxonomy information
>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus communis] gi|223541492|gb|EEF43041.1| GATA transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa] gi|222864502|gb|EEF01633.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|357511771|ref|XP_003626174.1| GATA transcription factor [Medicago truncatula] gi|355501189|gb|AES82392.1| GATA transcription factor [Medicago truncatula] Back     alignment and taxonomy information
>gi|255542842|ref|XP_002512484.1| conserved hypothetical protein [Ricinus communis] gi|223548445|gb|EEF49936.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query168
TAIR|locus:2083388149 GATA15 "GATA transcription fac 0.261 0.295 0.733 8.1e-19
TAIR|locus:2155919139 GATA16 "GATA transcription fac 0.244 0.294 0.761 1.7e-18
TAIR|locus:504955441197 AT4G16141 [Arabidopsis thalian 0.25 0.213 0.690 1.4e-15
TAIR|locus:2093678190 GATA17 "GATA transcription fac 0.238 0.210 0.7 2.2e-15
TAIR|locus:2148558120 GATA23 "GATA transcription fac 0.267 0.375 0.644 2.3e-14
TAIR|locus:2120845352 CGA1 "cytokinin-responsive gat 0.25 0.119 0.714 4.3e-13
TAIR|locus:2170277398 GNC "GATA, nitrate-inducible, 0.261 0.110 0.659 1.3e-12
TAIR|locus:2115195211 GATA19 "GATA transcription fac 0.208 0.165 0.742 6.4e-12
TAIR|locus:2062095208 GATA20 "GATA transcription fac 0.208 0.168 0.742 8.2e-12
TAIR|locus:2077932295 MNP "MONOPOLE" [Arabidopsis th 0.244 0.138 0.690 8.7e-12
TAIR|locus:2083388 GATA15 "GATA transcription factor 15" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 172 (65.6 bits), Expect = 8.1e-19, Sum P(2) = 8.1e-19
 Identities = 33/45 (73%), Positives = 36/45 (80%)

Query:     2 NQNDSN-LKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
             N+  SN  K C  C T+KTPLWRGGPAGPKSLCNACGIR RKK+R
Sbjct:    33 NEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRR 77


GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0008270 "zinc ion binding" evidence=IEA
GO:0043565 "sequence-specific DNA binding" evidence=IEA
GO:0009407 "toxin catabolic process" evidence=RCA
TAIR|locus:2155919 GATA16 "GATA transcription factor 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504955441 AT4G16141 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2093678 GATA17 "GATA transcription factor 17" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2148558 GATA23 "GATA transcription factor 23" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2120845 CGA1 "cytokinin-responsive gata factor 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2170277 GNC "GATA, nitrate-inducible, carbon metabolism-involved" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2115195 GATA19 "GATA transcription factor 19" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062095 GATA20 "GATA transcription factor 20" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2077932 MNP "MONOPOLE" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pg.C_1330084
hypothetical protein (161 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query168
pfam0032036 pfam00320, GATA, GATA zinc finger 1e-14
smart0040152 smart00401, ZnF_GATA, zinc finger binding to DNA c 2e-13
cd0020254 cd00202, ZnF_GATA, Zinc finger DNA binding domain; 3e-11
>gnl|CDD|109380 pfam00320, GATA, GATA zinc finger Back     alignment and domain information
 Score = 63.8 bits (156), Expect = 1e-14
 Identities = 21/36 (58%), Positives = 27/36 (75%)

Query: 11 CTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRV 46
          C++C TTKTPLWR GP G ++LCNACG+ +RK    
Sbjct: 1  CSNCGTTKTPLWRRGPDGNRTLCNACGLYYRKHGLK 36


This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the domain. Length = 36

>gnl|CDD|214648 smart00401, ZnF_GATA, zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>gnl|CDD|238123 cd00202, ZnF_GATA, Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 168
smart0040152 ZnF_GATA zinc finger binding to DNA consensus sequ 99.51
cd0020254 ZnF_GATA Zinc finger DNA binding domain; binds spe 99.51
PF0032036 GATA: GATA zinc finger; InterPro: IPR000679 Zinc f 99.44
KOG1601340 consensus GATA-4/5/6 transcription factors [Transc 98.71
COG5641 498 GAT1 GATA Zn-finger-containing transcription facto 98.18
KOG3554 693 consensus Histone deacetylase complex, MTA1 compon 93.61
COG5641498 GAT1 GATA Zn-finger-containing transcription facto 82.51
>smart00401 ZnF_GATA zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
Probab=99.51  E-value=8.1e-15  Score=98.46  Aligned_cols=46  Identities=46%  Similarity=0.988  Sum_probs=39.7

Q ss_pred             CCcccccCCCCCCCcccCCCCCCcccchHHHHHHHhhccccccCcc
Q 030971            7 NLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKK   52 (168)
Q Consensus         7 ~~~~C~~Cgtt~Tp~WRrGp~G~~~LCNACGL~y~k~~~~~~~l~~   52 (168)
                      ....|++|+++.||+||+||.|+..|||+|||+|++.....+|+..
T Consensus         2 ~~~~C~~C~~~~T~~WR~g~~g~~~LCnaCgl~~~k~~~~~rp~~~   47 (52)
T smart00401        2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLKRPLSL   47 (52)
T ss_pred             CCCCcCCCCCCCCCccccCCCCCCcEeecccHHHHHcCCCCCcccc
Confidence            4689999999999999999999879999999988887665466554



>cd00202 ZnF_GATA Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information
>PF00320 GATA: GATA zinc finger; InterPro: IPR000679 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information
>KOG3554 consensus Histone deacetylase complex, MTA1 component [Chromatin structure and dynamics] Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query168
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 3e-13
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 4e-12
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 1e-10
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 4e-09
3dfx_A63 Trans-acting T-cell-specific transcription factor 1e-07
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Length = 71 Back     alignment and structure
 Score = 60.7 bits (147), Expect = 3e-13
 Identities = 17/67 (25%), Positives = 25/67 (37%), Gaps = 2/67 (2%)

Query: 2  NQNDSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRK--KKRVFEGLKKRSDKKKD 59
          +  +     C++C  T+T  WR   +     CNAC I  RK  K R    + K   +K  
Sbjct: 2  SHMNKKSFQCSNCSVTETIRWRNIRSKEGIQCNACFIYQRKYNKTRPVTAVNKYQKRKLK 61

Query: 60 KTTNAAA 66
                 
Sbjct: 62 VQETNGV 68


>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Length = 46 Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Length = 43 Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Length = 66 Back     alignment and structure
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Length = 63 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query168
3dfx_A63 Trans-acting T-cell-specific transcription factor 99.67
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 99.67
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 99.66
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 99.65
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 99.61
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.5
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.42
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Back     alignment and structure
Probab=99.67  E-value=3.3e-17  Score=113.81  Aligned_cols=48  Identities=31%  Similarity=0.737  Sum_probs=40.8

Q ss_pred             CCCCcccccCCCCCCCcccCCCCCCcccchHHHHHHHhhccccccCcccc
Q 030971            5 DSNLKYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKKRS   54 (168)
Q Consensus         5 ~~~~~~C~~Cgtt~Tp~WRrGp~G~~~LCNACGL~y~k~~~~~~~l~~~~   54 (168)
                      ......|++|+++.||+||+||+|+ +|||||||+|++ +++.+|+.+.+
T Consensus         4 ~~~~~~C~~C~tt~Tp~WR~gp~G~-~LCNACGl~~~~-~~~~RP~~~~~   51 (63)
T 3dfx_A            4 RRAGTSCANCQTTTTTLWRRNANGD-PVCNACGLYYKL-HNINRPLTMKK   51 (63)
T ss_dssp             CCTTCCCTTTCCSCCSSCCCCTTSC-CCCHHHHHHHHH-HSSCCCGGGCC
T ss_pred             CCCCCcCCCcCCCCCCccCCCCCCC-chhhHHHHHHHH-cCCCCCcCcCC
Confidence            4567899999999999999999996 999999997765 56678887754



>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Back     alignment and structure
>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Back     alignment and structure
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 168
d1y0ja139 g.39.1.1 (A:200-238) Erythroid transcription facto 4e-12
d2vuti142 g.39.1.1 (I:671-712) Erythroid transcription facto 8e-11
d3gata_66 g.39.1.1 (A:) Erythroid transcription factor GATA- 2e-07
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Length = 39 Back     information, alignment and structure

class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Mouse (Mus musculus) [TaxId: 10090]
 Score = 55.5 bits (134), Expect = 4e-12
 Identities = 15/37 (40%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 9  KYCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKR 45
          + C +C  T TPLWR    G   LCNACG+  +   +
Sbjct: 3  RECVNCGATATPLWRRDRTG-HYLCNACGLYHKMNGQ 38


>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Length = 42 Back     information, alignment and structure
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Length = 66 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query168
d2vuti142 Erythroid transcription factor GATA-1 {Emericella 99.71
d1y0ja139 Erythroid transcription factor GATA-1 {Mouse (Mus 99.64
d3gata_66 Erythroid transcription factor GATA-1 {Chicken (Ga 99.64
d1neea237 Zinc-binding domain of translation initiation fact 84.43
>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Back     information, alignment and structure
class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Emericella nidulans [TaxId: 162425]
Probab=99.71  E-value=1.5e-18  Score=110.72  Aligned_cols=41  Identities=51%  Similarity=1.023  Sum_probs=37.0

Q ss_pred             ccccCCCCCCCcccCCCCCCcccchHHHHHHHhhccccccCcc
Q 030971           10 YCTDCKTTKTPLWRGGPAGPKSLCNACGIRFRKKKRVFEGLKK   52 (168)
Q Consensus        10 ~C~~Cgtt~Tp~WRrGp~G~~~LCNACGL~y~k~~~~~~~l~~   52 (168)
                      .|+||++++||+||+||+| ++||||||| |.|.+++.||+.+
T Consensus         2 ~C~nC~tt~Tp~WRr~~~G-~~lCNACGl-~~k~~g~~RP~sl   42 (42)
T d2vuti1           2 TCTNCFTQTTPLWRRNPEG-QPLCNACGL-FLKLHGVVRPLSL   42 (42)
T ss_dssp             CCSSSCCCCCSCCEECTTS-CEECHHHHH-HHHHHSSCCCCCC
T ss_pred             cCCCCCCCCCccceeCCCC-CCchhhhhH-HHHHcCCCCCCCC
Confidence            6999999999999999999 799999999 5577888888864



>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Back     information, alignment and structure
>d1neea2 g.59.1.1 (A:99-135) Zinc-binding domain of translation initiation factor 2 beta {Archaeon Methanobacterium thermoautotrophicum [TaxId: 145262]} Back     information, alignment and structure