Citrus Sinensis ID: 023942


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-----
MYGQSQSMNISSQMSGGGAAADEDDVSVAADDHHLSYDPHSALENGIVVVEDVAHDSGYATGGNELSNSSQLTLSFRGQVYVFDSVTPDKVQAVLLLLGGCELSSSPQGMEVIPHSQRGIADYPAKCTQPQRAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQRNKGQFTSAKKCEGGALGWSNAQDPGQDDSPSETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWANKGALRDLGKKMEDQPLTPAEQRSKHGNLFLWHISSFRVKVK
cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEccEEEEccccccHHHHHHHHHcccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHcccccccccccccccccccccccccccEEEEEEccEEEEcc
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEccEEEEEccccHHHHHHHHHHHccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHcccccEEEHHHHHHHHHHcccccccEcccccccccccccccccccccccccccEEEccccccccccccccccccccccHHHHHHHHHHHcccccccccccccccccccccccccccEEEEEEccEEEEEc
mygqsqsmnissqmsgggaaadeddvsvaaddhhlsydphsaleNGIVVVEdvahdsgyatggnelsnssqltlsfrgqvyvfdsvtPDKVQAVLLLLGgcelssspqgmeviphsqrgiadypakctqpqraaSLDRFRQKRKERCFDKKVRYSVRQEVALRMQRnkgqftsakkceggalgwsnaqdpgqddspsetscthcgisskstpmmrrgpsgprslcnacglfwankGALRDlgkkmedqpltpaeqrskhgnlfLWHISSFRVKVK
MYGQSQSMNISSQMSGGGAAADEDDVSVAADDHHLSYDPHSALENGIVVVEDVAHDSGYATggnelsnssqLTLSFRGQVYVFDSVTPDKVQAVLLLLGGCELSSSPQGMEVIPHSQRGIADypakctqpqraasldrfrqkrkercfdkkvrysvRQEVAlrmqrnkgqftsakkceggaLGWSNAQDPGQDDSPSETSCTHcgisskstpmmrRGPSGPRSLCNACGLFWANKGALRDLGKKMEDQPLTpaeqrskhgnlflwhissfrvkvk
MYgqsqsmnissqmsgggaaadeddvsvaaddHHLSYDPHSALENGIVVVEDVAHDSGYATGGNELSNSSQLTLSFRGQVYVFDSVTPDKVQAVllllggcelsssPQGMEVIPHSQRGIADYPAKCTQPQRAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQRNKGQFTSAKKCEGGALGWSNAQDPGQDDSPSETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWANKGALRDLGKKMEDQPLTPAEQRSKHGNLFLWHISSFRVKVK
****************************************SALENGIVVVEDVAHDSGYATG********QLTLSFRGQVYVFDSVTPDKVQAVLLLLGGCEL*******************************************CFDKKVRYSVR******************************************************************LCNACGLFWANKGALR********************GNLFLWHISSFR****
******************************************************************************************************************************************************************************KKCEGGALGWSNAQDPGQDDSPSETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWA******************************************
**********************EDDVSVAADDHHLSYDPHSALENGIVVVEDVAHDSGYATGGNELSNSSQLTLSFRGQVYVFDSVTPDKVQAVLLLLGGCELSSSPQGMEVIPHSQRGIADYPAKCTQPQRAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQRNKGQFTSAKKCEGGALGWSNA*****************GISSKSTPMMRRGPSGPRSLCNACGLFWANKGALRDLGKKMEDQPLTPAEQRSKHGNLFLWHISSFRVKVK
*******************************************************************NSSQLTLSFRGQVYVFDSVTPDKVQAVLLLLGGCEL**********************KCTQPQRAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQ********************************ETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWANKGA********************KHGNLFLWHISSFRVKVK
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MYGQSQSMNISSQMSGGGAAADEDDVSVAADDHHLSYDPHSALENGIVVVEDVAHDSGYATGGNELSNSSQLTLSFRGQVYVFDSVTPDKVQAVLLLLGGCELSSSPQGMEVIPHSQRGIADYPAKCTQPQRAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQRNKGQFTSAKKCEGGALGWSNAQDPGQDDSPSETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWANKGALRDLGKKMEDQPLTPAEQRSKHGNLFLWHISSFRVKVK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query275 2.2.26 [Sep-21-2011]
Q9LRH6309 GATA transcription factor yes no 0.901 0.802 0.528 5e-67
Q8H1G0302 GATA transcription factor no no 0.785 0.715 0.509 5e-55
Q8GXL7297 GATA transcription factor no no 0.647 0.599 0.579 4e-53
Q8L500468 Two-component response re no no 0.432 0.254 0.338 8e-09
Q93WK5727 Two-component response re no no 0.152 0.057 0.595 3e-07
Q689G9518 Two-component response re no no 0.152 0.081 0.595 9e-07
Q689G6623 Two-component response re no no 0.152 0.067 0.571 2e-06
Q6LA42558 Two-component response re no no 0.152 0.075 0.547 3e-06
Q00858457 Cutinase gene palindrome- N/A no 0.149 0.089 0.534 6e-06
B0G188 695 GATA zinc finger domain-c yes no 0.127 0.050 0.567 7e-06
>sp|Q9LRH6|GAT25_ARATH GATA transcription factor 25 OS=Arabidopsis thaliana GN=GATA25 PE=2 SV=2 Back     alignment and function desciption
 Score =  254 bits (649), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 140/265 (52%), Positives = 175/265 (66%), Gaps = 17/265 (6%)

Query: 1   MYGQSQSMNISSQMSGGGAAADEDDVSVAADDHHLSYD-------PHSALENGIVVVEDV 53
           M+G+  S+  ++Q+    A+A ED VS +A   H+ YD       P S       ++ D 
Sbjct: 1   MFGR-HSIIPNNQIGTASASAGEDHVSASATSGHIPYDDMEEIPHPDSIYGAASDLIPDG 59

Query: 54  AHDSGYATGGNEL------SNSSQLTLSFRGQVYVFDSVTPDKVQAVLLLLGG-CELSSS 106
           +    + + G+EL        ++QLT+SFRGQVYVFD+V  DKV AVL LLGG  EL+  
Sbjct: 60  SQLVAHRSDGSELLVSRPPEGANQLTISFRGQVYVFDAVGADKVDAVLSLLGGSTELAPG 119

Query: 107 PQGMEVIPH-SQRGIADYPAKCTQPQRAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQ 165
           PQ ME+    +   + +Y ++C+ PQRA SLDRFR+KR  RCF+KKVRY VRQEVALRM 
Sbjct: 120 PQVMELAQQQNHMPVVEYQSRCSLPQRAQSLDRFRKKRNARCFEKKVRYGVRQEVALRMA 179

Query: 166 RNKGQFTSAKKCEGGALGWSNAQDPGQDDSPSETSCTHCGISSKSTPMMRRGPSGPRSLC 225
           RNKGQFTS+K  +G A      QD  QDD+  E SCTHCGISSK TPMMRRGPSGPR+LC
Sbjct: 180 RNKGQFTSSKMTDG-AYNSGTDQDSAQDDAHPEISCTHCGISSKCTPMMRRGPSGPRTLC 238

Query: 226 NACGLFWANKGALRDLGKKMEDQPL 250
           NACGLFWAN+G LRDL KK E+  L
Sbjct: 239 NACGLFWANRGTLRDLSKKTEENQL 263




Transcriptional activator that specifically binds 5'-GATA-3' or 5'-GAT-3' motifs within gene promoters.
Arabidopsis thaliana (taxid: 3702)
>sp|Q8H1G0|GAT28_ARATH GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2 SV=1 Back     alignment and function description
>sp|Q8GXL7|GAT24_ARATH GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2 SV=2 Back     alignment and function description
>sp|Q8L500|APRR9_ARATH Two-component response regulator-like APRR9 OS=Arabidopsis thaliana GN=APRR9 PE=2 SV=2 Back     alignment and function description
>sp|Q93WK5|APRR7_ARATH Two-component response regulator-like APRR7 OS=Arabidopsis thaliana GN=APRR7 PE=2 SV=1 Back     alignment and function description
>sp|Q689G9|PRR1_ORYSJ Two-component response regulator-like PRR1 OS=Oryza sativa subsp. japonica GN=PRR1 PE=2 SV=2 Back     alignment and function description
>sp|Q689G6|PRR95_ORYSJ Two-component response regulator-like PRR95 OS=Oryza sativa subsp. japonica GN=PRR95 PE=2 SV=1 Back     alignment and function description
>sp|Q6LA42|APRR5_ARATH Two-component response regulator-like APRR5 OS=Arabidopsis thaliana GN=APRR5 PE=1 SV=2 Back     alignment and function description
>sp|Q00858|CGPB_FUSSO Cutinase gene palindrome-binding protein OS=Fusarium solani subsp. pisi PE=2 SV=1 Back     alignment and function description
>sp|B0G188|GTAP_DICDI GATA zinc finger domain-containing protein 16 OS=Dictyostelium discoideum GN=gtaP PE=4 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query275
255572876324 GATA transcription factor, putative [Ric 0.850 0.722 0.662 6e-86
356552974307 PREDICTED: GATA transcription factor 25- 0.92 0.824 0.658 7e-86
356498865310 PREDICTED: GATA transcription factor 25- 0.901 0.8 0.657 5e-85
224067118290 predicted protein [Populus trichocarpa] 0.894 0.848 0.623 8e-85
224084544288 predicted protein [Populus trichocarpa] 0.909 0.868 0.637 9e-85
225459060294 PREDICTED: GATA transcription factor 25 0.901 0.843 0.593 2e-82
302398803306 GATA domain class transcription factor [ 0.883 0.794 0.619 5e-80
363808184286 uncharacterized protein LOC100784789 [Gl 0.825 0.793 0.656 1e-79
356510132309 PREDICTED: GATA transcription factor 25- 0.8 0.711 0.673 9e-79
449469795293 PREDICTED: GATA transcription factor 25- 0.952 0.894 0.560 4e-76
>gi|255572876|ref|XP_002527370.1| GATA transcription factor, putative [Ricinus communis] gi|223533289|gb|EEF35042.1| GATA transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  323 bits (827), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 165/249 (66%), Positives = 191/249 (76%), Gaps = 15/249 (6%)

Query: 1   MYGQSQSMNISSQMSGGGAAADEDDVSVAADD---HHLSYDPHSALENGIVVVEDVAHDS 57
           MY  SQSMN+  Q+        +D+  VAAD    HH+ Y+  +A   G+V+ +DV+HDS
Sbjct: 1   MYSHSQSMNVHHQIHTPTGPTGDDEDGVAADSIDHHHIRYEDGNA--TGVVLEDDVSHDS 58

Query: 58  GY---ATGGNELS-----NSSQLTLSFRGQVYVFDSVTPDKVQAVLLLLGGCELSSSPQG 109
            Y   +  G+EL+      SSQLTL+FRGQVYVFD+VTPDKVQAVLLLLGGCEL+S P G
Sbjct: 59  VYVPTSAAGSELAIQGNDVSSQLTLTFRGQVYVFDAVTPDKVQAVLLLLGGCELTSGPHG 118

Query: 110 MEVIPHSQR-GIADYPAKCTQPQRAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQRNK 168
           +EV   +QR  + DYP +CTQPQRAASL+RFRQKRKER FDKKVRYSVRQEVALRMQRNK
Sbjct: 119 LEVASQNQRSAVVDYPGRCTQPQRAASLNRFRQKRKERNFDKKVRYSVRQEVALRMQRNK 178

Query: 169 GQFTSAKKCEGGALGWSNAQDPGQDDSPSETSCTHCGISSKSTPMMRRGPSGPRSLCNAC 228
           GQFTS+KK +G   GW   QD GQDDS  ETSCTHCG SSKSTPMMRRGPSGPRSLCNAC
Sbjct: 179 GQFTSSKKSDG-TYGWGGGQDSGQDDSQQETSCTHCGTSSKSTPMMRRGPSGPRSLCNAC 237

Query: 229 GLFWANKGA 237
           GLFWAN+G+
Sbjct: 238 GLFWANRGS 246




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356552974|ref|XP_003544834.1| PREDICTED: GATA transcription factor 25-like [Glycine max] Back     alignment and taxonomy information
>gi|356498865|ref|XP_003518268.1| PREDICTED: GATA transcription factor 25-like [Glycine max] Back     alignment and taxonomy information
>gi|224067118|ref|XP_002302364.1| predicted protein [Populus trichocarpa] gi|222844090|gb|EEE81637.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224084544|ref|XP_002307332.1| predicted protein [Populus trichocarpa] gi|222856781|gb|EEE94328.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225459060|ref|XP_002283753.1| PREDICTED: GATA transcription factor 25 [Vitis vinifera] Back     alignment and taxonomy information
>gi|302398803|gb|ADL36696.1| GATA domain class transcription factor [Malus x domestica] Back     alignment and taxonomy information
>gi|363808184|ref|NP_001242484.1| uncharacterized protein LOC100784789 [Glycine max] gi|255641630|gb|ACU21087.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356510132|ref|XP_003523794.1| PREDICTED: GATA transcription factor 25-like [Glycine max] Back     alignment and taxonomy information
>gi|449469795|ref|XP_004152604.1| PREDICTED: GATA transcription factor 25-like [Cucumis sativus] gi|449511422|ref|XP_004163952.1| PREDICTED: GATA transcription factor 25-like [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query275
TAIR|locus:2017582302 ZML2 "ZIM-LIKE 2" [Arabidopsis 0.647 0.589 0.562 2.3e-46
TAIR|locus:505006360297 ZML1 "ZIM-like 1" [Arabidopsis 0.672 0.622 0.532 1.3e-45
TAIR|locus:2062749183 AT2G46670 "AT2G46670" [Arabido 0.429 0.644 0.318 4.9e-08
TAIR|locus:2083388149 GATA15 "GATA transcription fac 0.229 0.422 0.397 1.1e-06
TAIR|locus:2044376468 PRR9 "pseudo-response regulato 0.429 0.252 0.325 1.2e-06
CGD|CAL0003020 688 GAT1 [Candida albicans (taxid: 0.356 0.142 0.327 2.4e-05
UNIPROTKB|Q5A432 688 GAT1 "Putative uncharacterized 0.356 0.142 0.327 2.4e-05
TAIR|locus:2155919139 GATA16 "GATA transcription fac 0.16 0.316 0.469 2.5e-05
TAIR|locus:2151206727 PRR7 "pseudo-response regulato 0.149 0.056 0.609 3.4e-05
CGD|CAL0005605442 orf19.1577 [Candida albicans ( 0.167 0.104 0.509 3.7e-05
TAIR|locus:2017582 ZML2 "ZIM-LIKE 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 108/192 (56%), Positives = 126/192 (65%)

Query:    63 GNELSNSSQLTLSFRGQVYVFDSVTPDKVQAVXXXXXXXXX-XXXPQGMEVIPHSQRGIA 121
             G+E     QLTLSF+GQVYVFDSV P+KVQAV             P G+   PH    ++
Sbjct:    77 GSE--QGDQLTLSFQGQVYVFDSVLPEKVQAVLLLLGGRELPQAAPPGLGS-PHQNNRVS 133

Query:   122 DYPA---KCTQPQRAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQRNKGQFTSAKKC- 177
               P    + + PQR ASL RFR+KRK R FDKK+RY+VR+EVALRMQRNKGQFTSAK   
Sbjct:   134 SLPGTPQRFSIPQRLASLVRFREKRKGRNFDKKIRYTVRKEVALRMQRNKGQFTSAKSNN 193

Query:   178 -EGGALG--WSNAQDPGQDDSPS---ETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLF 231
              E  + G  W + Q    + S +   E SC HCGI  KSTPMMRRGP+GPR+LCNACGL 
Sbjct:   194 DEAASAGSSWGSNQTWAIESSEAQHQEISCRHCGIGEKSTPMMRRGPAGPRTLCNACGLM 253

Query:   232 WANKGALRDLGK 243
             WANKGA RDL K
Sbjct:   254 WANKGAFRDLSK 265




GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0008270 "zinc ion binding" evidence=IEA
GO:0043565 "sequence-specific DNA binding" evidence=IEA
TAIR|locus:505006360 ZML1 "ZIM-like 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062749 AT2G46670 "AT2G46670" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2083388 GATA15 "GATA transcription factor 15" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2044376 PRR9 "pseudo-response regulator 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
CGD|CAL0003020 GAT1 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms
UNIPROTKB|Q5A432 GAT1 "Putative uncharacterized protein GAT1" [Candida albicans SC5314 (taxid:237561)] Back     alignment and assigned GO terms
TAIR|locus:2155919 GATA16 "GATA transcription factor 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2151206 PRR7 "pseudo-response regulator 7" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
CGD|CAL0005605 orf19.1577 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9LRH6GAT25_ARATHNo assigned EC number0.52830.90180.8025yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00021024
hypothetical protein (291 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query275
pfam0620345 pfam06203, CCT, CCT motif 5e-18
smart0040152 smart00401, ZnF_GATA, zinc finger binding to DNA c 2e-13
cd0020254 cd00202, ZnF_GATA, Zinc finger DNA binding domain; 8e-13
pfam0032036 pfam00320, GATA, GATA zinc finger 2e-12
pfam0620036 pfam06200, tify, tify domain 3e-11
smart0097936 smart00979, TIFY, This short possible domain is fo 2e-10
>gnl|CDD|203407 pfam06203, CCT, CCT motif Back     alignment and domain information
 Score = 75.3 bits (186), Expect = 5e-18
 Identities = 21/44 (47%), Positives = 29/44 (65%)

Query: 132 RAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQRNKGQFTSAK 175
           R A+L R+++KRK R FDKK+RY+ R+ VA    R KG+F    
Sbjct: 1   REAALLRYKEKRKTRKFDKKIRYASRKAVAESRPRVKGRFVKQS 44


This short motif is found in a number of plant proteins. It is rich in basic amino acids and has been called a CCT motif after Co, Col and Toc1. The CCT motif is about 45 amino acids long and contains a putative nuclear localisation signal within the second half of the CCT motif. Toc1 mutants have been identified in this region. Length = 45

>gnl|CDD|214648 smart00401, ZnF_GATA, zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>gnl|CDD|238123 cd00202, ZnF_GATA, Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information
>gnl|CDD|109380 pfam00320, GATA, GATA zinc finger Back     alignment and domain information
>gnl|CDD|203405 pfam06200, tify, tify domain Back     alignment and domain information
>gnl|CDD|198047 smart00979, TIFY, This short possible domain is found in a variety of plant transcription factors that contain GATA domains as well as other motifs Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 275
PF0620036 tify: tify domain; InterPro: IPR010399 The tify do 99.7
cd0020254 ZnF_GATA Zinc finger DNA binding domain; binds spe 99.59
smart0040152 ZnF_GATA zinc finger binding to DNA consensus sequ 99.56
PF0032036 GATA: GATA zinc finger; InterPro: IPR000679 Zinc f 99.47
PF0942527 CCT_2: Divergent CCT motif; InterPro: IPR018467 Th 99.06
PF0620345 CCT: CCT motif; InterPro: IPR010402 The CCT (CONST 98.95
KOG1601340 consensus GATA-4/5/6 transcription factors [Transc 98.71
COG5641 498 GAT1 GATA Zn-finger-containing transcription facto 98.35
COG5641 498 GAT1 GATA Zn-finger-containing transcription facto 90.85
KOG3554 693 consensus Histone deacetylase complex, MTA1 compon 84.93
>PF06200 tify: tify domain; InterPro: IPR010399 The tify domain is a 36-amino acid domain only found among Embryophyta (land plants) Back     alignment and domain information
Probab=99.70  E-value=1.7e-17  Score=110.72  Aligned_cols=35  Identities=46%  Similarity=0.731  Sum_probs=32.6

Q ss_pred             CCCCcceeEEeCCEEEEecCCChHHHHHHHHHhcC
Q 023942           66 LSNSSQLTLSFRGQVYVFDSVTPDKVQAVLLLLGG  100 (275)
Q Consensus        66 ~~~~~QLTIfY~G~V~Vfd~V~~dKaq~Im~lA~~  100 (275)
                      .+.++||||||+|+|+|||+||+|||++||+||+.
T Consensus         2 ~~~~~qLTIfY~G~V~Vfd~v~~~Ka~~im~lA~r   36 (36)
T PF06200_consen    2 SPETAQLTIFYGGQVCVFDDVPPDKAQEIMLLASR   36 (36)
T ss_pred             CCCCCcEEEEECCEEEEeCCCCHHHHHHHHHHhcC
Confidence            36789999999999999999999999999999973



It has been named after the most conserved amino acid pattern (TIF[F/Y]XG) it contains, but was previously known as the Zim domain. As the use of uppercase characters (TIFY) might imply that the domain is fully conserved across proteins, a lowercase lettering has been chosen in an attempt to highlight the reality of its natural variability. Based on the domain architecture, tify domain containing proteins can be classified into two groups. Group I is formed by proteins possessing a CCT (CONSTANS, CO-like, and TOC1) domain and a GATA-type zinc finger in addition to the tify domain. Group II contains proteins characterised by the tify domain but lacking a GATA-type zinc finger. Tify domain containing proteins might be involved in developmental processes and some of them have features that are characteristic for transcription factors: a nuclear localisation and the presence of a putative DNA-binding domain []. Some proteins known to contain a tify domain include: Arabidopsis thaliana Zinc-finger protein expressed in Inflorescence Meristem (ZIM), a putative transcription factor involved in inflorescence and flower development [, ]. A. thaliana ZIM-like proteins (ZML) []. A. thaliana PEAPOD1 and PEAPOD2 (PPD1 and PPD2) [].

>cd00202 ZnF_GATA Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information
>smart00401 ZnF_GATA zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>PF00320 GATA: GATA zinc finger; InterPro: IPR000679 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>PF09425 CCT_2: Divergent CCT motif; InterPro: IPR018467 The short CCT (CO, COL, TOC1) motif is found in a number of plant proteins, including Constans (CO), Constans-like (COL) and TOC1 Back     alignment and domain information
>PF06203 CCT: CCT motif; InterPro: IPR010402 The CCT (CONSTANS, CO-like, and TOC1) domain is a highly conserved basic module of ~43 amino acids, which is found near the C terminus of plant proteins often involved in light signal transduction Back     alignment and domain information
>KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information
>COG5641 GAT1 GATA Zn-finger-containing transcription factor [Transcription] Back     alignment and domain information
>KOG3554 consensus Histone deacetylase complex, MTA1 component [Chromatin structure and dynamics] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query275
4gat_A66 Solution Nmr Structure Of The Wild Type Dna Binding 1e-04
7gat_A66 Solution Nmr Structure Of The L22v Mutant Dna Bindi 2e-04
2vus_I43 Crystal Structure Of Unliganded Nmra-Area Zinc Fing 2e-04
>pdb|4GAT|A Chain A, Solution Nmr Structure Of The Wild Type Dna Binding Domain Of Area Complexed To A 13bp Dna Containing A Cgata Site, Regularized Mean Structure Length = 66 Back     alignment and structure

Iteration: 1

Score = 43.9 bits (102), Expect = 1e-04, Method: Composition-based stats. Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 5/56 (8%) Query: 191 GQDDSPSETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWANKGALRDLGKKME 246 G+ + P T+CT+C +++TP+ RR P G + LCNACGLF G +R L K + Sbjct: 4 GEQNGP--TTCTNC--FTQTTPLWRRNPEG-QPLCNACGLFLKLHGVVRPLSLKTD 54
>pdb|7GAT|A Chain A, Solution Nmr Structure Of The L22v Mutant Dna Binding Domain Of Area Complexed To A 13 Bp Dna Containing A Tgata Site, 34 Structures Length = 66 Back     alignment and structure
>pdb|2VUS|I Chain I, Crystal Structure Of Unliganded Nmra-Area Zinc Finger Complex Length = 43 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query275
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 2e-11
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 3e-10
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 7e-09
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 3e-08
3dfx_A63 Trans-acting T-cell-specific transcription factor 2e-07
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Length = 71 Back     alignment and structure
 Score = 57.6 bits (139), Expect = 2e-11
 Identities = 11/49 (22%), Positives = 16/49 (32%), Gaps = 2/49 (4%)

Query: 191 GQDDSPSETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWANKGALR 239
           G   +     C++C      T   R   S     CNAC ++       R
Sbjct: 1   GSHMNKKSFQCSNCS--VTETIRWRNIRSKEGIQCNACFIYQRKYNKTR 47


>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Length = 46 Back     alignment and structure
>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Length = 43 Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Length = 66 Back     alignment and structure
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Length = 63 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query275
3dfx_A63 Trans-acting T-cell-specific transcription factor 99.73
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 99.72
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 99.72
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 99.7
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 99.63
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.56
4hc9_A115 Trans-acting T-cell-specific transcription factor; 99.47
3ogl_Q21 JAZ1 incomplete degron peptide; leucine-rich repea 98.89
3ogk_Q22 JAZ1 incomplete degron peptide; leucine rich repea 98.87
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Back     alignment and structure
Probab=99.73  E-value=1.5e-18  Score=127.91  Aligned_cols=55  Identities=33%  Similarity=0.624  Sum_probs=49.1

Q ss_pred             CCCCccceeecccCCCCCceecCCCCCCccchhhhhHHHhcCCCCCCCCCCCCCCCCc
Q 023942          195 SPSETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWANKGALRDLGKKMEDQPLTP  252 (275)
Q Consensus       195 ~~~~~~C~~Cgtt~~~Tp~WRrGp~G~~~LCNACGl~~~~~~~~r~~~~k~~~~~~~~  252 (275)
                      ......|+||+++  .||+||+||+|+ +|||||||||++++..||++++++.+....
T Consensus         4 ~~~~~~C~~C~tt--~Tp~WR~gp~G~-~LCNACGl~~~~~~~~RP~~~~~~~i~~R~   58 (63)
T 3dfx_A            4 RRAGTSCANCQTT--TTTLWRRNANGD-PVCNACGLYYKLHNINRPLTMKKEGIQTRN   58 (63)
T ss_dssp             CCTTCCCTTTCCS--CCSSCCCCTTSC-CCCHHHHHHHHHHSSCCCGGGCCSSCCCCC
T ss_pred             CCCCCcCCCcCCC--CCCccCCCCCCC-chhhHHHHHHHHcCCCCCcCcCCCcccccc
Confidence            4567789999999  999999999997 999999999999999999999988666543



>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Back     alignment and structure
>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Back     alignment and structure
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 275
d2vuti142 g.39.1.1 (I:671-712) Erythroid transcription facto 8e-12
d1y0ja139 g.39.1.1 (A:200-238) Erythroid transcription facto 2e-10
d3gata_66 g.39.1.1 (A:) Erythroid transcription factor GATA- 6e-09
>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Length = 42 Back     information, alignment and structure

class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Emericella nidulans [TaxId: 162425]
 Score = 56.7 bits (137), Expect = 8e-12
 Identities = 21/43 (48%), Positives = 29/43 (67%), Gaps = 3/43 (6%)

Query: 199 TSCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWANKGALRDL 241
           T+CT+C   +++TP+ RR P G + LCNACGLF    G +R L
Sbjct: 1   TTCTNCF--TQTTPLWRRNPEG-QPLCNACGLFLKLHGVVRPL 40


>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Length = 39 Back     information, alignment and structure
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Length = 66 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query275
d2vuti142 Erythroid transcription factor GATA-1 {Emericella 99.77
d3gata_66 Erythroid transcription factor GATA-1 {Chicken (Ga 99.73
d1y0ja139 Erythroid transcription factor GATA-1 {Mouse (Mus 99.7
>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Back     information, alignment and structure
class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Emericella nidulans [TaxId: 162425]
Probab=99.77  E-value=3.5e-20  Score=124.52  Aligned_cols=41  Identities=49%  Similarity=1.009  Sum_probs=39.3

Q ss_pred             cceeecccCCCCCceecCCCCCCccchhhhhHHHhcCCCCCCCC
Q 023942          200 SCTHCGISSKSTPMMRRGPSGPRSLCNACGLFWANKGALRDLGK  243 (275)
Q Consensus       200 ~C~~Cgtt~~~Tp~WRrGp~G~~~LCNACGl~~~~~~~~r~~~~  243 (275)
                      .|+||+++  .||+||+||+| ++|||||||||++|+++||++|
T Consensus         2 ~C~nC~tt--~Tp~WRr~~~G-~~lCNACGl~~k~~g~~RP~sl   42 (42)
T d2vuti1           2 TCTNCFTQ--TTPLWRRNPEG-QPLCNACGLFLKLHGVVRPLSL   42 (42)
T ss_dssp             CCSSSCCC--CCSCCEECTTS-CEECHHHHHHHHHHSSCCCCCC
T ss_pred             cCCCCCCC--CCccceeCCCC-CCchhhhhHHHHHcCCCCCCCC
Confidence            69999998  99999999999 6999999999999999999986



>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Back     information, alignment and structure
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure