Citrus Sinensis ID: 015162
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 412 | ||||||
| 302398871 | 440 | HSF domain class transcription factor [M | 0.978 | 0.915 | 0.483 | 1e-107 | |
| 225446710 | 442 | PREDICTED: heat stress transcription fac | 0.978 | 0.911 | 0.494 | 1e-103 | |
| 255569843 | 464 | Heat shock factor protein, putative [Ric | 0.927 | 0.823 | 0.471 | 6e-96 | |
| 224131376 | 443 | predicted protein [Populus trichocarpa] | 0.961 | 0.893 | 0.459 | 1e-94 | |
| 147856184 | 409 | hypothetical protein VITISV_017449 [Viti | 0.769 | 0.775 | 0.554 | 6e-90 | |
| 302143480 | 348 | unnamed protein product [Vitis vinifera] | 0.825 | 0.977 | 0.482 | 3e-89 | |
| 255562528 | 402 | Heat shock factor protein, putative [Ric | 0.929 | 0.952 | 0.470 | 9e-88 | |
| 224114609 | 406 | predicted protein [Populus trichocarpa] | 0.922 | 0.935 | 0.428 | 9e-82 | |
| 224076924 | 407 | predicted protein [Populus trichocarpa] | 0.941 | 0.953 | 0.426 | 2e-80 | |
| 356525313 | 402 | PREDICTED: heat stress transcription fac | 0.956 | 0.980 | 0.430 | 8e-79 |
| >gi|302398871|gb|ADL36730.1| HSF domain class transcription factor [Malus x domestica] | Back alignment and taxonomy information |
|---|
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 216/447 (48%), Positives = 286/447 (63%), Gaps = 44/447 (9%)
Query: 1 MDESQGQGSSNSPAPFLIKTYEMVDDPLTNSLVSWSQSGGSFIVWNPPDFAKDLLPLYFK 60
M+ SQG GS +PAPFL KTY++VDDP +N +VSW++SG SF+VW+P +FAK++LP+YFK
Sbjct: 1 MEGSQG-GSGGAPAPFLTKTYDLVDDPSSNHMVSWTESGSSFVVWDPTEFAKEMLPMYFK 59
Query: 61 HNNFSSFVRQLNTYGFRKIDPDQWEFANEEFIRGQRHLLKNIHRRKPVHSHSTQS---PT 117
HNNFSSFVRQLNTYGFRKIDP+QWEFANEEF+RG RHLLKNIHRRKP+HSHS Q+
Sbjct: 60 HNNFSSFVRQLNTYGFRKIDPEQWEFANEEFLRGGRHLLKNIHRRKPIHSHSMQNHEYTV 119
Query: 118 PITETERNEFEAKIERLKKENSLLQLELRRRESGNHEFEYHVQSVRERLRYLENRQTQMI 177
+++TER E+E KI RL + SLL+LEL+R + N EFE+ VQ +RE+L+ +ENRQ Q
Sbjct: 120 ALSDTEREEYEKKISRLNHDKSLLELELQRHQRENQEFEFQVQILREQLQNMENRQKQYT 179
Query: 178 ACLANLLKKPGFSSALNQQAEVHSKKRRLLRPTTIEYLDDVKSASLSVNEQKENPDATFD 237
A LA L++KPGF+S L QQ+E+HSKKRRLL + DD L++N QKEN +
Sbjct: 180 AFLAQLVQKPGFASVLVQQSEIHSKKRRLLNSNN--FPDDFGMEGLNLNPQKENLGSI-- 235
Query: 238 PSPALNLEQIDRLESSVDFWTFFLCGIGEAFNQDVYDFGVLRPVSAVTDDEIEE-GVSSE 296
+P + L+Q++ +ESS++FW FL GIGEA +DV D G L S + E ++ ++S
Sbjct: 236 STPIIKLDQLETMESSLNFWEDFLHGIGEAMPEDVNDIGPLSQASPIIVTETQDTSLNSR 295
Query: 297 EHDGLSNVSSPKSMDVHSSPKQAASSENPVDGASAT------------------------ 332
S++SSP SM+ HS P+ AA S N +D S T
Sbjct: 296 PCSPRSHLSSPNSMNAHSFPEVAAGSANILDILSITSMFPTVDFRSKSSGIDMNSKPDSA 355
Query: 333 ----EDETANSSVKN----RANDQFWEQFLTETPAAAASSSARPESRDPESGSTYVNKPA 384
++ VKN +AND FWEQ LTETP A + E RD + ++ N
Sbjct: 356 PALAKEMVVELEVKNAEPTKANDVFWEQCLTETPGLADALEVHSERRDGDGKASDANP-- 413
Query: 385 EADSKKFWWHSNNLDTLTRHFGNLTSA 411
A KK WW+++N+D T G LT A
Sbjct: 414 -AIQKKHWWNTDNVDNFTNQIGRLTPA 439
|
Source: Malus x domestica Species: Malus x domestica Genus: Malus Family: Rosaceae Order: Rosales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|225446710|ref|XP_002278037.1| PREDICTED: heat stress transcription factor A-4b [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255569843|ref|XP_002525885.1| Heat shock factor protein, putative [Ricinus communis] gi|223534799|gb|EEF36489.1| Heat shock factor protein, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|224131376|ref|XP_002321069.1| predicted protein [Populus trichocarpa] gi|222861842|gb|EEE99384.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|147856184|emb|CAN80282.1| hypothetical protein VITISV_017449 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|302143480|emb|CBI22041.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255562528|ref|XP_002522270.1| Heat shock factor protein, putative [Ricinus communis] gi|223538523|gb|EEF40128.1| Heat shock factor protein, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|224114609|ref|XP_002316809.1| predicted protein [Populus trichocarpa] gi|222859874|gb|EEE97421.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|224076924|ref|XP_002305052.1| predicted protein [Populus trichocarpa] gi|222848016|gb|EEE85563.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356525313|ref|XP_003531269.1| PREDICTED: heat stress transcription factor A-4a-like [Glycine max] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 412 | ||||||
| TAIR|locus:2117139 | 401 | HSF A4A "AT4G18880" [Arabidops | 0.949 | 0.975 | 0.385 | 3.2e-65 | |
| TAIR|locus:2172003 | 345 | RHA1 "AT5G45710" [Arabidopsis | 0.531 | 0.634 | 0.474 | 4.2e-59 | |
| TAIR|locus:2129276 | 466 | AT-HSFA5 "AT4G13980" [Arabidop | 0.487 | 0.431 | 0.461 | 1.8e-54 | |
| TAIR|locus:2075447 | 468 | HSFA1E "AT3G02990" [Arabidopsi | 0.584 | 0.514 | 0.414 | 3.7e-52 | |
| TAIR|locus:2005495 | 495 | HSF1 "AT4G17750" [Arabidopsis | 0.266 | 0.222 | 0.684 | 5e-50 | |
| TAIR|locus:2149050 | 481 | HSF3 "AT5G16820" [Arabidopsis | 0.475 | 0.407 | 0.481 | 2.6e-49 | |
| TAIR|locus:2028326 | 485 | HSFA1D "AT1G32330" [Arabidopsi | 0.759 | 0.645 | 0.368 | 2.3e-48 | |
| TAIR|locus:2144603 | 412 | HSFA3 "AT5G03720" [Arabidopsis | 0.444 | 0.444 | 0.478 | 1.4e-44 | |
| TAIR|locus:2057371 | 345 | HSFA2 "AT2G26150" [Arabidopsis | 0.546 | 0.652 | 0.424 | 9.3e-43 | |
| TAIR|locus:2083690 | 272 | HSFA7A "AT3G51910" [Arabidopsi | 0.439 | 0.665 | 0.430 | 2.2e-40 |
| TAIR|locus:2117139 HSF A4A "AT4G18880" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 664 (238.8 bits), Expect = 3.2e-65, P = 3.2e-65
Identities = 161/418 (38%), Positives = 226/418 (54%)
Query: 1 MDESQGQGSSNSPAPFLIKTYEMVDDPLTNSLVSWSQSGGSFIVWNPPDFAKDLLPLYFK 60
MDE+ SS+S PFL KTYEMVDD ++S+VSWSQS SFIVWNPP+F++DLLP +FK
Sbjct: 1 MDENNHGVSSSSLPPFLTKTYEMVDDSSSDSIVSWSQSNKSFIVWNPPEFSRDLLPRFFK 60
Query: 61 HNNFSSFVRQLNTYGFRKIDPDQWEFANEEFIRGQRHLLKNIHRRKPVHSHST---QSP- 116
HNNFSSF+RQLNTYGFRK DP+QWEFAN++F+RGQ HL+KNIHRRKPVHSHS Q+
Sbjct: 61 HNNFSSFIRQLNTYGFRKADPEQWEFANDDFVRGQPHLMKNIHRRKPVHSHSLPNLQAQL 120
Query: 117 TPITETERNEFEAKIERLKKENSXXXXXXXXXXSGNHEFEYHVQSVRERLRYLENRQTQM 176
P+T++ER +IERL KE FE V+ ++ERL+++E RQ M
Sbjct: 121 NPLTDSERVRMNNQIERLTKEKEGLLEELHKQDEEREVFEMQVKELKERLQHMEKRQKTM 180
Query: 177 IACLANLLKKPGFSSALNQQA-EVHSKKRRLLRPTTIEYLDDVKSASLSVNEQKENPDAT 235
++ ++ +L+KPG + L+ E + +KRR R IE+ D + + +
Sbjct: 181 VSFVSQVLEKPGLALNLSPCVPETNERKRRFPR---IEFFPDEPMLEENKTCVVVREEGS 237
Query: 236 FDPSPALNLEQIDRLESSVDFWTFFLCGIGEAFNQDVYDFGVLRPVSAVTDDEIEEGVSS 295
PS Q+++LESS+ W E D + +L+ S +T D ++E +
Sbjct: 238 TSPSSHTREHQVEQLESSIAIW--------ENLVSDSCE-SMLQSRSMMTLD-VDESSTF 287
Query: 296 EEHDGLSNVSSPKSMDVHSSPKQAASSEN-PVDGASATEDETANSSVKNR-ANDQFWEQF 353
E LS + + S P N DG+ A AND FW+QF
Sbjct: 288 PESPPLSCIQLSVDSRLKSPPSPRIIDMNCEPDGSKEQNTVAAPPPPPVAGANDGFWQQF 347
Query: 354 LTETXXXXXXXXXXXXXXXXXXGSTYVNKPAEADSKKFWWHSNNLDTLTRHFGNLTSA 411
+E +K A ++K WW+S N++ +T G+LTS+
Sbjct: 348 FSENPGSTEQREVQLERKDDK------DK-AGVRTEKCWWNSRNVNAITEQLGHLTSS 398
|
|
| TAIR|locus:2172003 RHA1 "AT5G45710" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2129276 AT-HSFA5 "AT4G13980" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2075447 HSFA1E "AT3G02990" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2005495 HSF1 "AT4G17750" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2149050 HSF3 "AT5G16820" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2028326 HSFA1D "AT1G32330" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2144603 HSFA3 "AT5G03720" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2057371 HSFA2 "AT2G26150" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2083690 HSFA7A "AT3G51910" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00018812001 | SubName- Full=Chromosome chr12 scaffold_18, whole genome shotgun sequence; (442 aa) | |||||||
(Vitis vinifera) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 412 | |||
| pfam00447 | 96 | pfam00447, HSF_DNA-bind, HSF-type DNA-binding | 1e-51 | |
| smart00415 | 105 | smart00415, HSF, heat shock factor | 6e-51 | |
| COG5169 | 282 | COG5169, HSF1, Heat shock transcription factor [Tr | 5e-29 |
| >gnl|CDD|215924 pfam00447, HSF_DNA-bind, HSF-type DNA-binding | Back alignment and domain information |
|---|
Score = 167 bits (426), Expect = 1e-51
Identities = 56/95 (58%), Positives = 70/95 (73%), Gaps = 2/95 (2%)
Query: 15 PFLIKTYEMVDDPLTNSLVSWSQSGGSFIVWNPPDFAKDLLPLYFKHNNFSSFVRQLNTY 74
PFL K YE+++DP + ++SWS+ G SFI+W+P +FAK +LP YFKHNNFSSFVRQLN Y
Sbjct: 1 PFLHKLYEILEDPSYDDIISWSEDGNSFIIWDPEEFAKQVLPKYFKHNNFSSFVRQLNMY 60
Query: 75 GFRKIDPD--QWEFANEEFIRGQRHLLKNIHRRKP 107
GF K+ D WEFA+ F RGQ+ LL I RRK
Sbjct: 61 GFHKVKDDRNGWEFAHPNFKRGQKELLSKIKRRKS 95
|
Length = 96 |
| >gnl|CDD|214654 smart00415, HSF, heat shock factor | Back alignment and domain information |
|---|
| >gnl|CDD|227497 COG5169, HSF1, Heat shock transcription factor [Transcription] | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 412 | |||
| KOG0627 | 304 | consensus Heat shock transcription factor [Transcr | 100.0 | |
| smart00415 | 105 | HSF heat shock factor. | 100.0 | |
| PF00447 | 103 | HSF_DNA-bind: HSF-type DNA-binding; InterPro: IPR0 | 100.0 | |
| COG5169 | 282 | HSF1 Heat shock transcription factor [Transcriptio | 99.98 | |
| PF00178 | 85 | Ets: Ets-domain; InterPro: IPR000418 Transcription | 95.3 | |
| smart00413 | 87 | ETS erythroblast transformation specific domain. v | 93.25 | |
| COG3074 | 79 | Uncharacterized protein conserved in bacteria [Fun | 92.77 | |
| KOG3806 | 177 | consensus Predicted transcription factor [Transcri | 89.89 | |
| PRK15422 | 79 | septal ring assembly protein ZapB; Provisional | 89.79 | |
| PF11932 | 251 | DUF3450: Protein of unknown function (DUF3450); In | 89.04 | |
| PF06005 | 72 | DUF904: Protein of unknown function (DUF904); Inte | 88.27 | |
| PF12325 | 120 | TMF_TATA_bd: TATA element modulatory factor 1 TATA | 86.99 | |
| PF04340 | 225 | DUF484: Protein of unknown function, DUF484; Inter | 86.85 | |
| PF02183 | 45 | HALZ: Homeobox associated leucine zipper; InterPro | 86.56 | |
| PF10473 | 140 | CENP-F_leu_zip: Leucine-rich repeats of kinetochor | 86.4 | |
| PF06005 | 72 | DUF904: Protein of unknown function (DUF904); Inte | 86.04 | |
| PF10168 | 717 | Nup88: Nuclear pore component; InterPro: IPR019321 | 85.09 | |
| PF08826 | 61 | DMPK_coil: DMPK coiled coil domain like; InterPro: | 82.0 | |
| PF06156 | 107 | DUF972: Protein of unknown function (DUF972); Inte | 81.83 | |
| TIGR02894 | 161 | DNA_bind_RsfA transcription factor, RsfA family. I | 81.53 | |
| PF10779 | 71 | XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemoly | 81.38 | |
| PF04111 | 314 | APG6: Autophagy protein Apg6; InterPro: IPR007243 | 80.8 |
| >KOG0627 consensus Heat shock transcription factor [Transcription] | Back alignment and domain information |
|---|
Probab=100.00 E-value=3.1e-44 Score=355.05 Aligned_cols=192 Identities=44% Similarity=0.780 Sum_probs=172.3
Q ss_pred CCCCCCCCCchHHHHHHhhcCCCCCCceEEcCCCCeEEEeCCchhhhhccccccCCCCchhhhhccccccccccC--CCC
Q 015162 6 GQGSSNSPAPFLIKTYEMVDDPLTNSLVSWSQSGGSFIVWNPPDFAKDLLPLYFKHNNFSSFVRQLNTYGFRKID--PDQ 83 (412)
Q Consensus 6 ~~~~~~~~~~Fl~KLy~mv~d~~~~~iI~W~~~G~sFiI~d~~~F~~~vLp~~Fkh~nfsSFvRQLN~YGF~Kv~--~~~ 83 (412)
|+.....+++|+.|||.||+||++++||+|+++|.+|||||+.+|++.+||+||||+||+|||||||+||||||+ +++
T Consensus 5 ~~~~~~~~~~Fl~K~y~~v~Dps~~~iisWs~~g~sFvv~d~~~F~~~~Lp~~FKh~NfsSFvRQLN~YgFrKv~~~~~~ 84 (304)
T KOG0627|consen 5 GLSEASGPPPFLEKLYEMVEDPSTDEIISWSPSGNSFVIWNPEEFAKVLLPLYFKHNNFSSFVRQLNMYGFRKVDFKSDR 84 (304)
T ss_pred CccccCCCCcHHHHHHHHhcCCCCCCceEECCCCCccccCCHHHHHHHHhHHhccccCccceeeeecccceeecCCCCCc
Confidence 345566899999999999999999999999999999999999999999999999999999999999999999999 999
Q ss_pred cccccccccccccccccccccccCCCCCCC---CC---CCCC-----ChhhhhhHHHHHHHHHHHhHHHHHHHHHHhhcc
Q 015162 84 WEFANEEFIRGQRHLLKNIHRRKPVHSHST---QS---PTPI-----TETERNEFEAKIERLKKENSLLQLELRRRESGN 152 (412)
Q Consensus 84 ~eF~h~~F~Rg~~~lL~~I~Rr~~~~s~~~---~~---~~~~-----t~~e~~~l~~eI~~LK~d~~~L~~EL~rL~qqq 152 (412)
|||+|++|+||+++||++|+|||+.+.... .. ...+ ....+..+..++.+|+++++.|+.++.+|++++
T Consensus 85 wEF~n~~F~rg~~~LL~~I~rrk~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~lr~~~ 164 (304)
T KOG0627|consen 85 WEFSNPCFVRGQKLLLKNIKRRKSASRIFQTKDSPKSFERQLNLYGFVKIRQLNLKESAKSLSKENEVLQRELVELRQQQ 164 (304)
T ss_pred eeecChhHhcChHHHHHHHhhhccccCCcccccCcchhhhhhhHHHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhhH
Confidence 999999999999999999999998776543 11 0001 112244578899999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhhhhh
Q 015162 153 HEFEYHVQSVRERLRYLENRQTQMIACLANLLKKPGFSSALNQQA 197 (412)
Q Consensus 153 q~l~~qvq~L~erL~~~e~~Qq~mlsfL~~vlq~p~f~~~L~q~~ 197 (412)
..+..+++.+.+++...+.+|+.|+.|++++++.|+|+.++.+..
T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 209 (304)
T KOG0627|consen 165 DALRATIQTSKRVVKSKETRNSLILSFLARDVQSPGFLNQAPQRQ 209 (304)
T ss_pred HHHHHHHHhhccccCchhhHHHHHhhHHHhhccCccchhcccchh
Confidence 999999999999999999999999999999999999998876544
|
|
| >smart00415 HSF heat shock factor | Back alignment and domain information |
|---|
| >PF00447 HSF_DNA-bind: HSF-type DNA-binding; InterPro: IPR000232 Heat shock factor (HSF) is a transcriptional activator of heat shock genes []: it binds specifically to heat shock promoter elements, which are palindromic sequences rich with repetitive purine and pyrimidine motifs [] | Back alignment and domain information |
|---|
| >COG5169 HSF1 Heat shock transcription factor [Transcription] | Back alignment and domain information |
|---|
| >PF00178 Ets: Ets-domain; InterPro: IPR000418 Transcription factors are protein molecules that bind to specific DNA sequences in the genome, resulting in the induction or inhibition of gene transcription [] | Back alignment and domain information |
|---|
| >smart00413 ETS erythroblast transformation specific domain | Back alignment and domain information |
|---|
| >COG3074 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
| >KOG3806 consensus Predicted transcription factor [Transcription] | Back alignment and domain information |
|---|
| >PRK15422 septal ring assembly protein ZapB; Provisional | Back alignment and domain information |
|---|
| >PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function | Back alignment and domain information |
|---|
| >PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation | Back alignment and domain information |
|---|
| >PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes [] | Back alignment and domain information |
|---|
| >PF04340 DUF484: Protein of unknown function, DUF484; InterPro: IPR007435 This family consists of several proteins of uncharacterised function | Back alignment and domain information |
|---|
| >PF02183 HALZ: Homeobox associated leucine zipper; InterPro: IPR003106 This region is a plant specific leucine zipper that is always found associated with a homeobox [] | Back alignment and domain information |
|---|
| >PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ] | Back alignment and domain information |
|---|
| >PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation | Back alignment and domain information |
|---|
| >PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs | Back alignment and domain information |
|---|
| >PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure | Back alignment and domain information |
|---|
| >PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication | Back alignment and domain information |
|---|
| >TIGR02894 DNA_bind_RsfA transcription factor, RsfA family | Back alignment and domain information |
|---|
| >PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [] | Back alignment and domain information |
|---|
| >PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() | |
| Query | 412 | ||||
| 2ldu_A | 125 | Solution Nmr Structure Of Heat Shock Factor Protein | 1e-22 | ||
| 1hks_A | 106 | Solution Structure Of The Dna-Binding Domain Of Dro | 3e-18 | ||
| 3hts_B | 102 | Heat Shock Transcription FactorDNA COMPLEX Length = | 7e-14 | ||
| 1fym_A | 92 | Serendipitous Crystal Structure Containing The Heat | 6e-13 | ||
| 1fbu_A | 90 | Heat Shock Transcription Factor Dna Binding Domain | 7e-13 | ||
| 1fbs_A | 90 | Heat Shock Transcription Factor Dna Binding Domain | 4e-12 | ||
| 1fbq_A | 90 | Heat Shock Transcription Factor Dna Binding Domain | 4e-12 | ||
| 1fyk_A | 92 | Serendipitous Crystal Structure Containing The Heat | 4e-12 |
| >pdb|2LDU|A Chain A, Solution Nmr Structure Of Heat Shock Factor Protein 1 Dna Binding Domain From Homo Sapiens, Northeast Structural Genomics Consortium Target Hr3023c Length = 125 | Back alignment and structure |
|
| >pdb|1HKS|A Chain A, Solution Structure Of The Dna-Binding Domain Of Drosophila Heat Shock Transcription Factor Length = 106 | Back alignment and structure |
| >pdb|3HTS|B Chain B, Heat Shock Transcription FactorDNA COMPLEX Length = 102 | Back alignment and structure |
| >pdb|1FYM|A Chain A, Serendipitous Crystal Structure Containing The Heat Shock Transcription Factor's Dna Binding Domain And Cognate Dna In A Tail-To-Tail Orientation Length = 92 | Back alignment and structure |
| >pdb|1FBU|A Chain A, Heat Shock Transcription Factor Dna Binding Domain Length = 90 | Back alignment and structure |
| >pdb|1FBS|A Chain A, Heat Shock Transcription Factor Dna Binding Domain Containing The P237a Mutation Length = 90 | Back alignment and structure |
| >pdb|1FBQ|A Chain A, Heat Shock Transcription Factor Dna Binding Domain Containing The P237k Mutation Length = 90 | Back alignment and structure |
| >pdb|1FYK|A Chain A, Serendipitous Crystal Structure Containing The Heat Shock Transcription Factor's Dna Binding Domain And Cognate Dna That Is Translationally Disordered Length = 92 | Back alignment and structure |
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 412 | |||
| 1hks_A | 106 | Heat-shock transcription factor; transcription reg | 1e-52 | |
| 2ldu_A | 125 | Heat shock factor protein 1; structural genomics, | 2e-52 | |
| 3hts_B | 102 | Heat shock transcription factor; transcription reg | 4e-43 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 3e-05 |
| >1hks_A Heat-shock transcription factor; transcription regulation; NMR {Drosophila melanogaster} SCOP: a.4.5.22 PDB: 1hkt_A Length = 106 | Back alignment and structure |
|---|
Score = 170 bits (432), Expect = 1e-52
Identities = 46/105 (43%), Positives = 62/105 (59%), Gaps = 10/105 (9%)
Query: 11 NSPAPFLIKTYEMVDDPLTNSLVSWSQSGGSFIVWNPPDFAKDLLPLYFKHNNFSSFVRQ 70
+ FL K + +VDD TN L+ W++ G SF++ N FAK+LLPL +KHNN +SF+RQ
Sbjct: 2 SGVPAFLAKLWRLVDDADTNRLICWTKDGQSFVIQNQAQFAKELLPLNYKHNNMASFIRQ 61
Query: 71 LNTYGFRKI----------DPDQWEFANEEFIRGQRHLLKNIHRR 105
LN YGF KI D D+ EF++ F R LL I R+
Sbjct: 62 LNMYGFHKITSIDNGGLRFDRDEIEFSHPFFKRNSPFLLDQIKRK 106
|
| >2ldu_A Heat shock factor protein 1; structural genomics, northeast structural genomics consortiu DNA-binding, PSI-biology, protein structure initiative; NMR {Homo sapiens} Length = 125 | Back alignment and structure |
|---|
| >3hts_B Heat shock transcription factor; transcription regulation, DNA-binding protein, complex (WING helix_TURN_ helix-DNA); 1.75A {Kluyveromyces lactis} SCOP: a.4.5.22 PDB: 2hts_A 1fyk_A* 1fym_A 1fyl_A 3hsf_A 1fbu_A 1fbs_A 1fbq_A Length = 102 | Back alignment and structure |
|---|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 412 | |||
| 1hks_A | 106 | Heat-shock transcription factor; transcription reg | 100.0 | |
| 2ldu_A | 125 | Heat shock factor protein 1; structural genomics, | 100.0 | |
| 3hts_B | 102 | Heat shock transcription factor; transcription reg | 99.97 | |
| 2dao_A | 118 | Transcription factor ETV6; ETS domain, structural | 92.31 | |
| 2jee_A | 81 | YIIU; FTSZ, septum, coiled-coil, cell division, ce | 91.81 | |
| 1wwx_A | 107 | E74-like factor 5 ESE-2B; DNA binding, transcripti | 90.64 | |
| 1awc_A | 110 | Protein (GA binding protein alpha); complex (trans | 89.61 | |
| 1fli_A | 98 | FLI-1; transcription/DNA; NMR {Homo sapiens} SCOP: | 89.49 | |
| 1bc8_C | 93 | SAP-1, protein (SAP-1 ETS domain); DNA-binding dom | 88.9 | |
| 2ypr_A | 102 | Protein FEV; transcription; 2.64A {Homo sapiens} P | 87.86 | |
| 4avp_A | 106 | ETS translocation variant 1; transcription, transc | 87.45 | |
| 1gvj_A | 146 | C-ETS-1 protein, P54; transcription, autoinhibitio | 85.98 | |
| 1wt6_A | 81 | Myotonin-protein kinase; coiled-coil, kinase activ | 84.1 | |
| 2nny_A | 171 | C-ETS-1 protein, P54; protein-DNA complex, transcr | 83.54 | |
| 2jee_A | 81 | YIIU; FTSZ, septum, coiled-coil, cell division, ce | 82.68 | |
| 1yo5_C | 97 | SAM pointed domain containing ETS transcription fa | 81.27 | |
| 1deb_A | 54 | APC protein, adenomatous polyposis coli protein; c | 80.98 |
| >1hks_A Heat-shock transcription factor; transcription regulation; NMR {Drosophila melanogaster} SCOP: a.4.5.22 PDB: 1hkt_A | Back alignment and structure |
|---|
Probab=100.00 E-value=1.2e-39 Score=276.63 Aligned_cols=95 Identities=47% Similarity=0.840 Sum_probs=91.3
Q ss_pred CCCCchHHHHHHhhcCCCCCCceEEcCCCCeEEEeCCchhhhhccccccCCCCchhhhhccccccccccC----------
Q 015162 11 NSPAPFLIKTYEMVDDPLTNSLVSWSQSGGSFIVWNPPDFAKDLLPLYFKHNNFSSFVRQLNTYGFRKID---------- 80 (412)
Q Consensus 11 ~~~~~Fl~KLy~mv~d~~~~~iI~W~~~G~sFiI~d~~~F~~~vLp~~Fkh~nfsSFvRQLN~YGF~Kv~---------- 80 (412)
+++|+||.|||+||+||++++||+|+++|++|||+|+.+|+++|||+||||+||+|||||||+|||||+.
T Consensus 2 ~~~p~F~~KL~~mv~d~~~~~iI~W~~~G~sFvI~d~~~F~~~vLp~yFkh~n~~SFvRQLN~YGF~Kv~~~~~~~~~~~ 81 (106)
T 1hks_A 2 SGVPAFLAKLWRLVDDADTNRLICWTKDGQSFVIQNQAQFAKELLPLNYKHNNMASFIRQLNMYGFHKITSIDNGGLRFD 81 (106)
T ss_dssp TTCCTTHHHHHHHHSSSTTTTTSEESTTTSCEECSCCSTTTTTTSTTTTSCCCHHHHHHHHHHHCCCCSSCSSSCCSSCT
T ss_pred CCcCcHHHHHHHHhcCCCCCCEEEEeCCCCEEEECCHHHHHHHHhHHhcCCCcHHHHHHhhhcCCCeEEecccccCccCC
Confidence 4678999999999999999999999999999999999999999999999999999999999999999984
Q ss_pred CCCcccccccccccccccccccccc
Q 015162 81 PDQWEFANEEFIRGQRHLLKNIHRR 105 (412)
Q Consensus 81 ~~~~eF~h~~F~Rg~~~lL~~I~Rr 105 (412)
++.|+|+|++|+||+|+||.+|+||
T Consensus 82 ~~~~ef~h~~F~Rg~~~LL~~IkRk 106 (106)
T 1hks_A 82 RDEIEFSHPFFKRNSPFLLDQIKRK 106 (106)
T ss_dssp TSTTEECCTTCCSSCTTSTTTCCCC
T ss_pred CCceEEECcCccCcCHHHHhhCcCC
Confidence 4789999999999999999999996
|
| >2ldu_A Heat shock factor protein 1; structural genomics, northeast structural genomics consortiu DNA-binding, PSI-biology, protein structure initiative; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >3hts_B Heat shock transcription factor; transcription regulation, DNA-binding protein, complex (WING helix_TURN_ helix-DNA); 1.75A {Kluyveromyces lactis} SCOP: a.4.5.22 PDB: 2hts_A 1fyk_A* 1fym_A 1fyl_A 3hsf_A 1fbu_A 1fbs_A 1fbq_A | Back alignment and structure |
|---|
| >2dao_A Transcription factor ETV6; ETS domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >2jee_A YIIU; FTSZ, septum, coiled-coil, cell division, cell cycle, hypothetical protein; 2.8A {Escherichia coli} | Back alignment and structure |
|---|
| >1wwx_A E74-like factor 5 ESE-2B; DNA binding, transcriptional activation and repression, structural genomics; NMR {Homo sapiens} SCOP: a.4.5.21 | Back alignment and structure |
|---|
| >1awc_A Protein (GA binding protein alpha); complex (transcription regulation/DNA), DNA-binding, nuclear protein, ETS domain, ankyrin repeats; HET: DNA BRU CBR; 2.15A {Mus musculus} SCOP: a.4.5.21 | Back alignment and structure |
|---|
| >1fli_A FLI-1; transcription/DNA; NMR {Homo sapiens} SCOP: a.4.5.21 | Back alignment and structure |
|---|
| >1bc8_C SAP-1, protein (SAP-1 ETS domain); DNA-binding domain, winged helix-turn-helix, DNA-binding specificity, transcription/DNA complex; HET: DNA; 1.93A {Homo sapiens} SCOP: a.4.5.21 PDB: 1bc7_C* 1k6o_A 1dux_C* | Back alignment and structure |
|---|
| >2ypr_A Protein FEV; transcription; 2.64A {Homo sapiens} PDB: 1fli_A | Back alignment and structure |
|---|
| >4avp_A ETS translocation variant 1; transcription, transcriptional activation and repression, DN binding protein, E twenty-SIX, erwing sarcoma; 1.82A {Homo sapiens} PDB: 4b06_A | Back alignment and structure |
|---|
| >1gvj_A C-ETS-1 protein, P54; transcription, autoinhibition, ETS domain; 1.53A {Homo sapiens} SCOP: a.4.5.21 PDB: 1md0_A 1r36_A 1k78_B 1k79_A* 1k7a_A* 2stt_A* 2stw_A* | Back alignment and structure |
|---|
| >1wt6_A Myotonin-protein kinase; coiled-coil, kinase activation, DMPK, molecular replacement, transferase; 1.60A {Homo sapiens} | Back alignment and structure |
|---|
| >2nny_A C-ETS-1 protein, P54; protein-DNA complex, transcription/DNA complex; 2.58A {Homo sapiens} PDB: 3ri4_A 3mfk_A 1mdm_B | Back alignment and structure |
|---|
| >2jee_A YIIU; FTSZ, septum, coiled-coil, cell division, cell cycle, hypothetical protein; 2.8A {Escherichia coli} | Back alignment and structure |
|---|
| >1yo5_C SAM pointed domain containing ETS transcription factor; protein-DNA complex, double helix, transcription/DNA complex; 2.00A {Homo sapiens} SCOP: a.4.5.21 | Back alignment and structure |
|---|
| >1deb_A APC protein, adenomatous polyposis coli protein; coiled coil, tumor suppressor, structural protein; 2.40A {Homo sapiens} SCOP: h.1.18.1 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 412 | ||||
| d1hksa_ | 106 | a.4.5.22 (A:) Heat-shock transcription factor {Dro | 2e-39 | |
| d2htsa_ | 92 | a.4.5.22 (A:) Heat-shock transcription factor {Mil | 3e-32 |
| >d1hksa_ a.4.5.22 (A:) Heat-shock transcription factor {Drosophila melanogaster [TaxId: 7227]} Length = 106 | Back information, alignment and structure |
|---|
class: All alpha proteins fold: DNA/RNA-binding 3-helical bundle superfamily: "Winged helix" DNA-binding domain family: Heat-shock transcription factor domain: Heat-shock transcription factor species: Drosophila melanogaster [TaxId: 7227]
Score = 134 bits (339), Expect = 2e-39
Identities = 46/105 (43%), Positives = 62/105 (59%), Gaps = 10/105 (9%)
Query: 11 NSPAPFLIKTYEMVDDPLTNSLVSWSQSGGSFIVWNPPDFAKDLLPLYFKHNNFSSFVRQ 70
+ FL K + +VDD TN L+ W++ G SF++ N FAK+LLPL +KHNN +SF+RQ
Sbjct: 2 SGVPAFLAKLWRLVDDADTNRLICWTKDGQSFVIQNQAQFAKELLPLNYKHNNMASFIRQ 61
Query: 71 LNTYGFRKI----------DPDQWEFANEEFIRGQRHLLKNIHRR 105
LN YGF KI D D+ EF++ F R LL I R+
Sbjct: 62 LNMYGFHKITSIDNGGLRFDRDEIEFSHPFFKRNSPFLLDQIKRK 106
|
| >d2htsa_ a.4.5.22 (A:) Heat-shock transcription factor {Milk yeast (Kluyveromyces lactis) [TaxId: 28985]} Length = 92 | Back information, alignment and structure |
|---|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 412 | |||
| d1hksa_ | 106 | Heat-shock transcription factor {Drosophila melano | 100.0 | |
| d2htsa_ | 92 | Heat-shock transcription factor {Milk yeast (Kluyv | 99.95 | |
| d1awca_ | 110 | GA binding protein (GABP) alpha {Mouse (Mus muscul | 91.92 | |
| d1pp7u_ | 114 | 39 kda initiator binding protein, IBP39, N-termina | 91.37 | |
| d1flia_ | 98 | Fli-1 {Human (Homo sapiens) [TaxId: 9606]} | 91.27 | |
| d1duxc_ | 86 | Elk-1 {Human (Homo sapiens) [TaxId: 9606]} | 89.57 | |
| d1gvja_ | 142 | ETS-1 transcription factor, residues 331-440 {Huma | 89.31 | |
| d1hbxg_ | 155 | Serum response factor accessory protein 1a, SAP-1 | 88.0 | |
| d1puee_ | 88 | Transcription factor PU.1, residues 171-259 {Mouse | 84.47 |
| >d1hksa_ a.4.5.22 (A:) Heat-shock transcription factor {Drosophila melanogaster [TaxId: 7227]} | Back information, alignment and structure |
|---|
class: All alpha proteins fold: DNA/RNA-binding 3-helical bundle superfamily: "Winged helix" DNA-binding domain family: Heat-shock transcription factor domain: Heat-shock transcription factor species: Drosophila melanogaster [TaxId: 7227]
Probab=100.00 E-value=2.3e-36 Score=254.29 Aligned_cols=95 Identities=47% Similarity=0.840 Sum_probs=90.3
Q ss_pred CCCCchHHHHHHhhcCCCCCCceEEcCCCCeEEEeCCchhhhhccccccCCCCchhhhhccccccccccC----------
Q 015162 11 NSPAPFLIKTYEMVDDPLTNSLVSWSQSGGSFIVWNPPDFAKDLLPLYFKHNNFSSFVRQLNTYGFRKID---------- 80 (412)
Q Consensus 11 ~~~~~Fl~KLy~mv~d~~~~~iI~W~~~G~sFiI~d~~~F~~~vLp~~Fkh~nfsSFvRQLN~YGF~Kv~---------- 80 (412)
+++|+|+.|||+||+||++++||+|+++|++|||+|+.+|++.|||+||+|++|+||+||||+|||+||.
T Consensus 2 ~~vp~F~~KL~~ml~~~~~~~iI~W~~~G~~fiI~d~~~f~~~vLp~~Fk~~~~~SF~RQLn~yGF~kv~~~~~~~~~~~ 81 (106)
T d1hksa_ 2 SGVPAFLAKLWRLVDDADTNRLICWTKDGQSFVIQNQAQFAKELLPLNYKHNNMASFIRQLNMYGFHKITSIDNGGLRFD 81 (106)
T ss_dssp TTCCTTHHHHHHHHSSSTTTTTSEESTTTSCEECSCCSTTTTTTSTTTTSCCCHHHHHHHHHHHCCCCSSCSSSCCSSCT
T ss_pred CCCCcHHHHHHHHHhCcCCCCEEEEcCCCCEEEEcCHHHHHHHHhhhhccccCHHHHHHHHHHCCCEEeeccccccccCC
Confidence 5688999999999999999999999999999999999999999999999999999999999999999984
Q ss_pred CCCcccccccccccccccccccccc
Q 015162 81 PDQWEFANEEFIRGQRHLLKNIHRR 105 (412)
Q Consensus 81 ~~~~eF~h~~F~Rg~~~lL~~I~Rr 105 (412)
++.++|+|++|+||+|+||..|+||
T Consensus 82 ~~~~~f~h~~F~Rg~p~ll~~IkRr 106 (106)
T d1hksa_ 82 RDEIEFSHPFFKRNSPFLLDQIKRK 106 (106)
T ss_dssp TSTTEECCTTCCSSCTTSTTTCCCC
T ss_pred CCccEEECccccCCCHHHHhhCcCC
Confidence 3568999999999999999999986
|
| >d2htsa_ a.4.5.22 (A:) Heat-shock transcription factor {Milk yeast (Kluyveromyces lactis) [TaxId: 28985]} | Back information, alignment and structure |
|---|
| >d1awca_ a.4.5.21 (A:) GA binding protein (GABP) alpha {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|
| >d1pp7u_ a.4.5.44 (U:) 39 kda initiator binding protein, IBP39, N-terminal domain {Trichomonas vaginalis [TaxId: 5722]} | Back information, alignment and structure |
|---|
| >d1flia_ a.4.5.21 (A:) Fli-1 {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
| >d1duxc_ a.4.5.21 (C:) Elk-1 {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
| >d1gvja_ a.4.5.21 (A:) ETS-1 transcription factor, residues 331-440 {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
| >d1hbxg_ a.4.5.21 (G:) Serum response factor accessory protein 1a, SAP-1 {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
| >d1puee_ a.4.5.21 (E:) Transcription factor PU.1, residues 171-259 {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|