Citrus Sinensis ID: 018255


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------36
MEEVEEANKIAVESCYRVLNLLSQHNKDPLQVHHQYRNLALETDEVVFKFKRVVSLLSHGTTRVRKLRKNYLKPTTPSNLPQNIFLDSPNTNLSKPILSPKPLQVIPPHDFFETTPSSSSIRREKLSTFNNNNNNNNNNRSPALQMATPSQSLQQFLYQQQIQRLQFQQQHMGMMMYSSGGNNGGINLKFDGSTSCCTPSNMTMSSTKSLISSLSLDGGSNNSFHLIGVPQHSSQISQHSKKRCFSSRAEDGSLKCSTSGKCHCSKRRKHRVKRSIKVPAISNKVADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNHSRIMQSSQSAHT
cHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHcccccEEccccccccccHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEccccccccccccHHHHcccccccccccccccEEEccccccccccccccccccccEEEEEEEccccccccccccccccc
cHHHHHHHHHHHHHHHHHHHHHHccccccccccccHccHHHHHHHHHHHHHHHHHHHccccccccEEccccccccccccccHHHHcccccccccccccccccccccccccccccccccccccccHcccccccccccccccccccccccccccccccccccEEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEEEEEEccccccccccccEEEcccEEcccccccccEEEEccccccccHHHHHHccccccEEEEEEcccccccccccccccccc
MEEVEEANKIAVESCYRVLNLLsqhnkdplqvHHQYRNLALETDEVVFKFKRVVSLLSHGTTRVRKLRKnylkpttpsnlpqnifldspntnlskpilspkplqvipphdffettpssssirREKLSTfnnnnnnnnnnrspalqmatpsQSLQQFLYQQQIQRLQFQQQHMGMMmyssggnngginlkfdgstscctpsnmtmsSTKSLISSlsldggsnnsfhligvpqhssqisqhskkrcfssraedgslkcstsgkchcskrrkhRVKRsikvpaisnkvadippdeytwrkygqkpikgsphprgyykcssvrgcparkhvercpeepsMLIVTYEgehnhsrimqssqsaht
MEEVEEANKIAVESCYRVLNLLSQHNKDPLQVHHQYRNLALETDEVVFKFKRVVsllshgttrvrklrknylkpttpsnlpqnifLDSPNTNLSKPILSPKPLQVIPPHDFFETTPSSSSIRREKLSTFNNNNNNNNNNRSPALQMATPSQSLQQFLYQQQIQRLQFQQQHMGMMMYSSGGNNGGINLKFDGSTSCCTPSNMTMSSTKSLISSLSLDGGSNNSFHLIGVPQHSSQISQHSKKRCFSsraedgslkcstsgkchcskrrkhrvkrsikvpaisnkvadippdeyTWRKYgqkpikgsphprgyykcSSVRGCParkhvercpeePSMLIVTYEGEHNhsrimqssqsaht
MEEVEEANKIAVESCYRVLNLLSQHNKDPLQVHHQYRNLALETDEVVFKFKRVVSLLSHGTTRVRKLRKNYLKPTTPSNLPQNIFLDSPNTNLSKPILSPKPLQVIPPHDFFETTPSSSSIRREKLSTFnnnnnnnnnnRSPALQMATPSqslqqflyqqqiqrlqfqqqHmgmmmyssggnnggINLKFDGSTSCCTPSNMTMsstkslisslsldggsnnsFHLIGVPqhssqisqhsKKRCFSSRAEDGSLKCSTSGKCHCSKRRKHRVKRSIKVPAISNKVADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNHSRIMQSSQSAHT
*********IAVESCYRVLNLLSQHNKDPLQVHHQYRNLALETDEVVFKFKRVVSLLSHGTTRVRKLRKNYL***********************************************************************************FLYQQQIQRLQFQ*****************INLKF*****************************************************************************************AISNKVADIPPDEYTWRKYGQ**********GYYKCSSVRGCPA*************LIVTY******************
MEEVEEANKIAVESCYRVL************************DEVVF***************************************************************************************************************************************************************************************************************************************************EYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHN*************
********KIAVESCYRVLNLLSQHNKDPLQVHHQYRNLALETDEVVFKFKRVVSLLSHGTTRVRKLRKNYLKPTTPSNLPQNIFLDSPNTNLSKPILSPKPLQVIPPHDFFET********REKLSTFNNNNNNNNNNRSPALQMATPSQSLQQFLYQQQIQRLQFQQQHMGMMMYSSGGNNGGINLKFDGSTSCCTPSNMTMSSTKSLISSLSLDGGSNNSFHLIGVP********************************************SIKVPAISNKVADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNH************
*******************N*L************QYRNLALETDEVVFKFKRVVSLLSHGTTRVRKLRKN*************************************************SIRREKLSTFNNNNNNNNNNRSPALQMATPSQSLQQFLYQQQIQRLQFQQ****************************************************************************************************HRVKRSIKVPAISNKVADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGE***************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEEVEEANKIAVESCYRVLNLLSQHNKDPLQVHHQYRNLALETDEVVFKFKRVVSLLSHGTTRVRKLRKNYLKPTTPSNLPQNIFLDSPNTNLSKPILSPKPLQVIPPHDFFETTPSSSSIRREKLSTFNNNNNNNNNNRSPALQMATPSQSLQQFLYQQQIQRLQFQQQHMGMMMYSSGGNNGGINLKFDGSTSCCTPSNMTMSSTKSLISSLSLDGGSNNSFHLIGVPQHSSQISQHSKKRCFSSRAEDGSLKCSTSGKCHCSKRRKHRVKRSIKVPAISNKVADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNHSRIMQSSQSAHT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query359 2.2.26 [Sep-21-2011]
Q93WU6330 Probable WRKY transcripti yes no 0.883 0.960 0.540 1e-87
Q9SR07330 Probable WRKY transcripti no no 0.852 0.927 0.501 5e-84
O04336380 Probable WRKY transcripti no no 0.451 0.426 0.676 3e-63
Q32SG4397 Protein WRKY1 OS=Zea mays N/A no 0.933 0.843 0.405 1e-59
Q9SV15325 Probable WRKY transcripti no no 0.766 0.846 0.365 2e-43
O22176317 Probable WRKY transcripti no no 0.314 0.356 0.669 1e-42
Q9SJA8321 Probable WRKY transcripti no no 0.298 0.333 0.681 3e-42
Q9STX0353 Probable WRKY transcripti no no 0.423 0.430 0.547 2e-41
O64747 427 Probable WRKY transcripti no no 0.222 0.187 0.550 2e-22
Q9LP56 259 Probable WRKY transcripti no no 0.256 0.355 0.525 3e-22
>sp|Q93WU6|WRK74_ARATH Probable WRKY transcription factor 74 OS=Arabidopsis thaliana GN=WRKY74 PE=2 SV=2 Back     alignment and function desciption
 Score =  323 bits (828), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 201/372 (54%), Positives = 243/372 (65%), Gaps = 55/372 (14%)

Query: 1   MEEVEEANKIAVESCYRVLNLLSQHNKDPLQVHHQYRNLALETDEVVFKFKRVVSLLSHG 60
           MEEVE ANK AVESC+ VLNLLSQ   D        +++ +ET E V KFKRV SLLS G
Sbjct: 1   MEEVEAANKAAVESCHGVLNLLSQQTNDS-------KSIMVETREAVCKFKRVSSLLSRG 53

Query: 61  TT--RVRKLRKNYLKPTTPSNLPQNIFLDSP---NTNLSK--PILSPKPLQVIPPHDFFE 113
               +++KL  N  K ++ S LPQ++FL+SP   N  +S   PIL+PKPLQ++P      
Sbjct: 54  LGQRKIKKLNNNNYKFSS-SLLPQHMFLESPVCSNNAISGCIPILAPKPLQIVPAGP--- 109

Query: 114 TTPSSSSIRREKLSTFNNNNNNNNNNRSPALQMATPSQSLQQFLYQQQIQRLQFQQQHMG 173
                       L  FN N   + +     L++  PS    + +  +  Q +   QQ   
Sbjct: 110 ----------PPLMLFNQNMCLDKS----FLELKPPS---SRAVDPKPYQFIHTHQQG-- 150

Query: 174 MMMYSSGGNNGGINLKFDGS--TSCCTPSNMTMSSTKSLISSLSLDGG----SNNSFHLI 227
             +YS   +  G+NLKFDGS   SC +PS    + ++S +SSLS+DG       NSFHLI
Sbjct: 151 --VYSR--SKSGLNLKFDGSIGASCYSPS--ISNGSRSFVSSLSMDGSVTDYDRNSFHLI 204

Query: 228 GVPQHSSQISQHSKKRCFSSRAEDGSLKCSTSGKCHCSKRRKHRVKRSIKVPAISNKVAD 287
           G+PQ S  ISQHS++   S     GSLKC +  KCHCSK+RK RVKRSIKVPAISNK+AD
Sbjct: 205 GLPQGSDHISQHSRRTSCS-----GSLKCGSKSKCHCSKKRKLRVKRSIKVPAISNKIAD 259

Query: 288 IPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNH 347
           IPPDEY+WRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERC EE SMLIVTYEGEHNH
Sbjct: 260 IPPDEYSWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCVEETSMLIVTYEGEHNH 319

Query: 348 SRIMQSSQSAHT 359
           SRI+ SSQSAHT
Sbjct: 320 SRIL-SSQSAHT 330




Transcription factor. Interacts specifically with the W box (5'-(T)TGAC[CT]-3'), a frequently occurring elicitor-responsive cis-acting element.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9SR07|WRK39_ARATH Probable WRKY transcription factor 39 OS=Arabidopsis thaliana GN=WRKY39 PE=2 SV=1 Back     alignment and function description
>sp|O04336|WRK21_ARATH Probable WRKY transcription factor 21 OS=Arabidopsis thaliana GN=WRKY21 PE=2 SV=1 Back     alignment and function description
>sp|Q32SG4|WRKY1_MAIZE Protein WRKY1 OS=Zea mays PE=1 SV=1 Back     alignment and function description
>sp|Q9SV15|WRK11_ARATH Probable WRKY transcription factor 11 OS=Arabidopsis thaliana GN=WRKY11 PE=2 SV=2 Back     alignment and function description
>sp|O22176|WRK15_ARATH Probable WRKY transcription factor 15 OS=Arabidopsis thaliana GN=WRKY15 PE=2 SV=1 Back     alignment and function description
>sp|Q9SJA8|WRK17_ARATH Probable WRKY transcription factor 17 OS=Arabidopsis thaliana GN=WRKY17 PE=2 SV=2 Back     alignment and function description
>sp|Q9STX0|WRKY7_ARATH Probable WRKY transcription factor 7 OS=Arabidopsis thaliana GN=WRKY7 PE=1 SV=1 Back     alignment and function description
>sp|O64747|WRK35_ARATH Probable WRKY transcription factor 35 OS=Arabidopsis thaliana GN=WRKY35 PE=2 SV=1 Back     alignment and function description
>sp|Q9LP56|WRK65_ARATH Probable WRKY transcription factor 65 OS=Arabidopsis thaliana GN=WRKY65 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query359
259121415354 WRKY transcription factor 25 [(Populus t 0.933 0.946 0.621 1e-104
224123454355 predicted protein [Populus trichocarpa] 0.938 0.949 0.620 1e-104
224131226353 predicted protein [Populus trichocarpa] 0.930 0.946 0.605 1e-100
296087892304 unnamed protein product [Vitis vinifera] 0.830 0.980 0.586 6e-97
225463956362 PREDICTED: probable WRKY transcription f 0.935 0.928 0.571 2e-96
350540816342 WRKY8, partial [Panax quinquefolius] 0.930 0.976 0.548 2e-88
224082746347 predicted protein [Populus trichocarpa] 0.910 0.942 0.534 4e-87
255559953353 WRKY transcription factor, putative [Ric 0.908 0.923 0.512 4e-87
206574999341 WRKY74-1 transcription factor [Brassica 0.905 0.953 0.538 1e-86
15241875330 putative WRKY transcription factor 74 [A 0.883 0.960 0.540 7e-86
>gi|259121415|gb|ACV92027.1| WRKY transcription factor 25 [(Populus tomentosa x P. bolleana) x P. tomentosa] Back     alignment and taxonomy information
 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 235/378 (62%), Positives = 272/378 (71%), Gaps = 43/378 (11%)

Query: 1   MEEVEEANKIAVESCYRVLNLLSQHNKDPLQVHHQYRNLALETDEVVFKFKRVVSLLSHG 60
           MEEVEEA+K A+ESC RV+ LL Q  KD +Q      NL +ET E VFKFKRV+SLLS G
Sbjct: 1   MEEVEEAHKAAIESCNRVIGLLCQE-KDQVQ----GSNLMVETRETVFKFKRVISLLSTG 55

Query: 61  TT--RVRKLRKNYLKPTTPSNLPQNIFLDSPNTNLSKPILSPKPLQVIPPHDFFET---- 114
               RVRK++K  L+P+    LPQNIFLDSPN    K ILSPKPLQ++PP +F ET    
Sbjct: 56  LGHGRVRKMKK--LRPS----LPQNIFLDSPNC---KTILSPKPLQMVPP-NFLETPLTD 105

Query: 115 --TPSSSSIRREKLSTFNNNNNNNNNNRSPALQM--ATPSQSLQQFLYQQQIQRLQFQQQ 170
               S  SI+  +     N     N+N SP +Q+    P Q+ Q     QQ+QR+ FQQQ
Sbjct: 106 MDAKSKPSIQISQKMFLENPVLELNSNISPPVQIMQTKPPQNFQLVQQHQQVQRMHFQQQ 165

Query: 171 HMGMM-----MYSSGGNNGGINLKFDGSTSCCTPSNMTMSSTKSLISSLSLDGGSNN--- 222
              M      +YS   +NGGINLKFDGST  CTP   TMSST+S ISSLS+DG  +N   
Sbjct: 166 QQQMKYQADRVYSR--SNGGINLKFDGST--CTP---TMSSTRSFISSLSMDGAVSNFDG 218

Query: 223 -SFHLIGVPQHSSQISQHSKKRCFSSRAEDGSLKCSTSGKCHCSKRRKHRVKRSIKVPAI 281
            SFHLIG+P  S  ISQ +++RC S R EDG+ KCS+SGKCHCSKRRK RVKRSIKVPAI
Sbjct: 219 DSFHLIGMPHSSDHISQQTRRRC-SGRGEDGNAKCSSSGKCHCSKRRKLRVKRSIKVPAI 277

Query: 282 SNKVADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTY 341
           SNKVADIPPDEY+WRKYGQKPIKGSPHPRGYYKCSS+RGCPARKHVERC E+PSMLIVTY
Sbjct: 278 SNKVADIPPDEYSWRKYGQKPIKGSPHPRGYYKCSSLRGCPARKHVERCLEDPSMLIVTY 337

Query: 342 EGEHNHSRIMQSSQSAHT 359
           EGEHNHSR++ SSQSAHT
Sbjct: 338 EGEHNHSRLI-SSQSAHT 354




Source: (Populus tomentosa x P. bolleana) x P. tomentosa

Species: (Populus tomentosa x P. bolleana) x P. tomentosa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224123454|ref|XP_002319082.1| predicted protein [Populus trichocarpa] gi|222857458|gb|EEE95005.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224131226|ref|XP_002328486.1| predicted protein [Populus trichocarpa] gi|222838201|gb|EEE76566.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|296087892|emb|CBI35175.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|225463956|ref|XP_002270614.1| PREDICTED: probable WRKY transcription factor 74-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|350540816|gb|AEQ29021.1| WRKY8, partial [Panax quinquefolius] Back     alignment and taxonomy information
>gi|224082746|ref|XP_002306823.1| predicted protein [Populus trichocarpa] gi|222856272|gb|EEE93819.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255559953|ref|XP_002520995.1| WRKY transcription factor, putative [Ricinus communis] gi|223539832|gb|EEF41412.1| WRKY transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|206574999|gb|ACI14408.1| WRKY74-1 transcription factor [Brassica napus] Back     alignment and taxonomy information
>gi|15241875|ref|NP_198217.1| putative WRKY transcription factor 74 [Arabidopsis thaliana] gi|332278119|sp|Q93WU6.2|WRK74_ARATH RecName: Full=Probable WRKY transcription factor 74; AltName: Full=WRKY DNA-binding protein 74 gi|17064168|gb|AAL35291.1|AF442398_1 WRKY transcription factor 74 [Arabidopsis thaliana] gi|332006441|gb|AED93824.1| putative WRKY transcription factor 74 [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query359
TAIR|locus:2148815330 WRKY74 "WRKY DNA-binding prote 0.467 0.509 0.646 1.2e-77
TAIR|locus:2084943330 WRKY39 "WRKY DNA-binding prote 0.459 0.5 0.644 3.7e-74
TAIR|locus:2064377380 WRKY21 "WRKY DNA-binding prote 0.451 0.426 0.580 3.3e-63
UNIPROTKB|Q32SG4397 Q32SG4 "Protein WRKY1" [Zea ma 0.456 0.413 0.556 2.3e-58
TAIR|locus:2125239325 WRKY11 "WRKY DNA-binding prote 0.325 0.36 0.652 4.2e-47
TAIR|locus:2046653321 WRKY17 "WRKY DNA-binding prote 0.320 0.358 0.644 1.6e-45
TAIR|locus:2058568317 WRKY15 "WRKY DNA-binding prote 0.300 0.340 0.688 5.3e-45
TAIR|locus:2135967353 WRKY7 "WRKY DNA-binding protei 0.284 0.288 0.705 2.3e-44
TAIR|locus:2029919 259 WRKY65 "WRKY DNA-binding prote 0.256 0.355 0.525 2.9e-23
TAIR|locus:2044777 427 WRKY35 "WRKY DNA-binding prote 0.228 0.192 0.540 2.1e-22
TAIR|locus:2148815 WRKY74 "WRKY DNA-binding protein 74" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 574 (207.1 bits), Expect = 1.2e-77, Sum P(2) = 1.2e-77
 Identities = 115/178 (64%), Positives = 126/178 (70%)

Query:   186 INLKFDGS--TSCCTPS--NMTMXXXXXXXXXXXXXXXXXXXFHLIGVPXXXXXXXXXXK 241
             +NLKFDGS   SC +PS  N +                    FHLIG+P          +
Sbjct:   159 LNLKFDGSIGASCYSPSISNGSRSFVSSLSMDGSVTDYDRNSFHLIGLPQGSDHISQHSR 218

Query:   242 KRCFSSRAEDGSLKCSTSGKCHCSKRRKHRVKRSIKVPAISNKVADIPPDEYTWRKYGQK 301
             +   S     GSLKC +  KCHCSK+RK RVKRSIKVPAISNK+ADIPPDEY+WRKYGQK
Sbjct:   219 RTSCS-----GSLKCGSKSKCHCSKKRKLRVKRSIKVPAISNKIADIPPDEYSWRKYGQK 273

Query:   302 PIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNHSRIMQSSQSAHT 359
             PIKGSPHPRGYYKCSSVRGCPARKHVERC EE SMLIVTYEGEHNHSRI+ SSQSAHT
Sbjct:   274 PIKGSPHPRGYYKCSSVRGCPARKHVERCVEETSMLIVTYEGEHNHSRIL-SSQSAHT 330


GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM;IC
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA;ISS
GO:0043565 "sequence-specific DNA binding" evidence=IEA
GO:0005516 "calmodulin binding" evidence=IDA
TAIR|locus:2084943 WRKY39 "WRKY DNA-binding protein 39" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2064377 WRKY21 "WRKY DNA-binding protein 21" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q32SG4 Q32SG4 "Protein WRKY1" [Zea mays (taxid:4577)] Back     alignment and assigned GO terms
TAIR|locus:2125239 WRKY11 "WRKY DNA-binding protein 11" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2046653 WRKY17 "WRKY DNA-binding protein 17" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2058568 WRKY15 "WRKY DNA-binding protein 15" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2135967 WRKY7 "WRKY DNA-binding protein 7" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2029919 WRKY65 "WRKY DNA-binding protein 65" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2044777 WRKY35 "WRKY DNA-binding protein 35" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q93WU6WRK74_ARATHNo assigned EC number0.54030.88300.9606yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query359
smart0077459 smart00774, WRKY, DNA binding domain 2e-34
pfam0310660 pfam03106, WRKY, WRKY DNA -binding domain 2e-32
pfam1053347 pfam10533, Plant_zn_clust, Plant zinc cluster doma 4e-19
>gnl|CDD|214815 smart00774, WRKY, DNA binding domain Back     alignment and domain information
 Score =  120 bits (304), Expect = 2e-34
 Identities = 36/59 (61%), Positives = 46/59 (77%)

Query: 289 PPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNH 347
             D Y WRKYGQK IKGSP+PR YY+C+  +GCPA+K V+R  ++PS++ VTYEGEH H
Sbjct: 1   LDDGYQWRKYGQKVIKGSPYPRSYYRCTYTQGCPAKKQVQRSDDDPSVVEVTYEGEHTH 59


The WRKY domain is a DNA binding domain found in one or two copies in a superfamily of plant transcription factors. These transcription factors are involved in the regulation of various physiological programs that are unique to plants, including pathogen defense, senescence and trichome development. The domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger-like motif. It binds specifically to the DNA sequence motif (T)(T)TGAC(C/T), which is known as the W box. The invariant TGAC core is essential for function and WRKY binding. Length = 59

>gnl|CDD|145969 pfam03106, WRKY, WRKY DNA -binding domain Back     alignment and domain information
>gnl|CDD|192617 pfam10533, Plant_zn_clust, Plant zinc cluster domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 359
smart0077459 WRKY DNA binding domain. The WRKY domain is a DNA 99.96
PF0310660 WRKY: WRKY DNA -binding domain; InterPro: IPR00365 99.96
PF1053347 Plant_zn_clust: Plant zinc cluster domain; InterPr 99.31
PF0450062 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR00 89.95
PF0310191 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 89.38
>smart00774 WRKY DNA binding domain Back     alignment and domain information
Probab=99.96  E-value=1.2e-30  Score=199.45  Aligned_cols=59  Identities=61%  Similarity=1.217  Sum_probs=56.8

Q ss_pred             CCCCccccccCCCCCCCCCCCCccccccCCCCCCcccceeecCCCCceEEEEEeccCCC
Q 018255          289 PPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNH  347 (359)
Q Consensus       289 p~Dgy~WRKYGQK~Ikgs~~PRsYYRCs~~~gC~ArK~Vqr~~dDp~~~~VtY~GeH~h  347 (359)
                      ++|||.|||||||.|+|+++||+||||++..||+|+|+|||+++|+.+++|||+|+|||
T Consensus         1 ~~DGy~WRKYGQK~ikgs~~pRsYYrCt~~~~C~a~K~Vq~~~~d~~~~~vtY~g~H~h   59 (59)
T smart00774        1 LDDGYQWRKYGQKVIKGSPFPRSYYRCTYSQGCPAKKQVQRSDDDPSVVEVTYEGEHTH   59 (59)
T ss_pred             CCCcccccccCcEecCCCcCcceEEeccccCCCCCcccEEEECCCCCEEEEEEeeEeCC
Confidence            47999999999999999999999999997689999999999999999999999999998



The WRKY domain is a DNA binding domain found in one or two copies in a superfamily of plant transcription factors. These transcription factors are involved in the regulation of various physiological programs that are unique to plants, including pathogen defense, senescence and trichome development. The domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger-like motif. It binds specifically to the DNA sequence motif (T)(T)TGAC(C/T), which is known as the W box. The invariant TGAC core is essential for function and WRKY binding.

>PF03106 WRKY: WRKY DNA -binding domain; InterPro: IPR003657 The WRKY domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger- like motif Back     alignment and domain information
>PF10533 Plant_zn_clust: Plant zinc cluster domain; InterPro: IPR018872 This zinc binding domain is found associated with the WRKY domain IPR003657 from INTERPRO [] Back     alignment and domain information
>PF04500 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR007588 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>PF03101 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 Phytochrome A is the primary photoreceptor for mediating various far-red light-induced responses in higher plants Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query359
2ayd_A76 Crystal Structure Of The C-Terminal Wrky Domainof A 1e-16
1wj2_A78 Solution Structure Of The C-Terminal Wrky Domain Of 4e-16
>pdb|2AYD|A Chain A, Crystal Structure Of The C-Terminal Wrky Domainof Atwrky1, An Sa-Induced And Partially Npr1-Dependent Transcription Factor Length = 76 Back     alignment and structure

Iteration: 1

Score = 84.0 bits (206), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 38/67 (56%), Positives = 48/67 (71%), Gaps = 1/67 (1%) Query: 282 SNKVADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTY 341 + + DI D Y WRKYGQK +KGSP+PR YY+CSS GCP +KHVER + +LI TY Sbjct: 7 TQTLFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCSSP-GCPVKKHVERSSHDTKLLITTY 65 Query: 342 EGEHNHS 348 EG+H+H Sbjct: 66 EGKHDHD 72
>pdb|1WJ2|A Chain A, Solution Structure Of The C-Terminal Wrky Domain Of Atwrky4 Length = 78 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query359
2ayd_A76 WRKY transcription factor 1; beta strands, zinc fi 2e-35
1wj2_A78 Probable WRKY transcription factor 4; DNA-binding 1e-34
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 3e-06
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-04
>2ayd_A WRKY transcription factor 1; beta strands, zinc finger; 1.60A {Arabidopsis thaliana} Length = 76 Back     alignment and structure
 Score =  122 bits (309), Expect = 2e-35
 Identities = 38/71 (53%), Positives = 50/71 (70%), Gaps = 1/71 (1%)

Query: 277 KVPAISNKVADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSM 336
           ++   +  + DI  D Y WRKYGQK +KGSP+PR YY+CSS  GCP +KHVER   +  +
Sbjct: 2   RIVVHTQTLFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCSS-PGCPVKKHVERSSHDTKL 60

Query: 337 LIVTYEGEHNH 347
           LI TYEG+H+H
Sbjct: 61  LITTYEGKHDH 71


>1wj2_A Probable WRKY transcription factor 4; DNA-binding domain, zinc-binding, structural genomics; NMR {Arabidopsis thaliana} SCOP: g.79.1.1 PDB: 2lex_A* Length = 78 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query359
2ayd_A76 WRKY transcription factor 1; beta strands, zinc fi 99.97
1wj2_A78 Probable WRKY transcription factor 4; DNA-binding 99.97
>2ayd_A WRKY transcription factor 1; beta strands, zinc finger; 1.60A {Arabidopsis thaliana} Back     alignment and structure
Probab=99.97  E-value=1.8e-32  Score=217.53  Aligned_cols=65  Identities=58%  Similarity=1.139  Sum_probs=62.3

Q ss_pred             ccCCCCCCccccccCCCCCCCCCCCCccccccCCCCCCcccceeecCCCCceEEEEEeccCCCCCC
Q 018255          285 VADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNHSRI  350 (359)
Q Consensus       285 ~~dip~Dgy~WRKYGQK~Ikgs~~PRsYYRCs~~~gC~ArK~Vqr~~dDp~~~~VtY~GeH~h~~~  350 (359)
                      ..++++|||.|||||||.|||++|||+||||++ .||+|+|+|||+++|+.+++|||+|+|||+.|
T Consensus        10 ~~~~~~DGy~WRKYGQK~ikgs~~PRsYYrCt~-~gC~a~K~Ver~~~d~~~~~~tY~G~H~H~~p   74 (76)
T 2ayd_A           10 LFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCSS-PGCPVKKHVERSSHDTKLLITTYEGKHDHDMP   74 (76)
T ss_dssp             SSSCCCCSSCEEEEEEECCTTCSSCEEEEEECS-TTCCCEEEEEECSSSTTEEEEEEESCCSSCCC
T ss_pred             CCCcCCCCchhhhCcccccCCCCCceeEeEcCC-CCCCceeeEEEECCCCCEEEEEEccCcCCCCC
Confidence            468899999999999999999999999999995 79999999999999999999999999999987



>1wj2_A Probable WRKY transcription factor 4; DNA-binding domain, zinc-binding, structural genomics; NMR {Arabidopsis thaliana} SCOP: g.79.1.1 PDB: 2lex_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 359
d1wj2a_71 g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cr 6e-28
>d1wj2a_ g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 71 Back     information, alignment and structure

class: Small proteins
fold: WRKY DNA-binding domain
superfamily: WRKY DNA-binding domain
family: WRKY DNA-binding domain
domain: WRKY DNA-binding protein 4
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score =  102 bits (255), Expect = 6e-28
 Identities = 35/64 (54%), Positives = 45/64 (70%), Gaps = 1/64 (1%)

Query: 286 ADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEH 345
            D+  D Y WRKYGQK +KG+P+PR YYKC++  GC  RKHVER   +P  ++ TYEG+H
Sbjct: 7   VDLLDDGYRWRKYGQKVVKGNPYPRSYYKCTT-PGCGVRKHVERAATDPKAVVTTYEGKH 65

Query: 346 NHSR 349
           NH  
Sbjct: 66  NHDL 69


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query359
d1wj2a_71 WRKY DNA-binding protein 4 {Thale cress (Arabidops 99.97
>d1wj2a_ g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: Small proteins
fold: WRKY DNA-binding domain
superfamily: WRKY DNA-binding domain
family: WRKY DNA-binding domain
domain: WRKY DNA-binding protein 4
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.97  E-value=1.3e-32  Score=214.11  Aligned_cols=64  Identities=55%  Similarity=1.101  Sum_probs=61.2

Q ss_pred             cCCCCCCccccccCCCCCCCCCCCCccccccCCCCCCcccceeecCCCCceEEEEEeccCCCCCC
Q 018255          286 ADIPPDEYTWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERCPEEPSMLIVTYEGEHNHSRI  350 (359)
Q Consensus       286 ~dip~Dgy~WRKYGQK~Ikgs~~PRsYYRCs~~~gC~ArK~Vqr~~dDp~~~~VtY~GeH~h~~~  350 (359)
                      .++++|||.|||||||.|+|++|||+||||++ .||+|+|+|||+++|+.+++|||+|+|||+.|
T Consensus         7 ~~~~dDGy~WRKYGQK~ikgs~~pRsYYrCt~-~~C~a~K~Vqr~~~d~~~~~vtY~G~H~h~~P   70 (71)
T d1wj2a_           7 VDLLDDGYRWRKYGQKVVKGNPYPRSYYKCTT-PGCGVRKHVERAATDPKAVVTTYEGKHNHDLP   70 (71)
T ss_dssp             CCCCCSSSCBCCCEEECCTTCSSCEEEEEEEC-SSCEEEEEEEEETTTTSEEEEEEESCCSSCCC
T ss_pred             cccCCCCcEecccCceeccCCCCceEEEEccc-cCCCCcceEEEEcCCCCEEEEEEeeEeCCCCC
Confidence            47889999999999999999999999999995 79999999999999999999999999999876