Citrus Sinensis ID: 016623


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380------
MVPRQFIGLGPSAETDHEVSNCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFDDQAAAAAAAENSNGKRIGREESPESETQGWGPNNKVQKLSSAKGIDQSNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATAAITADPNFTAALAAAITSIIGGAQNPFSNNSNNNNRSCIIFTNFFFQY
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccEEEEEEcccccccccccccccccccccccccccccccccccccccccccccHHccccccEEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHcccccHHHHHHHHHHHHHccccccccccccccccccccccccEEEc
cccHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEEcccccccccccHHHcccEEEccccccccEEEEccccccccHHHHEEcccccEEEEEEEcccccccccccHccccccccccHHHccccccccccccccccccccccccccccccccccccccccEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHccccHHHHHHHHHHHHHcccccccccccccccccccccccEEEcc
mvprqfiglgpsaetdhevsncssdeertlsgtppniveAASKehvnsngkneivsFDDQAAAAAAAENsngkrigreespesetqgwgpnnkvqklssakgidqsneATMRKARVSVrarseapmitdgcqwrkygqkmakgnpcprayyrctmavgcpvrkqvqrCAEDRTILITtyegnhnhplppaamAMASTTTAAASMLLSgsmssadgimnpnlLARAilpcsssmatisasapfptvtldlthspnplqLQRQAAQFQvqfpgqpqnlasvtntqlpqvFGQALynqskfsglqlsqnigsnsqsgshqtlppplqqpqqLADTVSAATAAITADPNFTAALAAAITSIIggaqnpfsnnsnnnnrsciiftnfffqy
mvprqfiglgpsaetdhevsnCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFDDQAAAAAAAensngkrigreespesetqgwgpnnkvqklssakgidqsneatmrkarvsvrarseapmitdgcqwrkygqKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATAAITADPNFTAALAAAITSIIGGAQNpfsnnsnnnnrsCIIFTNFFFQY
MVPRQFIGLGPSAETDHEVSNCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFDDQaaaaaaaENSNGKRIGREESPESETQGWGPNNKVQKLSSAKGIDQSNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHplppaamamastttaaasmllsgsmssaDGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPlqlqrqaaqfqvqfPGQPQNLASVTNTQLPQVFGQALYNQSKFSGlqlsqnigsnsqsgsHQTlppplqqpqqlADTVSAATAAITADPNFtaalaaaitsiiggaQNPFsnnsnnnnRSCIIFTNFFFQY
*****************************************************************************************************************************MITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGN************************************PNLLARAILPCSSSMATI*****FPTVTL***********************************QLPQVFGQALYN*****************************************ATAAITADPNFTAALAAAITSIIGGAQN*********NRSCIIFTNFFF**
MVPRQFIG*************************************************************************************************************VRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHP**************************************************************************************************************************************************VSAATAAITADPNFTAALAAAITS***********************TNFFFQY
MVPRQFIGLGPS*******************GTPPNIVEAASKEHVNSNGKNEIVSFDDQ*****************************PNNKVQKLSSAKGIDQSNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGLQLSQN****************************AATAAITADPNFTAALAAAITSIIGGAQNPFSNNSNNNNRSCIIFTNFFFQY
**************************************************************************************************************MRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNH*L*******************************************S******ASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGLQLSQNIGS********************ADTVSAATAAITADPNFTAALAAAITSIIGG***************CIIFTNFFFQY
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MVPRQFIGLGPSAETDHEVSNCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFDDQAAAAAAAENSNGKRIGREESPESETQGWGPNNKVQKLSSAKGIDQSNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATAAITADPNFTAALAAAITSIIGGAQNPFSNNSNNNNRSCIIFTNFFFQY
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query386 2.2.26 [Sep-21-2011]
Q9XEC3528 Probable WRKY transcripti yes no 0.784 0.573 0.584 1e-101
Q9C519553 WRKY transcription factor no no 0.834 0.582 0.591 1e-100
Q93WT0538 Probable WRKY transcripti no no 0.810 0.581 0.587 1e-81
Q9ZSI7489 Probable WRKY transcripti no no 0.518 0.408 0.494 2e-49
Q9LXG8548 Probable WRKY transcripti no no 0.712 0.501 0.408 2e-45
Q9C9F0374 Probable WRKY transcripti no no 0.269 0.278 0.694 5e-40
Q8VWV6480 Probable WRKY transcripti no no 0.629 0.506 0.430 6e-40
Q9CAR4387 Probable WRKY transcripti no no 0.391 0.390 0.522 8e-34
Q8S8P5519 Probable WRKY transcripti no no 0.233 0.173 0.505 5e-23
Q93WV0557 Probable WRKY transcripti no no 0.199 0.138 0.551 2e-21
>sp|Q9XEC3|WRK42_ARATH Probable WRKY transcription factor 42 OS=Arabidopsis thaliana GN=WRKY42 PE=2 SV=1 Back     alignment and function desciption
 Score =  367 bits (942), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 222/380 (58%), Positives = 251/380 (66%), Gaps = 77/380 (20%)

Query: 1   MVPRQFIGLGPSAETDHEVSNCSSDEERTL--SGTPPNIVEAASKEHVNSNGKNEIVSFD 58
           MVPRQFI LGP ++   EVS+    EERT   SG+PP+++E +S                
Sbjct: 179 MVPRQFIDLGPHSD---EVSS----EERTTVRSGSPPSLLEKSSSRQ------------- 218

Query: 59  DQAAAAAAAENSNGKRI-GREESPESETQGWGPNNKVQKL--------------SSAKGI 103
                       NGKR+  REESPE+E+ GW   NKV K               +S+K I
Sbjct: 219 ------------NGKRVLVREESPETESNGWRNPNKVPKHHASSSICGGNGSENASSKVI 266

Query: 104 DQ-SNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVR 162
           +Q + EATMRKARVSVRARSEAPM++DGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVR
Sbjct: 267 EQAAAEATMRKARVSVRARSEAPMLSDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVR 326

Query: 163 KQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGS-MSSADGIMNP-N 220
           KQVQRCAEDRTILITTYEGNHNHPLPPAAM MASTTTAAASMLLSGS MS+ DG+MNP N
Sbjct: 327 KQVQRCAEDRTILITTYEGNHNHPLPPAAMNMASTTTAAASMLLSGSTMSNQDGLMNPTN 386

Query: 221 LLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVT 280
           LLAR ILPCSSSMATISASAPFPT+TLDLT S              +QF  Q   L  + 
Sbjct: 387 LLARTILPCSSSMATISASAPFPTITLDLTES---PNGNNPTNNPLMQF-SQRSGLVELN 442

Query: 281 NTQLPQVFGQALY--NQSKFSGLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATA 338
            + LP + GQALY   QSKFSGL +                     QP    ++VSAATA
Sbjct: 443 QSVLPHMMGQALYYNQQSKFSGLHMP-------------------SQPLNAGESVSAATA 483

Query: 339 AITADPNFTAALAAAITSII 358
           AI ++PNF AALAAAITSII
Sbjct: 484 AIASNPNFAAALAAAITSII 503




Transcription factor. Interacts specifically with the W box (5'-(T)TGAC[CT]-3'), a frequently occurring elicitor-responsive cis-acting element.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9C519|WRKY6_ARATH WRKY transcription factor 6 OS=Arabidopsis thaliana GN=WRKY6 PE=1 SV=1 Back     alignment and function description
>sp|Q93WT0|WRK31_ARATH Probable WRKY transcription factor 31 OS=Arabidopsis thaliana GN=WRKY31 PE=2 SV=1 Back     alignment and function description
>sp|Q9ZSI7|WRK47_ARATH Probable WRKY transcription factor 47 OS=Arabidopsis thaliana GN=WRKY47 PE=2 SV=2 Back     alignment and function description
>sp|Q9LXG8|WRK72_ARATH Probable WRKY transcription factor 72 OS=Arabidopsis thaliana GN=WRKY72 PE=2 SV=1 Back     alignment and function description
>sp|Q9C9F0|WRKY9_ARATH Probable WRKY transcription factor 9 OS=Arabidopsis thaliana GN=WRKY9 PE=2 SV=1 Back     alignment and function description
>sp|Q8VWV6|WRK61_ARATH Probable WRKY transcription factor 61 OS=Arabidopsis thaliana GN=WRKY61 PE=2 SV=1 Back     alignment and function description
>sp|Q9CAR4|WRK36_ARATH Probable WRKY transcription factor 36 OS=Arabidopsis thaliana GN=WRKY36 PE=2 SV=1 Back     alignment and function description
>sp|Q8S8P5|WRK33_ARATH Probable WRKY transcription factor 33 OS=Arabidopsis thaliana GN=WRKY33 PE=1 SV=2 Back     alignment and function description
>sp|Q93WV0|WRK20_ARATH Probable WRKY transcription factor 20 OS=Arabidopsis thaliana GN=WRKY20 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query386
255554813 580 WRKY transcription factor, putative [Ric 0.888 0.591 0.787 1e-139
259121421 602 WRKY transcription factor 28 [(Populus t 0.898 0.576 0.760 1e-131
224118042 578 predicted protein [Populus trichocarpa] 0.857 0.572 0.760 1e-131
225444291 535 PREDICTED: WRKY transcription factor 6-l 0.878 0.633 0.682 1e-128
147779800 535 hypothetical protein VITISV_002247 [Viti 0.836 0.603 0.709 1e-128
209867508 556 WRKY transcription factor [Picrorhiza ku 0.875 0.607 0.65 1e-124
356547095 614 PREDICTED: WRKY transcription factor 6 [ 0.930 0.584 0.639 1e-123
147841888 620 hypothetical protein VITISV_024690 [Viti 0.886 0.551 0.663 1e-123
359485613 593 PREDICTED: WRKY transcription factor 6-l 0.886 0.576 0.663 1e-123
224115798 540 predicted protein [Populus trichocarpa] 0.883 0.631 0.766 1e-122
>gi|255554813|ref|XP_002518444.1| WRKY transcription factor, putative [Ricinus communis] gi|223542289|gb|EEF43831.1| WRKY transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 285/362 (78%), Positives = 304/362 (83%), Gaps = 19/362 (5%)

Query: 1   MVPRQFIGLGPSAETDHEVSNCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFDDQ 60
           +VPRQF+ LGPSAETD E+S+ SSD+ERT SGTP    E AS   V +NGK E+ +FD +
Sbjct: 217 VVPRQFLDLGPSAETD-EISHSSSDDERTRSGTPQTNTETAS---VKNNGKIEMSTFDQE 272

Query: 61  AAAAAAAENSNGKRIGREESPESETQGWGPNNKVQKLSSA-KGIDQSNEATMRKARVSVR 119
            ++       +GK IGREESPESETQGW PN KVQKL+ A KGIDQ+ EATMRKARVSVR
Sbjct: 273 NSSF-----RDGKGIGREESPESETQGWNPN-KVQKLNPASKGIDQNAEATMRKARVSVR 326

Query: 120 ARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTY 179
           ARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAED +ILITTY
Sbjct: 327 ARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDTSILITTY 386

Query: 180 EGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISAS 239
           EGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISAS
Sbjct: 387 EGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISAS 446

Query: 240 APFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFS 299
           APFPTVTLDLTHSPNPLQ+QR    FQV FPGQPQN ASVT  QLPQVFGQALYNQSKFS
Sbjct: 447 APFPTVTLDLTHSPNPLQVQRPPTHFQVPFPGQPQNFASVTPQQLPQVFGQALYNQSKFS 506

Query: 300 GLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATAAITADPNFTAALAAAITSIIG 359
           GLQLSQ +  + Q      L P   Q   L D+VSAATAAITADPNFTAALAAAITSIIG
Sbjct: 507 GLQLSQELPQSQQ------LHP--SQQHSLVDSVSAATAAITADPNFTAALAAAITSIIG 558

Query: 360 GA 361
           G 
Sbjct: 559 GG 560




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|259121421|gb|ACV92030.1| WRKY transcription factor 28 [(Populus tomentosa x P. bolleana) x P. tomentosa] Back     alignment and taxonomy information
>gi|224118042|ref|XP_002331543.1| predicted protein [Populus trichocarpa] gi|222873767|gb|EEF10898.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225444291|ref|XP_002263115.1| PREDICTED: WRKY transcription factor 6-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|147779800|emb|CAN70362.1| hypothetical protein VITISV_002247 [Vitis vinifera] Back     alignment and taxonomy information
>gi|209867508|gb|ACI90292.1| WRKY transcription factor [Picrorhiza kurrooa] Back     alignment and taxonomy information
>gi|356547095|ref|XP_003541953.1| PREDICTED: WRKY transcription factor 6 [Glycine max] Back     alignment and taxonomy information
>gi|147841888|emb|CAN65218.1| hypothetical protein VITISV_024690 [Vitis vinifera] Back     alignment and taxonomy information
>gi|359485613|ref|XP_002269696.2| PREDICTED: WRKY transcription factor 6-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|224115798|ref|XP_002317127.1| predicted protein [Populus trichocarpa] gi|222860192|gb|EEE97739.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query386
TAIR|locus:2018052553 WRKY6 [Arabidopsis thaliana (t 0.665 0.464 0.560 1.7e-78
TAIR|locus:2137179528 WRKY42 [Arabidopsis thaliana ( 0.722 0.528 0.537 2.3e-74
TAIR|locus:2120623538 WRKY31 "WRKY DNA-binding prote 0.787 0.565 0.510 7.5e-73
TAIR|locus:2133432489 WRKY47 [Arabidopsis thaliana ( 0.422 0.333 0.523 4.1e-45
TAIR|locus:2034964480 WRKY61 "WRKY DNA-binding prote 0.621 0.5 0.362 5.7e-36
TAIR|locus:2150876548 WRKY72 "WRKY DNA-binding prote 0.471 0.332 0.435 3.2e-33
TAIR|locus:2199317374 WRKY9 "WRKY DNA-binding protei 0.290 0.299 0.610 2.6e-31
TAIR|locus:2196779387 WRKY36 "WRKY DNA-binding prote 0.440 0.439 0.393 3.3e-30
TAIR|locus:2057212519 WRKY33 "AT2G38470" [Arabidopsi 0.422 0.314 0.368 2.3e-22
UNIPROTKB|Q6IEL0348 WRKY71 "Transcription factor W 0.272 0.301 0.447 1.1e-21
TAIR|locus:2018052 WRKY6 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 733 (263.1 bits), Expect = 1.7e-78, Sum P(2) = 1.7e-78
 Identities = 162/289 (56%), Positives = 178/289 (61%)

Query:    70 SNGKRIGREESPESETQGWGPNNKVQKLSSAKG--IDQSNEATMRKARVSVRARSEAPMI 127
             SNGKR+GREESPE+E+      NK+QK++S      DQ+ EATMRKARVSVRARSEAPMI
Sbjct:   258 SNGKRLGREESPETES------NKIQKVNSTTPTTFDQTAEATMRKARVSVRARSEAPMI 311

Query:   128 TDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHXX 187
             +DGCQWRKYGQKMAKGNPCPRAYYRCTMA GCPVRKQVQRCAEDR+ILITTYEGNHNH  
Sbjct:   312 SDGCQWRKYGQKMAKGNPCPRAYYRCTMATGCPVRKQVQRCAEDRSILITTYEGNHNHPL 371

Query:   188 XXXXXXXXXXXXXXXXXXXXXXXXXXDGIMNP-NLLARAILPCSSSMATISASAPFPTVT 246
                                       DG+MNP NLLARA+LPCS+SMATISASAPFPTVT
Sbjct:   372 PPAAVAMASTTTAAANMLLSGSMSSHDGMMNPTNLLARAVLPCSTSMATISASAPFPTVT 431

Query:   247 LDLTHSP------NPXXXXXXXXXXXXXXPGQPQNLASVTNT---QLPQVFGQALYNQSK 297
             LDLTHSP      NP                  Q    +TN     LP V GQALYNQSK
Sbjct:   432 LDLTHSPPPPNGSNPSSSAATNNNHNSLMQRPQQQQQQMTNLPPGMLPHVIGQALYNQSK 491

Query:   298 FSGXXXXXXXXXXXXXXXHQTXXXXXXXXXXXADTVSAATAAITADPNF 346
             FSG                             ADT++A    +TADPNF
Sbjct:   492 FSGLQFSGGS----------PSTAAFSQSHAVADTITA----LTADPNF 526


GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS;IMP
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA;ISS
GO:0043565 "sequence-specific DNA binding" evidence=IEA
GO:0010200 "response to chitin" evidence=IEP;RCA
GO:0005515 "protein binding" evidence=IPI
GO:0016036 "cellular response to phosphate starvation" evidence=IMP
GO:0044212 "transcription regulatory region DNA binding" evidence=IDA
GO:0045892 "negative regulation of transcription, DNA-dependent" evidence=IMP
GO:0080169 "cellular response to boron-containing substance deprivation" evidence=IMP
GO:0002679 "respiratory burst involved in defense response" evidence=RCA
GO:0009407 "toxin catabolic process" evidence=RCA
GO:0010583 "response to cyclopentenone" evidence=RCA
GO:0035556 "intracellular signal transduction" evidence=RCA
GO:0043090 "amino acid import" evidence=RCA
TAIR|locus:2137179 WRKY42 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2120623 WRKY31 "WRKY DNA-binding protein 31" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2133432 WRKY47 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2034964 WRKY61 "WRKY DNA-binding protein 61" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2150876 WRKY72 "WRKY DNA-binding protein 72" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2199317 WRKY9 "WRKY DNA-binding protein 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2196779 WRKY36 "WRKY DNA-binding protein 36" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2057212 WRKY33 "AT2G38470" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q6IEL0 WRKY71 "Transcription factor WRKY71" [Oryza sativa Indica Group (taxid:39946)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query386
smart0077459 smart00774, WRKY, DNA binding domain 5e-37
pfam0310660 pfam03106, WRKY, WRKY DNA -binding domain 4e-36
>gnl|CDD|214815 smart00774, WRKY, DNA binding domain Back     alignment and domain information
 Score =  127 bits (323), Expect = 5e-37
 Identities = 35/57 (61%), Positives = 43/57 (75%)

Query: 129 DGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNH 185
           DG QWRKYGQK+ KG+P PR+YYRCT   GCP +KQVQR  +D +++  TYEG H H
Sbjct: 3   DGYQWRKYGQKVIKGSPYPRSYYRCTYTQGCPAKKQVQRSDDDPSVVEVTYEGEHTH 59


The WRKY domain is a DNA binding domain found in one or two copies in a superfamily of plant transcription factors. These transcription factors are involved in the regulation of various physiological programs that are unique to plants, including pathogen defense, senescence and trichome development. The domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger-like motif. It binds specifically to the DNA sequence motif (T)(T)TGAC(C/T), which is known as the W box. The invariant TGAC core is essential for function and WRKY binding. Length = 59

>gnl|CDD|145969 pfam03106, WRKY, WRKY DNA -binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 386
PF0310660 WRKY: WRKY DNA -binding domain; InterPro: IPR00365 99.97
smart0077459 WRKY DNA binding domain. The WRKY domain is a DNA 99.97
PF0310191 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 93.85
PF0450062 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR00 93.8
>PF03106 WRKY: WRKY DNA -binding domain; InterPro: IPR003657 The WRKY domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger- like motif Back     alignment and domain information
Probab=99.97  E-value=4.2e-32  Score=208.74  Aligned_cols=60  Identities=65%  Similarity=1.255  Sum_probs=52.2

Q ss_pred             CCccchhhhccccccCCCCCCCccccccccCCcccccceeeecCCCcEEEEEeccCCCCCC
Q 016623          127 ITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPL  187 (386)
Q Consensus       127 ~~DGy~WRKYGQK~IKGsp~PRsYYRCT~~~gC~arKqVQR~~dD~~il~tTYeG~HnH~~  187 (386)
                      ++|||+|||||||.|+|+++||+||||++. +|+|+|+|||+.+|+.+++|||+|+|||+.
T Consensus         1 ~~Dgy~WRKYGqK~i~g~~~pRsYYrCt~~-~C~akK~Vqr~~~d~~~~~vtY~G~H~h~k   60 (60)
T PF03106_consen    1 LDDGYRWRKYGQKNIKGSPYPRSYYRCTHP-GCPAKKQVQRSADDPNIVIVTYEGEHNHPK   60 (60)
T ss_dssp             --SSS-EEEEEEEEETTTTCEEEEEEEECT-TEEEEEEEEEETTCCCEEEEEEES--SS--
T ss_pred             CCCCCchhhccCcccCCCceeeEeeecccc-ChhheeeEEEecCCCCEEEEEEeeeeCCCC
Confidence            579999999999999999999999999995 999999999999999999999999999973



The WRKY domain is found in one or two copies in a superfamily of plant transcription factors involved in the regulation of various physiological programs that are unique to plants, including pathogen defence, senescence, trichome development and the biosynthesis of secondary metabolites. The WRKY domain binds specifically to the DNA sequence motif (T)(T)TGAC(C/T), which is known as the W box. The invariant TGAC core of the W box is essential for function and WRKY binding []. Some proteins known to contain a WRKY domain include Arabidopsis thaliana ZAP1 (Zinc-dependent Activator Protein-1) and AtWRKY44/TTG2, a protein involved in trichome development and anthocyanin pigmentation; and wild oat ABF1-2, two proteins involved in the gibberelic acid-induced expression of the alpha-Amy2 gene. Structural studies indicate that this domain is a four-stranded beta-sheet with a zinc binding pocket, forming a novel zinc and DNA binding structure []. The WRKYGQK residues correspond to the most N-terminal beta-strand, which enables extensive hydrophobic interactions, contributing to the structural stability of the beta-sheet.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2AYD_A 1WJ2_A 2LEX_A.

>smart00774 WRKY DNA binding domain Back     alignment and domain information
>PF03101 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 Phytochrome A is the primary photoreceptor for mediating various far-red light-induced responses in higher plants Back     alignment and domain information
>PF04500 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR007588 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query386
2ayd_A76 Crystal Structure Of The C-Terminal Wrky Domainof A 4e-17
1wj2_A78 Solution Structure Of The C-Terminal Wrky Domain Of 7e-17
>pdb|2AYD|A Chain A, Crystal Structure Of The C-Terminal Wrky Domainof Atwrky1, An Sa-Induced And Partially Npr1-Dependent Transcription Factor Length = 76 Back     alignment and structure

Iteration: 1

Score = 85.9 bits (211), Expect = 4e-17, Method: Composition-based stats. Identities = 37/72 (51%), Positives = 53/72 (73%), Gaps = 1/72 (1%) Query: 114 ARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRT 173 +R+ V ++ ++ DG +WRKYGQK KG+P PR+YYRC+ + GCPV+K V+R + D Sbjct: 1 SRIVVHTQTLFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCS-SPGCPVKKHVERSSHDTK 59 Query: 174 ILITTYEGNHNH 185 +LITTYEG H+H Sbjct: 60 LLITTYEGKHDH 71
>pdb|1WJ2|A Chain A, Solution Structure Of The C-Terminal Wrky Domain Of Atwrky4 Length = 78 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query386
2ayd_A76 WRKY transcription factor 1; beta strands, zinc fi 1e-48
1wj2_A78 Probable WRKY transcription factor 4; DNA-binding 1e-47
>2ayd_A WRKY transcription factor 1; beta strands, zinc finger; 1.60A {Arabidopsis thaliana} Length = 76 Back     alignment and structure
 Score =  158 bits (400), Expect = 1e-48
 Identities = 39/77 (50%), Positives = 56/77 (72%), Gaps = 1/77 (1%)

Query: 114 ARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRT 173
           +R+ V  ++   ++ DG +WRKYGQK  KG+P PR+YYRC+ + GCPV+K V+R + D  
Sbjct: 1   SRIVVHTQTLFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCS-SPGCPVKKHVERSSHDTK 59

Query: 174 ILITTYEGNHNHPLPPA 190
           +LITTYEG H+H +PP 
Sbjct: 60  LLITTYEGKHDHDMPPG 76


>1wj2_A Probable WRKY transcription factor 4; DNA-binding domain, zinc-binding, structural genomics; NMR {Arabidopsis thaliana} SCOP: g.79.1.1 PDB: 2lex_A* Length = 78 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query386
2ayd_A76 WRKY transcription factor 1; beta strands, zinc fi 100.0
1wj2_A78 Probable WRKY transcription factor 4; DNA-binding 100.0
2rpr_A87 Flywch-type zinc finger-containing protein 1; flyw 86.31
>2ayd_A WRKY transcription factor 1; beta strands, zinc finger; 1.60A {Arabidopsis thaliana} Back     alignment and structure
Probab=100.00  E-value=4.4e-36  Score=240.08  Aligned_cols=75  Identities=52%  Similarity=1.105  Sum_probs=72.3

Q ss_pred             eeEEEEeccCCCCCCccchhhhccccccCCCCCCCccccccccCCcccccceeeecCCCcEEEEEeccCCCCCCCh
Q 016623          114 ARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPP  189 (386)
Q Consensus       114 ~RV~v~t~se~~~~~DGy~WRKYGQK~IKGsp~PRsYYRCT~~~gC~arKqVQR~~dD~~il~tTYeG~HnH~~P~  189 (386)
                      .||.|++.++.++++|||+|||||||.|||++|||+||||++ .||+|+|+|||+.+|+.+++|||+|+|||+.|.
T Consensus         1 ~r~~v~t~~~~~~~~DGy~WRKYGQK~ikgs~~PRsYYrCt~-~gC~a~K~Ver~~~d~~~~~~tY~G~H~H~~p~   75 (76)
T 2ayd_A            1 SRIVVHTQTLFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCSS-PGCPVKKHVERSSHDTKLLITTYEGKHDHDMPP   75 (76)
T ss_dssp             CEEEEEEECSSSCCCCSSCEEEEEEECCTTCSSCEEEEEECS-TTCCCEEEEEECSSSTTEEEEEEESCCSSCCCC
T ss_pred             CeEEEEecCCCCcCCCCchhhhCcccccCCCCCceeEeEcCC-CCCCceeeEEEECCCCCEEEEEEccCcCCCCCC
Confidence            389999999999999999999999999999999999999998 699999999999999999999999999999885



>1wj2_A Probable WRKY transcription factor 4; DNA-binding domain, zinc-binding, structural genomics; NMR {Arabidopsis thaliana} SCOP: g.79.1.1 PDB: 2lex_A* Back     alignment and structure
>2rpr_A Flywch-type zinc finger-containing protein 1; flywch domain, alternative splicing, DNA-binding, metal- binding, nucleus, metal binding protein; NMR {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 386
d1wj2a_71 g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cr 7e-36
>d1wj2a_ g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 71 Back     information, alignment and structure

class: Small proteins
fold: WRKY DNA-binding domain
superfamily: WRKY DNA-binding domain
family: WRKY DNA-binding domain
domain: WRKY DNA-binding protein 4
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score =  123 bits (311), Expect = 7e-36
 Identities = 41/71 (57%), Positives = 51/71 (71%), Gaps = 1/71 (1%)

Query: 118 VRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILIT 177
           V+  SE  ++ DG +WRKYGQK+ KGNP PR+YY+CT   GC VRK V+R A D   ++T
Sbjct: 1   VQTTSEVDLLDDGYRWRKYGQKVVKGNPYPRSYYKCTTP-GCGVRKHVERAATDPKAVVT 59

Query: 178 TYEGNHNHPLP 188
           TYEG HNH LP
Sbjct: 60  TYEGKHNHDLP 70


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query386
d1wj2a_71 WRKY DNA-binding protein 4 {Thale cress (Arabidops 100.0
>d1wj2a_ g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: Small proteins
fold: WRKY DNA-binding domain
superfamily: WRKY DNA-binding domain
family: WRKY DNA-binding domain
domain: WRKY DNA-binding protein 4
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=100.00  E-value=1.1e-35  Score=233.29  Aligned_cols=71  Identities=58%  Similarity=1.111  Sum_probs=67.4

Q ss_pred             EEeccCCCCCCccchhhhccccccCCCCCCCccccccccCCcccccceeeecCCCcEEEEEeccCCCCCCCh
Q 016623          118 VRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPP  189 (386)
Q Consensus       118 v~t~se~~~~~DGy~WRKYGQK~IKGsp~PRsYYRCT~~~gC~arKqVQR~~dD~~il~tTYeG~HnH~~P~  189 (386)
                      |++.++.++++|||+|||||||.|||+++||+||||++ .+|+|+|+|||+++|+.+++|||+|+|||+.|.
T Consensus         1 v~t~~~~~~~dDGy~WRKYGQK~ikgs~~pRsYYrCt~-~~C~a~K~Vqr~~~d~~~~~vtY~G~H~h~~Ps   71 (71)
T d1wj2a_           1 VQTTSEVDLLDDGYRWRKYGQKVVKGNPYPRSYYKCTT-PGCGVRKHVERAATDPKAVVTTYEGKHNHDLPA   71 (71)
T ss_dssp             CCCCCCCCCCCSSSCBCCCEEECCTTCSSCEEEEEEEC-SSCEEEEEEEEETTTTSEEEEEEESCCSSCCCC
T ss_pred             CccccccccCCCCcEecccCceeccCCCCceEEEEccc-cCCCCcceEEEEcCCCCEEEEEEeeEeCCCCCC
Confidence            46778899999999999999999999999999999998 699999999999999999999999999999874