Citrus Sinensis ID: 025097


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------26
MDSHNSPKEKMKLLQAKLEHVRKQNENLRHLVKAMNNQCNDLLARIHEANRTYSSSDHHHFNNNINIGGVTAQVPPVPNAKQSRIFVKADSKDSSLIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHDVQCSSLGQSSSLTNYCSPKSSIVHCPDYQTTDSFGSDVTLDLTLSGSNQETRPPRNLMQVCDDKKKIEEYVASLTKDPSFTIAVADAVASSINGPPHRPM
cccccccHHHHHHHHHHHHHHHHHcHHHHHHHHHHHcccHHHHHHHHHccccccccccccccccccccccccccccccccccccEEEEcccccccccccccccccccccccccccccccccEEcccccccccccccEEEcccccccEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHccccHHHHHHHHHHHcccccccccc
ccccccHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEccccccccccccccHEHcccEEEccccccccEEEEccccccccccEEEEEcccccEEEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHccccccccHHHHHHHHHHccccccccccc
MDSHNSPKEKMKLLQAKLEHVRKQNENLRHLVKAMNNQCNDLLARIHEAnrtysssdhhhfnnniniggvtaqvppvpnakqsrifvkadskdsslivkdghqwrkygqkvtkdnpsprayfrcsmassgcpvkkkvQRCMEDKSFLVATyegehnhdvqcsslgqsssltnycspkssivhcpdyqttdsfgsdvtldltlsgsnqetrpprnlmqvcDDKKKIEEYVASLTKDPSFTIAVADAVAssingpphrpm
mdshnspkeKMKLLQAKLEHVRKQNENLRHLVKAMNNQCNDLLARIHEANRTYSSSDHHHFNNNINIGGVTAQVPPVPNAKQSRIfvkadskdsslivkdghqwrkygqkvtkdnpsprayfrcsmassgcpvkkkVQRCMEDKSFLVATYEGEHNHDVQCSSLGQSSSLTNYCSPKSSIVHCPDYQTTDSFGSDVTLDLTLsgsnqetrpprnlmqvcDDKKKIEEYVASLTKDPSFTIAVADAVASsingpphrpm
MDSHNSPKEKMKLLQAKLEHVRKQNENLRHLVKAMNNQCNDLLARIHEANRTYSSSDHHHFnnniniggVTAQVPPVPNAKQSRIFVKADSKDSSLIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHDVQCSSLGQSSSLTNYCSPKSSIVHCPDYQTTDSFGSDVTLDLTLSGSNQETRPPRNLMQVCDDKKKIEEYVASLTKDPSFTIAVADAVASSINGPPHRPM
****************************RHLVKAMNNQCNDLLARIHEANRTY***DHHHFNNNINIGGVTAQV**********IFV********LIVKDGHQWRKYG***********AYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHDVQ***********NYCSPKSSIVHCPDYQTTDSFGSDVTL*************************KIEEYVASLTKDPSFTIAVAD**************
***************************************************************************************KADSKDSSLIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHN**************************************************************************SLTKDPSFTIAVADAVAS**********
**********MKLLQAKLEHVRKQNENLRHLVKAMNNQCNDLLARIHEANRTYSSSDHHHFNNNINIGGVTAQVPPVPNAKQSRIFVKADSKDSSLIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHDVQCSSLGQSSSLTNYCSPKSSIVHCPDYQTTDSFGSDVTLDLTLSGSNQETRPPRNLMQVCDDKKKIEEYVASLTKDPSFTIAVADAVASS*********
*******KEKMKLLQAKLEHVRK********************************************************AKQSRIFVKADSKDSSLIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEH*********************************************************************IEEYVASLTKDPSFTIAVADAVASSIN*******
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDSHNSPKEKxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxYSSSDHHHFNNNINIGGVTAQVPPVPNAKQSRIFVKADSKDSSLIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHDVQCSSLGQSSSLTNYCSPKSSIVHCPDYQTTDSFGSDVTLDLTLSGSNQETRPPRNLMQVCDDKKKIEEYVASLTKDPSFTIAVADAVASSINGPPHRPM
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query258 2.2.26 [Sep-21-2011]
Q9SK33271 Probable WRKY transcripti yes no 0.806 0.767 0.346 6e-31
Q9SAH7302 Probable WRKY transcripti no no 0.662 0.566 0.454 6e-31
Q9C5T4310 WRKY transcription factor no no 0.562 0.467 0.482 6e-31
Q9XEC3528 Probable WRKY transcripti no no 0.399 0.195 0.462 8e-23
Q9C519553 WRKY transcription factor no no 0.286 0.133 0.584 1e-22
Q93WT0538 Probable WRKY transcripti no no 0.286 0.137 0.584 2e-22
Q9ZSI7489 Probable WRKY transcripti no no 0.593 0.312 0.314 9e-22
Q9C9F0374 Probable WRKY transcripti no no 0.437 0.302 0.433 6e-21
Q8VWV6 480 Probable WRKY transcripti no no 0.290 0.156 0.564 3e-20
Q9LXG8 548 Probable WRKY transcripti no no 0.333 0.156 0.494 7e-20
>sp|Q9SK33|WRK60_ARATH Probable WRKY transcription factor 60 OS=Arabidopsis thaliana GN=WRKY60 PE=1 SV=1 Back     alignment and function desciption
 Score =  134 bits (338), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 90/260 (34%), Positives = 130/260 (50%), Gaps = 52/260 (20%)

Query: 9   EKMKLLQAKLEHVRKQNENLRHLVK-------AMNNQCNDLLAR-----IHEANRTYSSS 56
           EK  +LQ ++  V  +N+ L  ++        A+NN   +L +R     ++  N+  +  
Sbjct: 40  EKRNMLQDEINRVNSENKKLTEMLARVCEKYYALNNLMEELQSRKSPESVNFQNKQLTGK 99

Query: 57  DHHHFNNNIN--IGGVTAQVPPVPNAKQ--SRIFVKADSKDSSLIVKDGHQWRKYGQKVT 112
                +  ++  IG     +  + N K   S  +  A+  D+SL VKDG+QWRKYGQK+T
Sbjct: 100 RKQELDEFVSSPIGLSLGPIENITNDKATVSTAYFAAEKSDTSLTVKDGYQWRKYGQKIT 159

Query: 113 KDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHDVQCSSLGQSSSLTN 172
           +DNPSPRAYFRCS + S C VKKKVQR  ED SFLVATYEG HNH               
Sbjct: 160 RDNPSPRAYFRCSFSPS-CLVKKKVQRSAEDPSFLVATYEGTHNH--------------- 203

Query: 173 YCSPKSSIVHCPDYQTTDSFGSDVTLDLTLSGSN--QETRPPRNLMQVCDDKKKIEEYVA 230
              P +S+               V LDL   G    +E +    + +V      +++  +
Sbjct: 204 -TGPHASVSRT------------VKLDLVQGGLEPVEEKKERGTIQEVL-----VQQMAS 245

Query: 231 SLTKDPSFTIAVADAVASSI 250
           SLTKDP FT A+A A++  +
Sbjct: 246 SLTKDPKFTAALATAISGRL 265




Transcription factor. Interacts specifically with the W box (5'-(T)TGAC[CT]-3'), a frequently occurring elicitor-responsive cis-acting element.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9SAH7|WRK40_ARATH Probable WRKY transcription factor 40 OS=Arabidopsis thaliana GN=WRKY40 PE=1 SV=1 Back     alignment and function description
>sp|Q9C5T4|WRK18_ARATH WRKY transcription factor 18 OS=Arabidopsis thaliana GN=WRKY18 PE=1 SV=2 Back     alignment and function description
>sp|Q9XEC3|WRK42_ARATH Probable WRKY transcription factor 42 OS=Arabidopsis thaliana GN=WRKY42 PE=2 SV=1 Back     alignment and function description
>sp|Q9C519|WRKY6_ARATH WRKY transcription factor 6 OS=Arabidopsis thaliana GN=WRKY6 PE=1 SV=1 Back     alignment and function description
>sp|Q93WT0|WRK31_ARATH Probable WRKY transcription factor 31 OS=Arabidopsis thaliana GN=WRKY31 PE=2 SV=1 Back     alignment and function description
>sp|Q9ZSI7|WRK47_ARATH Probable WRKY transcription factor 47 OS=Arabidopsis thaliana GN=WRKY47 PE=2 SV=2 Back     alignment and function description
>sp|Q9C9F0|WRKY9_ARATH Probable WRKY transcription factor 9 OS=Arabidopsis thaliana GN=WRKY9 PE=2 SV=1 Back     alignment and function description
>sp|Q8VWV6|WRK61_ARATH Probable WRKY transcription factor 61 OS=Arabidopsis thaliana GN=WRKY61 PE=2 SV=1 Back     alignment and function description
>sp|Q9LXG8|WRK72_ARATH Probable WRKY transcription factor 72 OS=Arabidopsis thaliana GN=WRKY72 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query258
147864790259 hypothetical protein VITISV_036455 [Viti 0.922 0.918 0.480 9e-51
359476150261 PREDICTED: WRKY transcription factor 18- 0.918 0.908 0.474 4e-47
296082063262 unnamed protein product [Vitis vinifera] 0.918 0.904 0.478 4e-47
224141959271 predicted protein [Populus trichocarpa] 0.883 0.841 0.476 3e-44
355398579271 WRKY transcription factor [(Populus tome 0.891 0.848 0.453 9e-43
351724539278 transcription factor [Glycine max] gi|16 0.875 0.812 0.434 2e-42
357437111255 WRKY transcription factor [Medicago trun 0.875 0.886 0.426 1e-41
255639751278 unknown [Glycine max] 0.875 0.812 0.434 2e-41
356552420278 PREDICTED: probable WRKY transcription f 0.860 0.798 0.424 4e-41
356509880261 PREDICTED: LOW QUALITY PROTEIN: probable 0.852 0.842 0.403 8e-38
>gi|147864790|emb|CAN84058.1| hypothetical protein VITISV_036455 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  206 bits (523), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 121/252 (48%), Positives = 159/252 (63%), Gaps = 14/252 (5%)

Query: 10  KMKLLQAKLEHVRKQNENLRHLVKAMNNQCNDLLARIHEANRTYSSSDHHHFNNNINIGG 69
           K+++L+ +LE +RK+NE+LR  ++ M ++   L A + +   T S       ++N     
Sbjct: 12  KVEVLKIELERLRKENEDLRLXLEIMGSKYEVLQAHLQKNMATISPDHGSSXDSNKR--- 68

Query: 70  VTAQVPPVPNAKQSRIFVKADSKDSSLIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASS 129
              +   V  AK S++FV+ + KD SL VKDG QWRKYGQK+TKDNPSPRAYFRCSMA  
Sbjct: 69  --PRTEEVSVAKASQVFVRTNPKDKSLTVKDGFQWRKYGQKITKDNPSPRAYFRCSMAPQ 126

Query: 130 GCPVKKKVQRCMEDKSFLVATYEGEHNHDVQCSSLGQSSSLTNYCSPKSSIVHCPDYQTT 189
            CPVKKKVQRC+ED S LVATYEG HNH+    +    SS +     K  + + P   T 
Sbjct: 127 -CPVKKKVQRCLEDSSILVATYEGAHNHEPPHDAPAGGSSYSPDSPIKGLVANFPCPTTV 185

Query: 190 DSFGSDVTLDLTLS--GSNQETRPPRNLMQVCDDKK----KIEEYVASLTKDPSFTIAVA 243
           D F   VTLDLTLS  G+ QE R P+N M+  D +K    ++EEYVASLTKD +FT+A+A
Sbjct: 186 DPFQPTVTLDLTLSGTGTGQENRRPQNFMK--DYRKSNCGRVEEYVASLTKDTNFTLALA 243

Query: 244 DAVASSINGPPH 255
            AVA SI   P+
Sbjct: 244 AAVARSITDQPN 255




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359476150|ref|XP_002282812.2| PREDICTED: WRKY transcription factor 18-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|296082063|emb|CBI21068.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224141959|ref|XP_002324328.1| predicted protein [Populus trichocarpa] gi|222865762|gb|EEF02893.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|355398579|gb|AER70306.1| WRKY transcription factor [(Populus tomentosa x P. bolleana) x P. tomentosa] Back     alignment and taxonomy information
>gi|351724539|ref|NP_001237573.1| transcription factor [Glycine max] gi|166203234|gb|ABY84657.1| transcription factor [Glycine max] Back     alignment and taxonomy information
>gi|357437111|ref|XP_003588831.1| WRKY transcription factor [Medicago truncatula] gi|355477879|gb|AES59082.1| WRKY transcription factor [Medicago truncatula] Back     alignment and taxonomy information
>gi|255639751|gb|ACU20169.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356552420|ref|XP_003544566.1| PREDICTED: probable WRKY transcription factor 40-like [Glycine max] Back     alignment and taxonomy information
>gi|356509880|ref|XP_003523671.1| PREDICTED: LOW QUALITY PROTEIN: probable WRKY transcription factor 40-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query258
TAIR|locus:2124874310 WRKY18 "WRKY DNA-binding prote 0.461 0.383 0.532 1.1e-33
TAIR|locus:2025687302 WRKY40 "WRKY DNA-binding prote 0.682 0.582 0.446 5.2e-33
TAIR|locus:2047395271 WRKY60 "WRKY DNA-binding prote 0.620 0.590 0.396 2.6e-32
TAIR|locus:2133432489 WRKY47 [Arabidopsis thaliana ( 0.333 0.175 0.528 2.9e-32
UNIPROTKB|Q6IEL0348 WRKY71 "Transcription factor W 0.635 0.471 0.441 2.7e-29
TAIR|locus:2137179528 WRKY42 [Arabidopsis thaliana ( 0.383 0.187 0.480 8e-28
TAIR|locus:2018052553 WRKY6 [Arabidopsis thaliana (t 0.341 0.159 0.516 1.5e-27
TAIR|locus:2034964 480 WRKY61 "WRKY DNA-binding prote 0.534 0.287 0.4 5.4e-27
TAIR|locus:2120623538 WRKY31 "WRKY DNA-binding prote 0.341 0.163 0.516 2.4e-26
TAIR|locus:2150876 548 WRKY72 "WRKY DNA-binding prote 0.341 0.160 0.483 8.5e-26
TAIR|locus:2124874 WRKY18 "WRKY DNA-binding protein 18" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 313 (115.2 bits), Expect = 1.1e-33, Sum P(2) = 1.1e-33
 Identities = 66/124 (53%), Positives = 80/124 (64%)

Query:    50 NRTYSSSDHHHFXXXXXXXXVTAQVPPVPN----AKQSRIFVKADSKDSSLIVKDGHQWR 105
             N + +   HHH         + +   PV +    AK S ++V  ++ D+SL VKDG QWR
Sbjct:   123 NSSSNEDHHHHHQQHEQKNQLLSCKRPVTDSFNKAKVSTVYVPTETSDTSLTVKDGFQWR 182

Query:   106 KYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHDVQCSSLG 165
             KYGQKVT+DNPSPRAYFRCS A S CPVKKKVQR  ED S LVATYEG HNH    +S G
Sbjct:   183 KYGQKVTRDNPSPRAYFRCSFAPS-CPVKKKVQRSAEDPSLLVATYEGTHNHLGPNASEG 241

Query:   166 QSSS 169
              ++S
Sbjct:   242 DATS 245


GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS;IMP;IDA
GO:0005634 "nucleus" evidence=ISM;IDA
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA;ISS
GO:0043565 "sequence-specific DNA binding" evidence=IEA
GO:0031347 "regulation of defense response" evidence=IMP
GO:0009751 "response to salicylic acid stimulus" evidence=IEP
GO:0042742 "defense response to bacterium" evidence=IEP
GO:0005515 "protein binding" evidence=IPI
GO:0042802 "identical protein binding" evidence=IPI
GO:0050832 "defense response to fungus" evidence=IEP
GO:0050691 "regulation of defense response to virus by host" evidence=IGI
GO:0010200 "response to chitin" evidence=IEP;RCA
GO:0002237 "response to molecule of bacterial origin" evidence=IMP
TAIR|locus:2025687 WRKY40 "WRKY DNA-binding protein 40" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2047395 WRKY60 "WRKY DNA-binding protein 60" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2133432 WRKY47 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q6IEL0 WRKY71 "Transcription factor WRKY71" [Oryza sativa Indica Group (taxid:39946)] Back     alignment and assigned GO terms
TAIR|locus:2137179 WRKY42 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2018052 WRKY6 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2034964 WRKY61 "WRKY DNA-binding protein 61" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2120623 WRKY31 "WRKY DNA-binding protein 31" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2150876 WRKY72 "WRKY DNA-binding protein 72" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh4_pg.C_LG_XVIII000488
hypothetical protein (271 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query258
smart0077459 smart00774, WRKY, DNA binding domain 5e-29
pfam0310660 pfam03106, WRKY, WRKY DNA -binding domain 2e-27
>gnl|CDD|214815 smart00774, WRKY, DNA binding domain Back     alignment and domain information
 Score =  103 bits (260), Expect = 5e-29
 Identities = 35/60 (58%), Positives = 44/60 (73%), Gaps = 1/60 (1%)

Query: 98  VKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNH 157
           + DG+QWRKYGQKV K +P PR+Y+RC+    GCP KK+VQR  +D S +  TYEGEH H
Sbjct: 1   LDDGYQWRKYGQKVIKGSPYPRSYYRCTYT-QGCPAKKQVQRSDDDPSVVEVTYEGEHTH 59


The WRKY domain is a DNA binding domain found in one or two copies in a superfamily of plant transcription factors. These transcription factors are involved in the regulation of various physiological programs that are unique to plants, including pathogen defense, senescence and trichome development. The domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger-like motif. It binds specifically to the DNA sequence motif (T)(T)TGAC(C/T), which is known as the W box. The invariant TGAC core is essential for function and WRKY binding. Length = 59

>gnl|CDD|145969 pfam03106, WRKY, WRKY DNA -binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 258
PF0310660 WRKY: WRKY DNA -binding domain; InterPro: IPR00365 99.97
smart0077459 WRKY DNA binding domain. The WRKY domain is a DNA 99.97
PF0450062 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR00 95.76
PF0310191 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 91.43
COG4026290 Uncharacterized protein containing TOPRIM domain, 90.21
PF0017064 bZIP_1: bZIP transcription factor cAMP response el 88.37
PHA03162135 hypothetical protein; Provisional 85.85
PHA03155115 hypothetical protein; Provisional 85.12
PF04201162 TPD52: Tumour protein D52 family; InterPro: IPR007 84.39
PF0865072 DASH_Dad4: DASH complex subunit Dad4; InterPro: IP 83.17
PF0537755 FlaC_arch: Flagella accessory protein C (FlaC); In 81.0
smart0033865 BRLZ basic region leucin zipper. 80.93
PF05812118 Herpes_BLRF2: Herpesvirus BLRF2 protein; InterPro: 80.84
KOG4196135 consensus bZIP transcription factor MafK [Transcri 80.41
>PF03106 WRKY: WRKY DNA -binding domain; InterPro: IPR003657 The WRKY domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger- like motif Back     alignment and domain information
Probab=99.97  E-value=5.9e-33  Score=203.51  Aligned_cols=59  Identities=56%  Similarity=1.141  Sum_probs=52.2

Q ss_pred             ccCCccccccCccccCCCCCCcceeecccCCCCCcccccceeecCCCcEEEEEeccccCCC
Q 025097           98 VKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHD  158 (258)
Q Consensus        98 ~~DGy~WRKYGQK~ikgn~~PRsYYRCs~~~~gC~akK~VQr~~~D~~~~~~tY~G~HnH~  158 (258)
                      ++|||+|||||||.|+|+++||+||||++.  +|+|+|+|||+.+|+.+++|||+|+|||+
T Consensus         1 ~~Dgy~WRKYGqK~i~g~~~pRsYYrCt~~--~C~akK~Vqr~~~d~~~~~vtY~G~H~h~   59 (60)
T PF03106_consen    1 LDDGYRWRKYGQKNIKGSPYPRSYYRCTHP--GCPAKKQVQRSADDPNIVIVTYEGEHNHP   59 (60)
T ss_dssp             --SSS-EEEEEEEEETTTTCEEEEEEEECT--TEEEEEEEEEETTCCCEEEEEEES--SS-
T ss_pred             CCCCCchhhccCcccCCCceeeEeeecccc--ChhheeeEEEecCCCCEEEEEEeeeeCCC
Confidence            589999999999999999999999999997  79999999999999999999999999997



The WRKY domain is found in one or two copies in a superfamily of plant transcription factors involved in the regulation of various physiological programs that are unique to plants, including pathogen defence, senescence, trichome development and the biosynthesis of secondary metabolites. The WRKY domain binds specifically to the DNA sequence motif (T)(T)TGAC(C/T), which is known as the W box. The invariant TGAC core of the W box is essential for function and WRKY binding []. Some proteins known to contain a WRKY domain include Arabidopsis thaliana ZAP1 (Zinc-dependent Activator Protein-1) and AtWRKY44/TTG2, a protein involved in trichome development and anthocyanin pigmentation; and wild oat ABF1-2, two proteins involved in the gibberelic acid-induced expression of the alpha-Amy2 gene. Structural studies indicate that this domain is a four-stranded beta-sheet with a zinc binding pocket, forming a novel zinc and DNA binding structure []. The WRKYGQK residues correspond to the most N-terminal beta-strand, which enables extensive hydrophobic interactions, contributing to the structural stability of the beta-sheet.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2AYD_A 1WJ2_A 2LEX_A.

>smart00774 WRKY DNA binding domain Back     alignment and domain information
>PF04500 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR007588 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>PF03101 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 Phytochrome A is the primary photoreceptor for mediating various far-red light-induced responses in higher plants Back     alignment and domain information
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only] Back     alignment and domain information
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization Back     alignment and domain information
>PHA03162 hypothetical protein; Provisional Back     alignment and domain information
>PHA03155 hypothetical protein; Provisional Back     alignment and domain information
>PF04201 TPD52: Tumour protein D52 family; InterPro: IPR007327 The hD52 gene was originally identified through its elevated expression level in human breast carcinoma Back     alignment and domain information
>PF08650 DASH_Dad4: DASH complex subunit Dad4; InterPro: IPR013959 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis [] Back     alignment and domain information
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct [] Back     alignment and domain information
>smart00338 BRLZ basic region leucin zipper Back     alignment and domain information
>PF05812 Herpes_BLRF2: Herpesvirus BLRF2 protein; InterPro: IPR008642 This family consists of several herpes virus BLRF2 tegument proteins Back     alignment and domain information
>KOG4196 consensus bZIP transcription factor MafK [Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query258
2ayd_A76 Crystal Structure Of The C-Terminal Wrky Domainof A 5e-17
1wj2_A78 Solution Structure Of The C-Terminal Wrky Domain Of 1e-15
>pdb|2AYD|A Chain A, Crystal Structure Of The C-Terminal Wrky Domainof Atwrky1, An Sa-Induced And Partially Npr1-Dependent Transcription Factor Length = 76 Back     alignment and structure

Iteration: 1

Score = 84.3 bits (207), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 37/63 (58%), Positives = 47/63 (74%), Gaps = 2/63 (3%) Query: 97 IVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHN 156 IV DG++WRKYGQK K +P PR+Y+RCS S GCPVKK V+R D L+ TYEG+H+ Sbjct: 13 IVNDGYRWRKYGQKSVKGSPYPRSYYRCS--SPGCPVKKHVERSSHDTKLLITTYEGKHD 70 Query: 157 HDV 159 HD+ Sbjct: 71 HDM 73
>pdb|1WJ2|A Chain A, Solution Structure Of The C-Terminal Wrky Domain Of Atwrky4 Length = 78 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query258
1wj2_A78 Probable WRKY transcription factor 4; DNA-binding 2e-33
2ayd_A76 WRKY transcription factor 1; beta strands, zinc fi 7e-33
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-04
>1wj2_A Probable WRKY transcription factor 4; DNA-binding domain, zinc-binding, structural genomics; NMR {Arabidopsis thaliana} SCOP: g.79.1.1 PDB: 2lex_A* Length = 78 Back     alignment and structure
 Score =  115 bits (289), Expect = 2e-33
 Identities = 35/80 (43%), Positives = 51/80 (63%), Gaps = 4/80 (5%)

Query: 80  AKQSRIFVKADSKDSSLIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQR 139
                  V+  S+    ++ DG++WRKYGQKV K NP PR+Y++C+  + GC V+K V+R
Sbjct: 1   GSSGSSGVQTTSEVD--LLDDGYRWRKYGQKVVKGNPYPRSYYKCT--TPGCGVRKHVER 56

Query: 140 CMEDKSFLVATYEGEHNHDV 159
              D   +V TYEG+HNHD+
Sbjct: 57  AATDPKAVVTTYEGKHNHDL 76


>2ayd_A WRKY transcription factor 1; beta strands, zinc finger; 1.60A {Arabidopsis thaliana} Length = 76 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query258
2ayd_A76 WRKY transcription factor 1; beta strands, zinc fi 100.0
1wj2_A78 Probable WRKY transcription factor 4; DNA-binding 100.0
3m91_A51 Proteasome-associated ATPase; coil COIL alpha heli 91.4
2zxx_A79 Geminin; coiled-coil, cell cycle, coiled coil, DNA 87.8
2rpr_A87 Flywch-type zinc finger-containing protein 1; flyw 87.42
1ci6_A63 Transcription factor ATF-4; BZIP; 2.60A {Homo sapi 84.87
1t2k_D61 Cyclic-AMP-dependent transcription factor ATF-2; p 83.17
2wt7_A63 Proto-oncogene protein C-FOS; transcription, trans 82.87
2oxj_A34 Hybrid alpha/beta peptide based on the GCN4-P1 Se 81.34
1wrd_A103 TOM1, target of MYB protein 1; three-helix bundle, 81.32
1jnm_A62 Proto-oncogene C-JUN; BZIP, protein-DNA complex, t 80.07
>2ayd_A WRKY transcription factor 1; beta strands, zinc finger; 1.60A {Arabidopsis thaliana} Back     alignment and structure
Probab=100.00  E-value=9.4e-36  Score=226.94  Aligned_cols=75  Identities=53%  Similarity=0.983  Sum_probs=70.1

Q ss_pred             ceEEEecccCCccccccCCccccccCccccCCCCCCcceeecccCCCCCcccccceeecCCCcEEEEEeccccCCCCCC
Q 025097           83 SRIFVKADSKDSSLIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHDVQC  161 (258)
Q Consensus        83 ~rv~v~~~~~d~s~~~~DGy~WRKYGQK~ikgn~~PRsYYRCs~~~~gC~akK~VQr~~~D~~~~~~tY~G~HnH~~p~  161 (258)
                      +||.|++.++  ..+++|||+|||||||.|+|+++||+||||++.  ||+|+|+|||+++|+.+++|||+|+|||+.|.
T Consensus         1 ~r~~v~t~~~--~~~~~DGy~WRKYGQK~ikgs~~PRsYYrCt~~--gC~a~K~Ver~~~d~~~~~~tY~G~H~H~~p~   75 (76)
T 2ayd_A            1 SRIVVHTQTL--FDIVNDGYRWRKYGQKSVKGSPYPRSYYRCSSP--GCPVKKHVERSSHDTKLLITTYEGKHDHDMPP   75 (76)
T ss_dssp             CEEEEEEECS--SSCCCCSSCEEEEEEECCTTCSSCEEEEEECST--TCCCEEEEEECSSSTTEEEEEEESCCSSCCCC
T ss_pred             CeEEEEecCC--CCcCCCCchhhhCcccccCCCCCceeEeEcCCC--CCCceeeEEEECCCCCEEEEEEccCcCCCCCC
Confidence            5899999875  468899999999999999999999999999985  79999999999999999999999999999885



>1wj2_A Probable WRKY transcription factor 4; DNA-binding domain, zinc-binding, structural genomics; NMR {Arabidopsis thaliana} SCOP: g.79.1.1 PDB: 2lex_A* Back     alignment and structure
>3m91_A Proteasome-associated ATPase; coil COIL alpha helix, ATP-binding, chaperone, nucleotide-BI proteasome, S-nitrosylation; 1.80A {Mycobacterium tuberculosis} PDB: 3m9h_A Back     alignment and structure
>2zxx_A Geminin; coiled-coil, cell cycle, coiled coil, DNA replication inhibitor, phosphoprotein, DNA-binding, nucleus, proto-oncogene; HET: DNA; 2.80A {Mus musculus} Back     alignment and structure
>2rpr_A Flywch-type zinc finger-containing protein 1; flywch domain, alternative splicing, DNA-binding, metal- binding, nucleus, metal binding protein; NMR {Homo sapiens} Back     alignment and structure
>1ci6_A Transcription factor ATF-4; BZIP; 2.60A {Homo sapiens} SCOP: h.1.3.1 Back     alignment and structure
>1t2k_D Cyclic-AMP-dependent transcription factor ATF-2; protein DNA complex, transcription/DNA complex; 3.00A {Homo sapiens} SCOP: h.1.3.1 Back     alignment and structure
>2wt7_A Proto-oncogene protein C-FOS; transcription, transcription regulation, nucleus, activator, repressor, DNA-binding, phosphoprotein, differentiation; 2.30A {Mus musculus} PDB: 1fos_E* 1a02_F* 1s9k_D Back     alignment and structure
>2oxj_A Hybrid alpha/beta peptide based on the GCN4-P1 Se heptad positions B and F substituted...; helix bundle, foldamer, unknown function; HET: B3K B3D B3E B3S B3Y B3X B3A BAL; 2.00A {Synthetic} PDB: 2oxk_A* Back     alignment and structure
>1wrd_A TOM1, target of MYB protein 1; three-helix bundle, ubiquitin-binding protein, protein trans signaling protein complex; 1.75A {Homo sapiens} SCOP: a.7.8.1 Back     alignment and structure
>1jnm_A Proto-oncogene C-JUN; BZIP, protein-DNA complex, transcription/DNA complex; 2.20A {Homo sapiens} SCOP: h.1.3.1 PDB: 1fos_F 2h7h_A 1t2k_C 1a02_J* 1s9k_E 1jun_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 258
d1wj2a_71 g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cr 3e-24
>d1wj2a_ g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 71 Back     information, alignment and structure

class: Small proteins
fold: WRKY DNA-binding domain
superfamily: WRKY DNA-binding domain
family: WRKY DNA-binding domain
domain: WRKY DNA-binding protein 4
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 90.3 bits (224), Expect = 3e-24
 Identities = 33/63 (52%), Positives = 46/63 (73%), Gaps = 2/63 (3%)

Query: 97  IVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHN 156
           ++ DG++WRKYGQKV K NP PR+Y++C+    GC V+K V+R   D   +V TYEG+HN
Sbjct: 9   LLDDGYRWRKYGQKVVKGNPYPRSYYKCTTP--GCGVRKHVERAATDPKAVVTTYEGKHN 66

Query: 157 HDV 159
           HD+
Sbjct: 67  HDL 69


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query258
d1wj2a_71 WRKY DNA-binding protein 4 {Thale cress (Arabidops 100.0
d1wrda193 Target of Myb protein 1, TOM1 {Human (Homo sapiens 86.82
>d1wj2a_ g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: Small proteins
fold: WRKY DNA-binding domain
superfamily: WRKY DNA-binding domain
family: WRKY DNA-binding domain
domain: WRKY DNA-binding protein 4
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=100.00  E-value=1.7e-35  Score=221.14  Aligned_cols=64  Identities=52%  Similarity=1.040  Sum_probs=61.3

Q ss_pred             ccccCCccccccCccccCCCCCCcceeecccCCCCCcccccceeecCCCcEEEEEeccccCCCCCC
Q 025097           96 LIVKDGHQWRKYGQKVTKDNPSPRAYFRCSMASSGCPVKKKVQRCMEDKSFLVATYEGEHNHDVQC  161 (258)
Q Consensus        96 ~~~~DGy~WRKYGQK~ikgn~~PRsYYRCs~~~~gC~akK~VQr~~~D~~~~~~tY~G~HnH~~p~  161 (258)
                      .+++|||+|||||||.|+|+++||+||||++.  ||+|+|+|||+++|+.+++|||+|+|||+.|+
T Consensus         8 ~~~dDGy~WRKYGQK~ikgs~~pRsYYrCt~~--~C~a~K~Vqr~~~d~~~~~vtY~G~H~h~~Ps   71 (71)
T d1wj2a_           8 DLLDDGYRWRKYGQKVVKGNPYPRSYYKCTTP--GCGVRKHVERAATDPKAVVTTYEGKHNHDLPA   71 (71)
T ss_dssp             CCCCSSSCBCCCEEECCTTCSSCEEEEEEECS--SCEEEEEEEEETTTTSEEEEEEESCCSSCCCC
T ss_pred             ccCCCCcEecccCceeccCCCCceEEEEcccc--CCCCcceEEEEcCCCCEEEEEEeeEeCCCCCC
Confidence            57899999999999999999999999999985  79999999999999999999999999999885



>d1wrda1 a.7.8.1 (A:215-307) Target of Myb protein 1, TOM1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure