Citrus Sinensis ID: 013190


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------45
MNTENQRLRDMLSQVTNNYNALQMHIIALMQQQQENRRAPESNQAHEVVEGKDEGKKHDDQVMVPRQFIGLGPSAETDHEVSNCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFDDQAAAAAAAENSNGKRIGREESPESETQGWGPNNKVQKLSSAKGIDQSNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATAAITADPNFTAALAAAITSIIGGAQNPFSNNSNNNNRSCIIFTNFFFQY
ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEcccccccccccHHHHHccccccccccccccccccccccccccccHHHHcccccEEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHccccHHHHHHHHHHHHHccccccccccccccccccccccccEEcc
ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccEEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEEcccccccccccHHHcccEEEccccccccEEEEccccccccHHHHEEcccccEEEEEEEcccccccccccHcccccccccHHHHccccccccccccccccccccccccccccccccccccccccEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHccccHHHHHHHHHHHHHcccccccccccccccccccccccEEEcc
MNTENQRLRDMLSQVTNNYNALQMHIIALMQQQqenrrapesnqahevvegkdegkkhddqvmvprqfiglgpsaetdhevsncssdeertlsgtppniveAASKehvnsngkneivsFDDQAAAAAAAENsngkrigreespesetqgwgpnnkvqklssakgidqsneATMRKARVSVrarseapmitdgcqwrkygqkmakgnpcprayyrctmavgcpvrkqvqrCAEDRTILITtyegnhnhplppaamAMASTTTAAASMLLSgsmssadgimnpnlLARAilpcsssmatisasapfptvtldlthspnplqLQRQAAQFQvqfpgqpqnlasvtntqlpqvFGQALynqskfsglqlsqnigsnsqsgshqtlppplqqpqqLADTVSAATAAITADPNFTAALAAAITSIIggaqnpfsnnsnnnnrsciiftnfffqy
mntenqrlRDMLSQVTNNYNALQMHIIALMQQQQENRRAPESNQAHEVvegkdegkkhddqvMVPRQFIGLGPSAETDHEVSNCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFDDQAAAAAAAensngkrigreespesetqgwgpnnkvqklssakgidqsneatmrkarvsvrarseapmitdgcqwrkygqKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATAAITADPNFTAALAAAITSIIGGAQNpfsnnsnnnnrsCIIFTNFFFQY
MNTENQRLRDMLSQVTNNYNALQMHIIALMQQQQENRRAPESNQAHEVVEGKDEGKKHDDQVMVPRQFIGLGPSAETDHEVSNCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFDDQaaaaaaaENSNGKRIGREESPESETQGWGPNNKVQKLSSAKGIDQSNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHplppaamamastttaaasmllsgsmssaDGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPlqlqrqaaqfqvqfPGQPQNLASVTNTQLPQVFGQALYNQSKFSGlqlsqnigsnsqsgsHQTlppplqqpqqlADTVSAATAAITADPNFtaalaaaitsiiggaQNPFsnnsnnnnRSCIIFTNFFFQY
**************VTNNYNALQMHIIAL**************************************************************************************************************************************************************MITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGN************************************PNLLARAILPCSSSMATI*****FPTVTL***********************************QLPQVFGQALYN*****************************************ATAAITADPNFTAALAAAITSIIGGAQN*********NRSCIIFTNFFF**
MNTENQRLRDMLSQVTNNYNALQMHII*********************************************************************************************************************************************************RARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHP***************************************************************************************************************************************************SAATAAITADPNFTAALAAAITS***********************TNFFFQY
********RDMLSQVTNNYNALQMHIIALMQQ*************************HDDQVMVPRQFIGLGPS*******************GTPPNIVEAASKEHVNSNGKNEIVSFDDQ*****************************PNNKVQKLSSAKGIDQSNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGLQLSQN****************************AATAAITADPNFTAALAAAITSIIGGAQNPFSNNSNNNNRSCIIFTNFFFQY
MNTENQRLRDMLSQVTNNYNALQMHIIALMQQQQEN*********HEVVEGKDEGK********************************************************************************************************************MRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPL***************************************************SAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGL*L***************************DTVSAATAAITADPNFTAALAAAITSIIGG***************CIIFTNFFFQ*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MNTENQRLRDMLSQVTNNYNALQMHIIALMQQQQENRRAPESNQAHEVVEGKDEGKKHDDQVMVPRQFIGLGPSAETDHEVSNCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFDDQAAAAAAAENSNGKRIGREESPESETQGWGPNNKVQKLSSAKGIDQSNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSKFSGLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATAAITADPNFTAALAAAITSIIGGAQNPFSNNSNNNNRSCIIFTNFFFQY
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query448 2.2.26 [Sep-21-2011]
Q9XEC3528 Probable WRKY transcripti yes no 0.792 0.672 0.556 1e-112
Q9C519553 WRKY transcription factor no no 0.830 0.672 0.563 1e-110
Q93WT0538 Probable WRKY transcripti no no 0.828 0.689 0.574 5e-98
Q9ZSI7489 Probable WRKY transcripti no no 0.446 0.408 0.498 2e-49
Q9LXG8548 Probable WRKY transcripti no no 0.868 0.709 0.337 2e-49
Q8VWV6480 Probable WRKY transcripti no no 0.892 0.833 0.342 2e-44
Q9C9F0374 Probable WRKY transcripti no no 0.234 0.280 0.688 3e-40
Q9CAR4387 Probable WRKY transcripti no no 0.642 0.744 0.375 1e-39
Q8S8P5519 Probable WRKY transcripti no no 0.200 0.173 0.505 8e-23
Q9C5T4310 WRKY transcription factor no no 0.254 0.367 0.426 2e-21
>sp|Q9XEC3|WRK42_ARATH Probable WRKY transcription factor 42 OS=Arabidopsis thaliana GN=WRKY42 PE=2 SV=1 Back     alignment and function desciption
 Score =  405 bits (1042), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 245/440 (55%), Positives = 287/440 (65%), Gaps = 85/440 (19%)

Query: 4   ENQRLRDMLSQVTNNYNALQMHIIALMQQQQENRRAPESNQAHEVVEGKDEGK-KHDDQV 62
           +NQRL+ MLSQ TNN+N+LQM ++A+M+QQ+++             E  D  K +H+   
Sbjct: 126 DNQRLKQMLSQTTNNFNSLQMQLVAVMRQQEDHHHLA-------TTENNDNVKNRHEVPE 178

Query: 63  MVPRQFIGLGPSAETDHEVSNCSSDEERTL--SGTPPNIVEAASKEHVNSNGKNEIVSFD 120
           MVPRQFI LGP ++   EVS+    EERT   SG+PP+++E +S                
Sbjct: 179 MVPRQFIDLGPHSD---EVSS----EERTTVRSGSPPSLLEKSSSRQ------------- 218

Query: 121 DQAAAAAAAENSNGKRI-GREESPESETQGWGPNNKVQKL--------------SSAKGI 165
                       NGKR+  REESPE+E+ GW   NKV K               +S+K I
Sbjct: 219 ------------NGKRVLVREESPETESNGWRNPNKVPKHHASSSICGGNGSENASSKVI 266

Query: 166 DQ-SNEATMRKARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVR 224
           +Q + EATMRKARVSVRARSEAPM++DGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVR
Sbjct: 267 EQAAAEATMRKARVSVRARSEAPMLSDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVR 326

Query: 225 KQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAASMLLSGS-MSSADGIMNP-N 282
           KQVQRCAEDRTILITTYEGNHNHPLPPAAM MASTTTAAASMLLSGS MS+ DG+MNP N
Sbjct: 327 KQVQRCAEDRTILITTYEGNHNHPLPPAAMNMASTTTAAASMLLSGSTMSNQDGLMNPTN 386

Query: 283 LLARAILPCSSSMATISASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVT 342
           LLAR ILPCSSSMATISASAPFPT+TLDLT S              +QF  Q   L  + 
Sbjct: 387 LLARTILPCSSSMATISASAPFPTITLDLTES---PNGNNPTNNPLMQF-SQRSGLVELN 442

Query: 343 NTQLPQVFGQALY--NQSKFSGLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATA 400
            + LP + GQALY   QSKFSGL +                     QP    ++VSAATA
Sbjct: 443 QSVLPHMMGQALYYNQQSKFSGLHMP-------------------SQPLNAGESVSAATA 483

Query: 401 AITADPNFTAALAAAITSII 420
           AI ++PNF AALAAAITSII
Sbjct: 484 AIASNPNFAAALAAAITSII 503




Transcription factor. Interacts specifically with the W box (5'-(T)TGAC[CT]-3'), a frequently occurring elicitor-responsive cis-acting element.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9C519|WRKY6_ARATH WRKY transcription factor 6 OS=Arabidopsis thaliana GN=WRKY6 PE=1 SV=1 Back     alignment and function description
>sp|Q93WT0|WRK31_ARATH Probable WRKY transcription factor 31 OS=Arabidopsis thaliana GN=WRKY31 PE=2 SV=1 Back     alignment and function description
>sp|Q9ZSI7|WRK47_ARATH Probable WRKY transcription factor 47 OS=Arabidopsis thaliana GN=WRKY47 PE=2 SV=2 Back     alignment and function description
>sp|Q9LXG8|WRK72_ARATH Probable WRKY transcription factor 72 OS=Arabidopsis thaliana GN=WRKY72 PE=2 SV=1 Back     alignment and function description
>sp|Q8VWV6|WRK61_ARATH Probable WRKY transcription factor 61 OS=Arabidopsis thaliana GN=WRKY61 PE=2 SV=1 Back     alignment and function description
>sp|Q9C9F0|WRKY9_ARATH Probable WRKY transcription factor 9 OS=Arabidopsis thaliana GN=WRKY9 PE=2 SV=1 Back     alignment and function description
>sp|Q9CAR4|WRK36_ARATH Probable WRKY transcription factor 36 OS=Arabidopsis thaliana GN=WRKY36 PE=2 SV=1 Back     alignment and function description
>sp|Q8S8P5|WRK33_ARATH Probable WRKY transcription factor 33 OS=Arabidopsis thaliana GN=WRKY33 PE=1 SV=2 Back     alignment and function description
>sp|Q9C5T4|WRK18_ARATH WRKY transcription factor 18 OS=Arabidopsis thaliana GN=WRKY18 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query448
255554813 580 WRKY transcription factor, putative [Ric 0.897 0.693 0.771 1e-162
259121421 602 WRKY transcription factor 28 [(Populus t 0.904 0.672 0.737 1e-150
224118042 578 predicted protein [Populus trichocarpa] 0.852 0.660 0.732 1e-149
225444291535 PREDICTED: WRKY transcription factor 6-l 0.895 0.749 0.665 1e-148
147779800535 hypothetical protein VITISV_002247 [Viti 0.859 0.719 0.686 1e-148
209867508 556 WRKY transcription factor [Picrorhiza ku 0.870 0.701 0.634 1e-142
224115798540 predicted protein [Populus trichocarpa] 0.890 0.738 0.740 1e-141
302144104510 unnamed protein product [Vitis vinifera] 0.839 0.737 0.640 1e-138
356547095 614 PREDICTED: WRKY transcription factor 6 [ 0.935 0.682 0.608 1e-138
147841888 620 hypothetical protein VITISV_024690 [Viti 0.881 0.637 0.633 1e-138
>gi|255554813|ref|XP_002518444.1| WRKY transcription factor, putative [Ricinus communis] gi|223542289|gb|EEF43831.1| WRKY transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 327/424 (77%), Positives = 353/424 (83%), Gaps = 22/424 (5%)

Query: 1   MNTENQRLRDMLSQVTNNYNALQMHIIALMQQQQENRRAPESNQAHEVVEGKDEGKKHDD 60
           MN ENQRLRDML+QVTNNYNALQMH++ALMQQQQ+    PE+   HEVV+GK E KKH+ 
Sbjct: 158 MNNENQRLRDMLTQVTNNYNALQMHLVALMQQQQQQNHGPEATHEHEVVQGKSEEKKHE- 216

Query: 61  QVMVPRQFIGLGPSAETDHEVSNCSSDEERTLSGTPPNIVEAASKEHVNSNGKNEIVSFD 120
             +VPRQF+ LGPSAETD E+S+ SSD+ERT SGTP    E AS   V +NGK E+ +FD
Sbjct: 217 --VVPRQFLDLGPSAETD-EISHSSSDDERTRSGTPQTNTETAS---VKNNGKIEMSTFD 270

Query: 121 DQAAAAAAAENSNGKRIGREESPESETQGWGPNNKVQKLSSA-KGIDQSNEATMRKARVS 179
            + ++       +GK IGREESPESETQGW PN KVQKL+ A KGIDQ+ EATMRKARVS
Sbjct: 271 QENSSF-----RDGKGIGREESPESETQGWNPN-KVQKLNPASKGIDQNAEATMRKARVS 324

Query: 180 VRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILIT 239
           VRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAED +ILIT
Sbjct: 325 VRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDTSILIT 384

Query: 240 TYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATIS 299
           TYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATIS
Sbjct: 385 TYEGNHNHPLPPAAMAMASTTTAAASMLLSGSMSSADGIMNPNLLARAILPCSSSMATIS 444

Query: 300 ASAPFPTVTLDLTHSPNPLQLQRQAAQFQVQFPGQPQNLASVTNTQLPQVFGQALYNQSK 359
           ASAPFPTVTLDLTHSPNPLQ+QR    FQV FPGQPQN ASVT  QLPQVFGQALYNQSK
Sbjct: 445 ASAPFPTVTLDLTHSPNPLQVQRPPTHFQVPFPGQPQNFASVTPQQLPQVFGQALYNQSK 504

Query: 360 FSGLQLSQNIGSNSQSGSHQTLPPPLQQPQQLADTVSAATAAITADPNFTAALAAAITSI 419
           FSGLQLSQ +  + Q      L P   Q   L D+VSAATAAITADPNFTAALAAAITSI
Sbjct: 505 FSGLQLSQELPQSQQ------LHP--SQQHSLVDSVSAATAAITADPNFTAALAAAITSI 556

Query: 420 IGGA 423
           IGG 
Sbjct: 557 IGGG 560




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|259121421|gb|ACV92030.1| WRKY transcription factor 28 [(Populus tomentosa x P. bolleana) x P. tomentosa] Back     alignment and taxonomy information
>gi|224118042|ref|XP_002331543.1| predicted protein [Populus trichocarpa] gi|222873767|gb|EEF10898.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225444291|ref|XP_002263115.1| PREDICTED: WRKY transcription factor 6-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|147779800|emb|CAN70362.1| hypothetical protein VITISV_002247 [Vitis vinifera] Back     alignment and taxonomy information
>gi|209867508|gb|ACI90292.1| WRKY transcription factor [Picrorhiza kurrooa] Back     alignment and taxonomy information
>gi|224115798|ref|XP_002317127.1| predicted protein [Populus trichocarpa] gi|222860192|gb|EEE97739.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|302144104|emb|CBI23209.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356547095|ref|XP_003541953.1| PREDICTED: WRKY transcription factor 6 [Glycine max] Back     alignment and taxonomy information
>gi|147841888|emb|CAN65218.1| hypothetical protein VITISV_024690 [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query448
TAIR|locus:2018052553 WRKY6 [Arabidopsis thaliana (t 0.573 0.464 0.560 1.5e-89
TAIR|locus:2120623538 WRKY31 "WRKY DNA-binding prote 0.808 0.672 0.509 5.3e-88
TAIR|locus:2137179528 WRKY42 [Arabidopsis thaliana ( 0.754 0.640 0.483 8.5e-77
TAIR|locus:2133432489 WRKY47 [Arabidopsis thaliana ( 0.363 0.333 0.523 5.8e-51
TAIR|locus:2034964480 WRKY61 "WRKY DNA-binding prote 0.665 0.620 0.341 1.5e-41
TAIR|locus:2150876548 WRKY72 "WRKY DNA-binding prote 0.656 0.536 0.332 8.9e-38
TAIR|locus:2196779387 WRKY36 "WRKY DNA-binding prote 0.520 0.602 0.338 1e-34
TAIR|locus:2199317374 WRKY9 "WRKY DNA-binding protei 0.25 0.299 0.610 1.3e-34
TAIR|locus:2124874310 WRKY18 "WRKY DNA-binding prote 0.258 0.374 0.410 2.5e-28
UNIPROTKB|Q6IEL0348 WRKY71 "Transcription factor W 0.234 0.301 0.447 2.1e-26
TAIR|locus:2018052 WRKY6 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 733 (263.1 bits), Expect = 1.5e-89, Sum P(2) = 1.5e-89
 Identities = 162/289 (56%), Positives = 178/289 (61%)

Query:   132 SNGKRIGREESPESETQGWGPNNKVQKLSSAKG--IDQSNEATMRKARVSVRARSEAPMI 189
             SNGKR+GREESPE+E+      NK+QK++S      DQ+ EATMRKARVSVRARSEAPMI
Sbjct:   258 SNGKRLGREESPETES------NKIQKVNSTTPTTFDQTAEATMRKARVSVRARSEAPMI 311

Query:   190 TDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHXX 249
             +DGCQWRKYGQKMAKGNPCPRAYYRCTMA GCPVRKQVQRCAEDR+ILITTYEGNHNH  
Sbjct:   312 SDGCQWRKYGQKMAKGNPCPRAYYRCTMATGCPVRKQVQRCAEDRSILITTYEGNHNHPL 371

Query:   250 XXXXXXXXXXXXXXXXXXXXXXXXXXDGIMNP-NLLARAILPCSSSMATISASAPFPTVT 308
                                       DG+MNP NLLARA+LPCS+SMATISASAPFPTVT
Sbjct:   372 PPAAVAMASTTTAAANMLLSGSMSSHDGMMNPTNLLARAVLPCSTSMATISASAPFPTVT 431

Query:   309 LDLTHSP------NPXXXXXXXXXXXXXXPGQPQNLASVTNT---QLPQVFGQALYNQSK 359
             LDLTHSP      NP                  Q    +TN     LP V GQALYNQSK
Sbjct:   432 LDLTHSPPPPNGSNPSSSAATNNNHNSLMQRPQQQQQQMTNLPPGMLPHVIGQALYNQSK 491

Query:   360 FSGXXXXXXXXXXXXXXXHQTXXXXXXXXXXXADTVSAATAAITADPNF 408
             FSG                             ADT++A    +TADPNF
Sbjct:   492 FSGLQFSGGS----------PSTAAFSQSHAVADTITA----LTADPNF 526


GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS;IMP
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA;ISS
GO:0043565 "sequence-specific DNA binding" evidence=IEA
GO:0010200 "response to chitin" evidence=IEP;RCA
GO:0005515 "protein binding" evidence=IPI
GO:0016036 "cellular response to phosphate starvation" evidence=IMP
GO:0044212 "transcription regulatory region DNA binding" evidence=IDA
GO:0045892 "negative regulation of transcription, DNA-dependent" evidence=IMP
GO:0080169 "cellular response to boron-containing substance deprivation" evidence=IMP
GO:0002679 "respiratory burst involved in defense response" evidence=RCA
GO:0009407 "toxin catabolic process" evidence=RCA
GO:0010583 "response to cyclopentenone" evidence=RCA
GO:0035556 "intracellular signal transduction" evidence=RCA
GO:0043090 "amino acid import" evidence=RCA
TAIR|locus:2120623 WRKY31 "WRKY DNA-binding protein 31" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2137179 WRKY42 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2133432 WRKY47 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2034964 WRKY61 "WRKY DNA-binding protein 61" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2150876 WRKY72 "WRKY DNA-binding protein 72" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2196779 WRKY36 "WRKY DNA-binding protein 36" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2199317 WRKY9 "WRKY DNA-binding protein 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2124874 WRKY18 "WRKY DNA-binding protein 18" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q6IEL0 WRKY71 "Transcription factor WRKY71" [Oryza sativa Indica Group (taxid:39946)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9XEC3WRK42_ARATHNo assigned EC number0.55680.79240.6723yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query448
smart0077459 smart00774, WRKY, DNA binding domain 4e-36
pfam0310660 pfam03106, WRKY, WRKY DNA -binding domain 2e-35
>gnl|CDD|214815 smart00774, WRKY, DNA binding domain Back     alignment and domain information
 Score =  126 bits (320), Expect = 4e-36
 Identities = 35/57 (61%), Positives = 43/57 (75%)

Query: 191 DGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNH 247
           DG QWRKYGQK+ KG+P PR+YYRCT   GCP +KQVQR  +D +++  TYEG H H
Sbjct: 3   DGYQWRKYGQKVIKGSPYPRSYYRCTYTQGCPAKKQVQRSDDDPSVVEVTYEGEHTH 59


The WRKY domain is a DNA binding domain found in one or two copies in a superfamily of plant transcription factors. These transcription factors are involved in the regulation of various physiological programs that are unique to plants, including pathogen defense, senescence and trichome development. The domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger-like motif. It binds specifically to the DNA sequence motif (T)(T)TGAC(C/T), which is known as the W box. The invariant TGAC core is essential for function and WRKY binding. Length = 59

>gnl|CDD|145969 pfam03106, WRKY, WRKY DNA -binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 448
PF0310660 WRKY: WRKY DNA -binding domain; InterPro: IPR00365 99.96
smart0077459 WRKY DNA binding domain. The WRKY domain is a DNA 99.96
PF0310191 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 93.76
PF0450062 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR00 92.19
>PF03106 WRKY: WRKY DNA -binding domain; InterPro: IPR003657 The WRKY domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger- like motif Back     alignment and domain information
Probab=99.96  E-value=2.4e-31  Score=208.40  Aligned_cols=59  Identities=66%  Similarity=1.288  Sum_probs=52.2

Q ss_pred             CCCcchhhcccccccCCCCCCcccccccccCCcccccceeeecCCCcEEEEEecccCCCC
Q 013190          189 ITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHP  248 (448)
Q Consensus       189 ~~DGy~WRKYGQK~IKGnp~PRsYYRCt~~~gC~arKqVQr~~~D~~i~~tTY~G~HnH~  248 (448)
                      ++|||+|||||||.|+|+++||+||||++. +|+|+|+|||+.+|+.+++|||+|+|||+
T Consensus         1 ~~Dgy~WRKYGqK~i~g~~~pRsYYrCt~~-~C~akK~Vqr~~~d~~~~~vtY~G~H~h~   59 (60)
T PF03106_consen    1 LDDGYRWRKYGQKNIKGSPYPRSYYRCTHP-GCPAKKQVQRSADDPNIVIVTYEGEHNHP   59 (60)
T ss_dssp             --SSS-EEEEEEEEETTTTCEEEEEEEECT-TEEEEEEEEEETTCCCEEEEEEES--SS-
T ss_pred             CCCCCchhhccCcccCCCceeeEeeecccc-ChhheeeEEEecCCCCEEEEEEeeeeCCC
Confidence            479999999999999999999999999995 99999999999999999999999999997



The WRKY domain is found in one or two copies in a superfamily of plant transcription factors involved in the regulation of various physiological programs that are unique to plants, including pathogen defence, senescence, trichome development and the biosynthesis of secondary metabolites. The WRKY domain binds specifically to the DNA sequence motif (T)(T)TGAC(C/T), which is known as the W box. The invariant TGAC core of the W box is essential for function and WRKY binding []. Some proteins known to contain a WRKY domain include Arabidopsis thaliana ZAP1 (Zinc-dependent Activator Protein-1) and AtWRKY44/TTG2, a protein involved in trichome development and anthocyanin pigmentation; and wild oat ABF1-2, two proteins involved in the gibberelic acid-induced expression of the alpha-Amy2 gene. Structural studies indicate that this domain is a four-stranded beta-sheet with a zinc binding pocket, forming a novel zinc and DNA binding structure []. The WRKYGQK residues correspond to the most N-terminal beta-strand, which enables extensive hydrophobic interactions, contributing to the structural stability of the beta-sheet.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2AYD_A 1WJ2_A 2LEX_A.

>smart00774 WRKY DNA binding domain Back     alignment and domain information
>PF03101 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 Phytochrome A is the primary photoreceptor for mediating various far-red light-induced responses in higher plants Back     alignment and domain information
>PF04500 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR007588 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query448
2ayd_A76 Crystal Structure Of The C-Terminal Wrky Domainof A 3e-17
1wj2_A78 Solution Structure Of The C-Terminal Wrky Domain Of 4e-17
>pdb|2AYD|A Chain A, Crystal Structure Of The C-Terminal Wrky Domainof Atwrky1, An Sa-Induced And Partially Npr1-Dependent Transcription Factor Length = 76 Back     alignment and structure

Iteration: 1

Score = 86.3 bits (212), Expect = 3e-17, Method: Composition-based stats. Identities = 37/72 (51%), Positives = 53/72 (73%), Gaps = 1/72 (1%) Query: 176 ARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRT 235 +R+ V ++ ++ DG +WRKYGQK KG+P PR+YYRC+ + GCPV+K V+R + D Sbjct: 1 SRIVVHTQTLFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCS-SPGCPVKKHVERSSHDTK 59 Query: 236 ILITTYEGNHNH 247 +LITTYEG H+H Sbjct: 60 LLITTYEGKHDH 71
>pdb|1WJ2|A Chain A, Solution Structure Of The C-Terminal Wrky Domain Of Atwrky4 Length = 78 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query448
2ayd_A76 WRKY transcription factor 1; beta strands, zinc fi 5e-48
1wj2_A78 Probable WRKY transcription factor 4; DNA-binding 5e-47
>2ayd_A WRKY transcription factor 1; beta strands, zinc finger; 1.60A {Arabidopsis thaliana} Length = 76 Back     alignment and structure
 Score =  158 bits (400), Expect = 5e-48
 Identities = 39/77 (50%), Positives = 56/77 (72%), Gaps = 1/77 (1%)

Query: 176 ARVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRT 235
           +R+ V  ++   ++ DG +WRKYGQK  KG+P PR+YYRC+ + GCPV+K V+R + D  
Sbjct: 1   SRIVVHTQTLFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCS-SPGCPVKKHVERSSHDTK 59

Query: 236 ILITTYEGNHNHPLPPA 252
           +LITTYEG H+H +PP 
Sbjct: 60  LLITTYEGKHDHDMPPG 76


>1wj2_A Probable WRKY transcription factor 4; DNA-binding domain, zinc-binding, structural genomics; NMR {Arabidopsis thaliana} SCOP: g.79.1.1 PDB: 2lex_A* Length = 78 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query448
2ayd_A76 WRKY transcription factor 1; beta strands, zinc fi 100.0
1wj2_A78 Probable WRKY transcription factor 4; DNA-binding 100.0
>2ayd_A WRKY transcription factor 1; beta strands, zinc finger; 1.60A {Arabidopsis thaliana} Back     alignment and structure
Probab=100.00  E-value=3.9e-35  Score=238.79  Aligned_cols=74  Identities=53%  Similarity=1.115  Sum_probs=71.7

Q ss_pred             eEEEEecCCCCCCCCcchhhcccccccCCCCCCcccccccccCCcccccceeeecCCCcEEEEEecccCCCCCCc
Q 013190          177 RVSVRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPP  251 (448)
Q Consensus       177 RV~v~t~se~~~~~DGy~WRKYGQK~IKGnp~PRsYYRCt~~~gC~arKqVQr~~~D~~i~~tTY~G~HnH~~p~  251 (448)
                      ||.|++.++.++++|||+|||||||.|||++|||+||||++ .||+|+|+|||+.+|+.+++|||+|+|||+.|.
T Consensus         2 r~~v~t~~~~~~~~DGy~WRKYGQK~ikgs~~PRsYYrCt~-~gC~a~K~Ver~~~d~~~~~~tY~G~H~H~~p~   75 (76)
T 2ayd_A            2 RIVVHTQTLFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCSS-PGCPVKKHVERSSHDTKLLITTYEGKHDHDMPP   75 (76)
T ss_dssp             EEEEEEECSSSCCCCSSCEEEEEEECCTTCSSCEEEEEECS-TTCCCEEEEEECSSSTTEEEEEEESCCSSCCCC
T ss_pred             eEEEEecCCCCcCCCCchhhhCcccccCCCCCceeEeEcCC-CCCCceeeEEEECCCCCEEEEEEccCcCCCCCC
Confidence            89999999999999999999999999999999999999998 699999999999999999999999999999875



>1wj2_A Probable WRKY transcription factor 4; DNA-binding domain, zinc-binding, structural genomics; NMR {Arabidopsis thaliana} SCOP: g.79.1.1 PDB: 2lex_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 448
d1wj2a_71 g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cr 6e-35
>d1wj2a_ g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 71 Back     information, alignment and structure

class: Small proteins
fold: WRKY DNA-binding domain
superfamily: WRKY DNA-binding domain
family: WRKY DNA-binding domain
domain: WRKY DNA-binding protein 4
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score =  122 bits (307), Expect = 6e-35
 Identities = 41/71 (57%), Positives = 51/71 (71%), Gaps = 1/71 (1%)

Query: 180 VRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILIT 239
           V+  SE  ++ DG +WRKYGQK+ KGNP PR+YY+CT   GC VRK V+R A D   ++T
Sbjct: 1   VQTTSEVDLLDDGYRWRKYGQKVVKGNPYPRSYYKCTTP-GCGVRKHVERAATDPKAVVT 59

Query: 240 TYEGNHNHPLP 250
           TYEG HNH LP
Sbjct: 60  TYEGKHNHDLP 70


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query448
d1wj2a_71 WRKY DNA-binding protein 4 {Thale cress (Arabidops 100.0
>d1wj2a_ g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: Small proteins
fold: WRKY DNA-binding domain
superfamily: WRKY DNA-binding domain
family: WRKY DNA-binding domain
domain: WRKY DNA-binding protein 4
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=100.00  E-value=9.8e-35  Score=231.80  Aligned_cols=71  Identities=58%  Similarity=1.111  Sum_probs=67.2

Q ss_pred             EEecCCCCCCCCcchhhcccccccCCCCCCcccccccccCCcccccceeeecCCCcEEEEEecccCCCCCCc
Q 013190          180 VRARSEAPMITDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPP  251 (448)
Q Consensus       180 v~t~se~~~~~DGy~WRKYGQK~IKGnp~PRsYYRCt~~~gC~arKqVQr~~~D~~i~~tTY~G~HnH~~p~  251 (448)
                      |++.++.++++|||+|||||||.|||+++||+||||++ .+|+|+|+|||+++|+.+++|||+|+|||+.|.
T Consensus         1 v~t~~~~~~~dDGy~WRKYGQK~ikgs~~pRsYYrCt~-~~C~a~K~Vqr~~~d~~~~~vtY~G~H~h~~Ps   71 (71)
T d1wj2a_           1 VQTTSEVDLLDDGYRWRKYGQKVVKGNPYPRSYYKCTT-PGCGVRKHVERAATDPKAVVTTYEGKHNHDLPA   71 (71)
T ss_dssp             CCCCCCCCCCCSSSCBCCCEEECCTTCSSCEEEEEEEC-SSCEEEEEEEEETTTTSEEEEEEESCCSSCCCC
T ss_pred             CccccccccCCCCcEecccCceeccCCCCceEEEEccc-cCCCCcceEEEEcCCCCEEEEEEeeEeCCCCCC
Confidence            46778889999999999999999999999999999998 699999999999999999999999999999773