Citrus Sinensis ID: 023115


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------
MSVLPSSIVTFLSSAAAIRNGDNNNSDGESFQKFFESWLVEQNQHLQALISASKQQENNSSSSNFEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEGKLHDLIRGLSSGDLGDLSPTQLTRVDGLQRVIIKEEKDLSEKLAKYQETVADSSLVELSHVVTELMSTNDDRQDSDRRLMDDQIESNFVTKEAKLEEIFHKADDLRLKTFKAVIDILTPIQAVHFLIAAAELHLRLHDWGKQRDARHLYGATTSS
cccccccccEEcccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHccccHHHHHHHHHHHHcHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHcHHHHHHcccccccccc
ccccccccccEEccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHccccccHHHHHHHHHccccHHHHHHHHHHHccHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHcccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHccccHHHHcccccccc
MSVLPSSIVTFLSSAAairngdnnnsdgeSFQKFFESWLVEQNQHLQALISASkqqennssssnfEEQVRVVVEHYEQYYRAKSRWVKLDVLgmltpswrssledsflwiggwrpsmAFHLLYSKSGLQLEGKLHDLIRglssgdlgdlsptqltrvdGLQRVIIKEEKDLSEKLAKYQETVADSSLVELSHVVTELmstnddrqdsdrrlmdDQIESNFVTKEAKLEEIFHKADDLRLKTFKAVIDILTPIQAVHFLIAAAELHLRLHDwgkqrdarhlygattss
MSVLPSSIVTFLSSAAAIRNGDNNNSDGESFQKFFESWLVEQNQHLQALISASKqqennssssnfeeQVRVVVEHYEQYYraksrwvkldVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEGKLHDLIRGLSsgdlgdlsptqltrvdglqrvIIKEEKDLSEKLAKYQEtvadsslvELSHVVTElmstnddrqdsdrrlmddqiesnfvtkeaklEEIFHKADDLRLKTFKAVIDILTPIQAVHFLIAAAELHLRLHDWGKQRDARHLYGATTSS
MSVLPSSIVTFLSSAAAIRNGDNNNSDGESFQKFFESWLVEQNQHLQALIsaskqqennssssnfeeqVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEGKLHDLIRGLSSGDLGDLSPTQLTRVDGLQRVIIKEEKDLSEKLAKYQETVADSSLVELSHVVTELMSTNDDRQDSDRRLMDDQIESNFVTKEAKLEEIFHKADDLRLKTFKAVIDILTPIQAVHFLIAAAELHLRLHDWGKQRDARHLYGATTSS
*******************************QKFFESWLVEQNQHL**********************VRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEGKLHDLIRGLSSGDLGDLSPTQLTRVDGLQRVIIKE*******************LVELSHVV*************************FVTKEAKLEEIFHKADDLRLKTFKAVIDILTPIQAVHFLIAAAELHLRLHDWGKQRDARHL*******
********************************KFFESWLVEQNQHLQALISA***********NFEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQ*******************LSPTQLTRVDGLQRVIIK**KDLSEKLAKYQETVADSSLVEL********************************KEAKLEEIFHKADDLRLKTFKAVIDILTPIQAVHFLIAAAELHLRLHDW****************
MSVLPSSIVTFLSSAAAIRNGDNNNSDGESFQKFFESWLVEQNQHLQALIS**************EEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEGKLHDLIRGLSSGDLGDLSPTQLTRVDGLQRVIIKEEKDLSEKLAKYQETVADSSLVELSHVVTELMSTNDDRQDSDRRLMDDQIESNFVTKEAKLEEIFHKADDLRLKTFKAVIDILTPIQAVHFLIAAAELHLRLHDWGKQRDARHLYGATTSS
*************************SDGESFQKFFESWLVEQNQHLQALISASKQQENNSSSSNFEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEGKLHDLIRGLSSGDLGDLSPTQLTRVDGLQRVIIKEEKDLSEKLAKYQETVADSSLVELSHVVTEL*************LMDDQIESNFVTKEAKLEEIFHKADDLRLKTFKAVIDILTPIQAVHFLIAAAELHLRLHDWGKQ*************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSVLPSSIVTFLSSAAAIRNGDNNNSDGESFQKFFESWLVEQNQHLQALISASKQQENNSSSSNFEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEGKLHDLIRGLSSGDLGDLSPTQLTRVDGLQRVIIKEEKDLSEKLAKYQETVADSSLVELSHVVTELMSTNDDRQDSDRRLMDDQIESNFVTKEAKLEEIFHKADDLRLKTFKAVIDILTPIQAVHFLIAAAELHLRLHDWGKQRDARHLYGATTSS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query287 2.2.26 [Sep-21-2011]
P43273330 Transcription factor TGA2 no no 0.717 0.624 0.258 3e-12
O24160456 TGACG-sequence-specific D N/A no 0.780 0.491 0.255 5e-12
Q41558476 Transcription factor HBP- N/A no 0.696 0.420 0.257 9e-12
Q39140330 Transcription factor TGA6 no no 0.721 0.627 0.252 2e-11
Q39163330 Transcription factor TGA5 no no 0.763 0.663 0.263 2e-11
Q93ZE2368 Transcription factor TGA7 no no 0.700 0.546 0.254 4e-11
Q39162364 Transcription factor TGA4 no no 0.724 0.571 0.269 9e-11
Q39237368 Transcription factor TGA1 no no 0.759 0.592 0.272 2e-10
Q39234384 Transcription factor TGA3 no no 0.773 0.578 0.255 2e-10
P23923332 Transcription factor HBP- N/A no 0.749 0.647 0.248 2e-09
>sp|P43273|TGA2_ARATH Transcription factor TGA2 OS=Arabidopsis thaliana GN=TGA2 PE=1 SV=1 Back     alignment and function desciption
 Score = 72.8 bits (177), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 115/259 (44%), Gaps = 53/259 (20%)

Query: 21  GDNNNSDGESFQKFFES----WLVEQNQHLQALISASKQQENNSSSSNFEEQVRVVVE-- 74
           GD  +S G +    F++    WL E+N+ +  L SA      +S       ++R++V+  
Sbjct: 100 GDQAHSTGGNGALAFDAEHSRWLEEKNKQMNELRSALNAHAGDS-------ELRIIVDGV 152

Query: 75  --HYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEG 132
             HYE+ +R KS   K DV  +L+  W++  E  FLW+GG+R S    LL +    QLE 
Sbjct: 153 MAHYEELFRIKSNAAKNDVFHLLSGMWKTPAERCFLWLGGFRSSELLKLLAN----QLE- 207

Query: 133 KLHDLIRGLSSGDLGDLSPTQLTRVDGLQRVIIKEEKDLSEKLAKYQETVADSSLVELSH 192
                           ++  QL  ++ LQ+   + E  LS+ +   Q+++AD+       
Sbjct: 208 ---------------PMTERQLMGINNLQQTSQQAEDALSQGMESLQQSLADTLSSGTLG 252

Query: 193 VVTELMSTNDDRQDSDRRLMDDQIESNFVTKEAKLEEIFHKADDLRLKTFKAVIDILTPI 252
             +     +   Q            +  + K   LE    +AD+LRL+T + +I +LT  
Sbjct: 253 SSSSGNVASYMGQ-----------MAMAMGKLGTLEGFIRQADNLRLQTLQQMIRVLTTR 301

Query: 253 QAVHFLIAAAELHLRLHDW 271
           Q+   L+A       +HD+
Sbjct: 302 QSARALLA-------IHDY 313




Transcriptional activator that binds specifically to the DNA sequence 5'-TGACG-3'. Recognizes ocs elements like the as-1 motif of the cauliflower mosaic virus 35S promoter. Binding to the as-1-like cis elements mediate auxin- and salicylic acid-inducible transcription. Required to induce the systemic acquired resistance (SAR) via the regulation of pathogenesis-related genes expression. Binding to the as-1 element of PR-1 promoter is salicylic acid-inducible and mediated by NPR1. Could also bind to the C-boxes (5'-ATGACGTCAT-3') with high affinity.
Arabidopsis thaliana (taxid: 3702)
>sp|O24160|TGA21_TOBAC TGACG-sequence-specific DNA-binding protein TGA-2.1 OS=Nicotiana tabacum GN=TGA21 PE=1 SV=1 Back     alignment and function description
>sp|Q41558|HBP1C_WHEAT Transcription factor HBP-1b(c1) (Fragment) OS=Triticum aestivum PE=1 SV=2 Back     alignment and function description
>sp|Q39140|TGA6_ARATH Transcription factor TGA6 OS=Arabidopsis thaliana GN=TGA6 PE=1 SV=2 Back     alignment and function description
>sp|Q39163|TGA5_ARATH Transcription factor TGA5 OS=Arabidopsis thaliana GN=TGA5 PE=1 SV=2 Back     alignment and function description
>sp|Q93ZE2|TGA7_ARATH Transcription factor TGA7 OS=Arabidopsis thaliana GN=TGA7 PE=1 SV=1 Back     alignment and function description
>sp|Q39162|TGA4_ARATH Transcription factor TGA4 OS=Arabidopsis thaliana GN=TGA4 PE=1 SV=2 Back     alignment and function description
>sp|Q39237|TGA1_ARATH Transcription factor TGA1 OS=Arabidopsis thaliana GN=TGA1 PE=1 SV=2 Back     alignment and function description
>sp|Q39234|TGA3_ARATH Transcription factor TGA3 OS=Arabidopsis thaliana GN=TGA3 PE=1 SV=1 Back     alignment and function description
>sp|P23923|HBP1B_WHEAT Transcription factor HBP-1b(c38) OS=Triticum aestivum PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query287
255574198271 conserved hypothetical protein [Ricinus 0.926 0.981 0.578 9e-80
224096129244 predicted protein [Populus trichocarpa] 0.815 0.959 0.602 1e-79
688423287 tumor-related protein [Nicotiana glauca 0.916 0.916 0.549 1e-78
356511199285 PREDICTED: transcription factor TGA4-lik 0.937 0.943 0.525 2e-77
449480891255 PREDICTED: transcription factor TGA6-lik 0.850 0.956 0.549 6e-76
225438349255 PREDICTED: transcription factor HBP-1b(c 0.846 0.952 0.591 3e-74
357520133272 Transcription factor HBP-1b(c1) [Medicag 0.850 0.897 0.506 2e-70
356565430277 PREDICTED: LOW QUALITY PROTEIN: transcri 0.818 0.848 0.503 5e-63
294610363260 hypothetical protein [Triticum aestivum] 0.808 0.892 0.430 1e-40
359493600246 PREDICTED: transcription factor TGA4-lik 0.804 0.939 0.367 3e-40
>gi|255574198|ref|XP_002528014.1| conserved hypothetical protein [Ricinus communis] gi|223532583|gb|EEF34370.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  302 bits (774), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 162/280 (57%), Positives = 204/280 (72%), Gaps = 14/280 (5%)

Query: 1   MSVLPSSIVTFLSSAAAIRNGDNNNSDGESFQKFFESWLVEQNQHLQALISASKQQENNS 60
           MS LPSS      S A++R+    +S   SF  FFE WLVEQ Q L  LISASKQ+ NN 
Sbjct: 1   MSDLPSS------SPASLRS---TSSSHNSFHDFFEWWLVEQKQELDELISASKQKLNNK 51

Query: 61  SSSN--FEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMA 118
           ++++   +  +  V+EHYEQYY AKS+W K DVL M +PSW S LED+FLWIGGWRPSMA
Sbjct: 52  NNNHDSLQPLINRVLEHYEQYYSAKSKWAKHDVLAMFSPSWTSPLEDAFLWIGGWRPSMA 111

Query: 119 FHLLYSKSGLQLEGKLHDLIRGLSSGDLGDLSPTQLTRVDGLQRVIIKEEKDLSEKLAKY 178
           FHLLYSKSGLQLE +LHD+IRGL +GDLGDLSP QL +VD  QR II+EEK+++E +AK+
Sbjct: 112 FHLLYSKSGLQLEAQLHDVIRGLCTGDLGDLSPNQLVQVDEFQRRIIREEKNITENMAKH 171

Query: 179 QETVADSSLVELSHVVTELMSTNDDRQDSDRRLMDDQIESNFVTKEAKLEEIFHKADDLR 238
           Q TVAD+S+VEL+H +++   T  D   S    + +++ES   TK   LE++  KADDLR
Sbjct: 172 QATVADTSMVELTHAISK---TRRDEGSSIGNEVQERVESTLKTKLEGLEKVLQKADDLR 228

Query: 239 LKTFKAVIDILTPIQAVHFLIAAAELHLRLHDWGKQRDAR 278
           L+T K +IDILTP Q VHFLIAAAELHLRLH+ GK+ D +
Sbjct: 229 LRTLKGIIDILTPDQTVHFLIAAAELHLRLHELGKKMDTK 268




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224096129|ref|XP_002310543.1| predicted protein [Populus trichocarpa] gi|222853446|gb|EEE90993.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|688423|dbj|BAA05470.1| tumor-related protein [Nicotiana glauca x Nicotiana langsdorffii] Back     alignment and taxonomy information
>gi|356511199|ref|XP_003524316.1| PREDICTED: transcription factor TGA4-like [Glycine max] Back     alignment and taxonomy information
>gi|449480891|ref|XP_004156023.1| PREDICTED: transcription factor TGA6-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|225438349|ref|XP_002273632.1| PREDICTED: transcription factor HBP-1b(c1) [Vitis vinifera] gi|296082614|emb|CBI21619.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357520133|ref|XP_003630355.1| Transcription factor HBP-1b(c1) [Medicago truncatula] gi|355524377|gb|AET04831.1| Transcription factor HBP-1b(c1) [Medicago truncatula] Back     alignment and taxonomy information
>gi|356565430|ref|XP_003550943.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor TGA2-like [Glycine max] Back     alignment and taxonomy information
>gi|294610363|dbj|BAJ05339.1| hypothetical protein [Triticum aestivum] Back     alignment and taxonomy information
>gi|359493600|ref|XP_003634633.1| PREDICTED: transcription factor TGA4-like [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query287
TAIR|locus:2086400244 AT3G14880 "AT3G14880" [Arabido 0.540 0.635 0.347 6.4e-37
TAIR|locus:2124157282 AT4G18690 "AT4G18690" [Arabido 0.832 0.847 0.284 1.8e-25
TAIR|locus:2152390291 DOG1 "AT5G45830" [Arabidopsis 0.592 0.584 0.340 2.9e-23
TAIR|locus:2124117232 AT4G18650 "AT4G18650" [Arabido 0.400 0.495 0.320 2.3e-18
TAIR|locus:2016620225 ZW2 [Arabidopsis thaliana (tax 0.334 0.426 0.33 6.7e-18
TAIR|locus:2124152235 AT4G18680 "AT4G18680" [Arabido 0.668 0.817 0.274 1.6e-15
TAIR|locus:2024412230 RAS1 "AT1G09950" [Arabidopsis 0.452 0.565 0.287 4.4e-14
TAIR|locus:2124132152 AT4G18660 "AT4G18660" [Arabido 0.449 0.848 0.302 3.3e-12
TAIR|locus:2030551384 TGA3 "AT1G22070" [Arabidopsis 0.317 0.236 0.308 4.8e-09
TAIR|locus:2184078364 TGA4 "AT5G10030" [Arabidopsis 0.358 0.282 0.311 1.2e-08
TAIR|locus:2086400 AT3G14880 "AT3G14880" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 273 (101.2 bits), Expect = 6.4e-37, Sum P(2) = 6.4e-37
 Identities = 57/164 (34%), Positives = 90/164 (54%)

Query:    30 SFQKFFESWLVEQNQHLQALIXXXXXXXXXXXXXXXXXXVRV--VVEHYEQYYRAKSRWV 87
             SF+KF +SW+ +   HL  L                     V  V+EH+ +Y+RAK    
Sbjct:     9 SFKKFQQSWIEQLRNHLNHLRSAQNHHRNSATGDEERLREAVDRVMEHFREYHRAKWAAT 68

Query:    88 KLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEGKLHDLIRGLSSGDLG 147
               DV+ ++   W S+LE S  W+GGWRP+  FHL+Y++S +  E ++ D++RG  +GDL 
Sbjct:    69 DKDVIEVMASPWASALERSLQWVGGWRPTTLFHLVYTESSILFESRIVDILRGFRTGDLS 128

Query:   148 DLSPTQLT-------RVDGLQRVIIKEEKDLSEKLAKYQETVAD 184
             DLSP+Q          V  LQ   +KEE  ++E+L+++Q+  +D
Sbjct:   129 DLSPSQFRVPFVKGRTVSELQCETVKEENAITEELSEWQDDASD 172


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0080167 "response to karrikin" evidence=IEP
TAIR|locus:2124157 AT4G18690 "AT4G18690" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2152390 DOG1 "AT5G45830" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2124117 AT4G18650 "AT4G18650" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2016620 ZW2 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2124152 AT4G18680 "AT4G18680" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2024412 RAS1 "AT1G09950" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2124132 AT4G18660 "AT4G18660" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2030551 TGA3 "AT1G22070" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2184078 TGA4 "AT5G10030" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.VII.1642.1
hypothetical protein (244 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query287
pfam1414479 pfam14144, DOG1, Seed dormancy control 3e-24
>gnl|CDD|206313 pfam14144, DOG1, Seed dormancy control Back     alignment and domain information
 Score = 93.0 bits (232), Expect = 3e-24
 Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 3/82 (3%)

Query: 44  QHLQALISASKQQENNSSSSNFEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSL 103
           + L  L +A       +S       V  V+ HY++Y+R KS   K DV  +L+  W++  
Sbjct: 1   RLLAELRAALNAH---ASDDELRSLVDKVLAHYDEYFRLKSAAAKADVFHLLSGMWKTPA 57

Query: 104 EDSFLWIGGWRPSMAFHLLYSK 125
           E  FLWIGG+RPS    LLYS 
Sbjct: 58  ERCFLWIGGFRPSELLKLLYSL 79


This family of plant proteins appears to be a highly specific controller seed dormancy. Length = 79

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 287
PF1414480 DOG1: Seed dormancy control 100.0
PF13801125 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 96.85
COG3678160 CpxP P pilus assembly/Cpx signaling pathway, perip 96.13
PRK12750170 cpxP periplasmic repressor CpxP; Reviewed 95.8
PRK10363166 cpxP periplasmic repressor CpxP; Reviewed 94.29
PRK10455161 periplasmic protein; Reviewed 93.95
PRK12751162 cpxP periplasmic stress adaptor protein CpxP; Revi 93.86
PF07813100 LTXXQ: LTXXQ motif family protein; InterPro: IPR01 90.93
>PF14144 DOG1: Seed dormancy control Back     alignment and domain information
Probab=100.00  E-value=4.2e-35  Score=226.97  Aligned_cols=79  Identities=48%  Similarity=0.907  Sum_probs=75.1

Q ss_pred             HHHHHHHHHhhhhcCCCCCchHHHHHHHHHHhHHHHHHHHhhhhhcccccccCCCCCChhHHHHHHhcCCCchHHHHHHH
Q 023115           44 QHLQALISASKQQENNSSSSNFEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLY  123 (287)
Q Consensus        44 ~~l~eLr~Al~~~~~~~sd~eL~~LV~~~l~HY~~y~~~Ks~aA~~DV~~llsp~W~tplEr~fLWiGG~RPS~~l~Lly  123 (287)
                      |++.|||+|++++.  .+|.+|+.||+++++||++||++|+.+|++|||++|+|+|+||+||||+||||||||++|||||
T Consensus         1 ~~l~eLr~al~~~~--~~~~~L~~lV~~~~~Hy~~y~~~K~~aa~~DV~~~~s~~W~sp~Er~flWiGG~RPS~~~~ll~   78 (80)
T PF14144_consen    1 RQLNELRAALQSHA--DSDDELRSLVDKVMSHYDEYYRAKSAAAKADVFHLLSPPWKSPLERCFLWIGGWRPSELFKLLY   78 (80)
T ss_pred             CcHHHHHHHHHhcC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHhCCCCCCHHHHHHHHHhcCCHHHHHHHHh
Confidence            57899999998665  4699999999999999999999999999999999999999999999999999999999999998


Q ss_pred             h
Q 023115          124 S  124 (287)
Q Consensus       124 s  124 (287)
                      +
T Consensus        79 s   79 (80)
T PF14144_consen   79 S   79 (80)
T ss_pred             c
Confidence            6



>PF13801 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H Back     alignment and domain information
>COG3678 CpxP P pilus assembly/Cpx signaling pathway, periplasmic inhibitor/zinc-resistance associated protein [Intracellular trafficking and secretion / Cell motility and secretio / Signal transduction mechanisms / Inorganic ion transport and metabolism] Back     alignment and domain information
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed Back     alignment and domain information
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed Back     alignment and domain information
>PRK10455 periplasmic protein; Reviewed Back     alignment and domain information
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed Back     alignment and domain information
>PF07813 LTXXQ: LTXXQ motif family protein; InterPro: IPR012899 This five residue motif is found in a number of bacterial proteins bearing similarity to the protein CpxP (P32158 from SWISSPROT) Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query287
2y39_A118 Nickel and cobalt resistance protein CNRR; metal b 96.97
3itf_A145 Periplasmic adaptor protein CPXP; CPXR, CPXA, cpxr 95.15
3o39_A108 Periplasmic protein related to spheroblast format; 92.9
3oeo_A138 Spheroplast protein Y; LTXXQ, extracytoplasmic str 91.35
3lay_A175 Zinc resistance-associated protein; salmonella typ 89.82
>2y39_A Nickel and cobalt resistance protein CNRR; metal binding protein; 1.41A {Cupriavidus metallidurans} PDB: 2y3b_A 2y3d_A 2y3g_A* 2y3h_A 3epv_A* Back     alignment and structure
Probab=96.97  E-value=0.0089  Score=48.65  Aligned_cols=89  Identities=13%  Similarity=0.120  Sum_probs=67.6

Q ss_pred             CCcHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhhhccCCCCCCcchhhhhhHHHHHHHHHhHHHH
Q 023115          148 DLSPTQLTRVDGLQRVIIKEEKDLSEKLAKYQETVADSSLVELSHVVTELMSTNDDRQDSDRRLMDDQIESNFVTKEAKL  227 (287)
Q Consensus       148 ~LS~~Ql~~I~~Lq~~t~~~E~aLs~~ma~lQ~slad~~~~~~a~~~~~~~g~~~~~~~~~~~~~~~~m~~a~~~kl~~L  227 (287)
                      +||++|-..|+.++..-.....+|..+|......+++.+..+        .+  ++          -++..|+...-.  
T Consensus        20 ~Lt~~Q~~~leaie~~fa~~r~~le~emRaan~~La~ai~~~--------~~--~~----------p~V~aaid~~h~--   77 (118)
T 2y39_A           20 PLDANEREILELKEDAFAQRRREIETRLRAANGKLADAIAKN--------PA--WS----------PEVEAATQEVER--   77 (118)
T ss_dssp             CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--------CS--CC----------HHHHHHHHHHHH--
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--------cc--cC----------HHHHHHHHHHHH--
Confidence            399999999999999999999999999999999999887421        11  22          245455433322  


Q ss_pred             HHHHHHhhHHHHHH---HHHHHhhcCHHHHHHHHHHHHH
Q 023115          228 EEIFHKADDLRLKT---FKAVIDILTPIQAVHFLIAAAE  263 (287)
Q Consensus       228 ~~~l~qAD~LR~~T---L~~l~~ILTp~QAA~fL~A~~e  263 (287)
                           ..-.|+..|   +-+|..||||-|+..|=-.+.+
T Consensus        78 -----~mG~LQkeTi~HvfeMR~VLtPeQ~~~fd~~vv~  111 (118)
T 2y39_A           78 -----AAGDLQRATLVHVFEMRAGLKPEHRPAYDRVLID  111 (118)
T ss_dssp             -----HHHHHHHHHHHHHHHHHHHSCGGGHHHHHHHHHH
T ss_pred             -----HHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence                 444588999   6667899999999999766554



>3itf_A Periplasmic adaptor protein CPXP; CPXR, CPXA, cpxrap, CPX-pathway, envelope stress, transduction; HET: MSE; 1.45A {Escherichia coli str} PDB: 3qzc_A Back     alignment and structure
>3o39_A Periplasmic protein related to spheroblast format; alpha-helical, structural genomics, montreal-kingston bacter structural genomics initiative; HET: MSE; 2.60A {Escherichia coli} Back     alignment and structure
>3oeo_A Spheroplast protein Y; LTXXQ, extracytoplasmic stress response-related, signaling P; 2.70A {Escherichia coli} Back     alignment and structure
>3lay_A Zinc resistance-associated protein; salmonella typhimurium L structural genomics, center for structural genomics of INFE diseases; 2.70A {Salmonella enterica subsp} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00