Citrus Sinensis ID: 037198


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------31
MSPHVGLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYESRENLEQNGRWIMHEFTINAALFGDQYSHKARNFVLYCVKKNELENPSEKANLGGKRSSGSKRKLKEVVTSSSHIDHDDEGFQSKRLKAHQNSVLIEQSHSPCSSSSLHTVEHQGEIDNILEIVESEVGIDSLPNKLVFNILVFYFKKNSRGEETEVYLREESIVNEALIHHYCEIFLML
ccccccccccccccccccHHHHHHHHHHHHHccccccccccccccEEccccccccccccccccccccccccccEEEEcccccccccccccccccccccccccccccccEEEcccccEEEEEEEEEEEccccccccccccEEEEEcccccccccccccccccEEEEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHccccccccccccccccccHHHHHcccccccccHHHHHHHccccHHHHHHHHHHHHcc
ccccccccccccccccccHHHHHHHHHHHHHccccccccccccccHHHccccccccccEHEccccccccccccEEEEEEccccccccccccccccccccEEEccccEEEEEEccccEEEEEEEEEEEcccccccccccEEEEEEEEccHHcccccccccccEEEEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEccccEEEccccccEEEEEEEEEEEcccccccEEEEEcHHHHHHHHHHHHHHHHHHHc
msphvglrfypvlrfypTEQELLEFYLLHKItqnhnlyqasddipvkdcdlysphepnliwdlyggdqlmAGQALYFFTQlkkaspkgsrfnikigfdawqgeDRAKLVLAVGsedqilgfkkrFRYESRENLEQNGRWIMHEFTINaalfgdqyshkaRNFVLYCVKknelenpsekanlggkrssgskrklkevvtssshidhddegfqskrlkahqnsvlieqshspcsssslhtvehqGEIDNILEIVEsevgidslpnkLVFNILVFYFkknsrgeeteVYLREESIVNEALIHHYCEIFLML
msphvglrfypVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKkaspkgsrFNIKIGFDAWQGEDRAKLVLavgsedqilgfkkrfryESRENLEQNGRWIMHEFTINAALFGDQYSHKARNFVLYCVKKnelenpsekanlggkrssgskrklkevvtssshidhddegfQSKRLKAHQNSVLIEQSHSPCSSSSLHTVEHQGEIDNILEIVESEVGIDSLPNKLVFNILVFYFKKNSRGEETEVYLREESIVNEALIHHYCEIFLML
MSPHVGLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYESRENLEQNGRWIMHEFTINAALFGDQYSHKARNFVLYCVKKNELENPSEKANlggkrssgskrklkEVVTSSSHIDHDDEGFQSKRLKAHQNSVLIEQshspcssssLHTVEHQGEIDNILEIVESEVGIDSLPNKLVFNILVFYFKKNSRGEETEVYLREESIVNEALIHHYCEIFLML
****VGLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYESRENLEQNGRWIMHEFTINAALFGDQYSHKARNFVLYCVK**************************************************************************GEIDNILEIVESEVGIDSLPNKLVFNILVFYFK*********************************
*****GL*FYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYESRENLEQNGRWIMHEFTI**************NFVLYCVK**************************************************************************************************FNI**********************IVNEALIHHYCEIFLML
MSPHVGLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYESRENLEQNGRWIMHEFTINAALFGDQYSHKARNFVLYCVKKNELENPS***************************DHDDEGFQSKRLKAHQN********************HQGEIDNILEIVESEVGIDSLPNKLVFNILVFYFKKNSRGEETEVYLREESIVNEALIHHYCEIFLML
*****GLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYESRENLEQNGRWIMHEFTINAALFGDQYSHKARNFVLYCVKKNE***************************************************************SLHT*EHQGEIDNILEIVESEVGIDSLPNKLVFNILVFYFKKNSRGEETEVYLREESIVNEALIHHYCEIFLML
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSPHVGLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYESRENLEQNGRWIMHEFTINAALFGDQYSHKARNFVLYCVKKNELENPSEKANLGGKRSSGSKRKLKEVVTSSSHIDHDDEGFQSKRLKAHQNSVLIEQSHSPCSSSSLHTVEHQGEIDNILEIVESEVGIDSLPNKLVFNILVFYFKKNSRGEETEVYLREESIVNEALIHHYCEIFLML
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query308 2.2.26 [Sep-21-2011]
Q93VY3297 NAC domain-containing pro no no 0.425 0.441 0.324 9e-09
Q52QH4318 NAC domain-containing pro no no 0.662 0.641 0.282 2e-08
Q84WP6365 NAC domain-containing pro no no 0.405 0.342 0.307 2e-08
Q9M274334 NAC domain-containing pro no no 0.422 0.389 0.300 2e-08
Q9MA17371 Protein SOMBRERO OS=Arabi no no 0.431 0.358 0.303 3e-07
Q9FWX2395 NAC domain-containing pro no no 0.386 0.301 0.333 4e-07
Q9LPI7358 NAC domain-containing pro no no 0.435 0.374 0.281 6e-07
Q9C878305 Protein BEARSKIN1 OS=Arab no no 0.399 0.403 0.328 1e-06
Q8H4S4 425 NAC transcription factor no no 0.422 0.305 0.291 2e-06
Q9S851334 Protein CUP-SHAPED COTYLE no no 0.405 0.374 0.291 2e-06
>sp|Q93VY3|NAC72_ARATH NAC domain-containing protein 72 OS=Arabidopsis thaliana GN=NAC072 PE=2 SV=1 Back     alignment and function desciption
 Score = 61.2 bits (147), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 47/145 (32%), Positives = 62/145 (42%), Gaps = 14/145 (9%)

Query: 3   PHVGLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWD 62
           P   L   P  RFYPT++ELL  YL  K+   H   Q      + D DLY   +P   WD
Sbjct: 8   PLAQLSLPPGFRFYPTDEELLVQYLCRKVAGYHFSLQV-----IGDIDLYK-FDP---WD 58

Query: 63  LYGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFK 122
           L     L   +  YFF+   +  P GSR N   G   W+     K++ A G     +G K
Sbjct: 59  L-PSKALFGEKEWYFFSPRDRKYPNGSRPNRVAGSGYWKATGTDKIITADGRR---VGIK 114

Query: 123 KRFR-YESRENLEQNGRWIMHEFTI 146
           K    Y  +        WIMHE+ +
Sbjct: 115 KALVFYAGKAPKGTKTNWIMHEYRL 139




Transcription factors that bind specifically to the 5'-CATGTG-3' motif.
Arabidopsis thaliana (taxid: 3702)
>sp|Q52QH4|NAC68_ORYSJ NAC domain-containing protein 68 OS=Oryza sativa subsp. japonica GN=NAC68 PE=2 SV=1 Back     alignment and function description
>sp|Q84WP6|NAC43_ARATH NAC domain-containing protein 43 OS=Arabidopsis thaliana GN=NAC043 PE=2 SV=2 Back     alignment and function description
>sp|Q9M274|NAC66_ARATH NAC domain-containing protein 66 OS=Arabidopsis thaliana GN=NAC066 PE=2 SV=1 Back     alignment and function description
>sp|Q9MA17|SMB_ARATH Protein SOMBRERO OS=Arabidopsis thaliana GN=SMB PE=1 SV=1 Back     alignment and function description
>sp|Q9FWX2|NAC7_ARATH NAC domain-containing protein 7 OS=Arabidopsis thaliana GN=NAC007 PE=2 SV=2 Back     alignment and function description
>sp|Q9LPI7|NAC12_ARATH NAC domain-containing protein 12 OS=Arabidopsis thaliana GN=NAC012 PE=2 SV=1 Back     alignment and function description
>sp|Q9C878|BRN1_ARATH Protein BEARSKIN1 OS=Arabidopsis thaliana GN=BRN1 PE=2 SV=1 Back     alignment and function description
>sp|Q8H4S4|NAC10_ORYSJ NAC transcription factor ONAC010 OS=Oryza sativa subsp. japonica GN=ONAC010 PE=2 SV=1 Back     alignment and function description
>sp|Q9S851|NAC31_ARATH Protein CUP-SHAPED COTYLEDON 3 OS=Arabidopsis thaliana GN=NAC031 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query308
255545410363 transcription factor, putative [Ricinus 0.571 0.484 0.367 6e-19
255577787316 conserved hypothetical protein [Ricinus 0.418 0.408 0.421 1e-18
255577785328 transcription factor, putative [Ricinus 0.548 0.515 0.359 4e-17
357439789325 NAC domain protein [Medicago truncatula] 0.470 0.446 0.380 7e-17
449448878306 PREDICTED: NAC domain-containing protein 0.581 0.584 0.321 1e-16
255553641215 conserved hypothetical protein [Ricinus 0.464 0.665 0.357 1e-16
449442783301 PREDICTED: NAC domain-containing protein 0.5 0.511 0.350 1e-15
255577751390 hypothetical protein RCOM_0574280 [Ricin 0.431 0.341 0.373 7e-15
357512399210 NAC domain-containing protein [Medicago 0.477 0.7 0.337 2e-14
255553637 457 conserved hypothetical protein [Ricinus 0.428 0.288 0.366 2e-14
>gi|255545410|ref|XP_002513765.1| transcription factor, putative [Ricinus communis] gi|223546851|gb|EEF48348.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 76/207 (36%), Positives = 105/207 (50%), Gaps = 31/207 (14%)

Query: 13  LRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAG 72
           +RF P ++ELL +YL HKI    N     +  PVKDCDLY  +EP  IWD +GG  LM  
Sbjct: 11  VRFRPLDEELLMYYLSHKI----NNLSLPNPCPVKDCDLYGKNEPWEIWDYFGGQNLMDV 66

Query: 73  QALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYESREN 132
           + LYFFTQLK+ S   SR    +G   W G+   K VL   ++ +++G  K+F Y+  ++
Sbjct: 67  EFLYFFTQLKRKSSSSSRILRTVGTGTWHGDTSGKPVLDRKTK-KVIGLIKKFHYKKPKS 125

Query: 133 LEQNGRWIMHEFTINAALFGDQYSHKARNFVLYCVKKNELENPSEKANLGGKRSSGSKRK 192
              NG W+M E+    +L G     ++ + VL C+K N+               SG KR 
Sbjct: 126 A-HNGGWLMREY----SLVG-----QSSDCVLCCLKMNK---------------SGEKRN 160

Query: 193 LKEVVTSSSHIDHDDEGFQ-SKRLKAH 218
             E  T    I +DD     SKRLK  
Sbjct: 161 RLEAETKDYIIYNDDVANDVSKRLKTQ 187




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255577787|ref|XP_002529768.1| conserved hypothetical protein [Ricinus communis] gi|223530766|gb|EEF32634.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|255577785|ref|XP_002529767.1| transcription factor, putative [Ricinus communis] gi|223530765|gb|EEF32633.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|357439789|ref|XP_003590172.1| NAC domain protein [Medicago truncatula] gi|355479220|gb|AES60423.1| NAC domain protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|449448878|ref|XP_004142192.1| PREDICTED: NAC domain-containing protein 68-like [Cucumis sativus] gi|449516019|ref|XP_004165045.1| PREDICTED: NAC domain-containing protein 68-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|255553641|ref|XP_002517861.1| conserved hypothetical protein [Ricinus communis] gi|223542843|gb|EEF44379.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|449442783|ref|XP_004139160.1| PREDICTED: NAC domain-containing protein 68-like [Cucumis sativus] gi|449485396|ref|XP_004157155.1| PREDICTED: NAC domain-containing protein 68-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|255577751|ref|XP_002529750.1| hypothetical protein RCOM_0574280 [Ricinus communis] gi|223530748|gb|EEF32616.1| hypothetical protein RCOM_0574280 [Ricinus communis] Back     alignment and taxonomy information
>gi|357512399|ref|XP_003626488.1| NAC domain-containing protein [Medicago truncatula] gi|355501503|gb|AES82706.1| NAC domain-containing protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|255553637|ref|XP_002517859.1| conserved hypothetical protein [Ricinus communis] gi|223542841|gb|EEF44377.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query308
TAIR|locus:2141075262 NAC071 "NAC domain containing 0.490 0.576 0.338 1.3e-09
TAIR|locus:2167923348 NAC101 "NAC-domain protein 101 0.402 0.356 0.316 5e-09
TAIR|locus:2142285292 NAC096 "NAC domain containing 0.474 0.5 0.348 1.5e-08
TAIR|locus:2016049324 VND7 "vascular related NAC-dom 0.415 0.395 0.326 2.7e-08
TAIR|locus:2026232394 ANAC026 "Arabidopsis NAC domai 0.409 0.319 0.321 6.3e-08
TAIR|locus:2044360365 NST1 "NAC SECONDARY WALL THICK 0.405 0.342 0.307 8.2e-08
TAIR|locus:2079562334 NAC066 "NAC domain protein 66" 0.409 0.377 0.337 1.2e-07
TAIR|locus:2089764246 NAC057 "NAC domain containing 0.675 0.845 0.260 1.7e-07
TAIR|locus:2060979365 VND1 "vascular related NAC-dom 0.435 0.367 0.313 1.8e-07
TAIR|locus:2202028395 NAC007 "NAC 007" [Arabidopsis 0.396 0.308 0.326 2.2e-07
TAIR|locus:2141075 NAC071 "NAC domain containing protein 71" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 158 (60.7 bits), Expect = 1.3e-09, P = 1.3e-09
 Identities = 60/177 (33%), Positives = 82/177 (46%)

Query:    11 PVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLM 70
             P  RF+PT++EL+ +YL  KI     L    + IPV   DLY   +P   W+L  G   +
Sbjct:     8 PGFRFHPTDEELIGYYLSRKI---EGLEIELEVIPV--IDLYK-FDP---WEL-PGKSFL 57

Query:    71 AGQAL--YFFTQLKKASPKGSRFN--IKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFR 126
               + L  +FF    K    GSR N   K G+    G+DR    +   S   I G++K   
Sbjct:    58 PNRDLEWFFFCPRDKKYANGSRTNRATKAGYWKATGKDRK---ITCKSSHVIAGYRKTLV 114

Query:   127 -YESRENLEQNGRWIMHEFTINAALFGDQYSHKARNF---VLYC--VKKNELENPSE 177
              YE R  L     W MHE+ +      D +S K+ NF      C  VKKNEL+  S+
Sbjct:   115 FYEGRAPLGDRTNWFMHEYRLCDI---DDHSQKSPNFKGAFALCRVVKKNELKKNSK 168




GO:0003677 "DNA binding" evidence=IEA
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0007275 "multicellular organismal development" evidence=ISS
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
TAIR|locus:2167923 NAC101 "NAC-domain protein 101" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2142285 NAC096 "NAC domain containing protein 96" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2016049 VND7 "vascular related NAC-domain protein 7" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2026232 ANAC026 "Arabidopsis NAC domain containing protein 26" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2044360 NST1 "NAC SECONDARY WALL THICKENING PROMOTING FACTOR1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2079562 NAC066 "NAC domain protein 66" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2089764 NAC057 "NAC domain containing protein 57" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2060979 VND1 "vascular related NAC-domain protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2202028 NAC007 "NAC 007" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query308
pfam02365130 pfam02365, NAM, No apical meristem (NAM) protein 1e-20
>gnl|CDD|216997 pfam02365, NAM, No apical meristem (NAM) protein Back     alignment and domain information
 Score = 85.7 bits (213), Expect = 1e-20
 Identities = 41/138 (29%), Positives = 65/138 (47%), Gaps = 19/138 (13%)

Query: 14  RFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQ 73
           RF+PT++EL+ +YL  K+              + + D+Y   EP   WDL  G      +
Sbjct: 6   RFHPTDEELVVYYLKRKVLGK----PLPLLDVIPEVDIYK-FEP---WDLPDGKAKGGDR 57

Query: 74  ALYFFTQLKKASPKGSRFN--IKIGFDAW--QGEDRAKLVLAVGSEDQILGFKKRFR-YE 128
             YFF+   +  P GSR N     G+  W   G+D+   VL+ G   +++G KK    Y+
Sbjct: 58  EWYFFSPRDRKYPNGSRTNRATGSGY--WKATGKDKP--VLSKGG--EVVGMKKTLVFYK 111

Query: 129 SRENLEQNGRWIMHEFTI 146
            R    +   W+MHE+ +
Sbjct: 112 GRAPKGEKTDWVMHEYRL 129


This is a family of no apical meristem (NAM) proteins these are plant development proteins. Mutations in NAM result in the failure to develop a shoot apical meristem in petunia embryos. NAM is indicated as having a role in determining positions of meristems and primordial. One member of this family NAP (NAC-like, activated by AP3/PI) is encoded by the target genes of the AP3/PI transcriptional activators and functions in the transition between growth by cell division and cell expansion in stamens and petals. Length = 130

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 308
PF02365129 NAM: No apical meristem (NAM) protein; InterPro: I 100.0
>PF02365 NAM: No apical meristem (NAM) protein; InterPro: IPR003441 The NAC domain (for Petunia hybrida (Petunia) NAM and for Arabidopsis ATAF1, ATAF2, and CUC2) is an N-terminal module of ~160 amino acids, which is found in proteins of the NAC family of plant-specific transcriptional regulators (no apical meristem (NAM) proteins) [] Back     alignment and domain information
Probab=100.00  E-value=8e-42  Score=286.45  Aligned_cols=128  Identities=28%  Similarity=0.518  Sum_probs=91.8

Q ss_pred             CCCCceeCCChHHHHHHHHHHhHhcCCCCCCCCCCcccccCCCCCCCCcchhhhhcCCCcccCCceEEEEeeccccCCCC
Q 037198            9 FYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKKASPKG   88 (308)
Q Consensus         9 LPpGfRF~PTDeELI~~YL~~Ki~~g~~lp~~~~~~~I~evDlY~~~ePwdLp~~~g~~~~~~~~eWYFFs~r~rk~~~G   88 (308)
                      |||||||+|||+|||.+||++|+. |.+++   ...+|+++||| .+|||+|+..+    ..++.+||||+++++++++|
T Consensus         1 LP~G~rF~PtD~ELi~~yL~~k~~-g~~~~---~~~~i~~~Diy-~~~P~~L~~~~----~~~~~~~yFF~~~~~~~~~~   71 (129)
T PF02365_consen    1 LPPGFRFRPTDEELINHYLRPKIL-GEPLP---CEDVIHDVDIY-SAHPWELPAKF----KGGDEEWYFFSPRKKKYPNG   71 (129)
T ss_dssp             --TTEEE---HHHHHHCTHHHHHT-T-HHC---S-CHSEE--GG-GS-GGGCHHHS----SS-SSEEEEEEE--------
T ss_pred             CCCceEecCChHHHHHHHHHHHhc-CCCCC---cccceeecccC-ccChHHhhhhc----cCCCceEEEEEecccccCCc
Confidence            899999999999999999999999 66664   34689999999 59999999532    23677999999999999999


Q ss_pred             CccceeeccceeeecCCCeEEEecCCCCeEEEEEEEEeee-cCCCCCCCcCeEEEEEEeC
Q 037198           89 SRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYE-SRENLEQNGRWIMHEFTIN  147 (308)
Q Consensus        89 ~R~~Rat~~G~WK~tG~~k~I~~~~~~g~vVG~KKtLvF~-~r~p~g~kT~WvMhEY~L~  147 (308)
                      .|.+|++++|+||++|++++|.+.  ++.+||+|++|+|+ ++.+++.+|+|+||||+|+
T Consensus        72 ~r~~R~~~~G~Wk~~g~~~~i~~~--~g~~iG~k~~l~f~~~~~~~~~kt~W~M~EY~L~  129 (129)
T PF02365_consen   72 GRPNRVTGGGYWKSTGKEKPIKDP--GGKVIGFKKTLVFYSGKSPNGKKTGWVMHEYSLE  129 (129)
T ss_dssp             --S-EEETTEEEEEECEEEEEEE---TTCEEEEEEEEEEEESSTTS-EEEEEEEEEEEE-
T ss_pred             ccccccccceEEeecccccccccc--cceeeeeEEEEEEEeccCCCCCcCCeEEEEEEeC
Confidence            999999999999999999999986  38899999999997 4556789999999999984



NAC proteins are involved in developmental processes, including formation of the shoot apical meristem, floral organs and lateral shoots, as well as in plant hormonal control and defence. The NAC domain is accompanied by diverse C-terminal transcriptional activation domains. The NAC domain has been shown to be a DNA-binding domain (DBD) and a dimerization domain [,]. The NAC domain can be subdivided into five subdomains (A-E). Each subdomain is distinguishable by blocks of heterogeneous amino acids or gaps. While the NAC domains were rich in basic amino acids (R, K and H) as a whole, the distribution of positive and negative amino acids in each subdomain were unequal. Subdomains C and D are rich in basic amino acids but poor in acidic amino acids, while subdomain B contains a high proportion of acidic amino acids. Putative nuclear localization signals (NLS) have been detected in subdomains C and D []. The DBD is contained within a 60 amino acid region located within subdomains D and E []. The overall structure of the NAC domain monomer consists of a very twisted antiparallel beta-sheet, which packs against an N-terminal alpha-helix on one side and one shorter helix on the other side surrounded by a few helical elements. The structure suggests that the NAC domain mediates dimerization through conserved interactions including a salt bridge, and DNA binding through the NAC dimer face rich in positive charges [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1UT4_A 3SWM_B 4DUL_B 3SWP_D 1UT7_B 3ULX_A.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query308
3ulx_A174 Crystal Structural Of The Conserved Domain Of Rice 2e-07
3swm_A174 The Nac Domain Of Anac019 In Complex With Dna, Gold 5e-06
1ut4_A171 Structure Of The Conserved Domain Of Anac, A Member 5e-06
>pdb|3ULX|A Chain A, Crystal Structural Of The Conserved Domain Of Rice Stress-Responsive Nac1 Length = 174 Back     alignment and structure

Iteration: 1

Score = 52.8 bits (125), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 47/144 (32%), Positives = 60/144 (41%), Gaps = 20/144 (13%) Query: 7 LRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVK---DCDLYSPHEPNLIWDL 63 L P RF+PT+ EL+E YL K A +PV + DLY +P WDL Sbjct: 13 LNLPPGFRFHPTDDELVEHYLCRK--------AAGQRLPVPIIAEVDLYK-FDP---WDL 60 Query: 64 YGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKK 123 L + YFFT + P GSR N G W+ K V G + LG KK Sbjct: 61 -PERALFGAREWYFFTPRDRKYPNGSRPNRAAGNGYWKATGADKPVAPRG---RTLGIKK 116 Query: 124 RFR-YESRENLEQNGRWIMHEFTI 146 Y + WIMHE+ + Sbjct: 117 ALVFYAGKAPRGVKTDWIMHEYRL 140
>pdb|3SWM|A Chain A, The Nac Domain Of Anac019 In Complex With Dna, Gold Derivative Length = 174 Back     alignment and structure
>pdb|1UT4|A Chain A, Structure Of The Conserved Domain Of Anac, A Member Of The Nac Family Of Transcription Factors Length = 171 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query308
3ulx_A174 Stress-induced transcription factor NAC1; NAC fami 6e-16
1ut7_A171 No apical meristem protein; transcription regulati 2e-15
>3ulx_A Stress-induced transcription factor NAC1; NAC family, stress-responsive, DNA binding protein; 2.60A {Oryza sativa subsp} Length = 174 Back     alignment and structure
 Score = 73.1 bits (180), Expect = 6e-16
 Identities = 53/172 (30%), Positives = 70/172 (40%), Gaps = 30/172 (17%)

Query: 14  RFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQ 73
           RF+PT+ EL+E YL  K             I   + DLY   +P   WDL     L   +
Sbjct: 20  RFHPTDDELVEHYLCRKAAGQR---LPVPIIA--EVDLY-KFDP---WDL-PERALFGAR 69

Query: 74  ALYFFTQLKKASPKGSRFN--IKIGFDAWQ--GEDRAKLVLAVGSEDQILGFKKR--FRY 127
             YFFT   +  P GSR N     G+  W+  G D+      V    + LG KK   F Y
Sbjct: 70  EWYFFTPRDRKYPNGSRPNRAAGNGY--WKATGADK-----PVAPRGRTLGIKKALVF-Y 121

Query: 128 ESRENLEQNGRWIMHEFTINAALFGDQYSHKARN----FVLYCV--KKNELE 173
             +        WIMHE+ +  A      + K       +VL  +  KKNE E
Sbjct: 122 AGKAPRGVKTDWIMHEYRLADAGRAAAGAKKGSLRLDDWVLCRLYNKKNEWE 173


>1ut7_A No apical meristem protein; transcription regulation, transcription, transcription factor, DNA binding, abscisic acid response, NAC domain; 1.9A {Arabidopsis thaliana} SCOP: b.143.1.1 PDB: 1ut4_A 4dul_A 3swp_A 3swm_A Length = 171 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query308
3ulx_A174 Stress-induced transcription factor NAC1; NAC fami 100.0
1ut7_A171 No apical meristem protein; transcription regulati 100.0
>3ulx_A Stress-induced transcription factor NAC1; NAC family, stress-responsive, DNA binding protein; 2.60A {Oryza sativa subsp} SCOP: b.143.1.1 Back     alignment and structure
Probab=100.00  E-value=1.6e-54  Score=381.93  Aligned_cols=156  Identities=28%  Similarity=0.432  Sum_probs=129.6

Q ss_pred             CCCCCCCCCceeCCChHHHHHHHHHHhHhcCCCCCCCCCCcccccCCCCCCCCcchhhhhcCCCcccCCceEEEEeeccc
Q 037198            4 HVGLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQLKK   83 (308)
Q Consensus         4 ~~~~~LPpGfRF~PTDeELI~~YL~~Ki~~g~~lp~~~~~~~I~evDlY~~~ePwdLp~~~g~~~~~~~~eWYFFs~r~r   83 (308)
                      .+++.|||||||+|||||||.|||++|+. |.++|    ..+|+++||| ++|||+||+.+    ..++++|||||++++
T Consensus        10 ~~~~~LPpGfRF~PTDeELV~~YL~~K~~-g~~~~----~~~I~evDvy-~~~Pw~Lp~~~----~~g~~ewYFFs~r~~   79 (174)
T 3ulx_A           10 EAELNLPPGFRFHPTDDELVEHYLCRKAA-GQRLP----VPIIAEVDLY-KFDPWDLPERA----LFGAREWYFFTPRDR   79 (174)
T ss_dssp             CSTTTCCTTCCCCCCHHHHHHHTHHHHHH-TCCCS----SSCCEECCGG-GSCGGGSGGGC----SSCSSEEEEEEECCC
T ss_pred             ccccCCCCcceeCCCHHHHHHHHHHHHhc-CCCCC----cCeeeecccc-cCCchhhhhhh----ccCCceEEEEecccc
Confidence            45679999999999999999999999999 77775    2689999999 69999999863    356789999999999


Q ss_pred             cCCCCCccceeeccceeeecCCCeEEEecCCCCeEEEEEEEEeee-cCCCCCCCcCeEEEEEEeCccccCC----cCCCC
Q 037198           84 ASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYE-SRENLEQNGRWIMHEFTINAALFGD----QYSHK  158 (308)
Q Consensus        84 k~~~G~R~~Rat~~G~WK~tG~~k~I~~~~~~g~vVG~KKtLvF~-~r~p~g~kT~WvMhEY~L~~~~~~~----~~~~~  158 (308)
                      ||++|.|++|+|++||||+||++++|...   |.+||+||+|+|| +++|++.+|+|+||||+|.......    .....
T Consensus        80 ky~~g~R~nR~t~~G~WkatG~dk~I~~~---g~~vG~KktLvFy~g~~p~g~kT~WvMhEY~L~~~~~~~~~~~~~~~~  156 (174)
T 3ulx_A           80 KYPNGSRPNRAAGNGYWKATGADKPVAPR---GRTLGIKKALVFYAGKAPRGVKTDWIMHEYRLADAGRAAAGAKKGSLR  156 (174)
T ss_dssp             -----CCSCEEETTEEEEECSCCEEECCS---SSCCEEEEEEEEEESSTTSCEEEEEEEEEEEECSCC-----------C
T ss_pred             ccCCCCCceeecCCceEccCCCCcEEeeC---CcEEEEEEEEEEecCCCCCCCcCCeEEEEEEeCCCCCcccccccCCCC
Confidence            99999999999999999999999999764   7899999999996 6678999999999999999875421    11235


Q ss_pred             CCcEEEEEEEEcCC
Q 037198          159 ARNFVLYCVKKNEL  172 (308)
Q Consensus       159 ~~~~VLCRIf~K~~  172 (308)
                      .++|||||||+|+.
T Consensus       157 ~~~wVlCrvf~K~~  170 (174)
T 3ulx_A          157 LDDWVLCRLYNKKN  170 (174)
T ss_dssp             CSSEEEEEEEESCC
T ss_pred             CCCEEEEEEEEcCC
Confidence            78999999999873



>1ut7_A No apical meristem protein; transcription regulation, transcription, transcription factor, DNA binding, abscisic acid response, NAC domain; 1.9A {Arabidopsis thaliana} SCOP: b.143.1.1 PDB: 1ut4_A 4dul_A 3swp_A 3swm_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 308
d1ut7a_166 b.143.1.1 (A:) No apical meristem (NAM, ANAC) {Mou 2e-22
>d1ut7a_ b.143.1.1 (A:) No apical meristem (NAM, ANAC) {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 166 Back     information, alignment and structure

class: All beta proteins
fold: NAC domain
superfamily: NAC domain
family: NAC domain
domain: No apical meristem (NAM, ANAC)
species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 89.3 bits (221), Expect = 2e-22
 Identities = 47/172 (27%), Positives = 67/172 (38%), Gaps = 15/172 (8%)

Query: 1   MSPHVGLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLI 60
             P   L   P  RFYPT++EL+  YL  K               + + DLY   +P   
Sbjct: 9   TDPLTQLSLPPGFRFYPTDEELMVQYLCRKAAGYDF-----SLQLIAEIDLYK-FDP--- 59

Query: 61  WDLYGGDQLMAGQALYFFTQLKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILG 120
           W L     L   +  YFF+   +  P GSR N   G   W+     K    + +E Q +G
Sbjct: 60  WVL-PNKALFGEKEWYFFSPRDRKYPNGSRPNRVAGSGYWKATGTDK---IISTEGQRVG 115

Query: 121 FKKRF-RYESRENLEQNGRWIMHEFTINAALFGDQYSHKARNFVLYCVKKNE 171
            KK    Y  +        WIMHE+ +         S K  ++VL  + K +
Sbjct: 116 IKKALVFYIGKAPKGTKTNWIMHEYRLIEPSR-RNGSTKLDDWVLCRIYKKQ 166


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query308
d1ut7a_166 No apical meristem (NAM, ANAC) {Mouse-ear cress (A 100.0
>d1ut7a_ b.143.1.1 (A:) No apical meristem (NAM, ANAC) {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All beta proteins
fold: NAC domain
superfamily: NAC domain
family: NAC domain
domain: No apical meristem (NAM, ANAC)
species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=100.00  E-value=1e-51  Score=358.61  Aligned_cols=157  Identities=27%  Similarity=0.472  Sum_probs=125.9

Q ss_pred             CCCCCCCCCCCCceeCCChHHHHHHHHHHhHhcCCCCCCCCCCcccccCCCCCCCCcchhhhhcCCCcccCCceEEEEee
Q 037198            1 MSPHVGLRFYPVLRFYPTEQELLEFYLLHKITQNHNLYQASDDIPVKDCDLYSPHEPNLIWDLYGGDQLMAGQALYFFTQ   80 (308)
Q Consensus         1 M~p~~~~~LPpGfRF~PTDeELI~~YL~~Ki~~g~~lp~~~~~~~I~evDlY~~~ePwdLp~~~g~~~~~~~~eWYFFs~   80 (308)
                      +++.++++|||||||+|||||||.|||++|+. |.+++.    .+|+++||| ++|||+||+.+    ..++++||||++
T Consensus         9 ~~~~~~l~LPpG~RF~PTDeELv~~YL~~Ki~-g~~l~~----~~I~~~Dvy-~~~Pw~Lp~~~----~~~~~~wyFft~   78 (166)
T d1ut7a_           9 TDPLTQLSLPPGFRFYPTDEELMVQYLCRKAA-GYDFSL----QLIAEIDLY-KFDPWVLPNKA----LFGEKEWYFFSP   78 (166)
T ss_dssp             ---CCSSCCCTTEEECCCHHHHHHHTHHHHHT-TCCCSS----CCSEECCGG-GSCGGGHHHHS----SSCSSEEEEEEE
T ss_pred             cCccccccCCCccccCCCcHHHHHHHHHHHHc-CCCCCc----ccceeccCC-cCChhhccchh----ccCcceEEEEee
Confidence            46778899999999999999999999999998 778853    589999999 69999999863    346889999999


Q ss_pred             ccccCCCCCccceeeccceeeecCCCeEEEecCCCCeEEEEEEEEeee-cCCCCCCCcCeEEEEEEeCccccCCcCCCCC
Q 037198           81 LKKASPKGSRFNIKIGFDAWQGEDRAKLVLAVGSEDQILGFKKRFRYE-SRENLEQNGRWIMHEFTINAALFGDQYSHKA  159 (308)
Q Consensus        81 r~rk~~~G~R~~Rat~~G~WK~tG~~k~I~~~~~~g~vVG~KKtLvF~-~r~p~g~kT~WvMhEY~L~~~~~~~~~~~~~  159 (308)
                      +.+++++|.|.+|+|++|+||++|+++.|.+.   |.+||+||+|+|| ++.+++.+|+|+||||+|.+..... .....
T Consensus        79 ~~~k~~~g~r~~R~~g~G~Wk~~g~~~~i~~~---g~~vG~kk~l~fy~~~~~~~~~t~W~M~EY~l~~~~~~~-~~~~~  154 (166)
T d1ut7a_          79 RDRKYPNGSRPNRVAGSGYWKATGTDKIISTE---GQRVGIKKALVFYIGKAPKGTKTNWIMHEYRLIEPSRRN-GSTKL  154 (166)
T ss_dssp             CCC-------CCEEETTEEEEEEEEEEEEEET---TEEEEEEEEEEEEESSTTSCEEEEEEEEEEEECCCC---------
T ss_pred             eccccCCCCccccccCCCEecccCCCceEecC---CcEEEEEEEEEEEecCCCCCCccCeEEEEEecCCccccc-Ccccc
Confidence            99999999999999999999999999988764   7899999999996 5677899999999999998875432 33567


Q ss_pred             CcEEEEEEEEcC
Q 037198          160 RNFVLYCVKKNE  171 (308)
Q Consensus       160 ~~~VLCRIf~K~  171 (308)
                      ++|||||||+|+
T Consensus       155 ~~~VLCrI~~Kk  166 (166)
T d1ut7a_         155 DDWVLCRIYKKQ  166 (166)
T ss_dssp             CCEEEEEEEECC
T ss_pred             CCEEEEEEEecC
Confidence            899999999985