Citrus Sinensis ID: 017473


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-
MESHMREWYKSQPSDELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFYKPRYTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVKDDPSYEKEFVLCRIFRRRNKKKRGISTTDEGESSEQLVSSPQQSPCDHSISNGNHSEESTHTIPQQQLQYHNSISYPRNLNVQNTFAYPQSITDDSPTQLSPGNHSEGTTHTYPQQLPNHNSISYTGNRIEQNMYLLKYLPNYSPAQQLPCDHFISARNYSEEYVPINQQPQPNDNLISHLGSQIEGFSPTNSLLPQNSNLEWQLPLHHPLEIHNSAENPFRKDVSQLEKSRHTVATIG
ccccccccccccccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccEEccccccccccccccccccccccccccccccccccccEEEEEEEEEEccccccccccccccEEEEEEccccccccccEEEEEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHcccccccccccccccccccccccccccccccccccccHHHHHHcccccccccccccccccccccccccccccccccccccccccccHHHHHHccccccccc
ccccccccccccccHHHHHHHHHHHHccccccccEEHHHEEcccccccccccHcccccccEEEEEcccccccccccccccccccEEEEccccccEcccccEEEEEEEEEEEEcccccccccccEEEEEEEccccccccccEEEEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHcccccccccccccccccccccccccccccccccccccccccHHHHHHccHcccEEEEEcc
meshmrewyksqpsdELILCLLKkkrldphfsygpikdighicslepwdlatesetdsedqtcyffykprytksndahratnagswkvtRKVSQIearngltgtkKSLTFYLRGlprkeaitewgmheyhvkddpsyekEFVLCRIFRrrnkkkrgisttdegesseqlvsspqqspcdhsisngnhseesthtipqqqlqyhnsisyprnlnvqntfaypqsitddsptqlspgnhsegtthtypqqlpnhnsisytgNRIEQNMYLLkylpnyspaqqlpcdhfisarnyseeyvpinqqpqpndnlishlgsqiegfsptnsllpqnsnlewqlplhhpleihnsaenpfrkdvSQLEKSRHTVATIG
meshmrewyksqpsdELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFYkprytksndahratnagswkvtrkvsqiearngltgtkksltfyLRGLPRKEAITEWGMheyhvkddpsyeKEFVLCRifrrrnkkkrgisttdegesseqlvsspqqspCDHSISNGNHSEESTHTIPQQQLQYHNSISYPRNLNVQNTFAYPQSITDDSPTQLSPGNHSEGTTHTYPQQLPNHNSISYTGNRIEQNMYLLKYLPNYSPAQQLPCDHFISARNYSEEYVPINQQPQPNDNLISHLGSQIEGFSPTNSLLPQNSNLEWQLPLHHPLEIHNSAENPFRKDVSQLEKSRHTVATIG
MESHMREWYKSQPSDELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFYKPRYTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVKDDPSYEKEFVLCRIFRRRNKKKRGISTTDEGESSEQLVSSPQQSPCDHSISNGNHSEESTHTIPQQQLQYHNSISYPRNLNVQNTFAYPQSITDDSPTQLSPGNHSEGTTHTYPQQLPNHNSISYTGNRIEQNMYLLKYLPNYSPAQQLPCDHFISARNYSEEYVPINQQPQPNDNLISHLGSQIEGFSPTNSLLPQNSNLEWQLPLHHPLEIHNSAENPFRKDVSQLEKSRHTVATIG
***************ELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATE******DQTCYFFYKPRYTKSN*AHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVKDDPSYEKEFVLCRIF****************************************************************LNVQNTF************************************ISYTGNRIEQNMYLLKYLPNYSPAQQLPCDHFISARNYSEEYVP************************************WQL**********************************
**SHMREWYKSQPSDELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFYKPRYTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVKDDPSYEKEFVLCRIFR******************************************************************************************************************************************************************************************WQLPLHH***********************H****I*
************PSDELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLA********DQTCYFFYKPRYTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVKDDPSYEKEFVLCRIFRRRN***********************************************QLQYHNSISYPRNLNVQNTFAYPQSITDDSPTQLSPGNHSEGTTHTYPQQLPNHNSISYTGNRIEQNMYLLKYLPNYSPAQQLPCDHFISARNYSEEYVPINQQPQPNDNLISHLGSQIEGFSPTNSLLPQNSNLEWQLPLHHPLEIHNSAENPFRKD***************
****MREWYKSQPSDELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFYKPRYTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVKDDPSYEKEFVLCRIFRRRN*****************************************************SISYPRNLNVQNTFAYPQSIT**************G*THTYPQQLPNHNS**Y*GNRIEQNMYLLKYL*********************************************************N*NLEWQLPLHHPLEIHNSAENPFRKDVSQLEKSRHTVATIG
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MESHMREWYKSQPSDELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFYKPRYTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVKDDPSYEKEFVLCRIFRRRNKKKRGISTTDEGESSEQLVSSPQQSPCDHSISNGNHSEESTHTIPQQQLQYHNSISYPRNLNVQNTFAYPQSITDDSPTQLSPGNHSEGTTHTYPQQLPNHNSISYTGNRIEQNMYLLKYLPNYSPAQQLPCDHFISARNYSEEYVPINQQPQPNDNLISHLGSQIEGFSPTNSLLPQNSNLEWQLPLHHPLEIHNSAENPFRKDVSQLEKSRHTVATIG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query371 2.2.26 [Sep-21-2011]
Q9FWX2395 NAC domain-containing pro yes no 0.423 0.397 0.319 6e-16
Q9SV87341 Protein BEARSKIN1 OS=Arab no no 0.517 0.563 0.305 1e-14
Q84K00 567 NAC domain-containing pro no no 0.644 0.421 0.262 2e-14
Q94F58340 NAC domain-containing pro no no 0.371 0.405 0.369 3e-13
Q9C878305 Protein BEARSKIN1 OS=Arab no no 0.393 0.478 0.324 4e-13
Q7GCL7 489 NAC domain-containing pro no no 0.361 0.274 0.356 5e-13
Q9FLJ2336 NAC domain-containing pro no no 0.390 0.431 0.329 9e-13
Q9M126457 NAC domain-containing pro no no 0.436 0.354 0.319 7e-12
Q9S851334 Protein CUP-SHAPED COTYLE no no 0.388 0.431 0.313 1e-11
O81913359 NAC domain-containing pro no no 0.417 0.431 0.307 2e-11
>sp|Q9FWX2|NAC7_ARATH NAC domain-containing protein 7 OS=Arabidopsis thaliana GN=NAC007 PE=2 SV=2 Back     alignment and function desciption
 Score = 85.5 bits (210), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 86/166 (51%), Gaps = 9/166 (5%)

Query: 3   SHMREWYKSQPSDE-LILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQ 61
           SH+   ++  P+DE L+   L+KK          IKDI  +  +EPWDL    +   E+Q
Sbjct: 5   SHVPPGFRFHPTDEELVDYYLRKKVASKRIEIDFIKDID-LYKIEPWDLQELCKIGHEEQ 63

Query: 62  TCYFFYK---PRYTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRK 118
           + ++F+     +Y      +RAT AG WK T +   I  R+ L G +K+L FY    P  
Sbjct: 64  SDWYFFSHKDKKYPTGTRTNRATKAGFWKATGRDKAIYLRHSLIGMRKTLVFYKGRAPNG 123

Query: 119 EAITEWGMHEYHVKDDPS---YEKEFVLCRIFRRRNKKKRGISTTD 161
           +  ++W MHEY ++ D +    E+ +V+CR+F++R    R +   D
Sbjct: 124 QK-SDWIMHEYRLETDENGTPQEEGWVVCRVFKKRLAAVRRMGDYD 168




Transcription factor probably involved in xylem formation.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9SV87|BRN2_ARATH Protein BEARSKIN1 OS=Arabidopsis thaliana GN=BRN2 PE=2 SV=1 Back     alignment and function description
>sp|Q84K00|NAC78_ARATH NAC domain-containing protein 78 OS=Arabidopsis thaliana GN=NAC078 PE=2 SV=2 Back     alignment and function description
>sp|Q94F58|NAC89_ARATH NAC domain-containing protein 89 OS=Arabidopsis thaliana GN=NAC089 PE=1 SV=1 Back     alignment and function description
>sp|Q9C878|BRN1_ARATH Protein BEARSKIN1 OS=Arabidopsis thaliana GN=BRN1 PE=2 SV=1 Back     alignment and function description
>sp|Q7GCL7|NAC74_ORYSJ NAC domain-containing protein 74 OS=Oryza sativa subsp. japonica GN=NAC74 PE=2 SV=1 Back     alignment and function description
>sp|Q9FLJ2|NC100_ARATH NAC domain-containing protein 100 OS=Arabidopsis thaliana GN=NAC100 PE=2 SV=1 Back     alignment and function description
>sp|Q9M126|NAC69_ARATH NAC domain-containing protein 69 OS=Arabidopsis thaliana GN=NAC69 PE=2 SV=1 Back     alignment and function description
>sp|Q9S851|NAC31_ARATH Protein CUP-SHAPED COTYLEDON 3 OS=Arabidopsis thaliana GN=NAC031 PE=1 SV=1 Back     alignment and function description
>sp|O81913|NAC4_ARATH NAC domain-containing protein 4 OS=Arabidopsis thaliana GN=NAC004 PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query371
255548095 612 transcription factor, putative [Ricinus 0.417 0.253 0.344 3e-18
356560207 1061 PREDICTED: uncharacterized protein LOC10 0.442 0.154 0.370 2e-16
449526798 518 PREDICTED: uncharacterized LOC101203389 0.382 0.274 0.387 5e-16
449445005 506 PREDICTED: uncharacterized protein LOC10 0.382 0.280 0.387 7e-16
356522462 497 PREDICTED: NAC domain-containing protein 0.442 0.329 0.370 9e-16
356547116370 PREDICTED: NAC domain-containing protein 0.471 0.472 0.310 2e-15
449443648366 PREDICTED: NAC domain-containing protein 0.495 0.502 0.331 3e-15
449492840350 PREDICTED: NAC domain-containing protein 0.495 0.525 0.331 4e-15
357453161 589 NAC domain protein [Medicago truncatula] 0.425 0.268 0.327 4e-15
242079245364 hypothetical protein SORBIDRAFT_07g02120 0.377 0.384 0.369 1e-14
>gi|255548095|ref|XP_002515104.1| transcription factor, putative [Ricinus communis] gi|223545584|gb|EEF47088.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 60/174 (34%), Positives = 87/174 (50%), Gaps = 19/174 (10%)

Query: 6   REWYKSQPSDELILCLLKKKRLDPH-FSYGPIKDIGHICSLEPWDLATESETDSEDQTCY 64
           R  Y+  P+DE ++    + ++D H F       +  +C  +PWDL   +   S DQ  Y
Sbjct: 4   RVGYRFHPTDEELVSYFLRHKIDGHDFLVDDDIHVIDLCKFDPWDLPGFASQGSNDQVWY 63

Query: 65  FFY----------KPRYTKSNDAHRATNAGSWKVTRKVSQIEARNG--LTGTKKSLTFYL 112
           FFY          KPRY ++     AT  G WK T     ++A+    + G K++L FYL
Sbjct: 64  FFYRRELKYKSPSKPRYNRTT----ATGTGHWKPTGAERLVKAKRTKRVIGVKRTLVFYL 119

Query: 113 RGLPRKEAITEWGMHEYHVKDDPSYEKEFVLCRIFRRRNKKKRGISTTDEGESS 166
           RG P KE  T W +HEY  K  P ++ + VLC++ + +   K   S  DEGESS
Sbjct: 120 RGTP-KEVKTNWVIHEYEPKTSPPHQADLVLCKL-KEKADNKADTSGCDEGESS 171




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356560207|ref|XP_003548385.1| PREDICTED: uncharacterized protein LOC100778755 [Glycine max] Back     alignment and taxonomy information
>gi|449526798|ref|XP_004170400.1| PREDICTED: uncharacterized LOC101203389 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449445005|ref|XP_004140264.1| PREDICTED: uncharacterized protein LOC101203389 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356522462|ref|XP_003529865.1| PREDICTED: NAC domain-containing protein 69-like [Glycine max] Back     alignment and taxonomy information
>gi|356547116|ref|XP_003541963.1| PREDICTED: NAC domain-containing protein 48-like [Glycine max] Back     alignment and taxonomy information
>gi|449443648|ref|XP_004139589.1| PREDICTED: NAC domain-containing protein 89-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449492840|ref|XP_004159117.1| PREDICTED: NAC domain-containing protein 89-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|357453161|ref|XP_003596857.1| NAC domain protein [Medicago truncatula] gi|355485905|gb|AES67108.1| NAC domain protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|242079245|ref|XP_002444391.1| hypothetical protein SORBIDRAFT_07g021200 [Sorghum bicolor] gi|241940741|gb|EES13886.1| hypothetical protein SORBIDRAFT_07g021200 [Sorghum bicolor] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query371
TAIR|locus:2202028395 NAC007 "NAC 007" [Arabidopsis 0.563 0.529 0.303 1.1e-17
TAIR|locus:2095007314 NAC058 "NAC domain containing 0.776 0.917 0.271 2.7e-17
TAIR|locus:2061082316 NAC038 "NAC domain containing 0.636 0.746 0.3 4.6e-17
TAIR|locus:2122804341 NAC070 "NAC domain containing 0.517 0.563 0.324 5.4e-17
TAIR|locus:2196641305 NAC015 "NAC domain containing 0.541 0.659 0.328 8.1e-17
TAIR|locus:2167923348 NAC101 "NAC-domain protein 101 0.824 0.879 0.266 9e-17
TAIR|locus:2164895285 NAC6 "NAC domain containing pr 0.638 0.831 0.280 1.1e-16
TAIR|locus:2128013 534 NTL9 "NAC transcription factor 0.450 0.312 0.344 1.2e-16
TAIR|locus:2039600335 NTL8 "NTM1-like 8" [Arabidopsi 0.477 0.528 0.331 1.6e-16
TAIR|locus:2089764246 NAC057 "NAC domain containing 0.595 0.898 0.304 3.9e-16
TAIR|locus:2202028 NAC007 "NAC 007" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 235 (87.8 bits), Expect = 1.1e-17, P = 1.1e-17
 Identities = 69/227 (30%), Positives = 113/227 (49%)

Query:     3 SHMREWYKSQPSDE-LILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQ 61
             SH+   ++  P+DE L+   L+KK          IKDI  +  +EPWDL    +   E+Q
Sbjct:     5 SHVPPGFRFHPTDEELVDYYLRKKVASKRIEIDFIKDID-LYKIEPWDLQELCKIGHEEQ 63

Query:    62 TCYFFYK---PRYTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRK 118
             + ++F+     +Y      +RAT AG WK T +   I  R+ L G +K+L FY    P  
Sbjct:    64 SDWYFFSHKDKKYPTGTRTNRATKAGFWKATGRDKAIYLRHSLIGMRKTLVFYKGRAPNG 123

Query:   119 EAITEWGMHEYHVKDDPS---YEKEFVLCRIFRRRNKKKRGISTTDEGES---SEQL--V 170
             +  ++W MHEY ++ D +    E+ +V+CR+F++R    R +   D   S    +QL  +
Sbjct:   124 QK-SDWIMHEYRLETDENGTPQEEGWVVCRVFKKRLAAVRRMGDYDSSPSHWYDDQLSFM 182

Query:   171 SSPQQSPCDHSISNGNHSEESTHTIPQQQLQYH-NSISYP-RNLNVQ 215
             +S  ++     I   NH ++  H   QQ + Y  N+ +Y   N N+Q
Sbjct:   183 ASELETNGQRRILP-NHHQQQQHE-HQQHMPYGLNASAYALNNPNLQ 227




GO:0003677 "DNA binding" evidence=IEA
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA;TAS
GO:0009793 "embryo development ending in seed dormancy" evidence=NAS
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0042546 "cell wall biogenesis" evidence=RCA
GO:0044036 "cell wall macromolecule metabolic process" evidence=RCA
TAIR|locus:2095007 NAC058 "NAC domain containing protein 58" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2061082 NAC038 "NAC domain containing protein 38" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2122804 NAC070 "NAC domain containing protein 70" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2196641 NAC015 "NAC domain containing protein 15" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2167923 NAC101 "NAC-domain protein 101" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2164895 NAC6 "NAC domain containing protein 6" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2128013 NTL9 "NAC transcription factor-like 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2039600 NTL8 "NTM1-like 8" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2089764 NAC057 "NAC domain containing protein 57" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
Sb07g021200.1
hypothetical protein (364 aa)
(Sorghum bicolor)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query371
pfam02365130 pfam02365, NAM, No apical meristem (NAM) protein 7e-26
>gnl|CDD|216997 pfam02365, NAM, No apical meristem (NAM) protein Back     alignment and domain information
 Score =  100 bits (251), Expect = 7e-26
 Identities = 44/123 (35%), Positives = 64/123 (52%), Gaps = 8/123 (6%)

Query: 13  PSD-ELILCLLKKKRLD-PHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFYKP- 69
           P+D EL++  LK+K L  P      I ++  I   EPWDL  + +    D+  YFF    
Sbjct: 9   PTDEELVVYYLKRKVLGKPLPLLDVIPEVD-IYKFEPWDLP-DGKAKGGDREWYFFSPRD 66

Query: 70  -RYTKSNDAHRATNAGSWKVTRKVSQIEARNG-LTGTKKSLTFYLRGLPRKEAITEWGMH 127
            +Y   +  +RAT +G WK T K   + ++ G + G KK+L FY +G   K   T+W MH
Sbjct: 67  RKYPNGSRTNRATGSGYWKATGKDKPVLSKGGEVVGMKKTLVFY-KGRAPKGEKTDWVMH 125

Query: 128 EYH 130
           EY 
Sbjct: 126 EYR 128


This is a family of no apical meristem (NAM) proteins these are plant development proteins. Mutations in NAM result in the failure to develop a shoot apical meristem in petunia embryos. NAM is indicated as having a role in determining positions of meristems and primordial. One member of this family NAP (NAC-like, activated by AP3/PI) is encoded by the target genes of the AP3/PI transcriptional activators and functions in the transition between growth by cell division and cell expansion in stamens and petals. Length = 130

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 371
PF02365129 NAM: No apical meristem (NAM) protein; InterPro: I 100.0
>PF02365 NAM: No apical meristem (NAM) protein; InterPro: IPR003441 The NAC domain (for Petunia hybrida (Petunia) NAM and for Arabidopsis ATAF1, ATAF2, and CUC2) is an N-terminal module of ~160 amino acids, which is found in proteins of the NAC family of plant-specific transcriptional regulators (no apical meristem (NAM) proteins) [] Back     alignment and domain information
Probab=100.00  E-value=1.6e-39  Score=277.27  Aligned_cols=124  Identities=32%  Similarity=0.600  Sum_probs=92.1

Q ss_pred             CCCCceECCc-HHHHHHHHhhhhcCCCCCC-CCccccCCccCCCCCCccccCCCCCCCceEEEee-ccC-CCCCCCCccc
Q 017473            5 MREWYKSQPS-DELILCLLKKKRLDPHFSY-GPIKDIGHICSLEPWDLATESETDSEDQTCYFFY-KPR-YTKSNDAHRA   80 (371)
Q Consensus         5 LPPGfRF~PT-EELV~~YLr~KI~G~plp~-~vI~ev~DVY~~ePWdL~~~~pGk~~d~eWYFFS-R~r-y~~G~R~nRa   80 (371)
                      |||||||+|| +|||.+||++|+.|.+++. ++|.++ |||++|||+|+..  ....+++||||+ +.+ +.+|.|.+|+
T Consensus         1 LP~G~rF~PtD~ELi~~yL~~k~~g~~~~~~~~i~~~-Diy~~~P~~L~~~--~~~~~~~~yFF~~~~~~~~~~~r~~R~   77 (129)
T PF02365_consen    1 LPPGFRFRPTDEELINHYLRPKILGEPLPCEDVIHDV-DIYSAHPWELPAK--FKGGDEEWYFFSPRKKKYPNGGRPNRV   77 (129)
T ss_dssp             --TTEEE---HHHHHHCTHHHHHTT-HHCS-CHSEE---GGGS-GGGCHHH--SSS-SSEEEEEEE----------S-EE
T ss_pred             CCCceEecCChHHHHHHHHHHHhcCCCCCcccceeec-ccCccChHHhhhh--ccCCCceEEEEEecccccCCccccccc
Confidence            8999999999 9999999999999999887 899999 9999999999732  233577999999 655 7789999999


Q ss_pred             CCCceeeeeCCceeEee-CCceEEEEEEEeeeccCCCCCCCCcCeEEEEEEeC
Q 017473           81 TNAGSWKVTRKVSQIEA-RNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVK  132 (371)
Q Consensus        81 tg~G~WKatG~dK~I~~-~g~vVG~KKtLvFY~gr~~r~p~KT~WvMhEYrL~  132 (371)
                      +++|+||++|++++|.+ +|.+||+|++|+||.++.+ .+.+|+|+||||+|.
T Consensus        78 ~~~G~Wk~~g~~~~i~~~~g~~iG~k~~l~f~~~~~~-~~~kt~W~M~EY~L~  129 (129)
T PF02365_consen   78 TGGGYWKSTGKEKPIKDPGGKVIGFKKTLVFYSGKSP-NGKKTGWVMHEYSLE  129 (129)
T ss_dssp             ETTEEEEEECEEEEEEE-TTCEEEEEEEEEEEESSTT-S-EEEEEEEEEEEE-
T ss_pred             ccceEEeecccccccccccceeeeeEEEEEEEeccCC-CCCcCCeEEEEEEeC
Confidence            99999999999999999 8999999999999987521 344999999999984



NAC proteins are involved in developmental processes, including formation of the shoot apical meristem, floral organs and lateral shoots, as well as in plant hormonal control and defence. The NAC domain is accompanied by diverse C-terminal transcriptional activation domains. The NAC domain has been shown to be a DNA-binding domain (DBD) and a dimerization domain [,]. The NAC domain can be subdivided into five subdomains (A-E). Each subdomain is distinguishable by blocks of heterogeneous amino acids or gaps. While the NAC domains were rich in basic amino acids (R, K and H) as a whole, the distribution of positive and negative amino acids in each subdomain were unequal. Subdomains C and D are rich in basic amino acids but poor in acidic amino acids, while subdomain B contains a high proportion of acidic amino acids. Putative nuclear localization signals (NLS) have been detected in subdomains C and D []. The DBD is contained within a 60 amino acid region located within subdomains D and E []. The overall structure of the NAC domain monomer consists of a very twisted antiparallel beta-sheet, which packs against an N-terminal alpha-helix on one side and one shorter helix on the other side surrounded by a few helical elements. The structure suggests that the NAC domain mediates dimerization through conserved interactions including a salt bridge, and DNA binding through the NAC dimer face rich in positive charges [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1UT4_A 3SWM_B 4DUL_B 3SWP_D 1UT7_B 3ULX_A.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query371
1ut4_A171 Structure Of The Conserved Domain Of Anac, A Member 7e-09
3swm_A174 The Nac Domain Of Anac019 In Complex With Dna, Gold 9e-09
3ulx_A174 Crystal Structural Of The Conserved Domain Of Rice 2e-07
>pdb|1UT4|A Chain A, Structure Of The Conserved Domain Of Anac, A Member Of The Nac Family Of Transcription Factors Length = 171 Back     alignment and structure

Iteration: 1

Score = 58.2 bits (139), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 42/156 (26%), Positives = 76/156 (48%), Gaps = 14/156 (8%) Query: 9 YKSQPSDE-LILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFY 67 ++ P+DE L++ L +K FS I +I + +PW L ++ ++ ++F+ Sbjct: 21 FRFYPTDEELMVQYLCRKAAGYDFSLQLIAEI-DLYKFDPWVLPNKALFGEKE---WYFF 76 Query: 68 KPR---YTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEW 124 PR Y + +R +G WK T I G KK+L FY+ P K T W Sbjct: 77 SPRDRKYPNGSRPNRVAGSGYWKATGTDKIISTEGQRVGIKKALVFYIGKAP-KGTKTNW 135 Query: 125 GMHEYHV-----KDDPSYEKEFVLCRIFRRRNKKKR 155 MHEY + ++ + ++VLCRI+++++ ++ Sbjct: 136 IMHEYRLIEPSRRNGSTKLDDWVLCRIYKKQSSAQK 171
>pdb|3SWM|A Chain A, The Nac Domain Of Anac019 In Complex With Dna, Gold Derivative Length = 174 Back     alignment and structure
>pdb|3ULX|A Chain A, Crystal Structural Of The Conserved Domain Of Rice Stress-Responsive Nac1 Length = 174 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query371
1ut7_A171 No apical meristem protein; transcription regulati 3e-32
3ulx_A174 Stress-induced transcription factor NAC1; NAC fami 4e-31
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 3e-04
>1ut7_A No apical meristem protein; transcription regulation, transcription, transcription factor, DNA binding, abscisic acid response, NAC domain; 1.9A {Arabidopsis thaliana} SCOP: b.143.1.1 PDB: 1ut4_A 4dul_A 3swp_A 3swm_A Length = 171 Back     alignment and structure
 Score =  118 bits (297), Expect = 3e-32
 Identities = 44/152 (28%), Positives = 71/152 (46%), Gaps = 14/152 (9%)

Query: 13  PSD-ELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFYKPR- 70
           P+D EL++  L +K     FS   I +I  +   +PW L  ++     ++  YFF  PR 
Sbjct: 25  PTDEELMVQYLCRKAAGYDFSLQLIAEI-DLYKFDPWVLPNKAL--FGEKEWYFFS-PRD 80

Query: 71  --YTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHE 128
             Y   +  +R   +G WK T     I       G KK+L FY  G   K   T W MHE
Sbjct: 81  RKYPNGSRPNRVAGSGYWKATGTDKIISTEGQRVGIKKALVFY-IGKAPKGTKTNWIMHE 139

Query: 129 YHVKDD-----PSYEKEFVLCRIFRRRNKKKR 155
           Y + +       +   ++VLCRI+++++  ++
Sbjct: 140 YRLIEPSRRNGSTKLDDWVLCRIYKKQSSAQK 171


>3ulx_A Stress-induced transcription factor NAC1; NAC family, stress-responsive, DNA binding protein; 2.60A {Oryza sativa subsp} Length = 174 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query371
1ut7_A171 No apical meristem protein; transcription regulati 100.0
3ulx_A174 Stress-induced transcription factor NAC1; NAC fami 100.0
>1ut7_A No apical meristem protein; transcription regulation, transcription, transcription factor, DNA binding, abscisic acid response, NAC domain; 1.9A {Arabidopsis thaliana} SCOP: b.143.1.1 PDB: 1ut4_A 4dul_A 3swp_A 3swm_A Back     alignment and structure
Probab=100.00  E-value=7.6e-51  Score=366.11  Aligned_cols=147  Identities=29%  Similarity=0.575  Sum_probs=123.6

Q ss_pred             CCCCCCceECCc-HHHHHHHHhhhhcCCCCCCCCccccCCccCCCCCCccccCCCCCCCceEEEee-ccC-CCCCCCCcc
Q 017473            3 SHMREWYKSQPS-DELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFY-KPR-YTKSNDAHR   79 (371)
Q Consensus         3 ~~LPPGfRF~PT-EELV~~YLr~KI~G~plp~~vI~ev~DVY~~ePWdL~~~~pGk~~d~eWYFFS-R~r-y~~G~R~nR   79 (371)
                      ..|||||||+|| ||||.+||++|+.|.+++.++|.++ |||++|||+|+...  ...+++||||+ |++ |++|.|++|
T Consensus        15 ~~lPpGfRF~PTDeELv~~YL~~K~~g~~~~~~~I~e~-Diy~~~Pw~Lp~~~--~~g~~ewyFFs~r~~k~~~g~R~~R   91 (171)
T 1ut7_A           15 LSLPPGFRFYPTDEELMVQYLCRKAAGYDFSLQLIAEI-DLYKFDPWVLPNKA--LFGEKEWYFFSPRDRKYPNGSRPNR   91 (171)
T ss_dssp             SCCCTTEEECCCHHHHHHHTHHHHHTTCCCSSCCSEEC-CGGGSCGGGHHHHS--SSCSSEEEEEEECCC-------CCE
T ss_pred             cCCCCcceeCCChHHHHHHHHHHHhcCCCCCCCeEeec-ccccCChhHhhchh--hcCCccEEEEeccccccCCCCcccc
Confidence            579999999999 9999999999999999999999999 99999999997653  23478999999 777 999999999


Q ss_pred             cCCCceeeeeCCceeEeeCCceEEEEEEEeeeccCCCCCCCCcCeEEEEEEeCCCCC-----CCCCeEEEEEEEccCCC
Q 017473           80 ATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVKDDPS-----YEKEFVLCRIFRRRNKK  153 (371)
Q Consensus        80 atg~G~WKatG~dK~I~~~g~vVG~KKtLvFY~gr~~r~p~KT~WvMhEYrL~~~~~-----~~~d~VLCRIykK~~~~  153 (371)
                      +|++||||+||++++|..++.+||+||+|+||.|+.+ .+.||+|+||||+|.....     ..++|||||||+|+...
T Consensus        92 ~t~~G~Wk~tG~~k~I~~~~~~vG~KktLvFy~g~~p-~g~kT~WvMhEY~l~~~~~~~~~~~~~~~VlCrv~~k~~~~  169 (171)
T 1ut7_A           92 VAGSGYWKATGTDKIISTEGQRVGIKKALVFYIGKAP-KGTKTNWIMHEYRLIEPSRRNGSTKLDDWVLCRIYKKQSSA  169 (171)
T ss_dssp             EETTEEEEEEEEEEEEEETTEEEEEEEEEEEEESSTT-SCEEEEEEEEEEEECCCC--------CCEEEEEEEECC---
T ss_pred             cCCCCEEeccCCCceEEecCcEEEEEEEEEEEcCcCC-CCCcCCeEEEEEEcCCCccccCcccCCCEEEEEEEEcCCCC
Confidence            9999999999999999998999999999999999832 3349999999999998752     36899999999988654



>3ulx_A Stress-induced transcription factor NAC1; NAC family, stress-responsive, DNA binding protein; 2.60A {Oryza sativa subsp} SCOP: b.143.1.1 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 371
d1ut7a_166 b.143.1.1 (A:) No apical meristem (NAM, ANAC) {Mou 1e-24
>d1ut7a_ b.143.1.1 (A:) No apical meristem (NAM, ANAC) {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 166 Back     information, alignment and structure

class: All beta proteins
fold: NAC domain
superfamily: NAC domain
family: NAC domain
domain: No apical meristem (NAM, ANAC)
species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 96.2 bits (239), Expect = 1e-24
 Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 8/148 (5%)

Query: 9   YKSQPSD-ELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFY 67
           ++  P+D EL++  L +K     FS   I +I  +   +PW L  ++    ++   +   
Sbjct: 21  FRFYPTDEELMVQYLCRKAAGYDFSLQLIAEI-DLYKFDPWVLPNKALFGEKEWYFFSPR 79

Query: 68  KPRYTKSNDAHRATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMH 127
             +Y   +  +R   +G WK T     I       G KK+L FY  G   K   T W MH
Sbjct: 80  DRKYPNGSRPNRVAGSGYWKATGTDKIISTEGQRVGIKKALVFY-IGKAPKGTKTNWIMH 138

Query: 128 EYHVKDD-----PSYEKEFVLCRIFRRR 150
           EY + +       +   ++VLCRI++++
Sbjct: 139 EYRLIEPSRRNGSTKLDDWVLCRIYKKQ 166


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query371
d1ut7a_166 No apical meristem (NAM, ANAC) {Mouse-ear cress (A 100.0
>d1ut7a_ b.143.1.1 (A:) No apical meristem (NAM, ANAC) {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All beta proteins
fold: NAC domain
superfamily: NAC domain
family: NAC domain
domain: No apical meristem (NAM, ANAC)
species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=100.00  E-value=1.5e-48  Score=345.30  Aligned_cols=145  Identities=29%  Similarity=0.586  Sum_probs=122.7

Q ss_pred             CCCCCCCceECCc-HHHHHHHHhhhhcCCCCCCCCccccCCccCCCCCCccccCCCCCCCceEEEee-ccC-CCCCCCCc
Q 017473            2 ESHMREWYKSQPS-DELILCLLKKKRLDPHFSYGPIKDIGHICSLEPWDLATESETDSEDQTCYFFY-KPR-YTKSNDAH   78 (371)
Q Consensus         2 e~~LPPGfRF~PT-EELV~~YLr~KI~G~plp~~vI~ev~DVY~~ePWdL~~~~pGk~~d~eWYFFS-R~r-y~~G~R~n   78 (371)
                      +.+|||||||+|| ||||.+||++|+.|.+++.++|+++ |||++|||+|+...  ...+++||||+ |.+ +++|.|.+
T Consensus        14 ~l~LPpG~RF~PTDeELv~~YL~~Ki~g~~l~~~~I~~~-Dvy~~~Pw~Lp~~~--~~~~~~wyFft~~~~k~~~g~r~~   90 (166)
T d1ut7a_          14 QLSLPPGFRFYPTDEELMVQYLCRKAAGYDFSLQLIAEI-DLYKFDPWVLPNKA--LFGEKEWYFFSPRDRKYPNGSRPN   90 (166)
T ss_dssp             SSCCCTTEEECCCHHHHHHHTHHHHHTTCCCSSCCSEEC-CGGGSCGGGHHHHS--SSCSSEEEEEEECCC-------CC
T ss_pred             cccCCCccccCCCcHHHHHHHHHHHHcCCCCCcccceec-cCCcCChhhccchh--ccCcceEEEEeeeccccCCCCccc
Confidence            4579999999999 9999999999999999999999999 99999999997642  23478899999 776 89999999


Q ss_pred             ccCCCceeeeeCCceeEeeCCceEEEEEEEeeeccCCCCCCCCcCeEEEEEEeCCCCCC-----CCCeEEEEEEEcc
Q 017473           79 RATNAGSWKVTRKVSQIEARNGLTGTKKSLTFYLRGLPRKEAITEWGMHEYHVKDDPSY-----EKEFVLCRIFRRR  150 (371)
Q Consensus        79 Ratg~G~WKatG~dK~I~~~g~vVG~KKtLvFY~gr~~r~p~KT~WvMhEYrL~~~~~~-----~~d~VLCRIykK~  150 (371)
                      |++++|+||++|+++.|.++|.+||+||+|+||+++.+ .+.+|+|+||||+|.+....     .++|||||||+|+
T Consensus        91 R~~g~G~Wk~~g~~~~i~~~g~~vG~kk~l~fy~~~~~-~~~~t~W~M~EY~l~~~~~~~~~~~~~~~VLCrI~~Kk  166 (166)
T d1ut7a_          91 RVAGSGYWKATGTDKIISTEGQRVGIKKALVFYIGKAP-KGTKTNWIMHEYRLIEPSRRNGSTKLDDWVLCRIYKKQ  166 (166)
T ss_dssp             EEETTEEEEEEEEEEEEEETTEEEEEEEEEEEEESSTT-SCEEEEEEEEEEEECCCC--------CCEEEEEEEECC
T ss_pred             cccCCCEecccCCCceEecCCcEEEEEEEEEEEecCCC-CCCccCeEEEEEecCCcccccCccccCCEEEEEEEecC
Confidence            99999999999999999999999999999999999832 33499999999999886532     5789999999974