Citrus Sinensis ID: 044290


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80
MAAGLSMLKLSVLSLLSNLNSGLWPESSKGLKMFPPTCNRIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQDISLVEAF
cccHHHHHHHHHHHHHHHHccccccccccccccccccccccccccEEEEEEcccEEEEEEccccEEEccccccccHHHcc
ccHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccEEEEccccEEEEEccccEEEEccccccccHHHcc
MAAGLSMLKLSVLSLLSNlnsglwpesskglkmfpptcnriecpsfetvhvgngfeirrydssmwmstspiqdisLVEAF
MAAGLSMLKLSVLSLLSNLNSGLWPESSKGLKMFPPTCNRIECpsfetvhvgNGFEIRRydssmwmstspiqdislveAF
MAAGlsmlklsvlsllsnlnsglWPESSKGLKMFPPTCNRIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQDISLVEAF
*******LKLSVLSLLSNLNSGLWPESSKGLKMFPPTCNRIECPSFETVHVGNGFEIRRYDSSMWMS*************
*****SMLKLSVLSLLSNLNSGLWPESSKGLKMFPPTCNRIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQDISLVEAF
MAAGLSMLKLSVLSLLSNLNSGLWPESSKGLKMFPPTCNRIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQDISLVEAF
*AAGLSMLKLSVLSLLSNLNSGLWPESSKGLKMFPPTCNRIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQDISLVEA*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
SSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAAGLSMLKLSVLSLLSNLNSGLWPESSKGLKMFPPTCNRIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQDISLVEAF
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query80
225470739 227 PREDICTED: heme-binding protein 2 [Vitis 0.687 0.242 0.636 8e-16
351723865 234 uncharacterized protein LOC100305993 pre 0.725 0.247 0.603 1e-15
147832981 227 hypothetical protein VITISV_035042 [Viti 0.687 0.242 0.636 2e-15
312281589 254 unnamed protein product [Thellungiella h 0.687 0.216 0.660 2e-15
18411970 225 SOUL heme-binding-like protein [Arabidop 0.9 0.32 0.531 2e-15
21593898 225 SOUL-like protein [Arabidopsis thaliana] 0.9 0.32 0.531 3e-15
255578538 234 Heme-binding protein, putative [Ricinus 0.8 0.273 0.542 4e-15
297844644 232 soul heme-binding family protein [Arabid 0.687 0.237 0.603 1e-14
15220033 232 SOUL heme-binding protein [Arabidopsis t 0.687 0.237 0.603 1e-14
388517637 217 unknown [Lotus japonicus] 0.637 0.235 0.666 1e-14
>gi|225470739|ref|XP_002266519.1| PREDICTED: heme-binding protein 2 [Vitis vinifera] Back     alignment and taxonomy information
 Score = 87.8 bits (216), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 35/55 (63%), Positives = 47/55 (85%)

Query: 25 PESSKGLKMFPPTCNRIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQDISLVEA 79
          P S K   +FPPTC+RIECP+++ +  GNG+EIRRY+S++W+STSPIQDISLV+A
Sbjct: 22 PNSQKSTGLFPPTCSRIECPTYDLIQAGNGYEIRRYNSTVWISTSPIQDISLVDA 76




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|351723865|ref|NP_001236014.1| uncharacterized protein LOC100305993 precursor [Glycine max] gi|255627213|gb|ACU13951.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|147832981|emb|CAN68405.1| hypothetical protein VITISV_035042 [Vitis vinifera] Back     alignment and taxonomy information
>gi|312281589|dbj|BAJ33660.1| unnamed protein product [Thellungiella halophila] Back     alignment and taxonomy information
>gi|18411970|ref|NP_565181.1| SOUL heme-binding-like protein [Arabidopsis thaliana] gi|4836887|gb|AAD30590.1|AC007260_21 Hypothetical protein [Arabidopsis thaliana] gi|8052530|gb|AAF71794.1|AC013430_3 F3F9.4 [Arabidopsis thaliana] gi|332197986|gb|AEE36107.1| SOUL heme-binding-like protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|21593898|gb|AAM65865.1| SOUL-like protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|255578538|ref|XP_002530132.1| Heme-binding protein, putative [Ricinus communis] gi|223530357|gb|EEF32248.1| Heme-binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|297844644|ref|XP_002890203.1| soul heme-binding family protein [Arabidopsis lyrata subsp. lyrata] gi|297336045|gb|EFH66462.1| soul heme-binding family protein [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|15220033|ref|NP_173153.1| SOUL heme-binding protein [Arabidopsis thaliana] gi|5734756|gb|AAD50021.1|AC007651_16 Similar to SOUL Protein [Arabidopsis thaliana] gi|21592576|gb|AAM64525.1| SOUL-like protein [Arabidopsis thaliana] gi|88196741|gb|ABD43013.1| At1g17100 [Arabidopsis thaliana] gi|332191420|gb|AEE29541.1| SOUL heme-binding protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|388517637|gb|AFK46880.1| unknown [Lotus japonicus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query80
TAIR|locus:2020307 232 HBP1 "AT1G17100" [Arabidopsis 0.687 0.237 0.603 4.7e-16
TAIR|locus:2032065 225 AT1G78450 "AT1G78450" [Arabido 0.562 0.2 0.688 1.8e-14
TAIR|locus:2032055 219 AT1G78460 [Arabidopsis thalian 0.662 0.242 0.464 3.2e-10
TAIR|locus:2020307 HBP1 "AT1G17100" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 200 (75.5 bits), Expect = 4.7e-16, P = 4.7e-16
 Identities = 35/58 (60%), Positives = 47/58 (81%)

Query:    25 PESSKGLKM---FPPTCNRIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQDISLVEA 79
             P++  G+     FPP+CNRIECPS+E VH GNG+EIRRY++++W+ST PI DISLV+A
Sbjct:    26 PDAESGVAQIGKFPPSCNRIECPSYELVHSGNGYEIRRYNNTVWVSTEPIPDISLVDA 83




GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0005773 "vacuole" evidence=IDA
GO:0005774 "vacuolar membrane" evidence=IDA
GO:0019761 "glucosinolate biosynthetic process" evidence=RCA
TAIR|locus:2032065 AT1G78450 "AT1G78450" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2032055 AT1G78460 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query80
pfam04832 174 pfam04832, SOUL, SOUL heme-binding protein 5e-09
>gnl|CDD|218284 pfam04832, SOUL, SOUL heme-binding protein Back     alignment and domain information
 Score = 49.6 bits (119), Expect = 5e-09
 Identities = 9/34 (26%), Positives = 15/34 (44%)

Query: 40 RIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQD 73
           +E P +  +     +EIRRY+  +W  T     
Sbjct: 1  AVETPPYTVLEREGDYEIRRYEPMVWAETEVDAG 34


This family represents a group of putative heme-binding proteins. Our family includes archaeal and bacterial homologues. Length = 174

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 80
PF04832 176 SOUL: SOUL heme-binding protein; InterPro: IPR0069 99.19
>PF04832 SOUL: SOUL heme-binding protein; InterPro: IPR006917 This family represents a group of putative haem-binding proteins [] Back     alignment and domain information
Probab=99.19  E-value=1.1e-11  Score=86.40  Aligned_cols=39  Identities=44%  Similarity=0.894  Sum_probs=30.7

Q ss_pred             CcCCCCeEEEeecCceeEEeeCCCCeEEecccccCcchhc
Q 044290           40 RIECPSFETVHVGNGFEIRRYDSSMWMSTSPIQDISLVEA   79 (80)
Q Consensus        40 ~~EcP~Y~vv~~~~dYEvR~Y~~skWVST~~V~~~s~~~A   79 (80)
                      ++|||+|+|+++.++||||+|++++||+|. +.+.++++|
T Consensus         1 ~~E~P~Y~v~~~~~~~EiR~Y~~~~w~~t~-~~~~~~~~a   39 (176)
T PF04832_consen    1 DIECPPYEVLKKGDDYEIRRYPPAKWASTT-VSGCSFEEA   39 (176)
T ss_dssp             --BS-SEEEECCCSSCEEEEE--CEEEEEE-EECS-HHHH
T ss_pred             CCcCCCeEEEEeCCCEEEEEECCceEEEEE-ecCCChhHH
Confidence            469999999999999999999999999997 888877765



It includes archaeal and bacterial homologues.; PDB: 2HVA_A 2GOV_A 4A1M_A 3R85_E 2YC9_A 3R8K_B 3R8J_B.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query80
4b0y_A 227 Heme-binding protein 2; apoptosis; 3.50A {Homo sap 3e-08
3r8j_A 212 Heme-binding protein 2; HEBP family, SOUL protein, 1e-06
2gov_A 195 P22HBP, heme-binding protein 1; structural genomic 1e-06
>4b0y_A Heme-binding protein 2; apoptosis; 3.50A {Homo sapiens} Length = 227 Back     alignment and structure
 Score = 47.2 bits (111), Expect = 3e-08
 Identities = 11/49 (22%), Positives = 20/49 (40%), Gaps = 5/49 (10%)

Query: 35 PPTCNRIECPSFETVHVGN----GFEIRRYDSSMWMSTSPIQDISLVEA 79
                +E P ++           +EIR Y  + W+STS ++ +    A
Sbjct: 36 DAAAQAVETPGWKAPEDAGPQPGSYEIRHYGPAKWVSTS-VESMDWDSA 83


>3r8j_A Heme-binding protein 2; HEBP family, SOUL protein, apoptosis; 1.60A {Homo sapiens} PDB: 3r8k_A 2yc9_A 3r85_E Length = 212 Back     alignment and structure
>2gov_A P22HBP, heme-binding protein 1; structural genomics, protein structure initiative, PSI, center for eukaryotic structural genomics, CESG; NMR {Mus musculus} SCOP: d.60.1.4 PDB: 4a1m_A 2hva_A Length = 195 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query80
2gov_A 195 P22HBP, heme-binding protein 1; structural genomic 99.21
4b0y_A 227 Heme-binding protein 2; apoptosis; 3.50A {Homo sap 99.16
3r8j_A 212 Heme-binding protein 2; HEBP family, SOUL protein, 99.08
>2gov_A P22HBP, heme-binding protein 1; structural genomics, protein structure initiative, PSI, center for eukaryotic structural genomics, CESG; NMR {Mus musculus} SCOP: d.60.1.4 PDB: 4a1m_A 2hva_A Back     alignment and structure
Probab=99.21  E-value=9.2e-12  Score=88.43  Aligned_cols=43  Identities=16%  Similarity=0.313  Sum_probs=39.5

Q ss_pred             CCcCCcCCCCeEEEeecCc----eeEEeeCCCCeEEecccccCcchhc
Q 044290           36 PTCNRIECPSFETVHVGNG----FEIRRYDSSMWMSTSPIQDISLVEA   79 (80)
Q Consensus        36 ~fC~~~EcP~Y~vv~~~~d----YEvR~Y~~skWVST~~V~~~s~~~A   79 (80)
                      ..|.++|||+|+|+++.++    ||||+|++.+||+|+ +++.++++|
T Consensus        13 ~~~~~~E~P~Y~Vl~~~~~~~~~yEiR~Y~~~~wa~t~-~~~~~~~~A   59 (195)
T 2gov_A           13 SLFGSVETWPWQVLSTGGKEDVSYEERACEGGKFATVE-VTDKPVDEA   59 (195)
T ss_dssp             CCCCSCCSCCCEEEEECCCSSSCEEEEECCCCEEEEEE-EESSCHHHH
T ss_pred             cccCCCCCCCeEEEEecCCCCCcEEEEEeCCeEEEEEE-ecCCChhHH
Confidence            5688999999999999887    999999999999997 999988876



>4b0y_A Heme-binding protein 2; apoptosis; 3.50A {Homo sapiens} Back     alignment and structure
>3r8j_A Heme-binding protein 2; HEBP family, SOUL protein, apoptosis; 1.60A {Homo sapiens} PDB: 3r8k_A 2yc9_A 4ayz_A 4ayz_B 3r85_E Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 80
d2gova1 184 d.60.1.4 (A:7-190) Heme-binding protein 1 {Mouse ( 5e-05
>d2gova1 d.60.1.4 (A:7-190) Heme-binding protein 1 {Mouse (Mus musculus) [TaxId: 10090]} Length = 184 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: Probable bacterial effector-binding domain
superfamily: Probable bacterial effector-binding domain
family: SOUL heme-binding protein
domain: Heme-binding protein 1
species: Mouse (Mus musculus) [TaxId: 10090]
 Score = 37.0 bits (85), Expect = 5e-05
 Identities = 7/43 (16%), Positives = 17/43 (39%), Gaps = 5/43 (11%)

Query: 41 IECPSFETVHVGN----GFEIRRYDSSMWMSTSPIQDISLVEA 79
          +E   ++ +  G      +E R  +   + +   + D  + EA
Sbjct: 7  VETWPWQVLSTGGKEDVSYEERACEGGKFATVE-VTDKPVDEA 48


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query80
d2gova1 184 Heme-binding protein 1 {Mouse (Mus musculus) [TaxI 99.11
>d2gova1 d.60.1.4 (A:7-190) Heme-binding protein 1 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Probable bacterial effector-binding domain
superfamily: Probable bacterial effector-binding domain
family: SOUL heme-binding protein
domain: Heme-binding protein 1
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=99.11  E-value=2e-11  Score=83.55  Aligned_cols=40  Identities=18%  Similarity=0.363  Sum_probs=36.1

Q ss_pred             CCcCCCCeEEEeecC----ceeEEeeCCCCeEEecccccCcchhc
Q 044290           39 NRIECPSFETVHVGN----GFEIRRYDSSMWMSTSPIQDISLVEA   79 (80)
Q Consensus        39 ~~~EcP~Y~vv~~~~----dYEvR~Y~~skWVST~~V~~~s~~~A   79 (80)
                      +.+|||+|+|+++.+    +||+|+|++++||+|+ +.+.++++|
T Consensus         5 ~~~E~~~y~v~~~~~~~~~~fE~R~Y~~~~wv~t~-~~~~~~~~a   48 (184)
T d2gova1           5 GSVETWPWQVLSTGGKEDVSYEERACEGGKFATVE-VTDKPVDEA   48 (184)
T ss_dssp             CSCCSCCCEEEEECCCSSSCEEEEECCCCEEEEEE-EESSCHHHH
T ss_pred             ccccCCCcEEEEECCccCCCEEEEecCCceEEEEE-ecCCChhhH
Confidence            478999999999865    7999999999999997 999998876