Citrus Sinensis ID: 045131


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------
ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEGCHAKLCQVRLVKSPKPECSEIVADGLSSARIDLTPSVGSDPELIRYANDLGFMKKESLPECAKVLEEMFIHG
cEEEEEEEcccccEEEEEEEEEEcccEEEEEEEEcccccEEEEEEcccccccccccccccccEEEEEEcccccccccccEEEcccccccccccHHHHHHHHHHHHcc
cEEEEEEEEccccEEEEEEEEEEccccEEEEEEcccHHHHEEEEEEccccccccccccccccccEEEEEcccccccccEEEccccccccccccHHHHHHHHHHcccc
ATVALeckeseggevvysrevvsdqsgtykipiegCHAKLCQVrlvkspkpecsEIVADGLssaridltpsvgsdpelIRYANDlgfmkkesLPECAKVLEEMFIHG
atvaleckeseggevvysreVVSDQSGTYKIPIEGCHAKLCQVRLVKSPKPECSEIVAdglssaridltpsvgsdpeLIRYANDLGFMKKESLPECAKVLEEMFIHG
ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEGCHAKLCQVRLVKSPKPECSEIVADGLSSARIDLTPSVGSDPELIRYANDLGFMKKESLPECAKVLEEMFIHG
**************VVYSREVVSDQSGTYKIPIEGCHAKLCQVRLVKSPKPECSEIVADGLS*ARIDL*******PELIRYANDLGFMKKESLPECAKVL*******
ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEGCHAKLCQVRLVKSPK***********SSARIDLTPSVGSDPELIRYANDLGFMKKESLPECAKVLEEMFIHG
***********GGEVVYSREVVSDQSGTYKIPIEGCHAKLCQVRLVKSPKPECSEIVADGLSSARIDLTPSVGSDPELIRYANDLGFMKKESLPECAKVLEEMFIHG
ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEGCHAKLCQVRLVKSPKPECSEIVADGLSSARIDLTPSVGSDPELIRYANDLGFMKKESLPECAKVLEEMFIHG
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEGCHAKLCQVRLVKSPKPECSEIVADGLSSARIDLTPSVGSDPELIRYANDLGFMKKESLPECAKVLEEMFIHG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query107 2.2.26 [Sep-21-2011]
O49813166 Olee1-like protein OS=Bet N/A no 0.971 0.626 0.490 5e-21
Q8LGR0168 Pollen allergen Che a 1 O N/A no 0.981 0.625 0.431 2e-16
Q7M1X5134 Major pollen allergen Lol N/A no 0.934 0.746 0.421 3e-14
Q8H6L7143 Pollen allergen Phl p 11 N/A no 0.934 0.699 0.421 3e-13
P13447161 Anther-specific protein L N/A no 0.943 0.627 0.394 2e-12
P19963145 Major pollen allergen Ole N/A no 0.953 0.703 0.311 6e-10
P33050170 Pollen-specific protein C N/A no 0.943 0.594 0.394 1e-09
O82015145 Major pollen allergen Lig N/A no 0.953 0.703 0.321 6e-09
P82242131 Major pollen allergen Pla N/A no 0.869 0.709 0.363 9e-09
>sp|O49813|OLEE1_BETPN Olee1-like protein OS=Betula pendula PE=2 SV=1 Back     alignment and function desciption
 Score = 99.4 bits (246), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 76/110 (69%), Gaps = 6/110 (5%)

Query: 1   ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEGCH-AKLCQVRLVKSPKPECSEIVAD 59
           A V+LEC+  EGG ++YS +  +D+SGTY+IP++G H  ++C++ L KS  P+CSE+  D
Sbjct: 53  AKVSLECRNREGGTLIYSSDSETDKSGTYRIPVDGDHEEEICEIALKKSSDPDCSEVSKD 112

Query: 60  GL--SSARIDLTPSVG-SDPELIRYANDLGFMKKESLPECAKVLEEMFIH 106
                SARI LT + G S P  +R AN LGFMKK+ LPECAK L E+ ++
Sbjct: 113 PFLKKSARISLTKNNGISTP--VRLANPLGFMKKKPLPECAKALRELGMN 160





Betula pendula (taxid: 3505)
>sp|Q8LGR0|CHE1_CHEAL Pollen allergen Che a 1 OS=Chenopodium album PE=1 SV=1 Back     alignment and function description
>sp|Q7M1X5|LOLB_LOLPR Major pollen allergen Lol p 11 OS=Lolium perenne PE=1 SV=1 Back     alignment and function description
>sp|Q8H6L7|PHLB_PHLPR Pollen allergen Phl p 11 OS=Phleum pratense PE=1 SV=1 Back     alignment and function description
>sp|P13447|LAT52_SOLLC Anther-specific protein LAT52 OS=Solanum lycopersicum GN=LAT52 PE=2 SV=1 Back     alignment and function description
>sp|P19963|ALL1_OLEEU Major pollen allergen Ole e 1 OS=Olea europaea PE=1 SV=2 Back     alignment and function description
>sp|P33050|PSC13_MAIZE Pollen-specific protein C13 OS=Zea mays GN=MGS1 PE=2 SV=1 Back     alignment and function description
>sp|O82015|LIGV1_LIGVU Major pollen allergen Lig v 1 OS=Ligustrum vulgare PE=1 SV=2 Back     alignment and function description
>sp|P82242|PLAL1_PLALA Major pollen allergen Pla l 1 OS=Plantago lanceolata PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query107
28380096166 RecName: Full=Olee1-like protein; Flags: 0.971 0.626 0.490 2e-19
224114822164 predicted protein [Populus trichocarpa] 0.943 0.615 0.509 5e-18
388519607168 unknown [Lotus japonicus] 0.962 0.613 0.495 9e-18
449451773166 PREDICTED: olee1-like protein-like [Cucu 0.943 0.608 0.5 2e-17
255564768166 Anther-specific protein LAT52 precursor, 0.925 0.596 0.470 1e-16
224157197109 predicted protein [Populus trichocarpa] 0.943 0.926 0.471 4e-16
224121526161 predicted protein [Populus trichocarpa] 0.943 0.627 0.471 5e-16
449452765165 PREDICTED: olee1-like protein-like [Cucu 0.943 0.612 0.462 2e-15
62720370168 pollen allergen Cro s 1 [Crocus sativus] 0.981 0.625 0.431 6e-15
47605504168 RecName: Full=Pollen allergen Che a 1; A 0.981 0.625 0.431 7e-15
>gi|28380096|sp|O49813.1|OLEE1_BETVE RecName: Full=Olee1-like protein; Flags: Precursor gi|2765366|emb|CAA74365.1| putative Ole e 1 protein [Betula pendula] Back     alignment and taxonomy information
 Score = 99.4 bits (246), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 76/110 (69%), Gaps = 6/110 (5%)

Query: 1   ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEGCH-AKLCQVRLVKSPKPECSEIVAD 59
           A V+LEC+  EGG ++YS +  +D+SGTY+IP++G H  ++C++ L KS  P+CSE+  D
Sbjct: 53  AKVSLECRNREGGTLIYSSDSETDKSGTYRIPVDGDHEEEICEIALKKSSDPDCSEVSKD 112

Query: 60  GL--SSARIDLTPSVG-SDPELIRYANDLGFMKKESLPECAKVLEEMFIH 106
                SARI LT + G S P  +R AN LGFMKK+ LPECAK L E+ ++
Sbjct: 113 PFLKKSARISLTKNNGISTP--VRLANPLGFMKKKPLPECAKALRELGMN 160




Source: Betula pendula

Species: Betula pendula

Genus: Betula

Family: Betulaceae

Order: Fagales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224114822|ref|XP_002316866.1| predicted protein [Populus trichocarpa] gi|222859931|gb|EEE97478.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388519607|gb|AFK47865.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|449451773|ref|XP_004143635.1| PREDICTED: olee1-like protein-like [Cucumis sativus] gi|449523377|ref|XP_004168700.1| PREDICTED: olee1-like protein-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|255564768|ref|XP_002523378.1| Anther-specific protein LAT52 precursor, putative [Ricinus communis] gi|223537328|gb|EEF38957.1| Anther-specific protein LAT52 precursor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|224157197|ref|XP_002337814.1| predicted protein [Populus trichocarpa] gi|222869851|gb|EEF06982.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224121526|ref|XP_002330722.1| predicted protein [Populus trichocarpa] gi|222872498|gb|EEF09629.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449452765|ref|XP_004144129.1| PREDICTED: olee1-like protein-like [Cucumis sativus] gi|449529385|ref|XP_004171680.1| PREDICTED: olee1-like protein-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|62720370|gb|AAX93750.1| pollen allergen Cro s 1 [Crocus sativus] Back     alignment and taxonomy information
>gi|47605504|sp|Q8LGR0.1|CHE1_CHEAL RecName: Full=Pollen allergen Che a 1; AltName: Allergen=Che a 1; Flags: Precursor gi|22074346|gb|AAL07319.1| Che a 1 allergen precursor [Chenopodium album] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query107
TAIR|locus:2030022171 AT1G29140 "AT1G29140" [Arabido 0.981 0.614 0.379 2.3e-14
TAIR|locus:2152440174 AT5G45880 "AT5G45880" [Arabido 0.981 0.603 0.333 9.1e-13
TAIR|locus:505006434159 SAH7 "AT4G08685" [Arabidopsis 0.906 0.610 0.365 1.9e-12
TAIR|locus:505006496172 AT4G18596 "AT4G18596" [Arabido 0.981 0.610 0.333 3.9e-12
TAIR|locus:2029456171 AT1G78040 "AT1G78040" [Arabido 0.962 0.602 0.358 7.4e-11
TAIR|locus:2184103164 AT5G10130 "AT5G10130" [Arabido 0.971 0.634 0.321 2.6e-06
TAIR|locus:2030022 AT1G29140 "AT1G29140" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 184 (69.8 bits), Expect = 2.3e-14, P = 2.3e-14
 Identities = 41/108 (37%), Positives = 63/108 (58%)

Query:     1 ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEGCHAK-LCQVRLVKSPKPECSEIVAD 59
             A V LECK  E   V  ++E V+D +G Y++ + G H + +C++ L++SP PEC ++   
Sbjct:    58 AKVKLECKGRENQTVTLTKEAVTDNAGNYQMEVMGDHEEEVCEIVLLQSPDPECGDVNNQ 117

Query:    60 GL--SSARIDLTPSVGSDPELIRYANDLGFMKKESLPECAKVLEEMFI 105
                 ++ARI LT + G      R  N LGFM+K  L EC +V +E+ I
Sbjct:   118 EFLRNAARISLTANDGIVSNETRTINPLGFMRKTPLAECPQVFKELGI 165




GO:0003674 "molecular_function" evidence=ND
GO:0005576 "extracellular region" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0048046 "apoplast" evidence=IDA
GO:0009827 "plant-type cell wall modification" evidence=RCA
GO:0009860 "pollen tube growth" evidence=RCA
TAIR|locus:2152440 AT5G45880 "AT5G45880" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006434 SAH7 "AT4G08685" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006496 AT4G18596 "AT4G18596" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2029456 AT1G78040 "AT1G78040" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2184103 AT5G10130 "AT5G10130" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query107
pfam0119095 pfam01190, Pollen_Ole_e_I, Pollen proteins Ole e I 1e-12
>gnl|CDD|216353 pfam01190, Pollen_Ole_e_I, Pollen proteins Ole e I like Back     alignment and domain information
 Score = 58.2 bits (141), Expect = 1e-12
 Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 7/72 (9%)

Query: 1  ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEG------CHAKLCQVRLVKSPKPECS 54
          A V +ECK    G VV S E V+D+ G +K+ + G           C+ +LV SP   CS
Sbjct: 25 AKVKIECK-DGDGRVVTSAEAVTDEKGYFKVELPGDPSSLTHLESACRAKLVSSPDSACS 83

Query: 55 EIVADGLSSARI 66
                 + A +
Sbjct: 84 APTNGNGTGAPL 95


Length = 95

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 107
PF0119097 Pollen_Ole_e_I: Pollen proteins Ole e I like; Inte 99.7
PF09103118 BRCA-2_OB1: BRCA2, oligonucleotide/oligosaccharide 81.9
>PF01190 Pollen_Ole_e_I: Pollen proteins Ole e I like; InterPro: IPR006041 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms Back     alignment and domain information
Probab=99.70  E-value=9.2e-17  Score=108.39  Aligned_cols=57  Identities=39%  Similarity=0.753  Sum_probs=53.1

Q ss_pred             CeEEEEeecCCCCceEEEEEEEcCCCeeEEEEecCC------Cc-ceeEEEEeeCCCccCccccc
Q 045131            1 ATVALECKESEGGEVVYSREVVSDQSGTYKIPIEGC------HA-KLCQVRLVKSPKPECSEIVA   58 (107)
Q Consensus         1 A~V~leCk~~~~~~v~~~~e~~TD~~G~y~i~v~~d------h~-~~C~V~LvsSP~~~C~~~~~   58 (107)
                      |+|+|+|++. .+.+.++.+++||++|+|+|+++++      |. +.|.|+|++||++.|+.+++
T Consensus        24 A~V~v~C~~~-~~~~~~~~~~~Td~~G~F~i~l~~~~~~~~~~~~~~C~v~l~~sp~~~C~~~~~   87 (97)
T PF01190_consen   24 AKVSVECKDG-NGGVVFSAEAKTDENGYFSIELPSDPGSSSPHLSSSCRVKLVSSPDPSCNVPTN   87 (97)
T ss_pred             CEEEEECCCC-CCCcEEEEEEEeCCCCEEEEEecCccccccCCCCCCcEEEEeCCCcCcCCCCcC
Confidence            7999999995 5559999999999999999999986      67 99999999999999999998



A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation. The allergens in this family include allergens with the following designations: Ole e 1. A number of plant pollen proteins, whose biological function is not yet known, are structurally related []. These proteins are most probably secreted and consist of about 145 residues. There are six cysteines which are conserved in the sequence of these proteins. They seem to be involved in disulphide bonds.

>PF09103 BRCA-2_OB1: BRCA2, oligonucleotide/oligosaccharide-binding, domain 1; InterPro: IPR015187 This domain assumes an OB fold, which consists of a highly curved five-stranded beta-sheet that closes on itself to form a beta-barrel Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query107
d1iyjb3133 OB-fold domains of BRCA2 {Rat (Rattus norvegicus) 91.14
>d1iyjb3 b.40.4.3 (B:2599-2731) OB-fold domains of BRCA2 {Rat (Rattus norvegicus) [TaxId: 10116]} Back     information, alignment and structure
class: All beta proteins
fold: OB-fold
superfamily: Nucleic acid-binding proteins
family: Single strand DNA-binding domain, SSB
domain: OB-fold domains of BRCA2
species: Rat (Rattus norvegicus) [TaxId: 10116]
Probab=91.14  E-value=0.042  Score=37.20  Aligned_cols=63  Identities=24%  Similarity=0.392  Sum_probs=44.1

Q ss_pred             CeeEEEEecCC---------------Cc-ceeEEEEeeCCCccCcccccCCCceeeEEeecCCCCCCCceeeecCCCccc
Q 045131           26 SGTYKIPIEGC---------------HA-KLCQVRLVKSPKPECSEIVADGLSSARIDLTPSVGSDPELIRYANDLGFMK   89 (107)
Q Consensus        26 ~G~y~i~v~~d---------------h~-~~C~V~LvsSP~~~C~~~~~~~r~~a~V~Lt~nnGi~s~~~R~aN~lgF~~   89 (107)
                      +|||.|...=|               +. -+|-+.|+.+++ .|+..+.  .++..+.|+ -|++.  ..|+.-.|||.+
T Consensus        55 DGWYsi~a~lD~~L~~~l~~grl~vGqKL~i~GA~L~g~~~-~~sPLE~--~~s~~L~i~-~Nstr--~a~w~~kLGf~~  128 (133)
T d1iyjb3          55 DGWYAVKAQLDPPLLALVKSGRLTVGQKIITQGAELVGSPD-ACAPLEA--PDSLRLKIS-ANSTR--PARWHSKLGFFH  128 (133)
T ss_dssp             CSSCEEEECCCHHHHHHHHHTCSCTTCEEEESCCEEESCCS-CCCGGGC--CSSCEEECC-TTTEE--ECCSSSCSEECS
T ss_pred             cCCEEEEEEcCHHHHHHHHcCCccccceEEEECcEeecCCC-CcCcccc--cCccEEEEe-eccee--eccccccCcccC
Confidence            59999997644               23 789999999864 4887765  334444444 56653  367778999999


Q ss_pred             cCCchh
Q 045131           90 KESLPE   95 (107)
Q Consensus        90 ~~pl~~   95 (107)
                      + |+|+
T Consensus       129 ~-p~pf  133 (133)
T d1iyjb3         129 D-PRPF  133 (133)
T ss_dssp             C-CSCC
T ss_pred             C-CCCC
Confidence            6 7763