Citrus Sinensis ID: 037924


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------16
MARLLLLFALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPGCALILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKYLLQNEIKFI
cHHHHHHHHHHHHHHHHHHcccccEEEEEEEEcccccccccccccEEccccEEEEEEEcccccEEEEEEEEEcccccEEEEEEccccccccEEEEEcccccccccccccccEEEEEccccccccEEEcccccccccccccccHHHHHHHccccccccc
cHHHHHHHHHHHHHHHHHccccccEEEEEEEEEcccccccccccccEccccEEEEEEEcccccEEEEEEEEEEccccEEEEEEcccccccEEEEEEEEcccccccccccccEEEEEcccccccccEEccccccccccccHHHHHHHHHHccccccccc
MARLLLLFALVALLLVELVnagdpfhirgrvycdtcrcgfetsaTTYIQGARVRIeckdrnslnlkysvdgdtdstgtynihvdgdhqdQICYVKLIsssladcktaypgcaliltrsngvvsnlHFANalgflknrplafYPELLKKYLLQNEIKFI
MARLLLLFALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSattyiqgarvrieckdrnslNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPGCALILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKYLlqneikfi
MARllllfalvalllvelvnaGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPGCALILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKYLLQNEIKFI
***LLLLFALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPGCALILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKYLLQNEI***
MARLLLLFALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKL**********************NGVVSNLHFANALGFLKNRPLAFYPELLKKYLLQN*****
MARLLLLFALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPGCALILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKYLLQNEIKFI
MARLLLLFALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPGCALILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKYLLQN*****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MARLLLLFALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPGCALILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKYLLQNEIKFI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query158 2.2.26 [Sep-21-2011]
O49813166 Olee1-like protein OS=Bet N/A no 0.911 0.867 0.393 1e-25
P33050170 Pollen-specific protein C N/A no 0.810 0.752 0.437 2e-24
Q8LGR0168 Pollen allergen Che a 1 O N/A no 0.930 0.875 0.376 7e-24
P13447161 Anther-specific protein L N/A no 0.924 0.906 0.335 5e-19
O82015145 Major pollen allergen Lig N/A no 0.765 0.834 0.369 3e-18
P19963145 Major pollen allergen Ole N/A no 0.715 0.779 0.377 6e-18
Q8H6L7143 Pollen allergen Phl p 11 N/A no 0.848 0.937 0.347 1e-17
Q7M1X5134 Major pollen allergen Lol N/A no 0.822 0.970 0.328 1e-15
P82242131 Major pollen allergen Pla N/A no 0.487 0.587 0.395 4e-09
>sp|O49813|OLEE1_BETPN Olee1-like protein OS=Betula pendula PE=2 SV=1 Back     alignment and function desciption
 Score =  115 bits (288), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 90/155 (58%), Gaps = 11/155 (7%)

Query: 1   MARLLLL--FALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECK 58
           MA+ +++   AL  L L+    +   F + G+VYCD CR  F T  +TY++GA+V +EC+
Sbjct: 1   MAKSIIIQAPALCFLSLLGFAYSESRFFVEGKVYCDNCRTQFVTKLSTYMKGAKVSLECR 60

Query: 59  DRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTA------YPGCA 112
           +R    L YS D +TD +GTY I VDGDH+++IC + L  SS  DC              
Sbjct: 61  NREGGTLIYSSDSETDKSGTYRIPVDGDHEEEICEIALKKSSDPDCSEVSKDPFLKKSAR 120

Query: 113 LILTRSNGVVSNLHFANALGFLKNRPLAFYPELLK 147
           + LT++NG+ + +  AN LGF+K +PL   PE  K
Sbjct: 121 ISLTKNNGISTPVRLANPLGFMKKKPL---PECAK 152





Betula pendula (taxid: 3505)
>sp|P33050|PSC13_MAIZE Pollen-specific protein C13 OS=Zea mays GN=MGS1 PE=2 SV=1 Back     alignment and function description
>sp|Q8LGR0|CHE1_CHEAL Pollen allergen Che a 1 OS=Chenopodium album PE=1 SV=1 Back     alignment and function description
>sp|P13447|LAT52_SOLLC Anther-specific protein LAT52 OS=Solanum lycopersicum GN=LAT52 PE=2 SV=1 Back     alignment and function description
>sp|O82015|LIGV1_LIGVU Major pollen allergen Lig v 1 OS=Ligustrum vulgare PE=1 SV=2 Back     alignment and function description
>sp|P19963|ALL1_OLEEU Major pollen allergen Ole e 1 OS=Olea europaea PE=1 SV=2 Back     alignment and function description
>sp|Q8H6L7|PHLB_PHLPR Pollen allergen Phl p 11 OS=Phleum pratense PE=1 SV=1 Back     alignment and function description
>sp|Q7M1X5|LOLB_LOLPR Major pollen allergen Lol p 11 OS=Lolium perenne PE=1 SV=1 Back     alignment and function description
>sp|P82242|PLAL1_PLALA Major pollen allergen Pla l 1 OS=Plantago lanceolata PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query158
255584845161 Pollen-specific protein C13 precursor, p 0.974 0.956 0.596 1e-47
224083735157 predicted protein [Populus trichocarpa] 0.974 0.980 0.585 9e-45
224096219158 predicted protein [Populus trichocarpa] 0.968 0.968 0.584 1e-44
86156028159 pollen-specific protein [Vitis pseudoret 0.930 0.924 0.564 2e-41
359492834159 PREDICTED: pollen-specific protein C13-l 0.930 0.924 0.557 1e-40
255545650159 Pollen-specific protein C13 precursor, p 0.943 0.937 0.532 1e-38
225469438159 PREDICTED: pollen-specific protein C13 i 0.943 0.937 0.564 3e-38
6561156159 allergen-like protein BRSn20 [Sambucus n 0.974 0.968 0.525 3e-36
449465966159 PREDICTED: pollen-specific protein C13-l 0.943 0.937 0.467 6e-34
357463405160 Pollen-specific protein C13 [Medicago tr 0.981 0.968 0.481 3e-33
>gi|255584845|ref|XP_002533139.1| Pollen-specific protein C13 precursor, putative [Ricinus communis] gi|223527067|gb|EEF29251.1| Pollen-specific protein C13 precursor, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  194 bits (492), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 96/161 (59%), Positives = 121/161 (75%), Gaps = 7/161 (4%)

Query: 1   MAR-LLLLFALVALLLVELVNA---GDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIE 56
           MAR L+L FAL   +L  LV+A   G PFHI+GRVYCDTCRCGFET  TTYI GARVRIE
Sbjct: 1   MARQLVLFFALSLCVLPSLVSARLVGRPFHIKGRVYCDTCRCGFETKKTTYIPGARVRIE 60

Query: 57  CKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPG---CAL 113
           C+DR +L LKYSV+G+TD TGTYNI V+ DHQDQICY  L+SS    C TA PG     +
Sbjct: 61  CEDRTTLRLKYSVEGETDETGTYNILVEDDHQDQICYAALVSSPWPSCNTADPGRSRAHV 120

Query: 114 ILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKYLLQNE 154
           ++TR+NG +S+LHFAN++GFL+N+ ++   EL+K+ L  +E
Sbjct: 121 VVTRNNGAISDLHFANSMGFLRNQAMSGCTELVKQLLQSDE 161




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224083735|ref|XP_002307104.1| predicted protein [Populus trichocarpa] gi|222856553|gb|EEE94100.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224096219|ref|XP_002310579.1| predicted protein [Populus trichocarpa] gi|222853482|gb|EEE91029.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|86156028|gb|ABC86745.1| pollen-specific protein [Vitis pseudoreticulata] Back     alignment and taxonomy information
>gi|359492834|ref|XP_003634471.1| PREDICTED: pollen-specific protein C13-like [Vitis vinifera] gi|147801295|emb|CAN61642.1| hypothetical protein VITISV_029644 [Vitis vinifera] gi|302141929|emb|CBI19132.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|255545650|ref|XP_002513885.1| Pollen-specific protein C13 precursor, putative [Ricinus communis] gi|223546971|gb|EEF48468.1| Pollen-specific protein C13 precursor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|225469438|ref|XP_002268073.1| PREDICTED: pollen-specific protein C13 isoform 1 [Vitis vinifera] gi|359496552|ref|XP_003635264.1| PREDICTED: pollen-specific protein C13 isoform 2 [Vitis vinifera] gi|296086870|emb|CBI33037.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|6561156|gb|AAF16869.1|AF109693_1 allergen-like protein BRSn20 [Sambucus nigra] Back     alignment and taxonomy information
>gi|449465966|ref|XP_004150698.1| PREDICTED: pollen-specific protein C13-like [Cucumis sativus] gi|449508472|ref|XP_004163322.1| PREDICTED: pollen-specific protein C13-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|357463405|ref|XP_003601984.1| Pollen-specific protein C13 [Medicago truncatula] gi|355491032|gb|AES72235.1| Pollen-specific protein C13 [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query158
TAIR|locus:2184103164 AT5G10130 "AT5G10130" [Arabido 0.867 0.835 0.5 2.3e-30
TAIR|locus:505006434159 SAH7 "AT4G08685" [Arabidopsis 0.803 0.798 0.484 4.4e-29
TAIR|locus:2030022171 AT1G29140 "AT1G29140" [Arabido 0.854 0.789 0.359 1.9e-21
TAIR|locus:2152440174 AT5G45880 "AT5G45880" [Arabido 0.854 0.775 0.359 3.5e-20
TAIR|locus:505006496172 AT4G18596 "AT4G18596" [Arabido 0.854 0.784 0.359 5.7e-20
TAIR|locus:2029456171 AT1G78040 "AT1G78040" [Arabido 0.810 0.748 0.356 3.6e-18
TAIR|locus:2143241 401 AT5G15780 "AT5G15780" [Arabido 0.683 0.269 0.364 1.8e-07
TAIR|locus:504955667171 AT3G09925 "AT3G09925" [Arabido 0.531 0.491 0.311 7e-06
TAIR|locus:2184103 AT5G10130 "AT5G10130" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 335 (123.0 bits), Expect = 2.3e-30, P = 2.3e-30
 Identities = 70/140 (50%), Positives = 86/140 (61%)

Query:    22 GDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNI 81
             G PFHI G VYCDTCR GFET AT YI+GARVRI CKDR +L  +      T   G Y +
Sbjct:    25 GTPFHIEGSVYCDTCRFGFETIATQYIRGARVRIVCKDRVTLKSELVGVAVTGPDGKYKV 84

Query:    82 HVDGDHQDQICYVKLISSSLADCKTAYPG---CALILTRSNGVVSNLHFANALGFLKNRP 138
              V GD QDQ C  +L+ S L+ C+ A PG     +ILTRSNG  S  H+ANA+GF ++ P
Sbjct:    85 AVRGDRQDQQCLAELVHSPLSRCQEADPGRSTATVILTRSNGAASTRHYANAMGFFRDEP 144

Query:   139 LAFYPELLKKYLLQNEIKFI 158
             L     L K+YL   + + I
Sbjct:   145 LRGCAALRKRYLADGDNRAI 164




GO:0003674 "molecular_function" evidence=ND
GO:0005576 "extracellular region" evidence=ISM
GO:0005615 "extracellular space" evidence=IEA
GO:0008150 "biological_process" evidence=ND
TAIR|locus:505006434 SAH7 "AT4G08685" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2030022 AT1G29140 "AT1G29140" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2152440 AT5G45880 "AT5G45880" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006496 AT4G18596 "AT4G18596" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2029456 AT1G78040 "AT1G78040" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2143241 AT5G15780 "AT5G15780" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504955667 AT3G09925 "AT3G09925" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00070565
hypothetical protein (158 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query158
pfam0119095 pfam01190, Pollen_Ole_e_I, Pollen proteins Ole e I 2e-21
>gnl|CDD|216353 pfam01190, Pollen_Ole_e_I, Pollen proteins Ole e I like Back     alignment and domain information
 Score = 82.5 bits (204), Expect = 2e-21
 Identities = 29/84 (34%), Positives = 42/84 (50%), Gaps = 6/84 (7%)

Query: 27  IRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGD 86
           + G VYCDTCR      +   + GA+V+IECKD +      S +  TD  G + + + GD
Sbjct: 1   VEGSVYCDTCRASGFELSAYPLPGAKVKIECKDGDGRV-VTSAEAVTDEKGYFKVELPGD 59

Query: 87  -----HQDQICYVKLISSSLADCK 105
                H +  C  KL+SS  + C 
Sbjct: 60  PSSLTHLESACRAKLVSSPDSACS 83


Length = 95

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 158
PF0119097 Pollen_Ole_e_I: Pollen proteins Ole e I like; Inte 99.94
PF1362082 CarboxypepD_reg: Carboxypeptidase regulatory-like 96.42
PF1371588 DUF4480: Domain of unknown function (DUF4480) 95.85
PF0106080 DUF290: Transthyretin-like family; InterPro: IPR00 95.37
PF1197497 MG1: Alpha-2-macroglobulin MG1 domain; InterPro: I 93.91
PF10670215 DUF4198: Domain of unknown function (DUF4198) 93.14
PRK15036137 hydroxyisourate hydrolase; Provisional 92.31
PF0721085 DUF1416: Protein of unknown function (DUF1416); In 92.31
PF08400134 phage_tail_N: Prophage tail fibre N-terminal; Inte 91.09
PF0573870 Cna_B: Cna protein B-type domain; InterPro: IPR008 89.93
cd03858374 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like sub 87.95
COG5266264 CbiK ABC-type Co2+ transport system, periplasmic c 85.26
cd03863375 M14_CPD_II The second carboxypeptidase (CP)-like d 85.0
cd03866376 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM 84.99
KOG1948 1165 consensus Metalloproteinase-related collagenase pM 83.74
PF0819436 DIM: DIM protein; InterPro: IPR013172 Drosophila i 82.79
cd03868372 M14_CPD_I The first carboxypeptidase (CP)-like dom 82.42
cd03464220 3,4-PCD_beta Protocatechuate 3,4-dioxygenase (3,4- 82.35
cd03463185 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4 81.93
TIGR02962112 hdxy_isourate hydroxyisourate hydrolase. Members o 81.46
TIGR02438281 catachol_actin catechol 1,2-dioxygenase, Actinobac 81.42
COG3485226 PcaH Protocatechuate 3,4-dioxygenase beta subunit 81.31
cd00421146 intradiol_dioxygenase Intradiol dioxygenases catal 80.4
>PF01190 Pollen_Ole_e_I: Pollen proteins Ole e I like; InterPro: IPR006041 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms Back     alignment and domain information
Probab=99.94  E-value=9.7e-27  Score=166.62  Aligned_cols=86  Identities=36%  Similarity=0.719  Sum_probs=77.7

Q ss_pred             EEEEEEcccCCCCCcCCCCccCCCCEEEEEecCCCCceeEEEEEeEcCCCceEEEEecCC------CCCcceEEEEccCC
Q 037924           27 IRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGD------HQDQICYVKLISSS  100 (158)
Q Consensus        27 V~G~VyCd~C~~~~~t~~s~pi~GA~V~v~C~~~~~~~v~~~~~~~TD~~G~F~i~l~~~------~~~~~C~v~LvsSP  100 (158)
                      |+|+||||+|+.++. ..++||+||+|+|+|+++ ++.+.++.+++||++|+|+|++|+.      +..+.|+|+|++||
T Consensus         1 V~G~V~C~~C~~~~~-~~~~~l~GA~V~v~C~~~-~~~~~~~~~~~Td~~G~F~i~l~~~~~~~~~~~~~~C~v~l~~sp   78 (97)
T PF01190_consen    1 VEGVVYCDDCSSGFS-RAAKPLPGAKVSVECKDG-NGGVVFSAEAKTDENGYFSIELPSDPGSSSPHLSSSCRVKLVSSP   78 (97)
T ss_pred             CEEEEEeCCCCCCcc-ccCccCCCCEEEEECCCC-CCCcEEEEEEEeCCCCEEEEEecCccccccCCCCCCcEEEEeCCC
Confidence            799999999999665 889999999999999998 4557889999999999999999874      67899999999999


Q ss_pred             ccCCCCCCCCceeeeeecCCc
Q 037924          101 LADCKTAYPGCALILTRSNGV  121 (158)
Q Consensus       101 ~~~C~~~~~~~~i~L~~~nGi  121 (158)
                      ++.|++++       +.++|+
T Consensus        79 ~~~C~~~~-------~~~~G~   92 (97)
T PF01190_consen   79 DPSCNVPT-------NSNGGR   92 (97)
T ss_pred             cCcCCCCc-------CCCCCc
Confidence            99999999       556776



A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation. The allergens in this family include allergens with the following designations: Ole e 1. A number of plant pollen proteins, whose biological function is not yet known, are structurally related []. These proteins are most probably secreted and consist of about 145 residues. There are six cysteines which are conserved in the sequence of these proteins. They seem to be involved in disulphide bonds.

>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A Back     alignment and domain information
>PF13715 DUF4480: Domain of unknown function (DUF4480) Back     alignment and domain information
>PF01060 DUF290: Transthyretin-like family; InterPro: IPR001534 This new apparently nematode-specific protein family has been called family 2 [] Back     alignment and domain information
>PF11974 MG1: Alpha-2-macroglobulin MG1 domain; InterPro: IPR021868 This is the N-terminal MG1 domain from alpha-2-macroglobulin [] Back     alignment and domain information
>PF10670 DUF4198: Domain of unknown function (DUF4198) Back     alignment and domain information
>PRK15036 hydroxyisourate hydrolase; Provisional Back     alignment and domain information
>PF07210 DUF1416: Protein of unknown function (DUF1416); InterPro: IPR010814 This family consists of several hypothetical bacterial proteins of around 100 residues in length Back     alignment and domain information
>PF08400 phage_tail_N: Prophage tail fibre N-terminal; InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein Back     alignment and domain information
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains Back     alignment and domain information
>cd03858 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs) Back     alignment and domain information
>COG5266 CbiK ABC-type Co2+ transport system, periplasmic component [Inorganic ion transport and metabolism] Back     alignment and domain information
>cd03863 M14_CPD_II The second carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3 Back     alignment and domain information
>cd03866 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs) Back     alignment and domain information
>KOG1948 consensus Metalloproteinase-related collagenase pM5 [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>PF08194 DIM: DIM protein; InterPro: IPR013172 Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila [] Back     alignment and domain information
>cd03868 M14_CPD_I The first carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3 Back     alignment and domain information
>cd03464 3,4-PCD_beta Protocatechuate 3,4-dioxygenase (3,4-PCD) , beta subunit Back     alignment and domain information
>cd03463 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4-PCD) , alpha subunit Back     alignment and domain information
>TIGR02962 hdxy_isourate hydroxyisourate hydrolase Back     alignment and domain information
>TIGR02438 catachol_actin catechol 1,2-dioxygenase, Actinobacterial Back     alignment and domain information
>COG3485 PcaH Protocatechuate 3,4-dioxygenase beta subunit [Secondary metabolites biosynthesis, transport, and catabolism] Back     alignment and domain information
>cd00421 intradiol_dioxygenase Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query158
3uaf_A117 TTR-52; beta barrel/sandwich, cell engulfment, sec 94.44
3irp_X429 URO-adherence factor A; DEV-IGG fold, cell WALL, h 93.81
3e8v_A82 Possible transglutaminase-family protein; structur 93.32
3kcp_A321 Cellulosomal-scaffolding protein A; dockerin, X-mo 90.43
1uwy_A426 Carboxypeptidase M; metallopeptidase, GPI-anchor, 84.73
3mn8_A435 LP15968P; catalytic domain of alpha/beta-hydrolase 84.36
2boy_A254 3-chlorocatechol 1,2-dioxygenase; beta barrel, oxi 81.23
1h8l_A380 Carboxypeptidase GP180 residues 503-882; hydrolase 80.69
>3uaf_A TTR-52; beta barrel/sandwich, cell engulfment, secreted, protein BIN; 2.01A {Caenorhabditis elegans} Back     alignment and structure
Probab=94.44  E-value=0.022  Score=41.31  Aligned_cols=52  Identities=10%  Similarity=0.034  Sum_probs=33.0

Q ss_pred             CCEEEEEEEEcccCCCCCcCCCCccCC--CCEEEEEecCCCC-ceeEEEEEeEcCCCceEEEEec
Q 037924           23 DPFHIRGRVYCDTCRCGFETSATTYIQ--GARVRIECKDRNS-LNLKYSVDGDTDSTGTYNIHVD   84 (158)
Q Consensus        23 ~~~~V~G~VyCd~C~~~~~t~~s~pi~--GA~V~v~C~~~~~-~~v~~~~~~~TD~~G~F~i~l~   84 (158)
                      ..+.|.|+..|..          .|-+  +++|+|-=.+... ..-....+..||++|+|+|.-.
T Consensus         3 qsV~V~G~L~C~~----------~Pa~~~~V~V~L~d~~~~~~d~Ddll~~~~Td~~G~F~l~G~   57 (117)
T 3uaf_A            3 SCLMATGVLKCPT----------DPEAVKKVHIDLWDAAAAAAESDDLMGRTWSDRNGNFQVTGC   57 (117)
T ss_dssp             EEEEEEEEEECTT----------CGGGGSSCEEEEEEHHHHHTTSCCEEEEEECCTTSEEEEEEE
T ss_pred             EEEEEEEEEEeCC----------ccCCCCCEEEEEEecccccCCchHhheeeEECCCCEEEEEEE
Confidence            3589999999942          4644  5666554211100 0112457889999999999854



>3irp_X URO-adherence factor A; DEV-IGG fold, cell WALL, hemagglutinin, peptidoglycan-anchor secreted, virulence, cell adhesion; 1.50A {Staphylococcus saprophyticus subsp} PDB: 3irz_A 3is0_X 3is1_X Back     alignment and structure
>3e8v_A Possible transglutaminase-family protein; structural genomics, unknown function, PSI-2, protein structure initiative; 2.40A {Bacteroides fragilis nctc 9343} Back     alignment and structure
>3kcp_A Cellulosomal-scaffolding protein A; dockerin, X-module, carbohydrate metabolism, cell WALL biogenesis/degradation; 1.94A {Clostridium thermocellum atcc 27405} PDB: 3p0d_C 4fl4_C 2b59_B Back     alignment and structure
>1uwy_A Carboxypeptidase M; metallopeptidase, GPI-anchor, metalloprotease, zinc, lipoprotein, hydrolase, structural proteomics in europe, spine; HET: NAG; 3.0A {Homo sapiens} SCOP: b.3.2.1 c.56.5.1 Back     alignment and structure
>3mn8_A LP15968P; catalytic domain of alpha/beta-hydrolase fold, C-terminal, A transthyretin-like domain, hydrolase; HET: NAG GEM; 2.70A {Drosophila melanogaster} Back     alignment and structure
>2boy_A 3-chlorocatechol 1,2-dioxygenase; beta barrel, oxidoreductase; HET: BHO LPP; 1.9A {Rhodococcus opacus} Back     alignment and structure
>1h8l_A Carboxypeptidase GP180 residues 503-882; hydrolase, zinc-dependent protease; HET: NAG BMA NDG GEM; 2.6A {Lophonetta specularioides} SCOP: b.3.2.1 c.56.5.1 PDB: 1qmu_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query158
d1vlfn179 Transhydroxylase beta subunit, BthL, C-terminal do 96.19
d1h8la179 Carboxypeptidase D C-terminal domain {Crested duck 95.91
d1uwya1107 Carboxypeptidase M C-terminal domain {Human (Homo 92.86
d2b59b296 Cellulosomal scaffolding protein A {Clostridium th 82.36
d3pccm_236 Protocatechuate-3,4-dioxygenase, beta chain {Pseud 82.1
d2burb1238 Protocatechuate-3,4-dioxygenase, beta chain {Acine 81.6
>d1vlfn1 b.3.5.1 (N:196-274) Transhydroxylase beta subunit, BthL, C-terminal domain {Pelobacter acidigallici [TaxId: 35816]} Back     information, alignment and structure
class: All beta proteins
fold: Prealbumin-like
superfamily: Cna protein B-type domain
family: Cna protein B-type domain
domain: Transhydroxylase beta subunit, BthL, C-terminal domain
species: Pelobacter acidigallici [TaxId: 35816]
Probab=96.19  E-value=0.0061  Score=39.44  Aligned_cols=44  Identities=16%  Similarity=0.323  Sum_probs=32.0

Q ss_pred             CEEEEEEEEcccCCCCCcCCCCccCCCCEEEEEecCCCCceeEEEEEeEcCCCceEEEE-ec
Q 037924           24 PFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIH-VD   84 (158)
Q Consensus        24 ~~~V~G~VyCd~C~~~~~t~~s~pi~GA~V~v~C~~~~~~~v~~~~~~~TD~~G~F~i~-l~   84 (158)
                      +++|.|.|.     .      ..||+||.|.|.  +. + ..  ..++.||++|.|+++ ++
T Consensus         2 n~~~~gi~~-----~------G~~v~gA~V~L~--~~-~-~~--v~~t~Td~~G~F~f~~l~   46 (79)
T d1vlfn1           2 NYVTAGILV-----Q------GDCFEGAKVVLK--SG-G-KE--VASAETNFFGEFKFDALD   46 (79)
T ss_dssp             EEEEEEEEE-----T------TEECTTCEEEEE--ET-T-EE--EEEEECCTTSEEEEEEEC
T ss_pred             CcEEeeEEE-----C------CccccccEEEEE--CC-C-Ce--EEeeEECCCCcEEEEecC
Confidence            367888875     1      259999999995  32 2 22  246789999999997 54



>d1h8la1 b.3.2.1 (A:305-383) Carboxypeptidase D C-terminal domain {Crested duck (Lophonetta specularioides) [TaxId: 8836]} Back     information, alignment and structure
>d1uwya1 b.3.2.1 (A:297-403) Carboxypeptidase M C-terminal domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2b59b2 b.3.2.2 (B:8-103) Cellulosomal scaffolding protein A {Clostridium thermocellum [TaxId: 1515]} Back     information, alignment and structure
>d3pccm_ b.3.6.1 (M:) Protocatechuate-3,4-dioxygenase, beta chain {Pseudomonas putida [TaxId: 303]} Back     information, alignment and structure
>d2burb1 b.3.6.1 (B:303-540) Protocatechuate-3,4-dioxygenase, beta chain {Acinetobacter calcoaceticus, adp1 [TaxId: 471]} Back     information, alignment and structure