Citrus Sinensis ID: 048602


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170------
MLSMALMLSAAFFMGCTNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDRIINGKELKAKLCSVRLVSSTHADCSIATNFAGGKTGVKLMRSPTSMFRDIIKYRVGPFYYTTPMCEKPDTNDSQSDEDQGSYY
cHHHHHHHHHHHHHHHHHHHcccccccccccEEEEEEEEcccccccccccccccccccccEEEEEcccccccEEEEEEEEcccccEEEEEEccEEccccccccccEEEEccccccccccccccccccccccEEccccccccccEEEEEEccccccccccccccccccccccccccc
cHHHHHHHHHHHHHHHHHHHHcccccccccEEEEEEEEEEccccccccccccccccccccEEEEEEEccccEEEEEEEEEcccccEEEEEEccEEccccccccEEEEEEEEcccccccccccccccccccccccccccccccEEEEEEccEEEccccccccccccccccccccccc
MLSMALMLSAAFFMGCTNLAVAnygdasagvihVGGQVLCQNCFKSYKewvngsnplkgvTVSLTcmddrsramcyksdetdeqgqfYMTVDRIINGKELKAKLCSVRlvssthadcsiatnfaggktgvklmrsptsmfrdiikyrvgpfyyttpmcekpdtndsqsdedqgsyy
MLSMALMLSAAFFMGCTNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEwvngsnplkgVTVSLTCMDDRSRAMcyksdetdeqgqfyMTVDRIINGKELKAKLCSVRLVSsthadcsiatnfaggktgvklmrsptsmfRDIIKYRVGPFYYttpmcekpdtndsqsdedqgsyy
MLSMALMLSAAFFMGCTNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDRIINGKELKAKLCSVRLVSSTHADCSIATNFAGGKTGVKLMRSPTSMFRDIIKYRVGPFYYTTPMCEKPDTNDSQSDEDQGSYY
****ALMLSAAFFMGCTNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDRIINGKELKAKLCSVRLVSSTHADCSIATNFAGGKTGVKLMRSPTSMFRDIIKYRVGPFYYTTPMC******************
*LSMALMLSAAFFMGCTNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDRIINGKELKAKLCSVRLVSSTHADCSIATNFA****************RDIIKYRVGPFYYTTPM*******************
MLSMALMLSAAFFMGCTNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDRIINGKELKAKLCSVRLVSSTHADCSIATNFAGGKTGVKLMRSPTSMFRDIIKYRVGPFYYTTPMCEK****************
MLSMALMLSAAFFMGCTNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDRIINGKELKAKLCSVRLVSSTHADCSIATNFAGGKTGVKLMRSPTSMFRDIIKYRVGPFYYTTPMCE*****************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MLSMALMLSAAFFMGCTNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDRIINGKELKAKLCSVRLVSSTHADCSIATNFAGGKTGVKLMRSPTSMFRDIIKYRVGPFYYTTPMCEKPDTNDSQSDEDQGSYY
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query176 2.2.26 [Sep-21-2011]
Q9FZ35335 Proline-rich protein 1 OS no no 0.579 0.304 0.310 2e-07
Q9LZJ7313 Proline-rich protein 3 OS no no 0.619 0.348 0.312 6e-06
Q03211426 Pistil-specific extensin- N/A no 0.636 0.262 0.293 0.0002
>sp|Q9FZ35|PRP1_ARATH Proline-rich protein 1 OS=Arabidopsis thaliana GN=PRP1 PE=2 SV=1 Back     alignment and function desciption
 Score = 55.1 bits (131), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 62/129 (48%), Gaps = 27/129 (20%)

Query: 34  VGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMD------DRSRAMCYKSDETDEQGQF 87
           VGG +LC+N +++Y        P++G    + C +       ++  + Y SD TD +G F
Sbjct: 214 VGGIILCKNGYETY--------PIQGAKAKIVCSERGSYEKSKNEVVIY-SDPTDFKGYF 264

Query: 88  YMTVDRIINGKELKAKLCSVRLVSSTHADCSIATNFAGGKTGVKLMRSPTSMFRD--IIK 145
           ++ +  I N        C V+L +S    C   TN   G TGV     P SM+ D  +  
Sbjct: 265 HVVLTHIKNLSN-----CRVKLYTSPVETCKNPTNVNKGLTGV-----PFSMYSDKNLKL 314

Query: 146 YRVGPFYYT 154
           + VGPFY+T
Sbjct: 315 FNVGPFYFT 323




May contribute to cell wall structure in root hairs.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9LZJ7|PRP3_ARATH Proline-rich protein 3 OS=Arabidopsis thaliana GN=PRP3 PE=2 SV=1 Back     alignment and function description
>sp|Q03211|PEXLP_TOBAC Pistil-specific extensin-like protein OS=Nicotiana tabacum PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query176
255556806169 conserved hypothetical protein [Ricinus 0.863 0.899 0.645 7e-51
224069703173 predicted protein [Populus trichocarpa] 0.943 0.959 0.552 4e-48
449440443158 PREDICTED: uncharacterized protein LOC10 0.897 1.0 0.529 8e-45
356505291176 PREDICTED: pistil-specific extensin-like 0.943 0.943 0.548 3e-44
297833766171 pollen ole e 1 allergen and extensin fam 0.897 0.923 0.512 1e-43
351724337175 uncharacterized protein LOC100527270 pre 0.914 0.92 0.535 2e-43
22330940171 pollen Ole e 1 allergen and extensin fam 0.875 0.900 0.524 1e-42
357510897182 Pistil-specific extensin-like protein [M 0.988 0.956 0.508 3e-42
115483240168 Os10g0546100 [Oryza sativa Japonica Grou 0.789 0.827 0.368 1e-18
297740015134 unnamed protein product [Vitis vinifera] 0.392 0.514 0.619 6e-18
>gi|255556806|ref|XP_002519436.1| conserved hypothetical protein [Ricinus communis] gi|223541299|gb|EEF42850.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  205 bits (521), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 104/161 (64%), Positives = 125/161 (77%), Gaps = 9/161 (5%)

Query: 16  CTNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMC 75
           CTNLAVA  G    GVIHVGG+V+CQ+C K Y +WVNG  P+KG  VSLTCMD+R R M 
Sbjct: 18  CTNLAVAWEG---TGVIHVGGKVMCQDCTKGYHDWVNGDRPIKGCKVSLTCMDERRRVMY 74

Query: 76  YKSDETDEQGQFYMTVDRIINGKELKAKLCSVRLVSSTHADCSIATNFAGGKTGVKLMRS 135
           Y+SD TDE+GQF MTV + INGKELK KLCSVRLVSS    C+I T+FAGGK+GVKL R 
Sbjct: 75  YESDITDEEGQFDMTVSKYINGKELKEKLCSVRLVSSPDQTCNILTDFAGGKSGVKL-RR 133

Query: 136 PTSMFRDIIKYRVGPFYYTTPMCEKPDTNDSQSDEDQGSYY 176
           PT ++RD++KY + PFY+TTPMCE+PDT  SQSD   G++Y
Sbjct: 134 PTLIYRDMVKYMLQPFYFTTPMCEEPDT--SQSD---GTHY 169




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224069703|ref|XP_002326398.1| predicted protein [Populus trichocarpa] gi|222833591|gb|EEE72068.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449440443|ref|XP_004137994.1| PREDICTED: uncharacterized protein LOC101220111 [Cucumis sativus] gi|449521215|ref|XP_004167625.1| PREDICTED: uncharacterized protein LOC101225099 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356505291|ref|XP_003521425.1| PREDICTED: pistil-specific extensin-like protein-like [Glycine max] Back     alignment and taxonomy information
>gi|297833766|ref|XP_002884765.1| pollen ole e 1 allergen and extensin family protein [Arabidopsis lyrata subsp. lyrata] gi|297330605|gb|EFH61024.1| pollen ole e 1 allergen and extensin family protein [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|351724337|ref|NP_001237310.1| uncharacterized protein LOC100527270 precursor [Glycine max] gi|255631922|gb|ACU16328.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|22330940|ref|NP_683542.1| pollen Ole e 1 allergen and extensin family protein [Arabidopsis thaliana] gi|332641315|gb|AEE74836.1| pollen Ole e 1 allergen and extensin family protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|357510897|ref|XP_003625737.1| Pistil-specific extensin-like protein [Medicago truncatula] gi|355500752|gb|AES81955.1| Pistil-specific extensin-like protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|115483240|ref|NP_001065213.1| Os10g0546100 [Oryza sativa Japonica Group] gi|13357245|gb|AAK20042.1|AC025783_2 putative pollen-related protein [Oryza sativa Japonica Group] gi|31433368|gb|AAP54887.1| Pollen proteins Ole e I family protein [Oryza sativa Japonica Group] gi|113639822|dbj|BAF27127.1| Os10g0546100 [Oryza sativa Japonica Group] gi|125532846|gb|EAY79411.1| hypothetical protein OsI_34543 [Oryza sativa Indica Group] gi|125575588|gb|EAZ16872.1| hypothetical protein OsJ_32350 [Oryza sativa Japonica Group] gi|215768694|dbj|BAH00923.1| unnamed protein product [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|297740015|emb|CBI30197.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query176
TAIR|locus:504955667171 AT3G09925 "AT3G09925" [Arabido 0.875 0.900 0.524 3.2e-42
TAIR|locus:2169682183 MOP10 "AT5G05500" [Arabidopsis 0.698 0.672 0.285 2.6e-08
TAIR|locus:2010990335 PRP1 "AT1G54970" [Arabidopsis 0.630 0.331 0.318 5.9e-08
TAIR|locus:2081730313 PRP3 "AT3G62680" [Arabidopsis 0.619 0.348 0.312 1.1e-06
TAIR|locus:2032180359 AGP31 "arabinogalactan protein 0.704 0.345 0.262 3.5e-06
TAIR|locus:2062035184 AT2G47530 "AT2G47530" [Arabido 0.676 0.646 0.294 3.8e-06
TAIR|locus:2143241 401 AT5G15780 "AT5G15780" [Arabido 0.494 0.216 0.311 6.1e-06
TAIR|locus:2061554175 AT2G34700 "AT2G34700" [Arabido 0.727 0.731 0.266 1.1e-05
TAIR|locus:2057604239 AGP30 "AT2G33790" [Arabidopsis 0.704 0.518 0.284 1.2e-05
TAIR|locus:504955667 AT3G09925 "AT3G09925" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 447 (162.4 bits), Expect = 3.2e-42, P = 3.2e-42
 Identities = 86/164 (52%), Positives = 115/164 (70%)

Query:     3 SMALMLSAAFFMGC----TNLAVANYGDASAGVIHVGGQVLCQNCFKSYKEWVNGSNPLK 58
             ++ L++   F + C    T  A  N GD     IHV G+V+CQ+C  +Y EW+NGS P+K
Sbjct:     7 TLVLLIEILFLVSCVTHITTTAAYNNGDK----IHVAGKVMCQDCSLNYDEWINGSEPIK 62

Query:    59 GVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDRIIN-GKELKAKLCSVRLVSSTHADC 117
             G  VS+TCMD+R R   Y SD TDE+GQF + V+++++ GK LK +LC+VRLVSS    C
Sbjct:    63 GAVVSITCMDERRRVSYYGSDLTDERGQFDLMVNKVLSHGKVLKPQLCNVRLVSSPDLSC 122

Query:   118 SIATNFAGGKTGVKLMRSPTSMFRDIIKYRVGPFYYTTPMCEKP 161
             +I TNF  G+TGVKL+R P ++FRD++KY VGPFYYTTPMCE P
Sbjct:   123 NIPTNFGNGQTGVKLVR-PFTVFRDLVKYVVGPFYYTTPMCETP 165




GO:0003674 "molecular_function" evidence=ND
GO:0005576 "extracellular region" evidence=ISM
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2169682 MOP10 "AT5G05500" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2010990 PRP1 "AT1G54970" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2081730 PRP3 "AT3G62680" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2032180 AGP31 "arabinogalactan protein 31" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062035 AT2G47530 "AT2G47530" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2143241 AT5G15780 "AT5G15780" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2061554 AT2G34700 "AT2G34700" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2057604 AGP30 "AT2G33790" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00280266
hypothetical protein (173 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query176
pfam0119095 pfam01190, Pollen_Ole_e_I, Pollen proteins Ole e I 3e-17
>gnl|CDD|216353 pfam01190, Pollen_Ole_e_I, Pollen proteins Ole e I like Back     alignment and domain information
 Score = 72.5 bits (178), Expect = 3e-17
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 4/99 (4%)

Query: 34  VGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDR 93
           V G V C  C      +   + PL G  V + C D   R +      TDE+G F + +  
Sbjct: 1   VEGSVYCDTC--RASGFELSAYPLPGAKVKIECKDGDGRVVTSAEAVTDEKGYFKVELPG 58

Query: 94  IINGKELKAKLCSVRLVSSTHADCSIATNFAGGKTGVKL 132
             +        C  +LVSS  + CS  TN     TG  L
Sbjct: 59  DPSSLTHLESACRAKLVSSPDSACSAPTNG--NGTGAPL 95


Length = 95

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 176
PF0119097 Pollen_Ole_e_I: Pollen proteins Ole e I like; Inte 99.95
PF1362082 CarboxypepD_reg: Carboxypeptidase regulatory-like 96.33
PF1197497 MG1: Alpha-2-macroglobulin MG1 domain; InterPro: I 96.17
PF1371588 DUF4480: Domain of unknown function (DUF4480) 95.75
PF0106080 DUF290: Transthyretin-like family; InterPro: IPR00 95.48
PF10670215 DUF4198: Domain of unknown function (DUF4198) 92.14
PRK15036137 hydroxyisourate hydrolase; Provisional 91.26
PF0721085 DUF1416: Protein of unknown function (DUF1416); In 91.16
PF0573870 Cna_B: Cna protein B-type domain; InterPro: IPR008 89.01
PF08400134 phage_tail_N: Prophage tail fibre N-terminal; Inte 88.5
cd03858374 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like sub 88.01
cd03866376 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM 85.2
COG5266264 CbiK ABC-type Co2+ transport system, periplasmic c 84.54
TIGR02438281 catachol_actin catechol 1,2-dioxygenase, Actinobac 84.09
PF15240179 Pro-rich: Proline-rich 83.7
PF00775183 Dioxygenase_C: Dioxygenase; InterPro: IPR000627 Th 83.53
cd03464220 3,4-PCD_beta Protocatechuate 3,4-dioxygenase (3,4- 83.2
cd03463185 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4 83.06
cd03459158 3,4-PCD Protocatechuate 3,4-dioxygenase (3,4-PCD) 82.75
cd00421146 intradiol_dioxygenase Intradiol dioxygenases catal 82.43
TIGR02422220 protocat_beta protocatechuate 3,4-dioxygenase, bet 81.65
PF0717295 GRP: Glycine rich protein family; InterPro: IPR010 80.2
>PF01190 Pollen_Ole_e_I: Pollen proteins Ole e I like; InterPro: IPR006041 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms Back     alignment and domain information
Probab=99.95  E-value=3.7e-28  Score=176.62  Aligned_cols=96  Identities=33%  Similarity=0.589  Sum_probs=86.3

Q ss_pred             EEEEEEcccCCCCCccccCCCccCCCCEEEEEecCCCCceEEEEEeEcCCCceEEEEEccccccC-CCcccceEEEEccC
Q 048602           34 VGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMTVDRIINGK-ELKAKLCSVRLVSS  112 (176)
Q Consensus        34 V~G~VyCd~C~~~~~~~~~~s~pi~GA~V~v~C~~~~~~~~~~~~~~TD~~G~F~i~l~~~~~~~-~~~~~~C~v~L~sS  112 (176)
                      |+|+||||+|+++ ..  ..++||+||+|+|+|+++++.+.+.++++||++|+|+|+||....+. .+..+.|+|+|++|
T Consensus         1 V~G~V~C~~C~~~-~~--~~~~~l~GA~V~v~C~~~~~~~~~~~~~~Td~~G~F~i~l~~~~~~~~~~~~~~C~v~l~~s   77 (97)
T PF01190_consen    1 VEGVVYCDDCSSG-FS--RAAKPLPGAKVSVECKDGNGGVVFSAEAKTDENGYFSIELPSDPGSSSPHLSSSCRVKLVSS   77 (97)
T ss_pred             CEEEEEeCCCCCC-cc--ccCccCCCCEEEEECCCCCCCcEEEEEEEeCCCCEEEEEecCccccccCCCCCCcEEEEeCC
Confidence            7999999999994 33  78899999999999999887777789999999999999999875432 46799999999999


Q ss_pred             CCcCCcccccCCCCccccee
Q 048602          113 THADCSIATNFAGGKTGVKL  132 (176)
Q Consensus       113 p~~~C~~~t~~n~g~sg~~L  132 (176)
                      |++.|+++++.|+|++|++|
T Consensus        78 p~~~C~~~~~~~~G~~ga~l   97 (97)
T PF01190_consen   78 PDPSCNVPTNSNGGRTGAKL   97 (97)
T ss_pred             CcCcCCCCcCCCCCccCCcC
Confidence            99999999999999999986



A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation. The allergens in this family include allergens with the following designations: Ole e 1. A number of plant pollen proteins, whose biological function is not yet known, are structurally related []. These proteins are most probably secreted and consist of about 145 residues. There are six cysteines which are conserved in the sequence of these proteins. They seem to be involved in disulphide bonds.

>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A Back     alignment and domain information
>PF11974 MG1: Alpha-2-macroglobulin MG1 domain; InterPro: IPR021868 This is the N-terminal MG1 domain from alpha-2-macroglobulin [] Back     alignment and domain information
>PF13715 DUF4480: Domain of unknown function (DUF4480) Back     alignment and domain information
>PF01060 DUF290: Transthyretin-like family; InterPro: IPR001534 This new apparently nematode-specific protein family has been called family 2 [] Back     alignment and domain information
>PF10670 DUF4198: Domain of unknown function (DUF4198) Back     alignment and domain information
>PRK15036 hydroxyisourate hydrolase; Provisional Back     alignment and domain information
>PF07210 DUF1416: Protein of unknown function (DUF1416); InterPro: IPR010814 This family consists of several hypothetical bacterial proteins of around 100 residues in length Back     alignment and domain information
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains Back     alignment and domain information
>PF08400 phage_tail_N: Prophage tail fibre N-terminal; InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein Back     alignment and domain information
>cd03858 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs) Back     alignment and domain information
>cd03866 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs) Back     alignment and domain information
>COG5266 CbiK ABC-type Co2+ transport system, periplasmic component [Inorganic ion transport and metabolism] Back     alignment and domain information
>TIGR02438 catachol_actin catechol 1,2-dioxygenase, Actinobacterial Back     alignment and domain information
>PF15240 Pro-rich: Proline-rich Back     alignment and domain information
>PF00775 Dioxygenase_C: Dioxygenase; InterPro: IPR000627 This entry represents the C-terminal domain common to several intradiol ring-cleavage dioxygenases Back     alignment and domain information
>cd03464 3,4-PCD_beta Protocatechuate 3,4-dioxygenase (3,4-PCD) , beta subunit Back     alignment and domain information
>cd03463 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4-PCD) , alpha subunit Back     alignment and domain information
>cd03459 3,4-PCD Protocatechuate 3,4-dioxygenase (3,4-PCD) catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate Back     alignment and domain information
>cd00421 intradiol_dioxygenase Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates Back     alignment and domain information
>TIGR02422 protocat_beta protocatechuate 3,4-dioxygenase, beta subunit Back     alignment and domain information
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query176
3irp_X429 URO-adherence factor A; DEV-IGG fold, cell WALL, h 93.73
3uaf_A117 TTR-52; beta barrel/sandwich, cell engulfment, sec 92.94
3e8v_A82 Possible transglutaminase-family protein; structur 92.44
3kcp_A321 Cellulosomal-scaffolding protein A; dockerin, X-mo 87.64
4ank_A147 Transthyretin; hormone binding protein, thyroxine 86.8
2bum_B241 Protocatechuate 3,4-dioxygenase beta chain; oxidor 80.83
3mn8_A435 LP15968P; catalytic domain of alpha/beta-hydrolase 80.78
2boy_A254 3-chlorocatechol 1,2-dioxygenase; beta barrel, oxi 80.31
>3irp_X URO-adherence factor A; DEV-IGG fold, cell WALL, hemagglutinin, peptidoglycan-anchor secreted, virulence, cell adhesion; 1.50A {Staphylococcus saprophyticus subsp} PDB: 3irz_A 3is0_X 3is1_X Back     alignment and structure
Probab=93.73  E-value=0.23  Score=43.81  Aligned_cols=58  Identities=21%  Similarity=0.341  Sum_probs=44.4

Q ss_pred             CceEEEEEEEcccCCCCCccccCCCccCCCCEEEEEecCCCCceEEEEEeEcCCCceEEEE-Ecc
Q 048602           30 GVIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMT-VDR   93 (176)
Q Consensus        30 ~~v~V~G~VyCd~C~~~~~~~~~~s~pi~GA~V~v~C~~~~~~~~~~~~~~TD~~G~F~i~-l~~   93 (176)
                      ....|.|.||-|.=+.+-..  .+..+|+|++|.|.  +..|.++  ...+||++|.|.+. |+.
T Consensus       304 G~~~igg~Vw~D~n~nG~~D--~~e~gl~gv~v~L~--d~~G~~v--~tttTd~~G~Y~F~~L~~  362 (429)
T 3irp_X          304 GFGGFGGYVWFDKNNDGVQN--DSNAAAAGITVNLL--DPTGIRL--ATTTTDITGHYNFDNLTN  362 (429)
T ss_dssp             CEEEEEEEEEECTTSSSSSC--CSSCBCTTCEEEEE--CTTSCEE--EEEECCTTSEEEEEEEES
T ss_pred             ceEEEccEEEEeCCCCCCcC--cCCCCcCCeEEEEE--cCCCCEE--EEEEeCCCCeEEeCCCCC
Confidence            34788999999985555444  45579999999997  5556654  68899999999986 444



>3uaf_A TTR-52; beta barrel/sandwich, cell engulfment, secreted, protein BIN; 2.01A {Caenorhabditis elegans} Back     alignment and structure
>3e8v_A Possible transglutaminase-family protein; structural genomics, unknown function, PSI-2, protein structure initiative; 2.40A {Bacteroides fragilis nctc 9343} Back     alignment and structure
>3kcp_A Cellulosomal-scaffolding protein A; dockerin, X-module, carbohydrate metabolism, cell WALL biogenesis/degradation; 1.94A {Clostridium thermocellum atcc 27405} PDB: 3p0d_C 4fl4_C 2b59_B Back     alignment and structure
>4ank_A Transthyretin; hormone binding protein, thyroxine transport protein, hormon amyloidosis inhibition; 1.70A {Homo sapiens} Back     alignment and structure
>2bum_B Protocatechuate 3,4-dioxygenase beta chain; oxidoreductase, aromatic degradation, non-heme iron, beta-SA mixed alpha/beta structure; 1.80A {Acinetobacter SP} SCOP: b.3.6.1 PDB: 1eo2_B 1eoa_B 1eob_B* 1eoc_B* 1eo9_B 2buq_B 2bur_B* 2bux_B 2buy_B 2buz_B* 2bv0_B* 2buu_B* 2but_B* 2buv_B* 2buw_B* Back     alignment and structure
>3mn8_A LP15968P; catalytic domain of alpha/beta-hydrolase fold, C-terminal, A transthyretin-like domain, hydrolase; HET: NAG GEM; 2.70A {Drosophila melanogaster} Back     alignment and structure
>2boy_A 3-chlorocatechol 1,2-dioxygenase; beta barrel, oxidoreductase; HET: BHO LPP; 1.9A {Rhodococcus opacus} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query176
d1vlfn179 Transhydroxylase beta subunit, BthL, C-terminal do 95.91
d1h8la179 Carboxypeptidase D C-terminal domain {Crested duck 94.62
d1uwya1107 Carboxypeptidase M C-terminal domain {Human (Homo 89.96
d3pccm_236 Protocatechuate-3,4-dioxygenase, beta chain {Pseud 89.26
d2burb1238 Protocatechuate-3,4-dioxygenase, beta chain {Acine 80.8
>d1vlfn1 b.3.5.1 (N:196-274) Transhydroxylase beta subunit, BthL, C-terminal domain {Pelobacter acidigallici [TaxId: 35816]} Back     information, alignment and structure
class: All beta proteins
fold: Prealbumin-like
superfamily: Cna protein B-type domain
family: Cna protein B-type domain
domain: Transhydroxylase beta subunit, BthL, C-terminal domain
species: Pelobacter acidigallici [TaxId: 35816]
Probab=95.91  E-value=0.008  Score=39.36  Aligned_cols=46  Identities=24%  Similarity=0.351  Sum_probs=32.8

Q ss_pred             ceEEEEEEEcccCCCCCccccCCCccCCCCEEEEEecCCCCceEEEEEeEcCCCceEEEE-Eccc
Q 048602           31 VIHVGGQVLCQNCFKSYKEWVNGSNPLKGVTVSLTCMDDRSRAMCYKSDETDEQGQFYMT-VDRI   94 (176)
Q Consensus        31 ~v~V~G~VyCd~C~~~~~~~~~~s~pi~GA~V~v~C~~~~~~~~~~~~~~TD~~G~F~i~-l~~~   94 (176)
                      .++|.|.|- |            ..||+||.|.|.  +. +..+  ....||++|.|+++ ++..
T Consensus         2 n~~~~gi~~-~------------G~~v~gA~V~L~--~~-~~~v--~~t~Td~~G~F~f~~l~~G   48 (79)
T d1vlfn1           2 NYVTAGILV-Q------------GDCFEGAKVVLK--SG-GKEV--ASAETNFFGEFKFDALDNG   48 (79)
T ss_dssp             EEEEEEEEE-T------------TEECTTCEEEEE--ET-TEEE--EEEECCTTSEEEEEEECSE
T ss_pred             CcEEeeEEE-C------------CccccccEEEEE--CC-CCeE--EeeEECCCCcEEEEecCCC
Confidence            457778774 2            279999999995  32 3333  46789999999997 5554



>d1h8la1 b.3.2.1 (A:305-383) Carboxypeptidase D C-terminal domain {Crested duck (Lophonetta specularioides) [TaxId: 8836]} Back     information, alignment and structure
>d1uwya1 b.3.2.1 (A:297-403) Carboxypeptidase M C-terminal domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d3pccm_ b.3.6.1 (M:) Protocatechuate-3,4-dioxygenase, beta chain {Pseudomonas putida [TaxId: 303]} Back     information, alignment and structure
>d2burb1 b.3.6.1 (B:303-540) Protocatechuate-3,4-dioxygenase, beta chain {Acinetobacter calcoaceticus, adp1 [TaxId: 471]} Back     information, alignment and structure