Citrus Sinensis ID: 032308


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140---
MLSRIRQALQWLSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVEFQQV
cHHHHHHHHHHHHHHHHHHHHHHccccccEEEEEEEcccccccccccccccccccccEEEEEccccccccccccccEEEEEccccccccccccccEEEEEEEccccccccccccccEEEccHHHHHHHHcccccEEEEEEEEc
cHHHHHHHHHHHHHHHHHHHHHHHHHcccccEEEEEcccccccccccccccccccccEEEEEcHHHcccccccccEEEEEEEccccccccccccEEEEEEEccccccccccccccccccccHHHHHHHHcccccEEEEEEEEc
MLSRIRQALQWLSFLLFISQLICntsfadvgtaarygppflpttcygndptqfpssnlfaaagdgiwdngascgRQYLVRCisasepgtcepeqtIQIRIVDYAlqlestpsvsgttiVLSETAFRTIANSTATLINVEFQQV
MLSRIRQALQWLSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISAsepgtcepeqtIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVEFQQV
MLSRIRQALQWLSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVEFQQV
*****RQALQWLSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVE****
*********QWLSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVEFQQV
MLSRIRQALQWLSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVEFQQV
*LSRIRQALQWLSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVEFQQV
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSiiiiHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MLSRIRQALQWLSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVEFQQV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query143 2.2.26 [Sep-21-2011]
Q9ZP41131 EG45-like domain containi N/A no 0.769 0.839 0.373 2e-13
Q9ZV52130 EG45-like domain containi yes no 0.867 0.953 0.304 3e-11
Q9M0C2123 Putative EG45-like domain no no 0.517 0.601 0.356 2e-08
>sp|Q9ZP41|EGC_CITJA EG45-like domain containing protein OS=Citrus jambhiri GN=CjBAp12 PE=1 SV=1 Back     alignment and function desciption
 Score = 74.7 bits (182), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/126 (37%), Positives = 67/126 (53%), Gaps = 16/126 (12%)

Query: 21  LICNTSFADVGTAARYGPPFLPTTC--YGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYL 78
           LI + ++A  GTA  Y PP++P+ C  Y ND        + AAA   IW+NGA C + + 
Sbjct: 17  LISSAAYASEGTATFYTPPYVPSACNGYKNDGV------MIAAASYAIWNNGAVCNKSFR 70

Query: 79  VRCISASEPGTCEPEQ--TIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLI 136
           V+C  A+  GT  P +  ++ ++IVD        P+    TI LS+ AF  IAN  A  I
Sbjct: 71  VKCTGATNQGTPHPCRGGSVLVKIVDLC------PAGCQATIDLSQEAFSQIANPDAGKI 124

Query: 137 NVEFQQ 142
            +EF Q
Sbjct: 125 KIEFNQ 130




Might have a systemic role in water and solute homeostasis. Has no expansin-like activity.
Citrus jambhiri (taxid: 64884)
>sp|Q9ZV52|EGC2_ARATH EG45-like domain containing protein 2 OS=Arabidopsis thaliana GN=EGC2 PE=2 SV=2 Back     alignment and function description
>sp|Q9M0C2|EGC1_ARATH Putative EG45-like domain containing protein 1 OS=Arabidopsis thaliana GN=EGC1 PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query143
225445398144 PREDICTED: EG45-like domain containing p 0.979 0.972 0.713 2e-50
255576970189 conserved hypothetical protein [Ricinus 0.902 0.682 0.689 8e-47
351723313144 uncharacterized protein LOC100306194 pre 0.895 0.888 0.676 4e-45
356515476140 PREDICTED: EG45-like domain containing p 0.979 1.0 0.640 5e-45
351724325144 uncharacterized protein LOC100306489 pre 0.993 0.986 0.631 8e-45
224143084157 predicted protein [Populus trichocarpa] 0.818 0.745 0.675 1e-39
224091381151 predicted protein [Populus trichocarpa] 0.825 0.781 0.677 3e-39
449443119148 PREDICTED: EG45-like domain containing p 0.993 0.959 0.550 4e-39
359476094 266 PREDICTED: uncharacterized protein LOC10 0.811 0.436 0.663 7e-39
224092422139 predicted protein [Populus trichocarpa] 0.818 0.841 0.666 2e-38
>gi|225445398|ref|XP_002284999.1| PREDICTED: EG45-like domain containing protein-like isoform 3 [Vitis vinifera] gi|225445400|ref|XP_002284997.1| PREDICTED: EG45-like domain containing protein-like isoform 2 [Vitis vinifera] gi|225445402|ref|XP_002284996.1| PREDICTED: EG45-like domain containing protein-like isoform 1 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  203 bits (517), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 102/143 (71%), Positives = 115/143 (80%), Gaps = 3/143 (2%)

Query: 4   RIRQALQWLSFLL---FISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFA 60
           RI + L+   FLL   FIS    N+   DVGTAA+Y PP+LPT CYGND ++FPSSNLFA
Sbjct: 2   RIIETLRCCLFLLDFLFISVFFINSCQGDVGTAAQYNPPYLPTICYGNDASEFPSSNLFA 61

Query: 61  AAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVL 120
           AAGDGIWDNGASCGRQYLVRCISA++PGTC P+QTIQI++VDYA    STPS  GTTIVL
Sbjct: 62  AAGDGIWDNGASCGRQYLVRCISATQPGTCVPDQTIQIKVVDYAPSAPSTPSADGTTIVL 121

Query: 121 SETAFRTIANSTATLINVEFQQV 143
           SETAF  IANSTAT IN+EFQQV
Sbjct: 122 SETAFGIIANSTATAINIEFQQV 144




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255576970|ref|XP_002529370.1| conserved hypothetical protein [Ricinus communis] gi|223531190|gb|EEF33037.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|351723313|ref|NP_001235227.1| uncharacterized protein LOC100306194 precursor [Glycine max] gi|255627831|gb|ACU14260.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356515476|ref|XP_003526426.1| PREDICTED: EG45-like domain containing protein-like [Glycine max] Back     alignment and taxonomy information
>gi|351724325|ref|NP_001235774.1| uncharacterized protein LOC100306489 precursor [Glycine max] gi|255628697|gb|ACU14693.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|224143084|ref|XP_002324844.1| predicted protein [Populus trichocarpa] gi|222866278|gb|EEF03409.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224091381|ref|XP_002309240.1| predicted protein [Populus trichocarpa] gi|222855216|gb|EEE92763.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449443119|ref|XP_004139328.1| PREDICTED: EG45-like domain containing protein 2-like isoform 1 [Cucumis sativus] gi|449443121|ref|XP_004139329.1| PREDICTED: EG45-like domain containing protein 2-like isoform 2 [Cucumis sativus] gi|449509459|ref|XP_004163595.1| PREDICTED: EG45-like domain containing protein 2-like isoform 1 [Cucumis sativus] gi|449509463|ref|XP_004163596.1| PREDICTED: EG45-like domain containing protein 2-like isoform 2 [Cucumis sativus] Back     alignment and taxonomy information
>gi|359476094|ref|XP_002282215.2| PREDICTED: uncharacterized protein LOC100259398 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224092422|ref|XP_002309602.1| predicted protein [Populus trichocarpa] gi|222855578|gb|EEE93125.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query143
TAIR|locus:2053983130 PNP-A "plant natriuretic pepti 0.888 0.976 0.312 1.2e-12
TAIR|locus:2118656123 AT4G30380 "AT4G30380" [Arabido 0.797 0.926 0.325 4.1e-10
TAIR|locus:2053983 PNP-A "plant natriuretic peptide A" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 168 (64.2 bits), Expect = 1.2e-12, P = 1.2e-12
 Identities = 44/141 (31%), Positives = 74/141 (52%)

Query:     5 IRQALQWLSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGD 64
             I+ A++++  ++  +Q++   + A  G A  Y PP+  + CYG   TQ     L     +
Sbjct:     2 IKMAVKFVVVMIVFAQILAPIAEAAQGKAVYYDPPYTRSACYG---TQ--RETLVVGVKN 56

Query:    65 GIWDNGASCGRQYLVRCISASE--PGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSE 122
              +W NG +CGR+Y VRCI A+      C   +T+ +++VD+  +    P  +G  + LS 
Sbjct:    57 NLWQNGRACGRRYRVRCIGATYNFDRACTG-RTVDVKVVDFCRE----PC-NGD-LNLSR 109

Query:   123 TAFRTIANSTATLINVEFQQV 143
              AFR IAN+ A  I V +  +
Sbjct:   110 DAFRVIANTDAGNIRVVYTPI 130




GO:0005576 "extracellular region" evidence=ISM;ISS
GO:0005618 "cell wall" evidence=IDA
GO:0048046 "apoplast" evidence=IDA
GO:0010230 "alternative respiration" evidence=IDA
GO:0009627 "systemic acquired resistance" evidence=RCA
GO:0034976 "response to endoplasmic reticulum stress" evidence=RCA
TAIR|locus:2118656 AT4G30380 "AT4G30380" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9ZV52EGC2_ARATHNo assigned EC number0.30430.86710.9538yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh4_pg.C_LG_XVIII000365
hypothetical protein (157 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query143
pfam0333077 pfam03330, DPBB_1, Rare lipoprotein A (RlpA)-like 2e-13
PLN03024125 PLN03024, PLN03024, Putative EG45-like domain cont 3e-13
>gnl|CDD|217497 pfam03330, DPBB_1, Rare lipoprotein A (RlpA)-like double-psi beta-barrel Back     alignment and domain information
 Score = 61.0 bits (149), Expect = 2e-13
 Identities = 24/82 (29%), Positives = 41/82 (50%), Gaps = 7/82 (8%)

Query: 60  AAAGDGIWDNGASCGRQYLVRCISASEP-GTCEPEQTIQIRIVDYALQLESTPSVSGTTI 118
           AA    ++++G +CG  Y V+C++A+ P GTC   +++ + I D        P       
Sbjct: 2   AAGSASLYNDGTACGECYQVKCLTAAHPTGTCRVLRSVTVTITDRC------PFPPRRHF 55

Query: 119 VLSETAFRTIANSTATLINVEF 140
            LS  AF  +A   A ++ VE+
Sbjct: 56  DLSGPAFEALAKPRAGIVPVEY 77


Rare lipoprotein A (RlpA) contains a conserved region that has the double-psi beta-barrel (DPBB) fold. The function of RlpA is not well understood, but it has been shown to act as a prc mutant suppressor in Escherichia coli. The DPBB fold is often an enzymatic domain. The members of this family are quite diverse, and if catalytic this family may contain several different functions. Another example of this domain is found in the N terminus of pollen allergen. Length = 77

>gnl|CDD|178595 PLN03024, PLN03024, Putative EG45-like domain containing protein 1; Provisional Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 143
PLN03024125 Putative EG45-like domain containing protein 1; Pr 100.0
PLN00193256 expansin-A; Provisional 99.97
PLN00050247 expansin A; Provisional 99.97
PLN03023 247 Expansin-like B1; Provisional 99.97
smart0083787 DPBB_1 Rare lipoprotein A (RlpA)-like double-psi b 99.93
PF0333078 DPBB_1: Rare lipoprotein A (RlpA)-like double-psi 99.87
COG4305 232 Endoglucanase C-terminal domain/subunit and relate 99.41
TIGR00413 208 rlpA rare lipoprotein A. This is a family of proka 99.24
COG0797233 RlpA Lipoproteins [Cell envelope biogenesis, outer 99.23
PF00967119 Barwin: Barwin family; InterPro: IPR001153 Barwin 99.16
PRK10672 361 rare lipoprotein A; Provisional 99.0
PF07249119 Cerato-platanin: Cerato-platanin; InterPro: IPR010 98.72
PF02015201 Glyco_hydro_45: Glycosyl hydrolase family 45; Inte 96.02
>PLN03024 Putative EG45-like domain containing protein 1; Provisional Back     alignment and domain information
Probab=100.00  E-value=3.3e-35  Score=215.50  Aligned_cols=119  Identities=29%  Similarity=0.613  Sum_probs=101.8

Q ss_pred             HHHHHHHHHHhhhccCCceeeEEEeCCCCCCCCcccCCCCCCCCCCeEEEeCCCCCCCCcccCCeEEEEeccCC--CCCc
Q 032308           12 LSFLLFISQLICNTSFADVGTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISAS--EPGT   89 (143)
Q Consensus        12 ~~~~~~~~~~~~~~a~~~~G~aT~Y~~~~~~GaC~g~~~~~~p~~~~iaA~s~~~~~~g~~CG~c~~V~c~~~~--~~~~   89 (143)
                      +||++.+++.+.+++.++.|+||||++ +.+|+||++.    +.+.++||+|+.+|++|+.||+||||+|.+..  .+.+
T Consensus         5 ~~~~~~~~~~~~~~~~~~~G~AT~Y~~-~~~gAC~~~~----~~g~~iaAls~~lf~~G~~CG~c~~V~C~~~~~~~~~~   79 (125)
T PLN03024          5 ILIFSTVLVFLFSVSYATPGIATFYTS-YTPSACYRGT----SFGVMIAAASDSLWNNGRVCGKMFTVKCKGPRNAVPHP   79 (125)
T ss_pred             eHHHHHHHHHHhhhhcccceEEEEeCC-CCCccccCCC----CCCCEeEEeCHHHcCCCcccCceEEEEECCCCcccccc
Confidence            456666777888999999999999996 5779997663    36899999999999999999999999997532  1347


Q ss_pred             cCCCCeEEEEEeecCCCCCCCCCCCCceEEeCHHHHHHhhcCCCceEEEEEEE
Q 032308           90 CEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVEFQQ  142 (143)
Q Consensus        90 C~~g~sV~V~V~D~Cp~c~~~~~~~~~~lDLS~~AF~~ia~~~~G~i~V~w~~  142 (143)
                      |. +++|+|+|+|+||++      +++|||||++||++||+++.|+|+|+|++
T Consensus        80 c~-gksV~V~VtD~CP~~------C~~~~DLS~~AF~~iA~~~aG~v~V~y~~  125 (125)
T PLN03024         80 CT-GKSVTVKIVDHCPSG------CASTLDLSREAFAQIANPVAGIINIDYIP  125 (125)
T ss_pred             cc-CCeEEEEEEcCCCCC------CCCceEcCHHHHHHhcCccCCEEEEEEeC
Confidence            86 789999999999952      23699999999999999999999999985



>PLN00193 expansin-A; Provisional Back     alignment and domain information
>PLN00050 expansin A; Provisional Back     alignment and domain information
>PLN03023 Expansin-like B1; Provisional Back     alignment and domain information
>smart00837 DPBB_1 Rare lipoprotein A (RlpA)-like double-psi beta-barrel Back     alignment and domain information
>PF03330 DPBB_1: Rare lipoprotein A (RlpA)-like double-psi beta-barrel; InterPro: IPR009009 Beta barrels are commonly observed in protein structures Back     alignment and domain information
>COG4305 Endoglucanase C-terminal domain/subunit and related proteins [Carbohydrate transport and metabolism] Back     alignment and domain information
>TIGR00413 rlpA rare lipoprotein A Back     alignment and domain information
>COG0797 RlpA Lipoproteins [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>PF00967 Barwin: Barwin family; InterPro: IPR001153 Barwin is a basic protein isolated from aqueous extracts of barley seeds Back     alignment and domain information
>PRK10672 rare lipoprotein A; Provisional Back     alignment and domain information
>PF07249 Cerato-platanin: Cerato-platanin; InterPro: IPR010829 Cerato-platanin (CP) is the first member of the cerato-platanin family Back     alignment and domain information
>PF02015 Glyco_hydro_45: Glycosyl hydrolase family 45; InterPro: IPR000334 O-Glycosyl hydrolases 3 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query143
3d30_A 208 YOAJ, expansin like protein; peptidoglycan associa 3e-07
2hcz_X 245 Beta-expansin 1A; domain 1 is A beta barrel and do 2e-06
1n10_A 241 PHL P I, pollen allergen PHL P 1; plant allergen, 3e-06
>3d30_A YOAJ, expansin like protein; peptidoglycan associated protei unknown function, MLTA, bacteria autolysis, peptidoglycan-B protein; 1.90A {Bacillus subtilis} PDB: 2bh0_A Length = 208 Back     alignment and structure
 Score = 46.8 bits (111), Expect = 3e-07
 Identities = 23/117 (19%), Positives = 35/117 (29%), Gaps = 23/117 (19%)

Query: 31  GTAARYGPPFLPTTCYGNDPTQFPSSNLFAAAGDGIWDNG----ASCGRQYLVRCISASE 86
           G A   G  +       +     PS     A      + G    A  G    V       
Sbjct: 9   GYATYTGSGYSGGAFLLDPI---PSDMEITAINPADLNYGGVKAALAGSYLEVEG----- 60

Query: 87  PGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANSTATLINVEFQQV 143
                P+    + + D        P  +   + LS  AFR I N     IN++++ V
Sbjct: 61  -----PKGKTTVYVTD------LYPEGARGALDLSPNAFRKIGNMKDGKINIKWRVV 106


>2hcz_X Beta-expansin 1A; domain 1 is A beta barrel and domain 2 is A immunoglobulin L sandwich, allergen; HET: NAG MAN FCA; 2.75A {Zea mays} Length = 245 Back     alignment and structure
>1n10_A PHL P I, pollen allergen PHL P 1; plant allergen, expansin, immunoglobulin-like fold, PSI beta barrel, structural genomics; HET: NAG; 2.90A {Phleum pratense} SCOP: b.7.3.1 b.52.1.3 Length = 241 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query143
2hcz_X 245 Beta-expansin 1A; domain 1 is A beta barrel and do 99.97
1n10_A 241 PHL P I, pollen allergen PHL P 1; plant allergen, 99.97
3d30_A 208 YOAJ, expansin like protein; peptidoglycan associa 99.96
1wc2_A181 Endoglucanase; hydrolase, cellulase, cellulose, do 99.94
4avr_A95 PA4485; unknown function, GRAM-negative bacteria, 99.75
1bw3_A125 Barwin, basic barley SEED protein; lectin; NMR {Ho 99.73
2kqa_A129 Cerato-platanin; elicitor, secreted, toxin; NMR {C 98.69
3m3g_A120 EPL1 protein; fungal, plant defense, fungus, polys 98.54
3suj_A127 Cerato-platanin 1; double PSI beta barrel, unknown 98.49
3sul_A122 Cerato-platanin-like protein; double PSI beta barr 98.49
3suk_A125 Cerato-platanin-like protein; double PSI beta barr 98.47
3sum_A136 Cerato-platanin-like protein; double PSI beta barr 98.35
2eng_A210 Endoglucanase V; cellulose degradation, hydrolase 96.58
>2hcz_X Beta-expansin 1A; domain 1 is A beta barrel and domain 2 is A immunoglobulin L sandwich, allergen; HET: NAG MAN FCA; 2.75A {Zea mays} Back     alignment and structure
Probab=99.97  E-value=3.2e-31  Score=212.38  Aligned_cols=108  Identities=27%  Similarity=0.439  Sum_probs=94.3

Q ss_pred             cCCceeeEEEeCCCC------CCCCc-ccCCCCCCCCCCeEEEeCCCCCCCCcccCCeEEEEeccCCCCCccCCCCeEEE
Q 032308           26 SFADVGTAARYGPPF------LPTTC-YGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQI   98 (143)
Q Consensus        26 a~~~~G~aT~Y~~~~------~~GaC-~g~~~~~~p~~~~iaA~s~~~~~~g~~CG~c~~V~c~~~~~~~~C~~g~sV~V   98 (143)
                      ..+..|+||||++++      ..||| |++. .+.|.+.++||+|+.+|++|..||+||||+|.+   +..|. +++|+|
T Consensus        17 ~~w~~g~AT~Yg~~~~~gt~~~gGACGyg~~-~~~p~~~~~aAls~~lf~~G~~CG~Cy~V~c~~---~~~C~-~~sv~V   91 (245)
T 2hcz_X           17 GKWLTARATWYGQPNGAGAPDNGGACGIKNV-NLPPYSGMTACGNVPIFKDGKGCGSCYEVRCKE---KPECS-GNPVTV   91 (245)
T ss_dssp             CCCEEEEEEECSCTTSCSSTTSCCTTCCCCT-TSTTTTTCEEEECHHHHGGGTSTTCEEEEECCS---SSSBC-SSCEEE
T ss_pred             CceeeeEEEEeCCCCCCCcCCCCccCCCCCC-CcCCCCCEEEEeCHHHcCCchhcCCeEEEEeCC---CCccC-CCCEEE
Confidence            356799999999865      37999 9984 356788999999999999999999999999985   45797 689999


Q ss_pred             EEeecCCCCCCCCCCCCceEEeCHHHHHHhhc-------CCCceEEEEEEEC
Q 032308           99 RIVDYALQLESTPSVSGTTIVLSETAFRTIAN-------STATLINVEFQQV  143 (143)
Q Consensus        99 ~V~D~Cp~c~~~~~~~~~~lDLS~~AF~~ia~-------~~~G~i~V~w~~V  143 (143)
                      +|||+||++     .+..|||||+.||++||+       .+.|+|+|+||+|
T Consensus        92 ~VtD~C~C~-----~~~~hfDLS~~AF~~iA~~g~~~~L~~~Gii~V~yrrV  138 (245)
T 2hcz_X           92 YITDMNYEP-----IAPYHFDLSGKAFGSLAKPGLNDKIRHCGIMDVEFRRV  138 (245)
T ss_dssp             EEEEECCCT-----TSSSEEEECHHHHHHTBCTTCHHHHTTTCCEEEEEEEE
T ss_pred             EEEeccCCC-----CCCccEEcCHHHHHHHhccccccccccCCEEEEEEEEE
Confidence            999999853     267999999999999999       6899999999997



>1n10_A PHL P I, pollen allergen PHL P 1; plant allergen, expansin, immunoglobulin-like fold, PSI beta barrel, structural genomics; HET: NAG; 2.90A {Phleum pratense} SCOP: b.7.3.1 b.52.1.3 Back     alignment and structure
>3d30_A YOAJ, expansin like protein; peptidoglycan associated protei unknown function, MLTA, bacteria autolysis, peptidoglycan-B protein; 1.90A {Bacillus subtilis} PDB: 2bh0_A Back     alignment and structure
>1wc2_A Endoglucanase; hydrolase, cellulase, cellulose, double-PSI fold, glycoside hydrolase; 1.2A {Mytilus edulis} SCOP: b.52.1.1 Back     alignment and structure
>4avr_A PA4485; unknown function, GRAM-negative bacteria, infectious disease structure-based inhibitor design; 1.08A {Pseudomonas aeruginosa PA01} Back     alignment and structure
>1bw3_A Barwin, basic barley SEED protein; lectin; NMR {Hordeum vulgare} SCOP: b.52.1.2 PDB: 1bw4_A Back     alignment and structure
>2kqa_A Cerato-platanin; elicitor, secreted, toxin; NMR {Ceratocystis platani} Back     alignment and structure
>3m3g_A EPL1 protein; fungal, plant defense, fungus, polysaccharide-binding protei; 1.39A {Hypocrea virens} Back     alignment and structure
>3suj_A Cerato-platanin 1; double PSI beta barrel, unknown function; 1.34A {Moniliophthora perniciosa} Back     alignment and structure
>3sul_A Cerato-platanin-like protein; double PSI beta barrel, unknown function; 1.63A {Moniliophthora perniciosa} Back     alignment and structure
>3suk_A Cerato-platanin-like protein; double PSI beta barrel, unknown function; 1.34A {Moniliophthora perniciosa} Back     alignment and structure
>3sum_A Cerato-platanin-like protein; double PSI beta barrel, unknown function; 1.87A {Moniliophthora perniciosa} Back     alignment and structure
>2eng_A Endoglucanase V; cellulose degradation, hydrolase (endoglucanase), glycosidase; 1.50A {Humicola insolens} SCOP: b.52.1.1 PDB: 3eng_A* 4eng_A* 1hd5_A 1oa7_A* 1oa9_A 1l8f_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 143
d1n10a2143 b.52.1.3 (A:1003-1145) Pollen allergen PHL P 1 N-t 1e-07
d1wc2a1180 b.52.1.1 (A:1-180) Endoglucanase (CMCase) {Blue mu 1e-07
>d1n10a2 b.52.1.3 (A:1003-1145) Pollen allergen PHL P 1 N-terminal domain {Timothy grass (Phleum pratense) [TaxId: 15957]} Length = 143 Back     information, alignment and structure

class: All beta proteins
fold: Double psi beta-barrel
superfamily: Barwin-like endoglucanases
family: Pollen allergen PHL P 1 N-terminal domain
domain: Pollen allergen PHL P 1 N-terminal domain
species: Timothy grass (Phleum pratense) [TaxId: 15957]
 Score = 45.8 bits (108), Expect = 1e-07
 Identities = 23/126 (18%), Positives = 47/126 (37%), Gaps = 22/126 (17%)

Query: 31  GTAARYGPPFLPTTCYGN------DPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISA 84
             +  YG P               D  + P S +       I+ +G  CG  + ++C   
Sbjct: 20  AKSTWYGKPTGAGPKDNGGACGYKDVDKPPFSGMTGCGNTPIFKSGRGCGSCFEIKC--- 76

Query: 85  SEPGTCEPEQTIQIRIVDYALQLESTPSVSGTTIVLSETAFRTIANS-------TATLIN 137
           ++P  C   + + + I D     ++   ++     LS  AF  +A         +A  + 
Sbjct: 77  TKPEACSG-EPVVVHITD-----DNEEPIAPYHFDLSGHAFGAMAKKGDEQKLRSAGELE 130

Query: 138 VEFQQV 143
           ++F++V
Sbjct: 131 LQFRRV 136


>d1wc2a1 b.52.1.1 (A:1-180) Endoglucanase (CMCase) {Blue mussel (Mytilus edulis) [TaxId: 6550]} Length = 180 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query143
d1n10a2143 Pollen allergen PHL P 1 N-terminal domain {Timothy 99.97
d1wc2a1180 Endoglucanase (CMCase) {Blue mussel (Mytilus eduli 99.96
d1bw3a_125 Barwin {Barley (Hordeum vulgare) [TaxId: 4513]} 98.84
d2enga_210 Endoglucanase V (Eng V) {Humicola insolens [TaxId: 96.4
>d1n10a2 b.52.1.3 (A:1003-1145) Pollen allergen PHL P 1 N-terminal domain {Timothy grass (Phleum pratense) [TaxId: 15957]} Back     information, alignment and structure
class: All beta proteins
fold: Double psi beta-barrel
superfamily: Barwin-like endoglucanases
family: Pollen allergen PHL P 1 N-terminal domain
domain: Pollen allergen PHL P 1 N-terminal domain
species: Timothy grass (Phleum pratense) [TaxId: 15957]
Probab=99.97  E-value=1.5e-32  Score=202.66  Aligned_cols=108  Identities=22%  Similarity=0.436  Sum_probs=92.5

Q ss_pred             cCCceeeEEEeCCCC------CCCCc-ccCCCCCCCCCCeEEEeCCCCCCCCcccCCeEEEEeccCCCCCccCCCCeEEE
Q 032308           26 SFADVGTAARYGPPF------LPTTC-YGNDPTQFPSSNLFAAAGDGIWDNGASCGRQYLVRCISASEPGTCEPEQTIQI   98 (143)
Q Consensus        26 a~~~~G~aT~Y~~~~------~~GaC-~g~~~~~~p~~~~iaA~s~~~~~~g~~CG~c~~V~c~~~~~~~~C~~g~sV~V   98 (143)
                      ..|..|+||||+.++      ..||| |++.. +.|.+.++||+|+.+|++|..||+||||+|.+   +.+|. +++|+|
T Consensus        15 ~~w~~g~AT~Yg~~~~~~~~~~gGACGyg~~~-~~p~g~~~aAls~~l~~~g~~CG~cyev~c~~---~~~c~-~~sv~V   89 (143)
T d1n10a2          15 DKWLDAKSTWYGKPTGAGPKDNGGACGYKDVD-KPPFSGMTGCGNTPIFKSGRGCGSCFEIKCTK---PEACS-GEPVVV   89 (143)
T ss_dssp             CSCEEEEEEEEC----------CCTTCCCCTT-STTTTTCEEEECHHHHGGGTTSSCEEEEEECS---STTBC-SCCEEE
T ss_pred             CCcEeeEEeEeCCCCCCCCCCCcccCCCCCcc-cCCCcceEEEeCHhHccCCCcCCcceEeEcCC---CCccc-CCCEEE
Confidence            468899999998542      36999 98854 45788999999999999999999999999984   56886 789999


Q ss_pred             EEeecCCCCCCCCCCCCceEEeCHHHHHHhhcC-------CCceEEEEEEEC
Q 032308           99 RIVDYALQLESTPSVSGTTIVLSETAFRTIANS-------TATLINVEFQQV  143 (143)
Q Consensus        99 ~V~D~Cp~c~~~~~~~~~~lDLS~~AF~~ia~~-------~~G~i~V~w~~V  143 (143)
                      +|||.||++     ++.+|||||+.||++||+.       +.|+|+|+||||
T Consensus        90 ~vtd~c~~~-----~~~~hfDLS~~AF~~iA~~~~~~~~~~~Giv~V~yRrV  136 (143)
T d1n10a2          90 HITDDNEEP-----IAPYHFDLSGHAFGAMAKKGDEQKLRSAGELELQFRRV  136 (143)
T ss_dssp             EEEEECSSC-----SSSSEEEEEHHHHHTTBSTTCHHHHHTTCSEEEEEEEC
T ss_pred             EEEecccCC-----CCCccccCCHHHHhhhccccchhhheecCccceEEEEE
Confidence            999999975     3678999999999999996       479999999998



>d1wc2a1 b.52.1.1 (A:1-180) Endoglucanase (CMCase) {Blue mussel (Mytilus edulis) [TaxId: 6550]} Back     information, alignment and structure
>d1bw3a_ b.52.1.2 (A:) Barwin {Barley (Hordeum vulgare) [TaxId: 4513]} Back     information, alignment and structure
>d2enga_ b.52.1.1 (A:) Endoglucanase V (Eng V) {Humicola insolens [TaxId: 34413]} Back     information, alignment and structure