Citrus Sinensis ID: 034497


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90---
MASKFSNSFLLLRFCRRSNAVAAGNVGGKQPAAAEVMRKMTDMKAESVCGVEKEEFWMRDPNTGNWIPESQFNQIDAADLRDQLLPKIKKNDS
ccccHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccHHHHHHHHccccccccc
ccccccHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHcccccEEEEccccccccccccccccccHHHHHHHHHHHcccccc
MASKFSNSFLLLRFCRRSNAvaagnvggkqpAAAEVMRKMTDMkaesvcgvekeefwmrdpntgnwipesqfnqidaADLRDQllpkikknds
MASKFSNSFLLLRFCRRSNAvaagnvggkqpAAAEVMRKMTDMKAESVCGVEKEEFWMRDPNTGNWIPESQFNQIDAADLRDqllpkikknds
MASKFSNSFLLLRFCRRSNAVAAGNVGGKQPAAAEVMRKMTDMKAESVCGVEKEEFWMRDPNTGNWIPESQFNQIDAADLRDQLLPKIKKNDS
*******SFLLLRFCRRSNAVAAGN*********************SVCGVEKEEFWMRDPNTGNWIPESQFN********************
********FLLLRFCRRSNA************************************WMRDPNTGNWIPESQFNQIDAADLR************
********FLLLRFCRRSNAVAAGNVGGKQPAAAEVMRKMTDMKAESVCGVEKEEFWMRDPNTGNWIPESQFNQIDAADLRDQLLPKIKKNDS
****FSNSFLLLRFCRRSNAVAAGNVGGK***********************KEEFWMRDPNTGNWIPESQFNQIDAADLRDQLLPK******
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
SSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MASKFSNSFLLLRFCRRSNAVAAGNVGGKQPAAAEVMRKMTDMKAESVCGVEKEEFWMRDPNTGNWIPESQFNQIDAADLRDQLLPKIKKNDS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query93
25558444891 conserved hypothetical protein [Ricinus 0.860 0.879 0.5 6e-12
35948289793 PREDICTED: uncharacterized protein LOC10 0.827 0.827 0.435 4e-11
35172087693 uncharacterized protein LOC100305512 [Gl 0.666 0.666 0.478 4e-10
35714261886 PREDICTED: late embryogenesis abundant p 0.731 0.790 0.428 2e-08
24203553595 hypothetical protein SORBIDRAFT_01g03307 0.913 0.894 0.369 1e-07
22407100957 predicted protein [Populus trichocarpa] 0.548 0.894 0.538 2e-07
35711968088 PREDICTED: uncharacterized protein LOC10 0.365 0.386 0.676 3e-07
125561745105 hypothetical protein OsI_29438 [Oryza sa 0.763 0.676 0.472 4e-07
42407625105 hypothetical protein [Oryza sativa Japon 0.763 0.676 0.472 8e-07
35751429397 hypothetical protein MTR_8g022990 [Medic 0.838 0.804 0.392 9e-07
>gi|255584448|ref|XP_002532955.1| conserved hypothetical protein [Ricinus communis] gi|223527284|gb|EEF29439.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score = 74.7 bits (182), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 43/86 (50%), Positives = 57/86 (66%), Gaps = 6/86 (6%)

Query: 7  NSFLLLRFCRRSNAVAAGNVGGKQPAAAEVMRKMTDMKAESVC--GVEKEEFWMRDPNTG 64
          ++FLLL   +RS AVA  ++   QP  + VMRK  ++ AE +    V+ E FWMRDP TG
Sbjct: 7  SNFLLL--SKRSYAVATESLNV-QPLLS-VMRKAQELAAEELAEKKVKDETFWMRDPKTG 62

Query: 65 NWIPESQFNQIDAADLRDQLLPKIKK 90
          NWIPES F +IDAADLR + L + +K
Sbjct: 63 NWIPESHFGEIDAADLRRKFLSRKEK 88




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359482897|ref|XP_003632857.1| PREDICTED: uncharacterized protein LOC100855396 [Vitis vinifera] Back     alignment and taxonomy information
>gi|351720876|ref|NP_001236423.1| uncharacterized protein LOC100305512 [Glycine max] gi|255625741|gb|ACU13215.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|357142618|ref|XP_003572634.1| PREDICTED: late embryogenesis abundant protein Lea5-like [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|242035535|ref|XP_002465162.1| hypothetical protein SORBIDRAFT_01g033070 [Sorghum bicolor] gi|241919016|gb|EER92160.1| hypothetical protein SORBIDRAFT_01g033070 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|224071009|ref|XP_002303328.1| predicted protein [Populus trichocarpa] gi|222840760|gb|EEE78307.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|357119680|ref|XP_003561563.1| PREDICTED: uncharacterized protein LOC100826408 [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|125561745|gb|EAZ07193.1| hypothetical protein OsI_29438 [Oryza sativa Indica Group] Back     alignment and taxonomy information
>gi|42407625|dbj|BAD08740.1| hypothetical protein [Oryza sativa Japonica Group] gi|125603617|gb|EAZ42942.1| hypothetical protein OsJ_27532 [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|357514293|ref|XP_003627435.1| hypothetical protein MTR_8g022990 [Medicago truncatula] gi|355521457|gb|AET01911.1| hypothetical protein MTR_8g022990 [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query93
TAIR|locus:213327997 SAG21 "AT4G02380" [Arabidopsis 0.860 0.824 0.359 6.3e-05
TAIR|locus:2133279 SAG21 "AT4G02380" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 95 (38.5 bits), Expect = 6.3e-05, P = 6.3e-05
 Identities = 32/89 (35%), Positives = 45/89 (50%)

Query:     1 MASKFSNSFLLLRFCRRSNAVAA--GNV--GGKQPAAAEVMRKMTDMKAESVCGVEKEEF 56
             + S F +  L     RR  A  A  G+V  GG+  A A  + K   ++ ES   +     
Sbjct:    10 IVSAFVSRELSNAIFRRGYAATAAQGSVSSGGRSGAVASAVMKKKGVE-ESTQKIS---- 64

Query:    57 WMRDPNTGNWIPESQFNQIDAADLRDQLL 85
             W+ DP TG + PE+  N+IDAA+LR  LL
Sbjct:    65 WVPDPKTGYYRPETGSNEIDAAELRAALL 93


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.317   0.129   0.384    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0       93        93   0.00091  102 3  11 22  0.36    30
                                                     29  0.41    31


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  548 (58 KB)
  Total size of DFA:  111 KB (2074 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  9.19u 0.17s 9.36t   Elapsed:  00:00:00
  Total cpu time:  9.19u 0.17s 9.36t   Elapsed:  00:00:00
  Start:  Thu May  9 17:17:01 2013   End:  Thu May  9 17:17:01 2013


GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0009790 "embryo development" evidence=ISS
GO:0000302 "response to reactive oxygen species" evidence=IEP
GO:0009409 "response to cold" evidence=IEP;RCA
GO:0006979 "response to oxidative stress" evidence=RCA;IMP
GO:0042631 "cellular response to water deprivation" evidence=IEP;RCA
GO:0000165 "MAPK cascade" evidence=RCA
GO:0006612 "protein targeting to membrane" evidence=RCA
GO:0007154 "cell communication" evidence=RCA
GO:0009611 "response to wounding" evidence=RCA
GO:0009738 "abscisic acid mediated signaling pathway" evidence=RCA
GO:0009862 "systemic acquired resistance, salicylic acid mediated signaling pathway" evidence=RCA
GO:0009867 "jasmonic acid mediated signaling pathway" evidence=RCA
GO:0010363 "regulation of plant-type hypersensitive response" evidence=RCA
GO:0016036 "cellular response to phosphate starvation" evidence=RCA
GO:0019375 "galactolipid biosynthetic process" evidence=RCA
GO:0030968 "endoplasmic reticulum unfolded protein response" evidence=RCA
GO:0031348 "negative regulation of defense response" evidence=RCA
GO:0043069 "negative regulation of programmed cell death" evidence=RCA
GO:0050832 "defense response to fungus" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query93
pfam0324294 pfam03242, LEA_3, Late embryogenesis abundant prot 2e-15
>gnl|CDD|217449 pfam03242, LEA_3, Late embryogenesis abundant protein Back     alignment and domain information
 Score = 65.2 bits (159), Expect = 2e-15
 Identities = 30/96 (31%), Positives = 38/96 (39%), Gaps = 14/96 (14%)

Query: 1  MASKFSNSFLLLRF---------CRR--SNAVAAGNVGGKQPAAAEVMRKMTDMKAESVC 49
          MA   SN+ LL             RR  + A AA   G  +  AA            S  
Sbjct: 1  MARSLSNAKLLSALVADGVSAAVHRRGYAAAAAAMRSGATRGGAAREAEMRPKAGEGSG- 59

Query: 50 GVEKEEFWMRDPNTGNWIPESQFNQIDAADLRDQLL 85
                 W+ DP TG + PE+  N+ID A+LR  LL
Sbjct: 60 --AATSSWVPDPVTGYYRPENHANEIDPAELRAMLL 93


Members of this family are similar to late embryogenesis abundant proteins. Members of the family have been isolated in a number of different screens. However, the molecular function of these proteins remains obscure. Length = 94

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 93
PF0324293 LEA_3: Late embryogenesis abundant protein; InterP 100.0
PF0273642 Myosin_N: Myosin N-terminal SH3-like domain; Inter 86.61
>PF03242 LEA_3: Late embryogenesis abundant protein; InterPro: IPR004926 Late-embryogenesis abundant (LEA) genes encode a diverse group of proteins that accumulate to high levels during the maturation phase of seed development [] Back     alignment and domain information
Probab=100.00  E-value=1.1e-38  Score=216.88  Aligned_cols=83  Identities=36%  Similarity=0.500  Sum_probs=61.4

Q ss_pred             Ccccchh---------hhhhhhhhhhhhhHhhhcccCCCchHH-HHHHHhhhhhhhhccccccccccccCCCCccccCCC
Q 034497            1 MASKFSN---------SFLLLRFCRRSNAVAAGNVGGKQPAAA-EVMRKMTDMKAESVCGVEKEEFWMRDPNTGNWIPES   70 (93)
Q Consensus         1 MArs~sn---------~~lsl~~~RRgYaaAA~~~a~~~~~~~-~~a~~~~~~~~~~~~~~~~~~~WmpDP~TG~y~Pen   70 (93)
                      |||||||         +.|+++++||||+++++.  +.+++.. .+.++. +...+.++..++++|||||||||||||||
T Consensus         1 MArsls~ak~lsal~~~~~s~~~~rRgYaaaA~~--~~~sa~r~g~~~~~-~~~~~~~~~~~~~~~W~pDPvTGyyrPen   77 (93)
T PF03242_consen    1 MARSLSNAKVLSALVSDAFSSLISRRGYAAAAAA--VRSSAARGGAAGKA-KMASKAGEDSKEKSSWMPDPVTGYYRPEN   77 (93)
T ss_pred             CchhHHHHHHHHHHHHHHHHHHHHhhhHHHHHHH--HHhhcccccccccc-hhhhhcccccccccccccCCCCccccCCC
Confidence            9999999         567888899999999984  2221100 011111 11122233478899999999999999999


Q ss_pred             CcCcCCHHHHHHhhCC
Q 034497           71 QFNQIDAADLRDQLLP   86 (93)
Q Consensus        71 ~~~e~D~aeLR~~lL~   86 (93)
                      ||+|||+||||++||+
T Consensus        78 ~~~EiD~AeLR~~lL~   93 (93)
T PF03242_consen   78 HFGEIDAAELRAKLLK   93 (93)
T ss_pred             CCCCCCHHHHHHHHhC
Confidence            9999999999999996



This group includes LEA-5 [], whose expression is induced by salt, drought and heat stress [], and related proteins. ; GO: 0006950 response to stress

>PF02736 Myosin_N: Myosin N-terminal SH3-like domain; InterPro: IPR004009 This domain has an SH3-like fold Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query93
d2mysa146 Myosin S1 fragment, N-terminal domain {Chicken (Ga 83.88
>d2mysa1 b.34.3.1 (A:34-79) Myosin S1 fragment, N-terminal domain {Chicken (Gallus gallus), pectoral muscle [TaxId: 9031]} Back     information, alignment and structure
class: All beta proteins
fold: SH3-like barrel
superfamily: Myosin S1 fragment, N-terminal domain
family: Myosin S1 fragment, N-terminal domain
domain: Myosin S1 fragment, N-terminal domain
species: Chicken (Gallus gallus), pectoral muscle [TaxId: 9031]
Probab=83.88  E-value=0.14  Score=28.94  Aligned_cols=14  Identities=14%  Similarity=0.249  Sum_probs=11.7

Q ss_pred             cccccccCCCCccc
Q 034497           53 KEEFWMRDPNTGNW   66 (93)
Q Consensus        53 ~~~~WmpDP~TG~y   66 (93)
                      ++..|||||+-||=
T Consensus         2 Kk~vwVpD~ke~yv   15 (46)
T d2mysa1           2 KSSVFVVHPKQSFV   15 (46)
T ss_dssp             TTEEEEECSSSSEE
T ss_pred             CceEEcccCcccEE
Confidence            46679999999984