Citrus Sinensis ID: 039446


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140------
PNHSSVSASNIITSLHLNKQMGKHQSRSLMQGHLKKAVKKLHMILSRIKLSKWNLASICRAGFSQSRRSLSFNDRLGLQYCIVDRDPDEGSSVRVLQRTKSCAYDGDDDIDRRAEIFITNFRRQLLYERQVSLELRYRRGNSFDEY
ccccccccccHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHccccHHHHHHHHHHcccccccccccccccccEEEEccccccccccccEEEEEEccccccccHHHHHHHHHHHHHHHHHHHHHHccEEEEEEcccccccc
cccccccHHHHHHHHHHHHHccHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccEEEcccccccccccccccccccccccEEEEEcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHEEHHHHccccHccc
PNHSSVSASNIITSLHLNKQMGKHQSRSLMQGHLKKAVKKLHMILSRIKLSKWNLASICRAgfsqsrrslsfndrlglqycivdrdpdegssvRVLQRTkscaydgdddiDRRAEIFITNFRRQLLYERQVSLELRYRRGNSFDEY
PNHSSVSASNIITSLHLNKQMGKHQSRSLMQGHLKKAVKKLHMILSRIKLSKWNLASICRAGfsqsrrslsfndRLGLQYCIvdrdpdegssvrvlqrtkscaydgdddidRRAEIFITnfrrqllyerqvslelryrrgnsfdey
PNHSSVSASNIITSLHLNKQMGKHQSRSLMQGHLKKAVKKLHMILSRIKLSKWNLASICRAGFSQSRRSLSFNDRLGLQYCIVDRDPDEGSSVRVLQRTKSCAYDGDDDIDRRAEIFITNFRRQLLYERQVSLELRYRRGNSFDEY
*********************************LKKAVKKLHMILSRIKLSKWNLASICRAGFSQSRRSLSFNDRLGLQYCIVDRD******VRVLQRTKSCAYDGDDDIDRRAEIFITNFRRQLLYERQVSLELRY*********
**********IIT********************LKKAVKKLHMILSRIKLSKWNL********************************************************RRAEIFITNFRRQLLYERQVSLELRYRRG******
*********NIITSLHLN***********MQGHLKKAVKKLHMILSRIKLSKWNLASICRAGFSQSRRSLSFNDRLGLQYCIVDRDPDEGSSVRVLQRTKSCAYDGDDDIDRRAEIFITNFRRQLLYERQVSLELRYRRGNSFDEY
*****VSASNIITSLHLNKQMGKHQSRSLMQGHLKKAVKKLHMILSRIKLSKWNLASICRAGFSQSRRSLSFNDRLGLQYCIV**********RVLQRTKSCAYDGDDDIDRRAEIFITNFRRQLLYERQVSLELRYRRGNSF***
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
PNHSSVSASNIITSLHLNKQMGKHQSRSLMQGHLKKAVKKLHMILSRIKLSKWNLASICRAGFSQSRRSLSFNDRLGLQYCIVDRDPDEGSSVRVLQRTKSCAYDGDDDIDRRAEIFITNFRRQLLYERQVSLELRYRRGNSFDEY
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query146
255548568126 conserved hypothetical protein [Ricinus 0.821 0.952 0.568 3e-25
356562451134 PREDICTED: uncharacterized protein LOC10 0.815 0.888 0.484 4e-21
225446076128 PREDICTED: uncharacterized protein LOC10 0.815 0.929 0.472 8e-19
356547332132 PREDICTED: uncharacterized protein LOC10 0.719 0.795 0.508 2e-18
357479721132 hypothetical protein MTR_4g128470 [Medic 0.773 0.856 0.495 7e-18
388514623132 unknown [Medicago truncatula] 0.698 0.772 0.523 2e-17
359485093128 PREDICTED: uncharacterized protein LOC10 0.815 0.929 0.464 3e-17
224121176128 predicted protein [Populus trichocarpa] 0.815 0.929 0.515 3e-17
297796651130 hypothetical protein ARALYDRAFT_918928 [ 0.828 0.930 0.455 2e-16
255646825132 unknown [Glycine max] 0.719 0.795 0.491 7e-16
>gi|255548568|ref|XP_002515340.1| conserved hypothetical protein [Ricinus communis] gi|223545284|gb|EEF46789.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  119 bits (299), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/125 (56%), Positives = 85/125 (68%), Gaps = 5/125 (4%)

Query: 21  MGKH-QSRSLMQGHLKKAVKKLHMILSRIKLSKWNLASICRAGFSQSRRSL-SFNDRLGL 78
           MG+  Q    + GHLKKAVKKL+ +L R  L  W +ASI R    + +  L SFNDRLGL
Sbjct: 1   MGRQRQQNWTVLGHLKKAVKKLNFLL-RFNLRGWRIASIVRNVSKRPQLRLKSFNDRLGL 59

Query: 79  QYCIVDRDPDEGSSVRVLQRTKSCAYDGDDDIDRRAEIFITNFRRQLLYERQVSLELRYR 138
             CI D + D    V+ LQRT+S  Y  D+DID+RAEIFI NFRRQLL ERQVSL++RY 
Sbjct: 60  HGCIEDLESDRSEMVKTLQRTRS--YASDEDIDQRAEIFIANFRRQLLLERQVSLQVRYY 117

Query: 139 RGNSF 143
           RGNSF
Sbjct: 118 RGNSF 122




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356562451|ref|XP_003549485.1| PREDICTED: uncharacterized protein LOC100798434 [Glycine max] Back     alignment and taxonomy information
>gi|225446076|ref|XP_002272519.1| PREDICTED: uncharacterized protein LOC100264996 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356547332|ref|XP_003542068.1| PREDICTED: uncharacterized protein LOC100814899 [Glycine max] Back     alignment and taxonomy information
>gi|357479721|ref|XP_003610146.1| hypothetical protein MTR_4g128470 [Medicago truncatula] gi|355511201|gb|AES92343.1| hypothetical protein MTR_4g128470 [Medicago truncatula] Back     alignment and taxonomy information
>gi|388514623|gb|AFK45373.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|359485093|ref|XP_003633214.1| PREDICTED: uncharacterized protein LOC100259802 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224121176|ref|XP_002330762.1| predicted protein [Populus trichocarpa] gi|224121184|ref|XP_002330764.1| predicted protein [Populus trichocarpa] gi|222872564|gb|EEF09695.1| predicted protein [Populus trichocarpa] gi|222872566|gb|EEF09697.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297796651|ref|XP_002866210.1| hypothetical protein ARALYDRAFT_918928 [Arabidopsis lyrata subsp. lyrata] gi|297312045|gb|EFH42469.1| hypothetical protein ARALYDRAFT_918928 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|255646825|gb|ACU23884.1| unknown [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query146
TAIR|locus:2174537130 AT5G57510 "AT5G57510" [Arabido 0.821 0.923 0.451 5.1e-19
TAIR|locus:2174537 AT5G57510 "AT5G57510" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 228 (85.3 bits), Expect = 5.1e-19, P = 5.1e-19
 Identities = 56/124 (45%), Positives = 74/124 (59%)

Query:    21 MGKHQSRSLMQGHLKKAVKKLHMILSRIKL-SKWNLASICRAGFSQSRRSLSFNDRLGLQ 79
             M K +  SL+   L++AV K+  +LS  K+ S W L  +  +  S S   LSFNDR GL 
Sbjct:     1 MTKSKDGSLLN-RLRQAVNKVKFVLS-FKINSLWGLVPMLGSSSSSSSLRLSFNDRPGLT 58

Query:    80 YCIVDRDPDE-GSSVRVLQRTKSCAYDGDDDIDRRAEIFITNFRRQLLYERQVSLELRYR 138
                 + +PD  GSS   L RT S     D+DID +AE+FI NF RQL  ERQ+SLEL+Y 
Sbjct:    59 AAFTENEPDSTGSSRGALYRTVSYDQSSDEDIDNKAEMFIANFYRQLKIERQISLELKYC 118

Query:   139 RGNS 142
             +GN+
Sbjct:   119 QGNN 122


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.134   0.387    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      146       146   0.00079  103 3  11 22  0.42    31
                                                     30  0.49    33


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  577 (61 KB)
  Total size of DFA:  138 KB (2086 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  13.72u 0.17s 13.89t   Elapsed:  00:00:01
  Total cpu time:  13.72u 0.17s 13.89t   Elapsed:  00:00:01
  Start:  Sat May 11 02:01:30 2013   End:  Sat May 11 02:01:31 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0071456 "cellular response to hypoxia" evidence=IEP
GO:0010200 "response to chitin" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query146
pfam0555338 pfam05553, DUF761, Cotton fibre expressed protein 1e-05
>gnl|CDD|218633 pfam05553, DUF761, Cotton fibre expressed protein Back     alignment and domain information
 Score = 39.5 bits (93), Expect = 1e-05
 Identities = 19/34 (55%), Positives = 24/34 (70%), Gaps = 1/34 (2%)

Query: 106 GDDDIDRRAEIFITNFRRQLLYERQVSLELRYRR 139
            DD++DRRAE FI  FR QL  +RQ SL+ RY+ 
Sbjct: 1   SDDEVDRRAEEFIARFREQLRLQRQESLK-RYQE 33


This family consists of several plant proteins of unknown function. Three of the sequences (from Gossypium hirsutum) in this family are described as cotton fibre expressed proteins. The remaining sequences, found in Arabidopsis thaliana, are uncharacterized. Length = 38

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 146
PF0555338 DUF761: Cotton fibre expressed protein; InterPro: 99.33
>PF05553 DUF761: Cotton fibre expressed protein; InterPro: IPR008480 This family consists of several plant proteins of unknown function Back     alignment and domain information
Probab=99.33  E-value=4.4e-13  Score=85.08  Aligned_cols=34  Identities=56%  Similarity=0.830  Sum_probs=30.8

Q ss_pred             CCcchhHHHHHHHHHHHHHHhHhhhhccceeeecC
Q 039446          106 GDDDIDRRAEIFITNFRRQLLYERQVSLELRYRRG  140 (146)
Q Consensus       106 ~d~dID~rAe~FIanFr~QlklqRQ~SLqLrY~~~  140 (146)
                      ++++||++||+||++||+||+||||.|+ .+|+++
T Consensus         1 ~~~evd~rAe~FI~~f~~qlrlqr~~S~-~ry~em   34 (38)
T PF05553_consen    1 SDDEVDRRAEEFIAKFREQLRLQRQESL-QRYQEM   34 (38)
T ss_pred             CchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH
Confidence            3679999999999999999999999999 688764



Three of the sequences from Gossypium hirsutum (Upland cotton) in this family are described as G. hirsutum fibre expressed proteins []. The remaining sequences, found in Arabidopsis thaliana, are uncharacterised.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00