Analysis of Proteins in Citrus sinensis Genome
Welcome to our database of the analysis of proteins in the Citrus sinensis Genome. The protein sequences were downloaded from the Citrus Genome Database. We first performed a simple analysis of the Citrus sinensis genome assembly. The size of the Citrus sinensis genome assembly (v1.0) is 319Mb. There are 12,574 scaffolds in the whole genome assembly with no observed scaffold shorter than 1000 bp. The length of the shortest scaffold is 1992 bp, while the longest scaffold is 5927,163 bp. The N50 statistic is 250,548. There are 46,147 transcripts, including 20,771 alternative transcripts. We furthur analyzed these transcripts, including prediction of sequence local features, detection of sequence similarity to known databases, prediction of gene ontology (GO), prediction of Enzyme Class (EC), as well as structure prediction. For each protein, we generated a webpage to present the complete analysis. (For example, you can see the results of protein 1g002345m predicted as a Small RNA 2'-O-methytransferase by clicking here). We summarize the annotation results of all proteins to facilitate the browsing of the whole genome. Search by name or keyword is also supported. In addition, we grouped the Citrus sinensis proteins according to GO categorization and Clusters of Orthologous Groups (COG) classification. As analysis proceeds, these pages will be updated. We hope these results and summaries will be helpful for your research.
| Protein annotation | Keyword Search | GO Categorization | Functional Classification |