Gupta Vikas, Estrada April D, Blakley Ivory, Reid Rob, Patel Ketan, Meyer Mason D, Andersen Stig Uggerhøj, Brown Allan F, Lila Mary Ann, Loraine Ann E
Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, North Carolina Research Campus, Kannapolis, NC 28081 USA ; Centre for Carbohydrate Recognition and Signaling, Department of Molecular Biology and Genetics, Aarhus University, Gustav Wieds Vej 10, 8000 Aarhus C, Denmark.
Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, North Carolina Research Campus, Kannapolis, NC 28081 USA.
Gigascience. 2015 Feb 13;4:5. doi: 10.1186/s13742-015-0046-9. eCollection 2015.
Blueberries are a rich source of antioxidants and other beneficial compounds that can protect against disease. Identifying genes involved in synthesis of bioactive compounds could enable the breeding of berry varieties with enhanced health benefits.
Toward this end, we annotated a previously sequenced draft blueberry genome assembly using RNA-Seq data from five stages of berry fruit development and ripening. Genome-guided assembly of RNA-Seq read alignments combined with output from ab initio gene finders produced around 60,000 gene models, of which more than half were similar to proteins from other species, typically the grape Vitis vinifera. Comparison of gene models to the PlantCyc database of metabolic pathway enzymes identified candidate genes involved in synthesis of bioactive compounds, including bixin, an apocarotenoid with potential disease-fighting properties, and defense-related cyanogenic glycosides, which are toxic. Cyanogenic glycoside (CG) biosynthetic enzymes were highly expressed in green fruit, and a candidate CG detoxification enzyme was up-regulated during fruit ripening. Candidate genes for ethylene, anthocyanin, and 400 other biosynthetic pathways were also identified. Homology-based annotation using Blast2GO and InterPro assigned Gene Ontology terms to around 15,000 genes. RNA-Seq expression profiling showed that blueberry growth, maturation, and ripening involve dynamic gene expression changes, including coordinated up- and down-regulation of metabolic pathway enzymes and transcriptional regulators. Analysis of RNA-seq alignments identified developmentally regulated alternative splicing, promoter use, and 3' end formation.
We report genome sequence, gene models, functional annotations, and RNA-Seq expression data that provide an important new resource enabling high throughput studies in blueberry.
蓝莓富含抗氧化剂和其他有益化合物,能够预防疾病。鉴定参与生物活性化合物合成的基因有助于培育具有更高健康效益的浆果品种。
为此,我们利用浆果果实发育和成熟五个阶段的RNA-Seq数据,对之前测序的蓝莓基因组草图进行了注释。RNA-Seq读段比对的基因组引导组装与从头基因预测工具的输出结果相结合,产生了约60,000个基因模型,其中一半以上与其他物种的蛋白质相似,通常是葡萄(Vitis vinifera)。将基因模型与代谢途径酶的PlantCyc数据库进行比较,鉴定出参与生物活性化合物合成的候选基因,包括具有潜在抗病特性的类胡萝卜素酸二异戊烯酯(胭脂树橙),以及与防御相关的生氰糖苷,后者具有毒性。生氰糖苷(CG)生物合成酶在绿色果实中高度表达,一种候选的CG解毒酶在果实成熟过程中上调。还鉴定出了乙烯、花青素和其他400条生物合成途径的候选基因。使用Blast2GO和InterPro基于同源性的注释为约15,000个基因赋予了基因本体论术语。RNA-Seq表达谱分析表明,蓝莓的生长、成熟和成熟过程涉及动态的基因表达变化,包括代谢途径酶和转录调节因子的协同上调和下调。对RNA-seq比对的分析确定了发育调控的可变剪接、启动子使用和3'端形成。
我们报告了基因组序列、基因模型、功能注释和RNA-Seq表达数据,这些为蓝莓的高通量研究提供了重要的新资源。