Suppr超能文献

利用 RNA-seq 构建改良的苹果参考转录组。

Towards an improved apple reference transcriptome using RNA-seq.

机构信息

Department of Horticulture, Cornell University, New York State Agricultural Experiment Station, Geneva, NY, 14456, USA.

出版信息

Mol Genet Genomics. 2014 Jun;289(3):427-38. doi: 10.1007/s00438-014-0819-3. Epub 2014 Feb 16.

Abstract

The reference genome of apple (Malus × domestica) has been available since 2010. Despite being a milestone in apple genomics, the reference genome is difficult to be used as a reference in RNA-seq (RNA sequencing) analysis, a widespread technology in transcriptomic studies. One of the major limitations appears to be the low coverage of the reference transcriptome in RNA-seq mapping of reads. To improve the reference transcriptome, we obtained 14 sets of strand-specific RNA-seq data of 168.5 million reads in total from fruit of Golden Delicious (GD, the source of the reference genome) in varying growth and developmental stages. Using a combination of genome-guided assembly and de novo assembly, the apple reference transcriptome was improved to a collection of 71,178 genes or transcripts, which includes 53,654 genes predicted originally (with MDP prefixed in their IDs) and 17,524 novel transcripts. Of these novel transcripts, 8,144 were identified from reads directly mapped to the reference genome while the remaining 9,380 were extracted from de novo assemblies of reads that could not be initially mapped to the reference genome. Evaluating the improved apple reference transcriptome with reads from Golden Delicious and other genotypes used in this and other studies showed that it allowed 62.5 ± 9.3-82.3 ± 2.7 % of reads to be mapped, a marked increase from the low rates of 37.4 ± 7.7-46.6 ± 7.1 % offered by the original reference transcriptome. The improved reference transcriptome therefore represents a step forward towards a complete reference transcriptome in apple.

摘要

苹果(Malus × domestica)的参考基因组自 2010 年以来就已经可用。尽管这是苹果基因组学的一个里程碑,但该参考基因组在 RNA-seq(RNA 测序)分析中很难作为参考,而 RNA-seq 是转录组研究中广泛使用的技术。其主要限制之一似乎是参考转录本在 RNA-seq 读段映射中的覆盖度低。为了改进参考转录本,我们从不同生长和发育阶段的 Golden Delicious(GD,参考基因组的来源)果实中总共获得了 14 组 1.685 亿条定向 RNA-seq 数据。我们使用基因组指导组装和从头组装的组合,将苹果参考转录本改进为一个包含 71178 个基因或转录本的集合,其中包括最初预测的 53654 个基因(其 ID 以 MDP 为前缀)和 17524 个新转录本。在这些新转录本中,有 8144 个是从直接映射到参考基因组的读段中鉴定出来的,而其余 9380 个是从最初无法映射到参考基因组的读段的从头组装中提取出来的。使用来自 Golden Delicious 和本研究和其他研究中使用的其他基因型的读段评估改进后的苹果参考转录本表明,它允许 62.5±9.3-82.3±2.7%的读段被映射,这比原始参考转录本提供的低比率 37.4±7.7-46.6±7.1%有明显提高。因此,改进后的参考转录本代表了苹果完整参考转录本的一个进步。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验