Suppr超能文献

从头组装中读取长度和转录组复杂性的影响:一项模拟研究。

The impacts of read length and transcriptome complexity for de novo assembly: a simulation study.

作者信息

Chang Zheng, Wang Zhenjia, Li Guojun

机构信息

School of Mathematics, Shandong University, Jinan, Shandong, China.

出版信息

PLoS One. 2014 Apr 15;9(4):e94825. doi: 10.1371/journal.pone.0094825. eCollection 2014.

Abstract

Transcriptome assembly using RNA-seq data - particularly in non-model organisms has been dramatically improved, but only recently have the pre-assembly procedures, such as sequencing depth and error correction, been studied. Increasing read length is viewed as a crucial condition to further improve transcriptome assembly, but it is unknown whether the read length really matters. In addition, though many assembly tools are available now, it is unclear whether the existing assemblers perform well enough for all data with different transcriptome complexities. In this paper, we studied these two open problems using two high-performing assemblers, Velvet/Oases and Trinity, on several simulated datasets of human, mouse and S.cerevisiae. The results suggest that (1) the read length of paired reads does not matter once it exceeds a certain threshold, and interestingly, the threshold is distinct in different organisms; (2) the quality of de novo assembly decreases sharply with the increase of transcriptome complexity, all existing de novo assemblers tend to corrupt whenever the genes contain a large number of alternative splicing events.

摘要

使用RNA测序数据进行转录组组装——尤其是在非模式生物中,已经有了显著改进,但直到最近才开始研究诸如测序深度和错误校正等预组装程序。增加读长被视为进一步改善转录组组装的关键条件,但读长是否真的重要尚不清楚。此外,尽管现在有许多组装工具,但目前尚不清楚现有的组装程序对于所有具有不同转录组复杂性的数据是否都能表现良好。在本文中,我们使用两个高性能组装程序Velvet/Oases和Trinity,在几个关于人类、小鼠和酿酒酵母的模拟数据集上研究了这两个未解决的问题。结果表明:(1)一旦双端读长超过某个阈值,其长度就无关紧要了,有趣的是,不同生物体中的阈值各不相同;(2)随着转录组复杂性的增加,从头组装的质量会急剧下降,每当基因包含大量可变剪接事件时,所有现有的从头组装程序都容易出错。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f38/3988101/f4d88a6b6486/pone.0094825.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验