Suppr超能文献

使用长读长和短读长混合的方法进行转录组组装,可提高组装质量。

Improved transcriptome assembly using a hybrid of long and short reads with StringTie.

机构信息

Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America.

Center for Computational Biology, Johns Hopkins University, Baltimore, Maryland, United States of America.

出版信息

PLoS Comput Biol. 2022 Jun 1;18(6):e1009730. doi: 10.1371/journal.pcbi.1009730. eCollection 2022 Jun.

Abstract

Short-read RNA sequencing and long-read RNA sequencing each have their strengths and weaknesses for transcriptome assembly. While short reads are highly accurate, they are rarely able to span multiple exons. Long-read technology can capture full-length transcripts, but its relatively high error rate often leads to mis-identified splice sites. Here we present a new release of StringTie that performs hybrid-read assembly. By taking advantage of the strengths of both long and short reads, hybrid-read assembly with StringTie is more accurate than long-read only or short-read only assembly, and on some datasets it can more than double the number of correctly assembled transcripts, while obtaining substantially higher precision than the long-read data assembly alone. Here we demonstrate the improved accuracy on simulated data and real data from Arabidopsis thaliana, Mus musculus, and human. We also show that hybrid-read assembly is more accurate than correcting long reads prior to assembly while also being substantially faster. StringTie is freely available as open source software at https://github.com/gpertea/stringtie.

摘要

短读 RNA 测序和长读 RNA 测序在转录组组装方面各有优势和劣势。虽然短读长非常准确,但它们很少能够跨越多个外显子。长读技术可以捕获全长转录本,但相对较高的错误率常常导致拼接位点错误识别。在这里,我们展示了一个新版本的 StringTie,它可以执行混合读取组装。通过利用长读和短读的优势,StringTie 的混合读取组装比仅使用长读或短读组装更准确,并且在某些数据集上,它可以将正确组装的转录本数量增加一倍以上,同时获得比仅使用长读数据组装更高的精度。在这里,我们展示了在模拟数据和拟南芥、小鼠和人类的真实数据上提高的准确性。我们还表明,混合读取组装比在组装前纠正长读取更准确,同时速度也更快。StringTie 可在 https://github.com/gpertea/stringtie 上免费获得开源软件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d31/9191730/f562931afedc/pcbi.1009730.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验