Suppr超能文献

使用 Bambu 从长读 RNA-seq 数据中进行上下文感知的转录本定量。

Context-aware transcript quantification from long-read RNA-seq data with Bambu.

机构信息

Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore.

Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Republic of Singapore.

出版信息

Nat Methods. 2023 Aug;20(8):1187-1195. doi: 10.1038/s41592-023-01908-w. Epub 2023 Jun 12.

Abstract

Most approaches to transcript quantification rely on fixed reference annotations; however, the transcriptome is dynamic and depending on the context, such static annotations contain inactive isoforms for some genes, whereas they are incomplete for others. Here we present Bambu, a method that performs machine-learning-based transcript discovery to enable quantification specific to the context of interest using long-read RNA-sequencing. To identify novel transcripts, Bambu estimates the novel discovery rate, which replaces arbitrary per-sample thresholds with a single, interpretable, precision-calibrated parameter. Bambu retains the full-length and unique read counts, enabling accurate quantification in presence of inactive isoforms. Compared to existing methods for transcript discovery, Bambu achieves greater precision without sacrificing sensitivity. We show that context-aware annotations improve quantification for both novel and known transcripts. We apply Bambu to quantify isoforms from repetitive HERVH-LTR7 retrotransposons in human embryonic stem cells, demonstrating the ability for context-specific transcript expression analysis.

摘要

大多数转录本定量方法都依赖于固定的参考注释; 然而,转录组是动态的,并且根据上下文,这些静态注释对于某些基因包含非活性异构体,而对于其他基因则不完整。在这里,我们介绍了 Bambu,这是一种基于机器学习的转录本发现方法,可使用长读 RNA 测序实现针对感兴趣上下文的定量分析。为了识别新的转录本,Bambu 估计了新的发现率,该方法用一个可解释的、经过精确校准的参数替代了任意的每个样本阈值。Bambu 保留了全长和唯一的读取计数,可在存在非活性异构体的情况下实现准确的定量。与现有的转录本发现方法相比,Bambu 在不牺牲敏感性的情况下实现了更高的精度。我们表明,上下文感知注释可提高新型和已知转录本的定量分析。我们应用 Bambu 对人类胚胎干细胞中重复的 HERVH-LTR7 逆转录转座子的异构体进行定量,展示了针对特定上下文的转录本表达分析的能力。

相似文献

7
Transcript Identification Through Long-Read Sequencing.通过长读测序进行转录本鉴定。
Methods Mol Biol. 2021;2284:531-541. doi: 10.1007/978-1-0716-1307-8_29.

引用本文的文献

本文引用的文献

2
Accurate isoform discovery with IsoQuant using long reads.利用长读长 IsoQuant 进行准确的异构体发现。
Nat Biotechnol. 2023 Jul;41(7):915-918. doi: 10.1038/s41587-022-01565-y. Epub 2023 Jan 2.
7
LIQA: long-read isoform quantification and analysis.LIQA:长读 isoform 定量分析。
Genome Biol. 2021 Jun 17;22(1):182. doi: 10.1186/s13059-021-02399-8.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验