Suppr超能文献

贝叶斯非参数发现同种型和个体特异性定量。

Bayesian nonparametric discovery of isoforms and individual specific quantification.

机构信息

Department of Computer Science, Princeton University, Princeton, NJ, 08540, USA.

Department of Electrical Engineering, Princeton University, Princeton, NJ, 08540, USA.

出版信息

Nat Commun. 2018 Apr 27;9(1):1681. doi: 10.1038/s41467-018-03402-w.

Abstract

Most human protein-coding genes can be transcribed into multiple distinct mRNA isoforms. These alternative splicing patterns encourage molecular diversity, and dysregulation of isoform expression plays an important role in disease etiology. However, isoforms are difficult to characterize from short-read RNA-seq data because they share identical subsequences and occur in different frequencies across tissues and samples. Here, we develop BIISQ, a Bayesian nonparametric model for isoform discovery and individual specific quantification from short-read RNA-seq data. BIISQ does not require isoform reference sequences but instead estimates an isoform catalog shared across samples. We use stochastic variational inference for efficient posterior estimates and demonstrate superior precision and recall for simulations compared to state-of-the-art isoform reconstruction methods. BIISQ shows the most gains for low abundance isoforms, with 36% more isoforms correctly inferred at low coverage versus a multi-sample method and 170% more versus single-sample methods. We estimate isoforms in the GEUVADIS RNA-seq data and validate inferred isoforms by associating genetic variants with isoform ratios.

摘要

大多数人类蛋白编码基因可以转录为多个不同的 mRNA 亚型。这些选择性剪接模式促进了分子多样性,并且亚型表达的失调在疾病发病机制中起着重要作用。然而,由于亚型在不同组织和样本中的出现频率不同,并且具有相同的子序列,因此从短读长 RNA-seq 数据中很难对其进行特征描述。在这里,我们开发了 BIISQ,这是一种用于从短读长 RNA-seq 数据中发现亚型和个体特异性定量的贝叶斯非参数模型。BIISQ 不需要亚型参考序列,而是估计跨样本共享的亚型目录。我们使用随机变分推断进行有效的后验估计,并证明与最先进的亚型重构方法相比,模拟具有更高的精度和召回率。BIISQ 对低丰度亚型的增益最大,在低覆盖度下正确推断的亚型比多样本方法多 36%,比单样本方法多 170%。我们在 GEUVADIS RNA-seq 数据中估计亚型,并通过将遗传变异与亚型比率相关联来验证推断的亚型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/5923247/68502dcd634d/41467_2018_3402_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验