• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于从RNA测序数据中识别差异表达转录本的贝叶斯模型选择方法。

A Bayesian model selection approach for identifying differentially expressed transcripts from RNA sequencing data.

作者信息

Papastamoulis Panagiotis, Rattray Magnus

机构信息

University of Manchester UK.

出版信息

J R Stat Soc Ser C Appl Stat. 2018 Jan;67(1):3-23. doi: 10.1111/rssc.12213. Epub 2017 Feb 7.

DOI:10.1111/rssc.12213
PMID:29353941
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5763373/
Abstract

Recent advances in molecular biology allow the quantification of the transcriptome and scoring transcripts as differentially or equally expressed between two biological conditions. Although these two tasks are closely linked, the available inference methods treat them separately: a primary model is used to estimate expression and its output is post processed by using a differential expression model. In the paper, both issues are simultaneously addressed by proposing the joint estimation of expression levels and differential expression: the unknown relative abundance of each transcript can either be equal or not between two conditions. A hierarchical Bayesian model builds on the BitSeq framework and the posterior distribution of transcript expression and differential expression is inferred by using Markov chain Monte Carlo sampling. It is shown that the model proposed enjoys conjugacy for fixed dimension variables; thus the full conditional distributions are analytically derived. Two samplers are constructed, a reversible jump Markov chain Monte Carlo sampler and a collapsed Gibbs sampler, and the latter is found to perform better. A cluster representation of the aligned reads to the transcriptome is introduced, allowing parallel estimation of the marginal posterior distribution of subsets of transcripts under reasonable computing time. Under a fixed prior probability of differential expression the clusterwise sampler has the same marginal posterior distributions as the raw sampler, but a more general prior structure is also employed. The algorithm proposed is benchmarked against alternative methods by using synthetic data sets and applied to real RNA sequencing data. Source code is available on line from https://github.com/mqbssppe/cjBitSeq.

摘要

分子生物学的最新进展使得对转录组进行定量分析,并对转录本在两种生物学条件下的差异表达或等量表达进行评分成为可能。尽管这两项任务紧密相关,但现有的推理方法将它们分开处理:使用一个主模型来估计表达量,其输出结果再通过差异表达模型进行后处理。在本文中,通过提出对表达水平和差异表达的联合估计,同时解决了这两个问题:每个转录本在两种条件下的未知相对丰度可能相等,也可能不相等。一个分层贝叶斯模型建立在BitSeq框架之上,通过马尔可夫链蒙特卡罗采样来推断转录本表达和差异表达的后验分布。结果表明,所提出的模型对于固定维度变量具有共轭性;因此可以解析推导完整的条件分布。构建了两个采样器,一个可逆跳跃马尔可夫链蒙特卡罗采样器和一个塌缩吉布斯采样器,发现后者性能更好。引入了比对到转录组的 reads 的聚类表示,使得在合理的计算时间内能够并行估计转录本子集的边际后验分布。在差异表达的固定先验概率下,聚类采样器与原始采样器具有相同的边际后验分布,但也采用了更一般的先验结构。所提出的算法通过使用合成数据集与其他方法进行基准测试,并应用于真实的 RNA 测序数据。源代码可从 https://github.com/mqbssppe/cjBitSeq 在线获取。

相似文献

1
A Bayesian model selection approach for identifying differentially expressed transcripts from RNA sequencing data.一种用于从RNA测序数据中识别差异表达转录本的贝叶斯模型选择方法。
J R Stat Soc Ser C Appl Stat. 2018 Jan;67(1):3-23. doi: 10.1111/rssc.12213. Epub 2017 Feb 7.
2
Identifying differentially expressed transcripts from RNA-seq data with biological variation.从具有生物学变异的 RNA-seq 数据中鉴定差异表达的转录本。
Bioinformatics. 2012 Jul 1;28(13):1721-8. doi: 10.1093/bioinformatics/bts260. Epub 2012 May 3.
3
Performance of Hamiltonian Monte Carlo and No-U-Turn Sampler for estimating genetic parameters and breeding values.汉密尔顿蒙特卡罗法和无回转抽样器在估计遗传参数和育种值中的性能。
Genet Sel Evol. 2019 Dec 10;51(1):73. doi: 10.1186/s12711-019-0515-1.
4
Particle Gibbs sampling for Bayesian phylogenetic inference.粒子 Gibbs 抽样贝叶斯系统发育推断。
Bioinformatics. 2021 May 5;37(5):642-649. doi: 10.1093/bioinformatics/btaa867.
5
Fast and accurate approximate inference of transcript expression from RNA-seq data.从RNA测序数据中快速准确地进行转录本表达的近似推断。
Bioinformatics. 2015 Dec 15;31(24):3881-9. doi: 10.1093/bioinformatics/btv483. Epub 2015 Aug 26.
6
Bayesian Inference for Mixed Model-Based Genome-Wide Analysis of Expression Quantitative Trait Loci by Gibbs Sampling.基于吉布斯抽样的混合模型全基因组表达数量性状位点分析的贝叶斯推断
Front Genet. 2019 Mar 22;10:199. doi: 10.3389/fgene.2019.00199. eCollection 2019.
7
Use of the reversible jump Markov chain Monte Carlo algorithm to select multiplicative terms in the AMMI-Bayesian model.使用可交换跳跃马尔可夫链蒙特卡罗算法在 AMMI-Bayesian 模型中选择乘法项。
PLoS One. 2023 Jan 3;18(1):e0279537. doi: 10.1371/journal.pone.0279537. eCollection 2023.
8
Improved variational Bayes inference for transcript expression estimation.用于转录本表达估计的改进变分贝叶斯推理
Stat Appl Genet Mol Biol. 2014 Apr 1;13(2):203-16. doi: 10.1515/sagmb-2013-0054.
9
A Gibbs Sampler for Learning DAGs.一种用于学习有向无环图的吉布斯采样器。
J Mach Learn Res. 2016 Apr;17(30):1-39.
10
Estimating CDMs Using the Slice-Within-Gibbs Sampler.使用吉布斯切片采样器估计CDM
Front Psychol. 2020 Sep 25;11:2260. doi: 10.3389/fpsyg.2020.02260. eCollection 2020.

引用本文的文献

1
BANDITS: Bayesian differential splicing accounting for sample-to-sample variability and mapping uncertainty.BANDITS:贝叶斯差异剪接考虑了样本间的可变性和映射不确定性。
Genome Biol. 2020 Mar 16;21(1):69. doi: 10.1186/s13059-020-01967-8.
2
The rise of the distributions: why non-normality is important for understanding the transcriptome and beyond.分布的兴起:为何非正态性对于理解转录组及其他方面至关重要。
Biophys Rev. 2019 Feb;11(1):89-94. doi: 10.1007/s12551-018-0494-4. Epub 2019 Jan 7.

本文引用的文献

1
Fast and accurate approximate inference of transcript expression from RNA-seq data.从RNA测序数据中快速准确地进行转录本表达的近似推断。
Bioinformatics. 2015 Dec 15;31(24):3881-9. doi: 10.1093/bioinformatics/btv483. Epub 2015 Aug 26.
2
MetaDiff: differential isoform expression analysis using random-effects meta-regression.MetaDiff:使用随机效应元回归进行差异异构体表达分析。
BMC Bioinformatics. 2015 Jul 2;16:208. doi: 10.1186/s12859-015-0623-z.
3
BADGE: a novel Bayesian model for accurate abundance quantification and differential analysis of RNA-Seq data.
标记:一种用于 RNA-Seq 数据精确丰度定量和差异分析的新型贝叶斯模型。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S6. doi: 10.1186/1471-2105-15-S9-S6. Epub 2014 Sep 10.
4
QUANTIFYING ALTERNATIVE SPLICING FROM PAIRED-END RNA-SEQUENCING DATA.从双端RNA测序数据中定量可变剪接
Ann Appl Stat. 2014 Mar;8(1):309-330. doi: 10.1214/13-aoas687.
5
Improved variational Bayes inference for transcript expression estimation.用于转录本表达估计的改进变分贝叶斯推理
Stat Appl Genet Mol Biol. 2014 Apr 1;13(2):203-16. doi: 10.1515/sagmb-2013-0054.
6
Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki).利用剪接分析试剂盒(Spanki)对果蝇头部 RNA-Seq 数据进行事后过滤的 RNA 剪接分析零模型设计。
BMC Bioinformatics. 2013 Nov 9;14:320. doi: 10.1186/1471-2105-14-320.
7
TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference.TIGAR:一种通过变分贝叶斯推断进行 RNA-Seq 数据缺口对齐的转录本丰度估计方法。
Bioinformatics. 2013 Sep 15;29(18):2292-9. doi: 10.1093/bioinformatics/btt381. Epub 2013 Jul 2.
8
EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments.EBSeq:RNA-seq 实验中用于推理的经验贝叶斯层次模型。
Bioinformatics. 2013 Apr 15;29(8):1035-43. doi: 10.1093/bioinformatics/btt087. Epub 2013 Feb 21.
9
Differential analysis of gene regulation at transcript resolution with RNA-seq.基于 RNA-seq 的转录分辨率下基因调控的差异分析。
Nat Biotechnol. 2013 Jan;31(1):46-53. doi: 10.1038/nbt.2450. Epub 2012 Dec 9.
10
Identifying differentially expressed transcripts from RNA-seq data with biological variation.从具有生物学变异的 RNA-seq 数据中鉴定差异表达的转录本。
Bioinformatics. 2012 Jul 1;28(13):1721-8. doi: 10.1093/bioinformatics/bts260. Epub 2012 May 3.