• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

方差自适应收缩(vash):方差的灵活经验贝叶斯估计

Variance adaptive shrinkage (vash): flexible empirical Bayes estimation of variances.

作者信息

Lu Mengyin, Stephens Matthew

机构信息

Department of Statistics, University of Chicago, Chicago, 60637, USA.

Department of Human Genetics, University of Chicago, Chicago, 60637, USA.

出版信息

Bioinformatics. 2016 Nov 15;32(22):3428-3434. doi: 10.1093/bioinformatics/btw483. Epub 2016 Jul 19.

DOI:10.1093/bioinformatics/btw483
PMID:27436563
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5181563/
Abstract

MOTIVATION

Genomic studies often involve estimation of variances of thousands of genes (or other genomic units) from just a few measurements on each. For example, variance estimation is an important step in gene expression analyses aimed at identifying differentially expressed genes. A common approach to this problem is to use an Empirical Bayes (EB) method that assumes the variances among genes follow an inverse-gamma distribution. This distributional assumption is relatively inflexible; for example, it may not capture 'outlying' genes whose variances are considerably bigger than usual. Here we describe a more flexible EB method, capable of capturing a much wider range of distributions. Indeed, the main assumption is that the distribution of the variances is unimodal (or, as an alternative, that the distribution of the precisions is unimodal). We argue that the unimodal assumption provides an attractive compromise between flexibility, computational tractability and statistical efficiency.

RESULTS

We show that this more flexible approach provides competitive performance with existing methods when the variances truly come from an inverse-gamma distribution, and can outperform them when the distribution of the variances is more complex. In analyses of several human gene expression datasets from the Genotype Tissues Expression consortium, we find that our more flexible model often fits the data appreciably better than the single inverse gamma distribution. At the same time we find that in these data this improved model fit leads to only small improvements in variance estimates and detection of differentially expressed genes.

AVAILABILITY AND IMPLEMENTATION

Our methods are implemented in an R package vashr available from http://github.com/mengyin/vashr CONTACT: mstephens@uchicago.eduSupplementary information: Supplementary data are available at Bioinformatics online.

摘要

动机

基因组研究通常涉及从对每个基因(或其他基因组单元)仅进行几次测量来估计数千个基因的方差。例如,方差估计是旨在识别差异表达基因的基因表达分析中的一个重要步骤。解决这个问题的一种常用方法是使用经验贝叶斯(EB)方法,该方法假设基因间的方差服从逆伽马分布。这种分布假设相对缺乏灵活性;例如,它可能无法捕捉方差比通常大得多的“异常”基因。在这里,我们描述一种更灵活的EB方法,它能够捕捉更广泛的分布范围。实际上,主要假设是方差的分布是单峰的(或者,作为一种替代,精度的分布是单峰的)。我们认为单峰假设在灵活性、计算易处理性和统计效率之间提供了一个有吸引力的折衷方案。

结果

我们表明,当方差真正来自逆伽马分布时,这种更灵活的方法与现有方法具有竞争力的性能,并且当方差分布更复杂时,它可以优于现有方法。在对来自基因型组织表达联盟的几个人类基因表达数据集的分析中,我们发现我们更灵活的模型通常比单一逆伽马分布能更好地拟合数据。同时我们发现,在这些数据中,这种改进的模型拟合仅导致方差估计和差异表达基因检测方面的小幅改进。

可用性和实现

我们的方法在一个R包vashr中实现,可从http://github.com/mengyin/vashr获取 联系方式:mstephens@uchicago.edu 补充信息:补充数据可在《生物信息学》在线获取。

相似文献

1
Variance adaptive shrinkage (vash): flexible empirical Bayes estimation of variances.方差自适应收缩(vash):方差的灵活经验贝叶斯估计
Bioinformatics. 2016 Nov 15;32(22):3428-3434. doi: 10.1093/bioinformatics/btw483. Epub 2016 Jul 19.
2
Flexible Signal Denoising via Flexible Empirical Bayes Shrinkage.通过灵活经验贝叶斯收缩实现灵活信号去噪
J Mach Learn Res. 2021 Jan-Dec;22.
3
Priors for genotyping polyploids.多倍体基因型的先验信息。
Bioinformatics. 2020 Mar 1;36(6):1795-1800. doi: 10.1093/bioinformatics/btz852.
4
Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments.基于强度的分层贝叶斯方法改进了微阵列实验中差异表达基因的检测。
BMC Bioinformatics. 2006 Dec 19;7:538. doi: 10.1186/1471-2105-7-538.
5
Bootstrapping and Empirical Bayes Methods Improve Rhythm Detection in Sparsely Sampled Data.自举法和经验贝叶斯方法可改善稀疏采样数据中的节律检测。
J Biol Rhythms. 2018 Aug;33(4):339-349. doi: 10.1177/0748730418789536.
6
Flexible empirical Bayes models for differential gene expression.用于差异基因表达的灵活经验贝叶斯模型。
Bioinformatics. 2007 Feb 1;23(3):328-35. doi: 10.1093/bioinformatics/btl612. Epub 2006 Nov 30.
7
A full Bayesian hierarchical mixture model for the variance of gene differential expression.用于基因差异表达方差的全贝叶斯分层混合模型。
BMC Bioinformatics. 2007 Apr 17;8:124. doi: 10.1186/1471-2105-8-124.
8
False discovery rates: a new deal.错误发现率:一项新举措。
Biostatistics. 2017 Apr 1;18(2):275-294. doi: 10.1093/biostatistics/kxw041.
9
Borrowing information across genes and experiments for improved error variance estimation in microarray data analysis.跨基因和实验借用信息以改进微阵列数据分析中的误差方差估计。
Stat Appl Genet Mol Biol. 2012;11(3):Article 12. doi: 10.1515/1544-6115.1806.
10
Empirical Bayes screening of many p-values with applications to microarray studies.用于微阵列研究的多p值经验贝叶斯筛选。
Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2.

引用本文的文献

1
A comparison of gene expression and DNA methylation patterns across tissues and species.比较不同组织和物种的基因表达和 DNA 甲基化模式。
Genome Res. 2020 Feb;30(2):250-262. doi: 10.1101/gr.254904.119. Epub 2020 Jan 17.
2
Bootstrapping and Empirical Bayes Methods Improve Rhythm Detection in Sparsely Sampled Data.自举法和经验贝叶斯方法可改善稀疏采样数据中的节律检测。
J Biol Rhythms. 2018 Aug;33(4):339-349. doi: 10.1177/0748730418789536.
3
False discovery rates: a new deal.错误发现率:一项新举措。
Biostatistics. 2017 Apr 1;18(2):275-294. doi: 10.1093/biostatistics/kxw041.

本文引用的文献

1
ROBUST HYPERPARAMETER ESTIMATION PROTECTS AGAINST HYPERVARIABLE GENES AND IMPROVES POWER TO DETECT DIFFERENTIAL EXPRESSION.稳健的超参数估计可抵御高变异性基因,并提高检测差异表达的能力。
Ann Appl Stat. 2016 Jun;10(2):946-963. doi: 10.1214/16-AOAS920. Epub 2016 Jul 22.
2
False discovery rates: a new deal.错误发现率:一项新举措。
Biostatistics. 2017 Apr 1;18(2):275-294. doi: 10.1093/biostatistics/kxw041.
3
voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.voom:精确权重为RNA测序读数计数解锁线性模型分析工具。
Genome Biol. 2014 Feb 3;15(2):R29. doi: 10.1186/gb-2014-15-2-r29.
4
The Genotype-Tissue Expression (GTEx) project.基因型-组织表达 (GTEx) 项目。
Nat Genet. 2013 Jun;45(6):580-5. doi: 10.1038/ng.2653.
5
Comparison of small n statistical tests of differential expression applied to microarrays.应用于微阵列的差异表达小样本量统计检验的比较。
BMC Bioinformatics. 2009 Feb 3;10:45. doi: 10.1186/1471-2105-10-45.
6
Linear models and empirical bayes methods for assessing differential expression in microarray experiments.用于评估微阵列实验中差异表达的线性模型和经验贝叶斯方法。
Stat Appl Genet Mol Biol. 2004;3:Article3. doi: 10.2202/1544-6115.1027. Epub 2004 Feb 12.
7
Statistical methods for ranking differentially expressed genes.对差异表达基因进行排名的统计方法。
Genome Biol. 2003;4(6):R41. doi: 10.1186/gb-2003-4-6-r41. Epub 2003 May 29.
8
A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes.用于分析微阵列表达数据的贝叶斯框架:正则化t检验与基因变化的统计推断
Bioinformatics. 2001 Jun;17(6):509-19. doi: 10.1093/bioinformatics/17.6.509.
9
Significance analysis of microarrays applied to the ionizing radiation response.应用于电离辐射反应的微阵列显著性分析。
Proc Natl Acad Sci U S A. 2001 Apr 24;98(9):5116-21. doi: 10.1073/pnas.091062498. Epub 2001 Apr 17.