Suppr超能文献

基于阴性对照去除不必要变异的统一和通用方法

UNIFYING AND GENERALIZING METHODS FOR REMOVING UNWANTED VARIATION BASED ON NEGATIVE CONTROLS.

作者信息

Gerard David, Stephens Matthew

机构信息

Department of Mathematics and Statistics, American University, Washington, DC 20016, USA.

Departments of Human Genetics and Statistics, University of Chicago, Chicago, IL 60637, USA.

出版信息

Stat Sin. 2021 Jul;31(3):1145-1166. doi: 10.5705/ss.202018.0345.

Abstract

Unwanted variation, including hidden confounding, is a well-known problem in many fields, but particularly in large-scale gene expression studies. Recent proposals to use control genes, genes assumed to be unassociated with the covariates of interest, have led to new methods to deal with this problem. Several versions of these removing unwanted variation (RUV) methods have been proposed, including RUV1, RUV2, RUV4, RUVinv, RUVrinv, and RUVfun. Here, we introduce a general framework, RUV*, that both unites and generalizes these approaches. This unifying framework helps clarify the connections between existing methods. In particular, we provide conditions under which RUV2 and RUV4 are equivalent. The RUV* framework preserves an advantage of the RUV approaches, namely, their modularity, which facilitates the development of novel methods based on existing matrix imputation algorithms. We illustrate this by implementing RUVB, a version of RUV* based on Bayesian factor analysis. In realistic simulations based on real data, we found RUVB to be competitive with existing methods in terms of both power and calibration. However, providing a consistently reliable calibration among the data sets remains challenging.

摘要

不必要的变异,包括隐藏的混杂因素,在许多领域都是一个众所周知的问题,尤其是在大规模基因表达研究中。最近提出的使用对照基因(即假定与感兴趣的协变量不相关的基因)的建议,催生了处理这一问题的新方法。已经提出了这些去除不必要变异(RUV)方法的几个版本,包括RUV1、RUV2、RUV4、RUVinv、RUVrinv和RUVfun。在此,我们引入了一个通用框架RUV*,它统一并概括了这些方法。这个统一框架有助于阐明现有方法之间的联系。特别是,我们给出了RUV2和RUV4等效的条件。RUV框架保留了RUV方法的一个优点,即其模块化,这有利于基于现有矩阵插补算法开发新方法。我们通过实现RUVB(一种基于贝叶斯因子分析的RUV版本)来说明这一点。在基于真实数据的实际模拟中,我们发现RUVB在功效和校准方面与现有方法具有竞争力。然而,在各数据集中提供始终可靠的校准仍然具有挑战性。

相似文献

3
7
Blind estimation and correction of microarray batch effect.盲估计和校正微阵列批次效应。
PLoS One. 2020 Apr 9;15(4):e0231446. doi: 10.1371/journal.pone.0231446. eCollection 2020.
10
CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.多重假设检验中的混杂因素调整
Ann Stat. 2017 Oct;45(5):1863-1894. doi: 10.1214/16-AOS1511. Epub 2017 Oct 31.

本文引用的文献

3
CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.多重假设检验中的混杂因素调整
Ann Stat. 2017 Oct;45(5):1863-1894. doi: 10.1214/16-AOS1511. Epub 2017 Oct 31.
7
False discovery rates: a new deal.错误发现率:一项新举措。
Biostatistics. 2017 Apr 1;18(2):275-294. doi: 10.1093/biostatistics/kxw041.
9
A reanalysis of mouse ENCODE comparative gene expression data.小鼠ENCODE比较基因表达数据的重新分析。
F1000Res. 2015 May 19;4:121. doi: 10.12688/f1000research.6536.1. eCollection 2015.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验