Suppr超能文献

分布中持续趋势的建模

Modeling Persistent Trends in Distributions.

作者信息

Mueller Jonas, Jaakkola Tommi, Gifford David

机构信息

MIT Computer Science & Artificial Intelligence Laboratory Cambridge, MA 02139.

出版信息

J Am Stat Assoc. 2018;113(523):1296-1310. doi: 10.1080/01621459.2017.1341412. Epub 2018 Jun 12.

Abstract

We present a nonparametric framework to model a short sequence of probability distributions that vary both due to underlying effects of sequential progression and confounding noise. To distinguish between these two types of variation and estimate the sequential-progression effects, our approach leverages an assumption that these effects follow a persistent trend. This work is motivated by the recent rise of single-cell RNA-sequencing experiments over a brief time course, which aim to identify genes relevant to the progression of a particular biological process across diverse cell populations. While classical statistical tools focus on scalar-response regression or order-agnostic differences between distributions, it is desirable in this setting to consider both the full distributions as well as the structure imposed by their ordering. We introduce a new regression model for ordinal covariates where responses are univariate distributions and the underlying relationship reflects consistent changes in the distributions over increasing levels of the covariate. This concept is formalized as a in distributions, which we define as an evolution that is linear under the Wasserstein metric. Implemented via a fast alternating projections algorithm, our method exhibits numerous strengths in simulations and analyses of single-cell gene expression data.

摘要

我们提出了一个非参数框架,用于对因序列进展的潜在效应和混杂噪声而变化的短序列概率分布进行建模。为了区分这两种类型的变化并估计序列进展效应,我们的方法利用了这些效应遵循持续趋势的假设。这项工作的动机来自于近期在短时间内单细胞RNA测序实验的兴起,这些实验旨在识别与跨不同细胞群体的特定生物学过程进展相关的基因。虽然经典统计工具侧重于标量响应回归或分布之间的顺序无关差异,但在这种情况下,考虑完整分布及其排序所施加的结构是很有必要的。我们引入了一种用于有序协变量的新回归模型,其中响应是单变量分布,潜在关系反映了随着协变量水平增加分布的一致变化。这个概念在分布中被形式化为一个,我们将其定义为在瓦瑟斯坦度量下是线性的演化。通过快速交替投影算法实现,我们的方法在单细胞基因表达数据的模拟和分析中展现出诸多优势。

相似文献

1
Modeling Persistent Trends in Distributions.分布中持续趋势的建模
J Am Stat Assoc. 2018;113(523):1296-1310. doi: 10.1080/01621459.2017.1341412. Epub 2018 Jun 12.
2
Statistical Change Detection by the Pool Adjacent Violators Algorithm.基于邻域违反者算法的统计变化检测。
IEEE Trans Pattern Anal Mach Intell. 2011 Sep;33(9):1894-910. doi: 10.1109/TPAMI.2011.42. Epub 2011 Mar 3.
3
4
Conditional Wasserstein Generator.条件式 Wasserstein 生成器。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7208-7219. doi: 10.1109/TPAMI.2022.3220965. Epub 2023 May 5.
8
M-quantile regression analysis of temporal gene expression data.时间基因表达数据的M-分位数回归分析
Stat Appl Genet Mol Biol. 2009;8:Article 41. doi: 10.2202/1544-6115.1452. Epub 2009 Sep 22.

引用本文的文献

1
Trajectories from Distribution-valued Functional Curves: A Unified Wasserstein Framework.分布值函数曲线的轨迹:一个统一的瓦瑟斯坦框架。
Med Image Comput Comput Assist Interv. 2020 Oct;12267:343-353. doi: 10.1007/978-3-030-59728-3_34. Epub 2020 Sep 29.

本文引用的文献

6
Bayesian approach to single-cell differential expression analysis.单细胞差异表达分析的贝叶斯方法。
Nat Methods. 2014 Jul;11(7):740-2. doi: 10.1038/nmeth.2967. Epub 2014 May 18.
9
10
Noncrossing quantile regression curve estimation.非交叉分位数回归曲线估计
Biometrika. 2010 Dec;97(4):825-838. doi: 10.1093/biomet/asq048. Epub 2010 Aug 30.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验