Suppr超能文献

用于高维比较研究的点间距离测试。

Interpoint distance tests for high-dimensional comparison studies.

作者信息

Marozzi Marco, Mukherjee Amitava, Kalina Jan

机构信息

Ca' Foscari University of Venice, Venice, Italy.

XLRI-Xavier School of Management, Jamshedpur, India.

出版信息

J Appl Stat. 2019 Jul 31;47(4):653-665. doi: 10.1080/02664763.2019.1649374. eCollection 2020.

Abstract

Modern data collection techniques allow to analyze a very large number of endpoints. In biomedical research, for example, expressions of thousands of genes are commonly measured only on a small number of subjects. In these situations, traditional methods for comparison studies are not applicable. Moreover, the assumption of normal distribution is often questionable for high-dimensional data, and some variables may be at the same time highly correlated with others. Hypothesis tests based on interpoint distances are very appealing for studies involving the comparison of means, because they do not assume data to come from normally distributed populations and comprise tests that are distribution free, unbiased, consistent, and computationally feasible, even if the number of endpoints is much larger than the number of subjects. New tests based on interpoint distances are proposed for multivariate studies involving simultaneous comparison of means and variability, or the whole distribution shapes. The tests are shown to perform well in terms of power, when the endpoints have complex dependence relations, such as in genomic and metabolomic studies. A practical application to a genetic cardiovascular case-control study is discussed.

摘要

现代数据收集技术使得能够分析大量的终点指标。例如,在生物医学研究中,通常仅对少数受试者测量数千个基因的表达。在这些情况下,传统的比较研究方法并不适用。此外,对于高维数据,正态分布的假设往往存在疑问,并且一些变量可能同时与其他变量高度相关。基于点间距离的假设检验对于涉及均值比较的研究非常有吸引力,因为它们不假定数据来自正态分布总体,并且包括无分布、无偏、一致且计算可行的检验,即使终点指标的数量远大于受试者的数量。本文提出了基于点间距离的新检验方法,用于涉及均值和变异性同时比较或整个分布形状的多变量研究。当终点指标具有复杂的依赖关系时,如在基因组和代谢组学研究中,这些检验在功效方面表现良好。本文还讨论了在遗传性心血管病例对照研究中的实际应用。

相似文献

1
Interpoint distance tests for high-dimensional comparison studies.用于高维比较研究的点间距离测试。
J Appl Stat. 2019 Jul 31;47(4):653-665. doi: 10.1080/02664763.2019.1649374. eCollection 2020.
2
Tests for comparison of multiple endpoints with application to omics data.用于多终点比较并应用于组学数据的检验。
Stat Appl Genet Mol Biol. 2018 Jan 30;17(1):sagmb-2017-0033. doi: 10.1515/sagmb-2017-0033.
6
A High-Dimensional Nonparametric Multivariate Test for Mean Vector.均值向量的高维非参数多元检验
J Am Stat Assoc. 2015;110(512):1658-1669. doi: 10.1080/01621459.2014.988215. Epub 2016 Jan 15.

引用本文的文献

1
Testing exchangeability of multivariate distributions.检验多元分布的可交换性。
J Appl Stat. 2022 Jul 26;50(15):3142-3156. doi: 10.1080/02664763.2022.2102158. eCollection 2023.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验