• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于高维比较研究的点间距离测试。

Interpoint distance tests for high-dimensional comparison studies.

作者信息

Marozzi Marco, Mukherjee Amitava, Kalina Jan

机构信息

Ca' Foscari University of Venice, Venice, Italy.

XLRI-Xavier School of Management, Jamshedpur, India.

出版信息

J Appl Stat. 2019 Jul 31;47(4):653-665. doi: 10.1080/02664763.2019.1649374. eCollection 2020.

DOI:10.1080/02664763.2019.1649374
PMID:35707487
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9042018/
Abstract

Modern data collection techniques allow to analyze a very large number of endpoints. In biomedical research, for example, expressions of thousands of genes are commonly measured only on a small number of subjects. In these situations, traditional methods for comparison studies are not applicable. Moreover, the assumption of normal distribution is often questionable for high-dimensional data, and some variables may be at the same time highly correlated with others. Hypothesis tests based on interpoint distances are very appealing for studies involving the comparison of means, because they do not assume data to come from normally distributed populations and comprise tests that are distribution free, unbiased, consistent, and computationally feasible, even if the number of endpoints is much larger than the number of subjects. New tests based on interpoint distances are proposed for multivariate studies involving simultaneous comparison of means and variability, or the whole distribution shapes. The tests are shown to perform well in terms of power, when the endpoints have complex dependence relations, such as in genomic and metabolomic studies. A practical application to a genetic cardiovascular case-control study is discussed.

摘要

现代数据收集技术使得能够分析大量的终点指标。例如,在生物医学研究中,通常仅对少数受试者测量数千个基因的表达。在这些情况下,传统的比较研究方法并不适用。此外,对于高维数据,正态分布的假设往往存在疑问,并且一些变量可能同时与其他变量高度相关。基于点间距离的假设检验对于涉及均值比较的研究非常有吸引力,因为它们不假定数据来自正态分布总体,并且包括无分布、无偏、一致且计算可行的检验,即使终点指标的数量远大于受试者的数量。本文提出了基于点间距离的新检验方法,用于涉及均值和变异性同时比较或整个分布形状的多变量研究。当终点指标具有复杂的依赖关系时,如在基因组和代谢组学研究中,这些检验在功效方面表现良好。本文还讨论了在遗传性心血管病例对照研究中的实际应用。

相似文献

1
Interpoint distance tests for high-dimensional comparison studies.用于高维比较研究的点间距离测试。
J Appl Stat. 2019 Jul 31;47(4):653-665. doi: 10.1080/02664763.2019.1649374. eCollection 2020.
2
Tests for comparison of multiple endpoints with application to omics data.用于多终点比较并应用于组学数据的检验。
Stat Appl Genet Mol Biol. 2018 Jan 30;17(1):sagmb-2017-0033. doi: 10.1515/sagmb-2017-0033.
3
Multivariate tests based on interpoint distances with application to magnetic resonance imaging.基于点间距离的多元检验及其在磁共振成像中的应用。
Stat Methods Med Res. 2016 Dec;25(6):2593-2610. doi: 10.1177/0962280214529104. Epub 2014 Apr 16.
4
Multivariate multidistance tests for high-dimensional low sample size case-control studies.高维小样本病例对照研究的多变量多距离检验
Stat Med. 2015 Apr 30;34(9):1511-26. doi: 10.1002/sim.6418. Epub 2015 Jan 29.
5
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
6
A High-Dimensional Nonparametric Multivariate Test for Mean Vector.均值向量的高维非参数多元检验
J Am Stat Assoc. 2015;110(512):1658-1669. doi: 10.1080/01621459.2014.988215. Epub 2016 Jan 15.
7
A new efficient statistical test for detecting variability in the gene expression data.一种用于检测基因表达数据变异性的新型高效统计检验方法。
Stat Methods Med Res. 2008 Aug;17(4):405-19. doi: 10.1177/0962280206078643. Epub 2007 Aug 14.
8
Statistical group comparison of diffusion tensors via multivariate hypothesis testing.通过多变量假设检验对扩散张量进行统计组比较。
Magn Reson Med. 2007 Jun;57(6):1065-74. doi: 10.1002/mrm.21229.
9
The misuse of distributional assumptions in functional class scoring gene-set and pathway analysis.功能分类评分基因集和通路分析中分布假设的误用。
G3 (Bethesda). 2022 Jan 4;12(1). doi: 10.1093/g3journal/jkab365.
10
Interpoint squared distance as a measure of spatial clustering.点间平方距离作为空间聚类的一种度量。
Soc Sci Med. 1993 Apr;36(8):1011-6. doi: 10.1016/0277-9536(93)90118-n.

引用本文的文献

1
Testing exchangeability of multivariate distributions.检验多元分布的可交换性。
J Appl Stat. 2022 Jul 26;50(15):3142-3156. doi: 10.1080/02664763.2022.2102158. eCollection 2023.

本文引用的文献

1
Distance-based analysis of variance for brain connectivity.基于距离的脑连接方差分析。
Biometrics. 2020 Mar;76(1):257-269. doi: 10.1111/biom.13123. Epub 2019 Sep 30.
2
A Robust Supervised Variable Selection for Noisy High-Dimensional Data.一种针对有噪声高维数据的稳健监督变量选择方法。
Biomed Res Int. 2015;2015:320385. doi: 10.1155/2015/320385. Epub 2015 Jun 2.
3
A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.一种结合基因集和单基因的方法,用于利用基因表达数据预测生存风险。
PLoS One. 2015 May 1;10(5):e0122103. doi: 10.1371/journal.pone.0122103. eCollection 2015.
4
Multivariate tests based on interpoint distances with application to magnetic resonance imaging.基于点间距离的多元检验及其在磁共振成像中的应用。
Stat Methods Med Res. 2016 Dec;25(6):2593-2610. doi: 10.1177/0962280214529104. Epub 2014 Apr 16.
5
Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data.RNA测序数据差异基因表达分析方法的综合评估
Genome Biol. 2013;14(9):R95. doi: 10.1186/gb-2013-14-9-r95.