• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

球形散度:非参数双样本检验

BALL DIVERGENCE: NONPARAMETRIC TWO SAMPLE TEST.

作者信息

Pan Wenliang, Tian Yuan, Wang Xueqin, Zhang Heping

机构信息

Sun Yat-sen University.

Yale University.

出版信息

Ann Stat. 2018 Jun;46(3):1109-1137. doi: 10.1214/17-AOS1579.

DOI:10.1214/17-AOS1579
PMID:30344356
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6192286/
Abstract

In this paper, we first introduce Ball Divergence, a novel measure of the difference between two probability measures in separable Banach spaces, and show that the Ball Divergence of two probability measures is zero if and only if these two probability measures are identical without any moment assumption. Using Ball Divergence, we present a metric rank test procedure to detect the equality of distribution measures underlying independent samples. It is therefore robust to outliers or heavy-tail data. We show that this multivariate two sample test statistic is consistent with the Ball Divergence, and it converges to a mixture of χ distributions under the null hypothesis and a normal distribution under the alternative hypothesis. Importantly, we prove its consistency against a general alternative hypothesis. Moreover, this result does not depend on the ratio of the two imbalanced sample sizes, ensuring that can be applied to imbalanced data. Numerical studies confirm that our test is superior to several existing tests in terms of Type I error and power. We conclude our paper with two applications of our method: one is for virtual screening in drug development process and the other is for genome wide expression analysis in hormone replacement therapy.

摘要

在本文中,我们首先引入球散度,这是一种用于衡量可分巴拿赫空间中两个概率测度差异的新方法,并表明在没有任何矩假设的情况下,当且仅当这两个概率测度相同时,它们的球散度为零。利用球散度,我们提出了一种度量秩检验程序,用于检测独立样本背后分布测度的相等性。因此,它对异常值或重尾数据具有鲁棒性。我们表明,这个多变量两样本检验统计量与球散度一致,并且在原假设下它收敛到χ分布的混合,在备择假设下收敛到正态分布。重要的是,我们证明了它针对一般备择假设的一致性。此外,该结果不依赖于两个不平衡样本量的比例,确保其可应用于不平衡数据。数值研究证实,我们的检验在一类错误和检验功效方面优于几种现有检验。我们在论文结尾给出了该方法的两个应用:一个用于药物开发过程中的虚拟筛选,另一个用于激素替代疗法中的全基因组表达分析。

相似文献

1
BALL DIVERGENCE: NONPARAMETRIC TWO SAMPLE TEST.球形散度:非参数双样本检验
Ann Stat. 2018 Jun;46(3):1109-1137. doi: 10.1214/17-AOS1579.
2
Ball Covariance: A Generic Measure of Dependence in Banach Space.球协方差:巴拿赫空间中相依性的一种通用度量。
J Am Stat Assoc. 2020;115(529):307-317. doi: 10.1080/01621459.2018.1543600. Epub 2019 Apr 11.
3
Ball divergence for the equality test of crossing survival curves.球差发散用于交叉生存曲线的均等性检验。
Stat Med. 2023 Dec 20;42(29):5353-5368. doi: 10.1002/sim.9914. Epub 2023 Sep 26.
4
An exact projection pursuit-based algorithm for multivariate two-sample nonparametric testing applicable to retrospective and group sequential studies.一种基于精确投影寻踪的多变量两样本非参数检验算法,适用于回顾性研究和序贯分组研究。
J Appl Stat. 2023 Nov 6;51(11):2214-2231. doi: 10.1080/02664763.2023.2277118. eCollection 2024.
5
Probability binning comparison: a metric for quantitating multivariate distribution differences.概率区间比较:一种用于量化多元分布差异的指标。
Cytometry. 2001 Sep 1;45(1):47-55. doi: 10.1002/1097-0320(20010901)45:1<47::aid-cyto1143>3.0.co;2-a.
6
A comparison of likelihood ratio tests and Rao's score test for three separable covariance matrix structures.三种可分离协方差矩阵结构的似然比检验与拉奥得分检验的比较。
Biom J. 2017 Jan;59(1):192-215. doi: 10.1002/bimj.201600044. Epub 2016 Oct 24.
7
The Chi-Square Test of Distance Correlation.距离相关性的卡方检验。
J Comput Graph Stat. 2022;31(1):254-262. doi: 10.1080/10618600.2021.1938585. Epub 2021 Jul 19.
8
Testing Equality of Multiple Population Means under Contaminated Normal Model Using the Density Power Divergence.在污染正态模型下使用密度幂散度检验多个总体均值的相等性。
Entropy (Basel). 2022 Aug 25;24(9):1189. doi: 10.3390/e24091189.
9
To permute or not to permute.是否进行置换。
Bioinformatics. 2006 Sep 15;22(18):2244-8. doi: 10.1093/bioinformatics/btl383. Epub 2006 Jul 26.
10
On the finite sample distribution of the likelihood ratio statistic for testing heterogeneity in meta-analysis.关于荟萃分析中用于检验异质性的似然比统计量的有限样本分布。
Biom J. 2020 Dec;62(8):1986-1996. doi: 10.1002/bimj.201900400. Epub 2020 Aug 5.

引用本文的文献

1
Nonparametric Statistical Inference via Metric Distribution Function in Metric Spaces.度量空间中基于度量分布函数的非参数统计推断
J Am Stat Assoc. 2024;119(548):2772-2784. doi: 10.1080/01621459.2023.2277417. Epub 2023 Dec 26.
2
Nonparametric two-sample tests of high dimensional mean vectors via random integration.基于随机积分的高维均值向量非参数双样本检验
J Am Stat Assoc. 2024;119(545):701-714. doi: 10.1080/01621459.2022.2141636. Epub 2022 Dec 12.
3
Use of random integration to test equality of high dimensional covariance matrices.

本文引用的文献

1
Hormone replacement therapy and breast cancer: heterogeneous risks by race, weight, and breast density.激素替代疗法与乳腺癌:按种族、体重和乳腺密度划分的异质风险。
J Natl Cancer Inst. 2013 Sep 18;105(18):1365-72. doi: 10.1093/jnci/djt207. Epub 2013 Sep 3.
2
Hormone replacement therapy increases the risk of cranial meningioma.激素替代疗法会增加颅膜瘤的风险。
Eur J Cancer. 2013 Oct;49(15):3303-10. doi: 10.1016/j.ejca.2013.05.026. Epub 2013 Jun 22.
3
Detecting novel associations in large data sets.在大型数据集 中检测新的关联。
使用随机积分来检验高维协方差矩阵的相等性。
Stat Sin. 2023 Oct;33(4):2359-2380. doi: 10.5705/ss.202020.0486.
4
Erector Spinae Plane Block for Perioperative Pain Control and Short-term Outcomes in Lumbar Laminoplasty: A Randomized Clinical Trial.竖脊肌平面阻滞用于腰椎椎板成形术围手术期疼痛控制及短期预后:一项随机临床试验
J Pain Res. 2021 Sep 3;14:2717-2727. doi: 10.2147/JPR.S321514. eCollection 2021.
5
Identifying genetic risk variants associated with brain volumetric phenotypes via K-sample Ball Divergence method.通过 K 样本球距法鉴定与脑容量表型相关的遗传风险变异。
Genet Epidemiol. 2021 Oct;45(7):710-720. doi: 10.1002/gepi.22423. Epub 2021 Jun 29.
6
Ball Covariance: A Generic Measure of Dependence in Banach Space.球协方差:巴拿赫空间中相依性的一种通用度量。
J Am Stat Assoc. 2020;115(529):307-317. doi: 10.1080/01621459.2018.1543600. Epub 2019 Apr 11.
7
Distance-based analysis of variance for brain connectivity.基于距离的脑连接方差分析。
Biometrics. 2020 Mar;76(1):257-269. doi: 10.1111/biom.13123. Epub 2019 Sep 30.
Science. 2011 Dec 16;334(6062):1518-24. doi: 10.1126/science.1205438.
4
The hormone replacement therapy (HRT) of menopause: focus on cardiovascular implications.更年期的激素替代疗法(HRT):关注对心血管的影响
Acta Biomed. 2010;81 Suppl 1:73-6.
5
Virtual screening of bioassay data.生物测定数据的虚拟筛选。
J Cheminform. 2009 Dec 22;1:21. doi: 10.1186/1758-2946-1-21.
6
Gene expression profiling of whole-blood samples from women exposed to hormone replacement therapy.
Mol Cancer Ther. 2006 Apr;5(4):868-76. doi: 10.1158/1535-7163.MCT-05-0329.
7
Gene expression analysis with the parametric bootstrap.
Biostatistics. 2001 Dec;2(4):445-61. doi: 10.1093/biostatistics/2.4.445.
8
A generalized two-sample Wilcoxon test for doubly censored data.针对双重删失数据的广义双样本威尔科克森检验。
Biometrika. 1965 Dec;52(3):650-3.