• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

柯西组合检验:一种在任意相依结构下具有解析值计算功能的强大检验。

Cauchy combination test: a powerful test with analytic -value calculation under arbitrary dependency structures.

作者信息

Liu Yaowu, Xie Jun

机构信息

Department of Biostatistics, Harvard School of Public Health.

Department of Statistics, Purdue University.

出版信息

J Am Stat Assoc. 2020;115(529):393-402. doi: 10.1080/01621459.2018.1554485. Epub 2019 Apr 25.

DOI:10.1080/01621459.2018.1554485
PMID:33012899
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7531765/
Abstract

Combining individual -values to aggregate multiple small effects has a long-standing interest in statistics, dating back to the classic Fisher's combination test. In modern large-scale data analysis, correlation and sparsity are common features and efficient computation is a necessary requirement for dealing with massive data. To overcome these challenges, we propose a new test that takes advantage of the Cauchy distribution. Our test statistic has a simple form and is defined as a weighted sum of Cauchy transformation of individual -values. We prove a non-asymptotic result that the tail of the null distribution of our proposed test statistic can be well approximated by a Cauchy distribution under arbitrary dependency structures. Based on this theoretical result, the -value calculation of our proposed test is not only accurate, but also as simple as the classic -test or -test, making our test well suited for analyzing massive data. We further show that the power of the proposed test is asymptotically optimal in a strong sparsity setting. Extensive simulations demonstrate that the proposed test has both strong power against sparse alternatives and a good accuracy with respect to -value calculations, especially for very small -values. The proposed test has also been applied to a genome-wide association study of Crohn's disease and compared with several existing tests.

摘要

将个体值组合起来以汇总多个小效应在统计学中一直备受关注,可追溯到经典的费舍尔组合检验。在现代大规模数据分析中,相关性和稀疏性是常见特征,高效计算是处理海量数据的必要条件。为克服这些挑战,我们提出一种利用柯西分布的新检验方法。我们的检验统计量具有简单形式,被定义为个体值的柯西变换的加权和。我们证明了一个非渐近结果,即在任意依赖结构下,我们提出的检验统计量的零分布尾部可以很好地用柯西分布近似。基于这一理论结果,我们提出的检验的p值计算不仅准确,而且与经典的t检验或z检验一样简单,这使得我们的检验非常适合分析海量数据。我们进一步表明,在强稀疏性设置下,所提出检验的功效是渐近最优的。大量模拟表明,所提出的检验对于稀疏备择假设具有强大功效,并且在p值计算方面具有良好的准确性,特别是对于非常小的p值。所提出的检验还已应用于克罗恩病的全基因组关联研究,并与几种现有检验进行了比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6112/7531765/33564a758596/nihms-1520325-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6112/7531765/4a3d34019d97/nihms-1520325-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6112/7531765/bb98ac08a2f5/nihms-1520325-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6112/7531765/33564a758596/nihms-1520325-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6112/7531765/4a3d34019d97/nihms-1520325-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6112/7531765/bb98ac08a2f5/nihms-1520325-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6112/7531765/33564a758596/nihms-1520325-f0003.jpg

相似文献

1
Cauchy combination test: a powerful test with analytic -value calculation under arbitrary dependency structures.柯西组合检验:一种在任意相依结构下具有解析值计算功能的强大检验。
J Am Stat Assoc. 2020;115(529):393-402. doi: 10.1080/01621459.2018.1554485. Epub 2019 Apr 25.
2
Accurate and Efficient -value Calculation via Gaussian Approximation: a Novel Monte-Carlo Method.通过高斯近似进行准确高效的价值计算:一种新型蒙特卡罗方法。
J Am Stat Assoc. 2019;114(525):384-392. doi: 10.1080/01621459.2017.1407776. Epub 2018 Jun 28.
3
Robust tests for combining p-values under arbitrary dependency structures.在任意依赖结构下结合 p 值的稳健检验。
Sci Rep. 2022 Feb 24;12(1):3158. doi: 10.1038/s41598-022-07094-7.
4
Analytic P-value calculation for the higher criticism test in finite problems.有限问题中高阶检验的解析P值计算。
Biometrika. 2014;101(4):964-970. doi: 10.1093/biomet/asu033.
5
The generalized Fisher's combination and accurate p-value calculation under dependence.相依情形下广义 Fisher 组合检验与精确检验 p 值的计算
Biometrics. 2023 Jun;79(2):1159-1172. doi: 10.1111/biom.13634. Epub 2022 Mar 9.
6
The intermediates take it all: asymptotics of higher criticism statistics and a powerful alternative based on equal local levels.中间结果包含了一切:高阶检验统计量的渐近性质以及基于相等局部水平的一个有力替代方法。
Biom J. 2015 Jan;57(1):159-80. doi: 10.1002/bimj.201300255. Epub 2014 Jun 10.
7
Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models.高维逻辑回归模型的全局和同步假设检验
J Am Stat Assoc. 2021;116(534):984-998. doi: 10.1080/01621459.2019.1699421. Epub 2020 Jan 21.
8
Power Enhancement in High Dimensional Cross-Sectional Tests.高维横截面测试中的功效增强
Econometrica. 2015 Jul 1;83(4):1497-1541. doi: 10.3982/ECTA12749.
9
A Weighted Rank-Sum Procedure for Comparing Samples with Multiple Endpoints.一种用于比较具有多个终点的样本的加权秩和程序。
Stat Interface. 2009 Jan 1;2(2):197-201. doi: 10.4310/sii.2009.v2.n2.a9.
10
Testing generalized linear models with high-dimensional nuisance parameter.检验具有高维干扰参数的广义线性模型。
Biometrika. 2023 Mar;110(1):83-99. doi: 10.1093/biomet/asac021. Epub 2022 Apr 5.

引用本文的文献

1
A novel two-sample Mendelian randomization framework integrating common and rare variants: application to assess the effect of HDL-C on preeclampsia risk.一种整合常见和罕见变异的新型两样本孟德尔随机化框架:用于评估高密度脂蛋白胆固醇对先兆子痫风险影响的应用。
medRxiv. 2025 Aug 24:2025.08.20.25334100. doi: 10.1101/2025.08.20.25334100.
2
Integrative analysis of microbial 16S gene and shotgun metagenomic sequencing data improves statistical efficiency in testing differential abundance.微生物16S基因与鸟枪法宏基因组测序数据的整合分析提高了差异丰度检测的统计效率。
J Am Stat Assoc. 2025 Aug 5. doi: 10.1080/01621459.2025.2516205.
3

本文引用的文献

1
Accurate and Efficient -value Calculation via Gaussian Approximation: a Novel Monte-Carlo Method.通过高斯近似进行准确高效的价值计算:一种新型蒙特卡罗方法。
J Am Stat Assoc. 2019;114(525):384-392. doi: 10.1080/01621459.2017.1407776. Epub 2018 Jun 28.
2
Estimation of the false discovery proportion with unknown dependence.在依赖关系未知的情况下对错误发现比例的估计。
J R Stat Soc Series B Stat Methodol. 2017 Sep;79(4):1143-1164. doi: 10.1111/rssb.12204. Epub 2016 Sep 26.
3
The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies.
Transcription factor BACH2 shapes tissue-resident memory T cell programs to promote HIV-1 persistence.
转录因子BACH2塑造组织驻留记忆T细胞程序以促进HIV-1持续存在。
Immunity. 2025 Aug 16. doi: 10.1016/j.immuni.2025.07.022.
4
TrimNN: characterizing cellular community motifs for studying multicellular topological organization in complex tissues.TrimNN:表征细胞群落基序以研究复杂组织中的多细胞拓扑组织
Nat Commun. 2025 Aug 19;16(1):7737. doi: 10.1038/s41467-025-63141-7.
5
Genetic Modulation of Lifespan: Dynamic Effects, Sex Differences, and Body Weight Trade-offs.寿命的基因调控:动态效应、性别差异及体重权衡
bioRxiv. 2025 Jul 21:2025.04.27.649857. doi: 10.1101/2025.04.27.649857.
6
Joint, multifaceted genomic analysis enables diagnosis of diverse, ultra-rare monogenic presentations.联合多层面基因组分析能够诊断多种极其罕见的单基因疾病表现。
Nat Commun. 2025 Aug 7;16(1):7267. doi: 10.1038/s41467-025-61712-2.
7
Adolescent dietary patterns and methyl-donor nutrient intakes in relation to blood leukocyte DNA methylation of circadian genes.青少年饮食模式和甲基供体营养素摄入量与昼夜节律基因的血液白细胞DNA甲基化的关系。
Chronobiol Int. 2025 Jul 23:1-16. doi: 10.1080/07420528.2025.2532796.
8
Multi-omics Integrative Analysis for Incomplete Data Using Weighted -Value Adjustment Approaches.使用加权值调整方法对不完整数据进行多组学综合分析。
J Agric Biol Environ Stat. 2025;30(3):601-617. doi: 10.1007/s13253-024-00603-3. Epub 2024 Feb 28.
9
Regenie.QRS: computationally efficient whole-genome quantile regression at biobank scale.Regenie.QRS:生物样本库规模下计算效率高的全基因组分位数回归
bioRxiv. 2025 May 7:2025.05.02.651730. doi: 10.1101/2025.05.02.651730.
10
Coconut: covariate-assisted composite null hypothesis testing with applications to replicability analysis of high-throughput experimental data.椰子:协变量辅助复合零假设检验及其在高通量实验数据可重复性分析中的应用
BMC Bioinformatics. 2025 Jul 1;26(1):163. doi: 10.1186/s12859-025-06163-8.
用于基因关联研究中检测单核苷酸多态性(SNP)集效应的广义高阶检验法
J Am Stat Assoc. 2017;112(517):64-76. doi: 10.1080/01621459.2016.1192039. Epub 2017 May 3.
4
Partitioning heritability by functional annotation using genome-wide association summary statistics.利用全基因组关联研究汇总统计数据,通过功能注释对遗传力进行划分。
Nat Genet. 2015 Nov;47(11):1228-35. doi: 10.1038/ng.3404. Epub 2015 Sep 28.
5
Exact meta-analysis approach for discrete data and its application to 2 × 2 tables with rare events.离散数据的精确元分析方法及其在具有罕见事件的2×2表中的应用。
J Am Stat Assoc. 2014 Oct;109(508):1450-1465. doi: 10.1080/01621459.2014.946318.
6
JEPEG: a summary statistics based tool for gene-level joint testing of functional variants.JEPEG:一种基于汇总统计量的功能变异基因水平联合检测工具。
Bioinformatics. 2015 Apr 15;31(8):1176-82. doi: 10.1093/bioinformatics/btu816. Epub 2014 Dec 12.
7
Rare-variant association analysis: study designs and statistical tests.罕见变异关联分析:研究设计与统计检验。
Am J Hum Genet. 2014 Jul 3;95(1):5-23. doi: 10.1016/j.ajhg.2014.06.009.
8
Estimating False Discovery Proportion Under Arbitrary Covariance Dependence.任意协方差依赖下错误发现比例的估计
J Am Stat Assoc. 2012;107(499):1019-1035. doi: 10.1080/01621459.2012.720478.
9
Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits.条件和联合多位点 GWAS 汇总统计分析确定了影响复杂性状的其他变体。
Nat Genet. 2012 Mar 18;44(4):369-75, S1-3. doi: 10.1038/ng.2213.
10
Rare-variant association testing for sequencing data with the sequence kernel association test.基于序列核关联检验的测序数据罕见变异关联分析
Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.