• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Fast and powerful conditional randomization testing via distillation.通过蒸馏实现快速且强大的条件随机化测试。
Biometrika. 2022 Jun;109(2):277-293. doi: 10.1093/biomet/asab039. Epub 2021 Jul 8.
2
DIET: Conditional independence testing with marginal dependence measures of residual information.饮食:基于残余信息边际依赖度量的条件独立性检验
Proc Mach Learn Res. 2023 Apr;206:10343-10367.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Familywise error rate control for block response-adaptive randomization.块应答自适应随机化的组内错误率控制。
Stat Methods Med Res. 2023 Jun;32(6):1193-1202. doi: 10.1177/09622802231167437. Epub 2023 Apr 6.
5
Summary statistics knockoffs inference with family-wise error rate control.基于 FWER 控制的摘要统计量置换检验推断。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae082.
6
Model-free prediction test with application to genomics data.无模型预测检验及其在基因组学数据中的应用。
Proc Natl Acad Sci U S A. 2022 Aug 23;119(34):e2205518119. doi: 10.1073/pnas.2205518119. Epub 2022 Aug 15.
7
A simulation study for comparing testing statistics in response-adaptive randomization.一种用于比较响应自适应随机化中检验统计量的仿真研究。
BMC Med Res Methodol. 2010 Jun 5;10:48. doi: 10.1186/1471-2288-10-48.
8
Safety and Efficacy of Imatinib for Hospitalized Adults with COVID-19: A structured summary of a study protocol for a randomised controlled trial.COVID-19 住院成人患者使用伊马替尼的安全性和疗效:一项随机对照试验研究方案的结构化总结。
Trials. 2020 Oct 28;21(1):897. doi: 10.1186/s13063-020-04819-9.
9
Test of Association Between Two Ordinal Variables While Adjusting for Covariates.在调整协变量的情况下对两个有序变量之间的关联性进行检验。
J Am Stat Assoc. 2010 Jun 1;105(490):612-620. doi: 10.1198/jasa.2010.tm09386.
10
A note on exact conditional and unconditional tests for Hardy-Weinberg equilibrium.关于哈迪-温伯格平衡的精确条件检验和无条件检验的注释
Hum Hered. 2013;76(1):10-7. doi: 10.1159/000353205. Epub 2013 Jul 31.

引用本文的文献

1
CRAmed: a conditional randomization test for high-dimensional mediation analysis in sparse microbiome data.CRAmed:一种用于稀疏微生物组数据中高维中介分析的条件随机化检验
Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf038.
2
Characterization of Post-COVID-19 Definitions and Clinical Coding Practices: Longitudinal Study.新冠后定义及临床编码实践的特征描述:纵向研究
Online J Public Health Inform. 2024 May 3;16:e53445. doi: 10.2196/53445.
3
Exponential family measurement error models for single-cell CRISPR screens.单细胞 CRISPR 筛选的指数族测量误差模型。
Biostatistics. 2024 Oct 1;25(4):1254-1272. doi: 10.1093/biostatistics/kxae010.
4
DIET: Conditional independence testing with marginal dependence measures of residual information.饮食:基于残余信息边际依赖度量的条件独立性检验
Proc Mach Learn Res. 2023 Apr;206:10343-10367.
5
Information Theoretic Methods for Variable Selection-A Review.变量选择的信息论方法——综述
Entropy (Basel). 2022 Aug 4;24(8):1079. doi: 10.3390/e24081079.
6
Double Empirical Bayes Testing.双重经验贝叶斯检验
Int Stat Rev. 2020 Dec;88(Suppl 1):S91-S113. doi: 10.1111/insr.12430. Epub 2020 Nov 25.
7
SCEPTRE improves calibration and sensitivity in single-cell CRISPR screen analysis.SCEPTRE 提高了单细胞 CRISPR 筛选分析中的校准和灵敏度。
Genome Biol. 2021 Dec 20;22(1):344. doi: 10.1186/s13059-021-02545-2.
8
Contra: Contrarian statistics for controlled variable selection.反方:用于控制变量选择的反向统计量。
Proc Mach Learn Res. 2021 Apr;130:1900-1908.

本文引用的文献

1
Causal inference in genetic trio studies.遗传三体型研究中的因果推断。
Proc Natl Acad Sci U S A. 2020 Sep 29;117(39):24117-24126. doi: 10.1073/pnas.2007743117. Epub 2020 Sep 18.
2
Multi-resolution localization of causal variants across the genome.全基因组因果变异的多分辨率定位。
Nat Commun. 2020 Feb 27;11(1):1093. doi: 10.1038/s41467-020-14791-2.
3
RUNX1 promotes tumour metastasis by activating the Wnt/β-catenin signalling pathway and EMT in colorectal cancer.RUNX1 通过激活结直肠癌中的 Wnt/β-catenin 信号通路和 EMT 促进肿瘤转移。
J Exp Clin Cancer Res. 2019 Aug 1;38(1):334. doi: 10.1186/s13046-019-1330-9.
4
Humanized yeast genetic interaction mapping predicts synthetic lethal interactions of FBXW7 in breast cancer.人源化酵母遗传互作图谱预测 FBXW7 在乳腺癌中的合成致死互作。
BMC Med Genomics. 2019 Jul 27;12(1):112. doi: 10.1186/s12920-019-0554-z.
5
Mutations Promote Cell Proliferation, Migration, and Invasion in Cervical Cancer.突变促进宫颈癌细胞的增殖、迁移和侵袭。
Genet Test Mol Biomarkers. 2019 Jun;23(6):409-417. doi: 10.1089/gtmb.2018.0278.
6
False Discovery Rate Control in Cancer Biomarker Selection Using Knockoffs.使用仿冒品在癌症生物标志物选择中控制错误发现率
Cancers (Basel). 2019 May 29;11(6):744. doi: 10.3390/cancers11060744.
7
Gene hunting with hidden Markov model knockoffs.使用隐马尔可夫模型仿样进行基因搜寻。
Biometrika. 2019 Mar;106(1):1-18. doi: 10.1093/biomet/asy033. Epub 2018 Aug 4.
8
Recurrent hotspot mutations in HRAS Q61 and PI3K-AKT pathway genes as drivers of breast adenomyoepitheliomas.HRAS Q61 热点突变和 PI3K-AKT 通路基因作为乳腺腺肌上皮瘤的驱动因素。
Nat Commun. 2018 May 8;9(1):1816. doi: 10.1038/s41467-018-04128-5.
9
G protein pathway suppressor 2 (GPS2) acts as a tumor suppressor in liposarcoma.G蛋白信号通路抑制因子2(GPS2)在脂肪肉瘤中作为一种肿瘤抑制因子发挥作用。
Tumour Biol. 2016 Oct;37(10):13333-13343. doi: 10.1007/s13277-016-5220-x. Epub 2016 Jul 26.
10
The somatic mutation profiles of 2,433 breast cancers refines their genomic and transcriptomic landscapes.2,433 例乳腺癌的体细胞突变图谱细化了其基因组和转录组景观。
Nat Commun. 2016 May 10;7:11479. doi: 10.1038/ncomms11479.

通过蒸馏实现快速且强大的条件随机化测试。

Fast and powerful conditional randomization testing via distillation.

作者信息

Liu Molei, Katsevich Eugene, Janson Lucas, Ramdas Aaditya

机构信息

Department of Biostatistics, Harvard Chan School of Public Health, 677 Huntington Avenue, Boston, Massachusetts 02115, U.S.A.

Department of Statistics and Data Science, Wharton School of the University of Pennsylvania, 265 South 37th Street, Philadelphia, Pennsylvania 19104, U.S.A.

出版信息

Biometrika. 2022 Jun;109(2):277-293. doi: 10.1093/biomet/asab039. Epub 2021 Jul 8.

DOI:10.1093/biomet/asab039
PMID:37416628
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10323874/
Abstract

We consider the problem of conditional independence testing: given a response and covariates , we test the null hypothesis that . The conditional randomization test was recently proposed as a way to use distributional information about to exactly and nonasymptotically control Type-I error using any test statistic in any dimensionality without assuming anything about . This flexibility, in principle, allows one to derive powerful test statistics from complex prediction algorithms while maintaining statistical validity. Yet the direct use of such advanced test statistics in the conditional randomization test is prohibitively computationally expensive, especially with multiple testing, due to the requirement to recompute the test statistic many times on resampled data. We propose the distilled conditional randomization test, a novel approach to using state-of-the-art machine learning algorithms in the conditional randomization test while drastically reducing the number of times those algorithms need to be run, thereby taking advantage of their power and the conditional randomization test's statistical guarantees without suffering the usual computational expense. In addition to distillation, we propose a number of other tricks, like screening and recycling computations, to further speed up the conditional randomization test without sacrificing its high power and exact validity. Indeed, we show in simulations that all our proposals combined lead to a test that has similar power to the most powerful existing conditional randomization test implementations, but requires orders of magnitude less computation, making it a practical tool even for large datasets. We demonstrate these benefits on a breast cancer dataset by identifying biomarkers related to cancer stage.

摘要

我们考虑条件独立性检验的问题

给定一个响应变量和协变量,我们检验原假设 。条件随机化检验最近被提出,作为一种利用关于 的分布信息,在不做任何关于 的假设的情况下,使用任意维度下的任何检验统计量来精确且非渐近地控制第一类错误的方法。原则上,这种灵活性允许人们从复杂的预测算法中推导出强大的检验统计量,同时保持统计有效性。然而,在条件随机化检验中直接使用这种先进的检验统计量在计算上成本过高,特别是在多重检验的情况下,因为需要在重采样数据上多次重新计算检验统计量。我们提出了蒸馏条件随机化检验,这是一种在条件随机化检验中使用先进机器学习算法的新方法,同时大幅减少这些算法需要运行的次数,从而在不承担通常计算成本的情况下利用其强大功能和条件随机化检验的统计保证。除了蒸馏,我们还提出了一些其他技巧,如筛选和循环计算,以进一步加快条件随机化检验的速度,同时不牺牲其高功效和精确有效性。事实上,我们在模拟中表明,我们所有的提议相结合会产生一种检验,其功效与现有的最强大的条件随机化检验实现类似,但所需计算量减少了几个数量级,使其即使对于大型数据集也是一个实用工具。我们通过识别与癌症分期相关的生物标志物,在一个乳腺癌数据集上展示了这些优势。