• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

克服模型不确定性——等效性检验如何从模型平均中受益。

Overcoming Model Uncertainty - How Equivalence Tests Can Benefit From Model Averaging.

作者信息

Hagemann Niklas, Möllenhoff Kathrin

机构信息

Institute of Medical Statistics and Computational Biology (IMSB), Faculty of Medicine, University of Cologne, Cologne, Germany.

出版信息

Stat Med. 2025 Mar 15;44(6):e10309. doi: 10.1002/sim.10309.

DOI:10.1002/sim.10309
PMID:40108784
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11923417/
Abstract

A common problem in numerous research areas, particularly in clinical trials, is to test whether the effect of an explanatory variable on an outcome variable is equivalent across different groups. In practice, these tests are frequently used to compare the effect between patient groups, for example, based on gender, age, or treatments. Equivalence is usually assessed by testing whether the difference between the groups does not exceed a pre-specified equivalence threshold. Classical approaches are based on testing the equivalence of single quantities, for example, the mean, the area under the curve or other values of interest. However, when differences depending on a particular covariate are observed, these approaches can turn out to be not very accurate. Instead, whole regression curves over the entire covariate range, describing for instance the time window or a dose range, are considered and tests are based on a suitable distance measure of two such curves, as, for example, the maximum absolute distance between them. In this regard, a key assumption is that the true underlying regression models are known, which is rarely the case in practice. However, misspecification can lead to severe problems as inflated type I errors or, on the other hand, conservative test procedures. In this paper, we propose a solution to this problem by introducing a flexible extension of such an equivalence test using model averaging in order to overcome this assumption and making the test applicable under model uncertainty. Precisely, we introduce model averaging based on smooth Bayesian information criterion weights and we propose a testing procedure which makes use of the duality between confidence intervals and hypothesis testing. We demonstrate the validity of our approach by means of a simulation study and illustrate its practical relevance considering a time-response case study with toxicological gene expression data.

摘要

在众多研究领域,尤其是临床试验中,一个常见的问题是检验解释变量对结果变量的影响在不同组间是否等同。在实际操作中,这些检验常被用于比较患者组之间的效应,例如基于性别、年龄或治疗方法。等同性通常通过检验组间差异是否不超过预先设定的等同性阈值来评估。经典方法基于检验单个量的等同性,例如均值、曲线下面积或其他感兴趣的值。然而,当观察到依赖于特定协变量的差异时,这些方法可能会变得不太准确。相反,会考虑在整个协变量范围内的完整回归曲线,例如描述时间窗或剂量范围,并且检验基于两条这样的曲线的合适距离度量,例如它们之间的最大绝对距离。在这方面,一个关键假设是真实的潜在回归模型是已知的,而这在实际中很少成立。然而,模型设定错误可能会导致严重问题,如第一类错误膨胀,或者另一方面导致保守的检验程序。在本文中,我们提出了一个解决这个问题的方案,即通过引入一种使用模型平均的等同性检验的灵活扩展,以克服这个假设并使检验在模型不确定性下适用。具体而言,我们引入基于平滑贝叶斯信息准则权重的模型平均,并提出一种利用置信区间和假设检验之间对偶性的检验程序。我们通过模拟研究证明了我们方法的有效性,并通过考虑一个有毒理学基因表达数据的时间响应案例研究来说明其实际相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/640ad1f9f136/SIM-44-0-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/f10ed71d7baa/SIM-44-0-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/2f922b6662a9/SIM-44-0-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/b865fd08df46/SIM-44-0-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/e5d8d76ca672/SIM-44-0-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/01bbfa075dad/SIM-44-0-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/640ad1f9f136/SIM-44-0-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/f10ed71d7baa/SIM-44-0-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/2f922b6662a9/SIM-44-0-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/b865fd08df46/SIM-44-0-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/e5d8d76ca672/SIM-44-0-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/01bbfa075dad/SIM-44-0-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1516/11923417/640ad1f9f136/SIM-44-0-g004.jpg

相似文献

1
Overcoming Model Uncertainty - How Equivalence Tests Can Benefit From Model Averaging.克服模型不确定性——等效性检验如何从模型平均中受益。
Stat Med. 2025 Mar 15;44(6):e10309. doi: 10.1002/sim.10309.
2
Testing for similarity of multivariate mixed outcomes using generalized joint regression models with application to efficacy-toxicity responses.使用广义联合回归模型对多元混合结局进行相似性检验及其在疗效-毒性反应中的应用。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae077.
3
Model averaging for treatment effect estimation in subgroups.亚组中治疗效果估计的模型平均法。
Pharm Stat. 2017 Mar;16(2):133-142. doi: 10.1002/pst.1796. Epub 2016 Dec 9.
4
Equivalence of regression curves sharing common parameters.回归曲线的参数共享等效性。
Biometrics. 2020 Jun;76(2):518-529. doi: 10.1111/biom.13149. Epub 2019 Nov 1.
5
Model selection in medical research: a simulation study comparing Bayesian model averaging and stepwise regression.医学研究中的模型选择:贝叶斯模型平均与逐步回归比较的模拟研究。
BMC Med Res Methodol. 2010 Dec 6;10:108. doi: 10.1186/1471-2288-10-108.
6
An Approach to Addressing Multiple Imputation Model Uncertainty Using Bayesian Model Averaging.贝叶斯模型平均处理多重插补模型不确定性的方法。
Multivariate Behav Res. 2020 Jul-Aug;55(4):553-567. doi: 10.1080/00273171.2019.1657790. Epub 2019 Sep 20.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Investigating non-inferiority or equivalence in time-to-event data under non-proportional hazards.在非比例风险下,对时依事件数据进行非劣效性或等效性研究。
Lifetime Data Anal. 2023 Jul;29(3):483-507. doi: 10.1007/s10985-023-09589-5. Epub 2023 Jan 28.
9
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
10
Bayesian Hodges-Lehmann tests for statistical equivalence in the two-sample setting: Power analysis, type I error rates and equivalence boundary selection in biomedical research.贝叶斯霍奇斯-莱曼检验在两样本设置中的统计等效性:生物医学研究中的功效分析、Ⅰ类错误率和等效边界选择。
BMC Med Res Methodol. 2021 Aug 17;21(1):171. doi: 10.1186/s12874-021-01341-7.

本文引用的文献

1
Testing Similarity of Parametric Competing Risks Models for Identifying Potentially Similar Pathways in Healthcare.测试参数竞争风险模型在识别医疗保健中潜在相似途径的相似性。
Stat Med. 2024 Dec 10;43(28):5316-5330. doi: 10.1002/sim.10243. Epub 2024 Oct 12.
2
Testing for similarity of multivariate mixed outcomes using generalized joint regression models with application to efficacy-toxicity responses.使用广义联合回归模型对多元混合结局进行相似性检验及其在疗效-毒性反应中的应用。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae077.
3
Benefit of using interaction effects for the analysis of high-dimensional time-response or dose-response data for two-group comparisons.
在两样本比较中分析高维时间响应或剂量反应数据时使用交互作用的好处。
Sci Rep. 2023 Nov 27;13(1):20804. doi: 10.1038/s41598-023-47057-0.
4
Guidance for statistical design and analysis of toxicological dose-response experiments, based on a comprehensive literature review.基于全面文献回顾的毒理学剂量-反应实验统计设计与分析指南。
Arch Toxicol. 2023 Oct;97(10):2741-2761. doi: 10.1007/s00204-023-03561-w. Epub 2023 Aug 12.
5
Impact of model misspecification on model-based tests in PK studies with parallel design: real case and simulation studies.模型误设定对平行设计 PK 研究中基于模型检验的影响:真实案例与模拟研究。
J Pharmacokinet Pharmacodyn. 2022 Oct;49(5):557-577. doi: 10.1007/s10928-022-09821-z. Epub 2022 Sep 16.
6
Model selection characteristics when using MCP-Mod for dose-response gene expression data.使用 MCP-Mod 对剂量反应基因表达数据进行模型选择的特点。
Biom J. 2022 Jun;64(5):883-897. doi: 10.1002/bimj.202000250. Epub 2022 Feb 20.
7
Spatio-Temporal Multiscale Analysis of Western Diet-Fed Mice Reveals a Translationally Relevant Sequence of Events during NAFLD Progression.高脂肪西式饮食喂养小鼠的时空多尺度分析揭示了 NAFLD 进展过程中具有翻译相关性的一系列事件。
Cells. 2021 Sep 23;10(10):2516. doi: 10.3390/cells10102516.
8
Errors in Statistical Inference Under Model Misspecification: Evidence, Hypothesis Testing, and AIC.模型设定错误下统计推断中的误差:证据、假设检验与赤池信息准则
Front Ecol Evol. 2019;7. doi: 10.3389/fevo.2019.00372. Epub 2019 Oct 21.
9
Testing for similarity of binary efficacy-toxicity responses.检测二进制疗效-毒性反应的相似性。
Biostatistics. 2022 Jul 18;23(3):949-966. doi: 10.1093/biostatistics/kxaa058.
10
Efficient model-based bioequivalence testing.基于模型的高效生物等效性测试。
Biostatistics. 2022 Jan 13;23(1):314-327. doi: 10.1093/biostatistics/kxaa026.