小样本量下协变量平衡的评估

Assessing Covariate Balance with Small Sample Sizes.

作者信息

Hripcsak George, Zhang Linying, Chen Yong, Li Kelly, Suchard Marc A, Ryan Patrick B, Schuemie Martijn J

机构信息

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.

Observational Health Data Science and Informatics, New York, NY, USA.

出版信息

medRxiv. 2025 Feb 21:2024.04.23.24306230. doi: 10.1101/2024.04.23.24306230.

DOI:10.1101/2024.04.23.24306230

PMID:38712282

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11071580/

Abstract

Propensity score adjustment addresses confounding by balancing covariates in subject treatment groups through matching, stratification, or weighting. Diagnostics test the success of adjustment. For example, if the standardized mean difference (SMD) for a relevant covariate exceeds a threshold like 0.1, the covariate is considered imbalanced and the study may be invalid. Unfortunately, for studies with small or moderate numbers of subjects, the probability of falsely rejecting the validity of a study because of chance imbalance-the probability of asserting imbalance by using a cutoff for SMD when no underlying imbalance exists-can be grossly larger than a given nominal level like 0.05. In this paper, we illustrate that chance imbalance is operative in real-world settings even for moderate sample sizes of 2000. We identify a previously unrecognized challenge that as meta-analyses increase the precision of an effect estimate, the diagnostics must also undergo meta-analysis for a corresponding increase in precision. We propose an alternative diagnostic that checks whether the standardized mean difference statistically significantly exceeds the threshold. Through simulation and real-world data, we find that this diagnostic achieves a better trade-off of type 1 error rate and power than standard nominal threshold tests and not testing for sample sizes from 250 to 4000 and for 20 to 100,000 covariates. We confirm that in network studies, meta-analysis of effect estimates must be accompanied by meta-analysis of the diagnostics or else systematic confounding may overwhelm the estimated effect. Our procedure supports the review of large numbers of covariates, enabling more rigorous diagnostics.

摘要

倾向得分调整通过匹配、分层或加权来平衡受试者治疗组中的协变量，从而解决混杂问题。诊断用于检验调整的成功与否。例如，如果相关协变量的标准化均值差异（SMD）超过0.1这样的阈值，则该协变量被认为是不平衡的，研究可能无效。不幸的是，对于受试者数量较少或适中的研究，由于偶然不平衡而错误拒绝研究有效性的概率——即在不存在潜在不平衡时使用SMD截止值断言不平衡的概率——可能远大于给定的名义水平，如0.05。在本文中，我们表明即使对于2000这样的中等样本量，偶然不平衡在现实世界中也是存在的。我们发现了一个以前未被认识到的挑战，即随着荟萃分析提高效应估计的精度，诊断也必须进行荟萃分析以相应提高精度。我们提出了一种替代诊断方法，检查标准化均值差异是否在统计上显著超过阈值。通过模拟和实际数据，我们发现对于样本量从250到4000以及20到100000个协变量的情况，这种诊断方法在一类错误率和检验功效之间实现了比标准名义阈值检验以及不进行检验更好的权衡。我们证实，在网络研究中，效应估计的荟萃分析必须伴随着诊断的荟萃分析，否则系统性混杂可能会掩盖估计效应。我们的方法支持对大量协变量进行审查，从而实现更严格的诊断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b90/11867539/4eefdc3c263e/nihpp-2024.04.23.24306230v2-f0001.jpg

相似文献

Assessing Covariate Balance with Small Sample Sizes.小样本量下协变量平衡的评估

medRxiv. 2025 Feb 21:2024.04.23.24306230. doi: 10.1101/2024.04.23.24306230.

Assessing Covariate Balance With Small Sample Sizes.

Stat Med. 2025 Aug;44(18-19):e70212. doi: 10.1002/sim.70212.

Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。

Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状荟萃分析。

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Portion, package or tableware size for changing selection and consumption of food, alcohol and tobacco.用于改变食品、酒精饮料和烟草的选择及消费量的份量、包装或餐具尺寸。

Cochrane Database Syst Rev. 2015 Sep 14;2015(9):CD011045. doi: 10.1002/14651858.CD011045.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗

Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.

Computer and mobile technology interventions for self-management in chronic obstructive pulmonary disease.用于慢性阻塞性肺疾病自我管理的计算机和移动技术干预措施。

Cochrane Database Syst Rev. 2017 May 23;5(5):CD011425. doi: 10.1002/14651858.CD011425.pub2.

本文引用的文献

OHDSI Standardized Vocabularies-a large-scale centralized reference ontology for international data harmonization.OHDSI 标准化词汇表-用于国际数据协调的大规模集中参考本体。

J Am Med Inform Assoc. 2024 Feb 16;31(3):583-590. doi: 10.1093/jamia/ocad247.

Robust variance estimation in small meta-analysis with the standardized mean difference.使用标准化均数差在小型荟萃分析中进行稳健方差估计。

Res Synth Methods. 2024 Jan;15(1):44-60. doi: 10.1002/jrsm.1668. Epub 2023 Sep 17.

A tutorial comparing different covariate balancing methods with an application evaluating the causal effects of substance use treatment programs for adolescents.一篇比较不同协变量平衡方法的教程，并应用于评估青少年物质使用治疗项目的因果效应。

Health Serv Outcomes Res Methodol. 2023;23(2):115-148. doi: 10.1007/s10742-022-00280-0. Epub 2022 May 27.

Research and scholarly methods: Propensity scores.研究与学术方法：倾向得分

J Am Coll Clin Pharm. 2022 Apr;5(4):467-475. doi: 10.1002/jac5.1591. Epub 2022 Jan 8.

Flexible propensity score estimation strategies for clustered data in observational studies.在观察性研究中，针对聚类数据的灵活倾向评分估计策略。

Stat Med. 2022 Nov 10;41(25):5016-5032. doi: 10.1002/sim.9551. Epub 2022 Aug 18.

Adjusting for indirectly measured confounding using large-scale propensity score.利用大规模倾向评分调整间接测量混杂。

J Biomed Inform. 2022 Oct;134:104204. doi: 10.1016/j.jbi.2022.104204. Epub 2022 Sep 13.

Large-scale evidence generation and evaluation across a network of databases for type 2 diabetes mellitus (LEGEND-T2DM): a protocol for a series of multinational, real-world comparative cardiovascular effectiveness and safety studies.大规模证据生成和评估网络数据库中的 2 型糖尿病（LEGEND-T2DM）：一系列跨国真实世界比较心血管有效性和安全性研究的方案。

BMJ Open. 2022 Jun 9;12(6):e057977. doi: 10.1136/bmjopen-2021-057977.

Drawing Reproducible Conclusions from Observational Clinical Data with OHDSI.从 OHDSI 观察性临床数据中得出可重现的结论。

Yearb Med Inform. 2021 Aug;30(1):283-289. doi: 10.1055/s-0041-1726481. Epub 2021 Apr 21.

Principles of Large-scale Evidence Generation and Evaluation across a Network of Databases (LEGEND).网络数据库中大规模证据生成与评估的原则（LEGEND）。

J Am Med Inform Assoc. 2020 Aug 1;27(8):1331-1337. doi: 10.1093/jamia/ocaa103.

International electronic health record-derived COVID-19 clinical course profiles: the 4CE consortium.国际电子健康记录衍生的COVID-19临床病程概况：4CE联盟

NPJ Digit Med. 2020 Aug 19;3:109. doi: 10.1038/s41746-020-00308-0. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

小样本量下协变量平衡的评估

Assessing Covariate Balance with Small Sample Sizes.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献