• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用大规模倾向评分匹配和基数匹配在观察性研究中的因果推断的比较。

Applied comparison of large-scale propensity score matching and cardinality matching for causal inference in observational research.

机构信息

Janssen R&D, LLC, Raritan, NJ, USA.

Johnson & Johnson, New Brunswick, NJ, USA.

出版信息

BMC Med Res Methodol. 2021 May 24;21(1):109. doi: 10.1186/s12874-021-01282-1.

DOI:10.1186/s12874-021-01282-1
PMID:34030640
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8146256/
Abstract

BACKGROUND

Cardinality matching (CM), a novel matching technique, finds the largest matched sample meeting prespecified balance criteria thereby overcoming limitations of propensity score matching (PSM) associated with limited covariate overlap, which are especially pronounced in studies with small sample sizes. The current study proposes a framework for large-scale CM (LS-CM); and compares large-scale PSM (LS-PSM) and LS-CM in terms of post-match sample size, covariate balance and residual confounding at progressively smaller sample sizes.

METHODS

Evaluation of LS-PSM and LS-CM within a comparative cohort study of new users of angiotensin-converting enzyme inhibitor (ACEI) and thiazide or thiazide-like diuretic monotherapy identified from a U.S. insurance claims database. Candidate covariates included patient demographics, and all observed prior conditions, drug exposures and procedures. Propensity scores were calculated using LASSO regression, and candidate covariates with non-zero beta coefficients in the propensity model were defined as matching covariates for use in LS-CM. One-to-one matching was performed using progressively tighter parameter settings. Covariate balance was assessed using standardized mean differences. Hazard ratios for negative control outcomes perceived as unassociated with treatment (i.e., true hazard ratio of 1) were estimated using unconditional Cox models. Residual confounding was assessed using the expected systematic error of the empirical null distribution of negative control effect estimates compared to the ground truth. To simulate diverse research conditions, analyses were repeated within 10 %, 1 and 0.5 % subsample groups with increasingly limited covariate overlap.

RESULTS

A total of 172,117 patients (ACEI: 129,078; thiazide: 43,039) met the study criteria. As compared to LS-PSM, LS-CM was associated with increased sample retention. Although LS-PSM achieved balance across all matching covariates within the full study population, substantial matching covariate imbalance was observed within the 1 and 0.5 % subsample groups. Meanwhile, LS-CM achieved matching covariate balance across all analyses. LS-PSM was associated with better candidate covariate balance within the full study population. Otherwise, both matching techniques achieved comparable candidate covariate balance and expected systematic error.

CONCLUSIONS

LS-CM found the largest matched sample meeting prespecified balance criteria while achieving comparable candidate covariate balance and residual confounding. We recommend LS-CM as an alternative to LS-PSM in studies with small sample sizes or limited covariate overlap.

摘要

背景

基数匹配(CM)是一种新颖的匹配技术,它找到满足预定平衡标准的最大匹配样本,从而克服了倾向评分匹配(PSM)的局限性,后者与有限的协变量重叠有关,在样本量较小的研究中尤为明显。本研究提出了一种大规模基数匹配(LS-CM)框架;并在一项来自美国保险索赔数据库的新型血管紧张素转换酶抑制剂(ACEI)和噻嗪或噻嗪类利尿剂单药治疗新使用者的比较队列研究中,比较了 LS-PSM 和 LS-CM 在匹配后样本量、协变量平衡和残差混杂方面的差异,样本量逐渐减小。

方法

使用 LASSO 回归计算倾向得分,并将倾向模型中具有非零β系数的候选协变量定义为用于 LS-CM 的匹配协变量。使用逐渐严格的参数设置进行一对一匹配。使用标准化均数差评估协变量平衡。使用无条件 Cox 模型估计被认为与治疗无关的负对照结果(即真实危险比为 1)的危险比。使用负对照效果估计值的经验零分布与实际值的预期系统误差评估残余混杂。为了模拟不同的研究条件,在 10%、1%和 0.5%的子样本组内重复进行分析,其中协变量重叠越来越有限。

结果

共有 172117 名患者(ACEI:129078;噻嗪:43039)符合研究标准。与 LS-PSM 相比,LS-CM 与样本保留增加有关。虽然 LS-PSM 在整个研究人群中实现了所有匹配协变量的平衡,但在 1%和 0.5%子样本组中观察到了大量的匹配协变量不平衡。同时,LS-CM 实现了所有分析的匹配协变量平衡。LS-PSM 在整个研究人群中与更好的候选协变量平衡相关。否则,两种匹配技术都实现了相当的候选协变量平衡和预期的系统误差。

结论

LS-CM 找到了满足预定平衡标准的最大匹配样本,同时实现了相当的候选协变量平衡和残余混杂。我们建议在样本量较小或协变量重叠有限的研究中,LS-CM 作为 LS-PSM 的替代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/5e1320d7da94/12874_2021_1282_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/34344d9ee2f8/12874_2021_1282_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/609d9a5be9b5/12874_2021_1282_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/4ac1c68da678/12874_2021_1282_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/85d3232a0bce/12874_2021_1282_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/5e1320d7da94/12874_2021_1282_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/34344d9ee2f8/12874_2021_1282_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/609d9a5be9b5/12874_2021_1282_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/4ac1c68da678/12874_2021_1282_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/85d3232a0bce/12874_2021_1282_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/8146256/5e1320d7da94/12874_2021_1282_Fig5_HTML.jpg

相似文献

1
Applied comparison of large-scale propensity score matching and cardinality matching for causal inference in observational research.应用大规模倾向评分匹配和基数匹配在观察性研究中的因果推断的比较。
BMC Med Res Methodol. 2021 May 24;21(1):109. doi: 10.1186/s12874-021-01282-1.
2
Indirect covariate balance and residual confounding: An applied comparison of propensity score matching and cardinality matching.间接协变量平衡与残余混杂:倾向得分匹配与基数匹配的应用比较
Pharmacoepidemiol Drug Saf. 2022 Dec;31(12):1242-1252. doi: 10.1002/pds.5510. Epub 2022 Jul 20.
3
Evaluation of subset matching methods and forms of covariate balance.子集匹配方法及协变量平衡形式的评估。
Stat Med. 2016 Nov 30;35(27):4961-4979. doi: 10.1002/sim.7036. Epub 2016 Jul 21.
4
Evaluating the comparability of osteoporosis treatments using propensity score and negative control outcome methods in UK and Denmark electronic health record databases.使用倾向评分和阴性对照结局方法评估英国和丹麦电子健康记录数据库中骨质疏松症治疗的可比性。
J Bone Miner Res. 2024 Aug 5;39(7):844-854. doi: 10.1093/jbmr/zjae059.
5
Five Steps to Successfully Implement and Evaluate Propensity Score Matching in Clinical Research Studies.成功实施和评估临床研究中倾向评分匹配的五个步骤。
Anesth Analg. 2018 Oct;127(4):1066-1073. doi: 10.1213/ANE.0000000000002787.
6
Double-adjustment in propensity score matching analysis: choosing a threshold for considering residual imbalance.倾向得分匹配分析中的双重调整:选择一个用于考虑残余不平衡的阈值。
BMC Med Res Methodol. 2017 Apr 28;17(1):78. doi: 10.1186/s12874-017-0338-0.
7
Application and comparison of generalized propensity score matching versus pairwise propensity score matching.广义倾向得分匹配与成对倾向得分匹配的应用及比较
J Comp Eff Res. 2018 Sep;7(9):923-934. doi: 10.2217/cer-2018-0030. Epub 2018 Jun 21.
8
Assessing the impact of propensity score estimation and implementation on covariate balance and confounding control within and across important subgroups in comparative effectiveness research.评估倾向评分估计和实施对比较有效性研究中重要亚组内和跨亚组的协变量平衡和混杂控制的影响。
Med Care. 2014 Mar;52(3):280-7. doi: 10.1097/MLR.0000000000000064.
9
A comparison of machine learning algorithms and covariate balance measures for propensity score matching and weighting.用于倾向得分匹配和加权的机器学习算法与协变量平衡度量的比较
Biom J. 2019 Jul;61(4):1049-1072. doi: 10.1002/bimj.201800132. Epub 2019 May 14.
10
Cholesterol-lowering effect of statin therapy in a clinical HIV cohort: an application of double propensity score adjustment.他汀类药物治疗对临床 HIV 队列中胆固醇降低的影响:双重倾向评分调整的应用。
Ann Epidemiol. 2020 Apr;44:8-15. doi: 10.1016/j.annepidem.2020.02.005. Epub 2020 Mar 2.

引用本文的文献

1
Competing risk analysis of surgical resection radiofrequency ablation in early-stage small hepatocellular carcinoma: a SEER-based study.早期小肝细胞癌手术切除与射频消融的竞争风险分析:一项基于监测、流行病学和最终结果(SEER)数据库的研究
Transl Cancer Res. 2025 Jun 30;14(6):3565-3576. doi: 10.21037/tcr-2024-2550. Epub 2025 Jun 27.
2
Large-scale examination of early-age sex differences in neurotypical toddlers and those with autism spectrum disorder or other developmental conditions.对神经发育正常的幼儿以及患有自闭症谱系障碍或其他发育障碍的幼儿的早期性别差异进行大规模研究。
Nat Hum Behav. 2025 May 26. doi: 10.1038/s41562-025-02132-6.
3

本文引用的文献

1
Comprehensive comparative effectiveness and safety of first-line antihypertensive drug classes: a systematic, multinational, large-scale analysis.一线降压药类别全面比较效果和安全性:系统的、多国的、大规模分析。
Lancet. 2019 Nov 16;394(10211):1816-1826. doi: 10.1016/S0140-6736(19)32317-7. Epub 2019 Oct 24.
2
2017 ACC/AHA/AAPA/ABC/ACPM/AGS/APhA/ASH/ASPC/NMA/PCNA Guideline for the Prevention, Detection, Evaluation, and Management of High Blood Pressure in Adults: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines.2017美国心脏病学会/美国心脏协会/美国医师助理学会/美国心脏病学学会/美国预防医学学院/美国老年病学会/美国药剂师协会/美国血液学会/美国预防心脏病学会/美国国家医学协会/美国初级保健医师学会成人高血压预防、检测、评估和管理指南:执行摘要:美国心脏病学会/美国心脏协会临床实践指南工作组报告
Circulation. 2018 Oct 23;138(17):e426-e483. doi: 10.1161/CIR.0000000000000597.
3
Cardiometabolic index as a predictor of gallstone incidence in U.S. adults: insights from NHANES 2017-2020.
心脏代谢指数作为美国成年人胆结石发病率的预测指标:来自2017 - 2020年美国国家健康与营养检查调查(NHANES)的见解
BMC Gastroenterol. 2025 Jan 29;25(1):45. doi: 10.1186/s12876-025-03642-3.
4
miMatch: a microbial metabolic background matching tool for mitigating host confounding in metagenomics research.miMatch:一种微生物代谢背景匹配工具,用于减轻宏基因组研究中的宿主混杂。
Gut Microbes. 2024 Jan-Dec;16(1):2434029. doi: 10.1080/19490976.2024.2434029. Epub 2024 Nov 27.
5
Cardinality matching versus propensity score matching for addressing cluster-level residual confounding in implantable medical device and surgical epidemiology: a parametric and plasmode simulation study.基于参数和等离子体模拟的研究:针对医疗器械和外科流行病学中基于簇的残余混杂问题,采用配比法和倾向评分匹配法的比较。
BMC Med Res Methodol. 2024 Nov 22;24(1):289. doi: 10.1186/s12874-024-02406-z.
6
Clinico-pathological study of esophageal mucoepidermoid carcinoma: a 10-year survival from a single center.食管黏液表皮样癌的临床病理研究:单中心 10 年生存分析。
BMC Gastroenterol. 2024 May 8;24(1):156. doi: 10.1186/s12876-024-03215-w.
7
Treatment modalities and outcomes of granular cell tumors and spindle cell oncocytomas of the pituitary gland: an analysis of two national cancer databases.垂体颗粒细胞肿瘤和梭形细胞嗜酸细胞瘤的治疗方式和结局:两个国家癌症数据库的分析。
Acta Neurochir (Wien). 2024 Apr 5;166(1):169. doi: 10.1007/s00701-024-06054-6.
8
Augmenting external control arms using Bayesian borrowing: a case study in first-line non-small cell lung cancer.使用贝叶斯借用增强外部对照臂:一线非小细胞肺癌的案例研究
J Comp Eff Res. 2024 May;13(5):e230175. doi: 10.57264/cer-2023-0175. Epub 2024 Apr 4.
9
Relationship between weight-adjusted waist circumference index and prevalence of gallstones in U.S. adults: a study based on the NHANES 2017-2020.体重调整腰围指数与美国成年人胆囊结石患病率的关系:基于 NHANES 2017-2020 的研究。
Front Endocrinol (Lausanne). 2023 Oct 27;14:1276465. doi: 10.3389/fendo.2023.1276465. eCollection 2023.
10
COVID-19 vaccination effectiveness rates by week and sources of bias: a retrospective cohort study.按周和偏倚来源划分的 COVID-19 疫苗有效性率:一项回顾性队列研究。
BMJ Open. 2022 Aug 23;12(8):e061126. doi: 10.1136/bmjopen-2022-061126.
Evaluating large-scale propensity score performance through real-world and synthetic data experiments.通过真实数据和合成数据实验评估大规模倾向评分性能。
Int J Epidemiol. 2018 Dec 1;47(6):2005-2014. doi: 10.1093/ije/dyy120.
4
Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data.基于观察性医疗保健数据的人群效应估计研究的经验置信区间校准。
Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2571-2577. doi: 10.1073/pnas.1708282114.
5
Accuracy of an automated knowledge base for identifying drug adverse reactions.用于识别药物不良反应的自动化知识库的准确性。
J Biomed Inform. 2017 Feb;66:72-81. doi: 10.1016/j.jbi.2016.12.005. Epub 2016 Dec 16.
6
Robust empirical calibration of p-values using observational data.使用观测数据对p值进行稳健的经验校准。
Stat Med. 2016 Sep 30;35(22):3883-8. doi: 10.1002/sim.6977.
7
Evaluation of subset matching methods and forms of covariate balance.子集匹配方法及协变量平衡形式的评估。
Stat Med. 2016 Nov 30;35(27):4961-4979. doi: 10.1002/sim.7036. Epub 2016 Jul 21.
8
Interpreting observational studies: why empirical calibration is needed to correct p-values.解读观察性研究:为何需要经验校准来修正 p 值。
Stat Med. 2014 Jan 30;33(2):209-18. doi: 10.1002/sim.5925. Epub 2013 Jul 30.
9
Evaluation of the propensity score methods for estimating marginal odds ratios in case of small sample size.评价在小样本量情况下估计边缘比值比的倾向评分方法。
BMC Med Res Methodol. 2012 May 30;12:70. doi: 10.1186/1471-2288-12-70.
10
Refining clinical risk stratification for predicting stroke and thromboembolism in atrial fibrillation using a novel risk factor-based approach: the euro heart survey on atrial fibrillation.采用新型基于风险因素的方法对房颤患者的卒中与血栓栓塞风险进行临床分层的研究:房颤的欧洲心脏调查。
Chest. 2010 Feb;137(2):263-72. doi: 10.1378/chest.09-1584. Epub 2009 Sep 17.