Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:1354-1357. doi: 10.1109/EMBC48229.2022.9871333.
Propensity score matching (PSM) is a technique used in retrospective investigation of cohort matching as an alternative approach to the prospective matching that is typically used by a randomized control trial (RCT). The process of selecting untreated cases that are the best match to the treated cases is the focus of this research. We created a PSM package for the python environment, termed PsmPy, to carry out this task. The PsmPy package debuted and proposed here is based on a logistic regression logit score where a match is selected using k-nearest neighbors (k-NN). Additional plotting and arguments are available to the user and are also described. To benchmark our method, we compared it with the existing R package, MatchIt, and evaluated our covariates' residual effect sizes with respect to the treatment condition before and after matching. Using a Mann-Whitney statistical test, we showed that our method significantly outperformed MatchIt in cohort matching (U=49, p<0.0001) when comparing residual effect sizes of the covariates. The PsmPy demonstrated a 10-fold average improvement in residual effect sizes amongst covariates when compared with the package MatchIt, suggesting that it is a viable alternative for use in propensity matching studies.
倾向评分匹配(PSM)是一种在回顾性队列匹配研究中使用的技术,作为随机对照试验(RCT)中常用的前瞻性匹配的替代方法。该研究的重点是选择与治疗病例最佳匹配的未经治疗的病例。我们为 Python 环境创建了一个 PSM 包,称为 PsmPy,以执行此任务。本文提出的 PsmPy 包是基于逻辑回归 logit 得分的,其中使用 k-最近邻(k-NN)选择匹配。向用户提供了其他绘图和参数,并对其进行了描述。为了基准测试我们的方法,我们将其与现有的 R 包 MatchIt 进行了比较,并评估了匹配前后治疗条件下协变量的残差效应大小。使用 Mann-Whitney 统计检验,我们表明在比较协变量的残差效应大小时,我们的方法在队列匹配方面明显优于 MatchIt(U=49,p<0.0001)。与 MatchIt 包相比,PsmPy 显示在协变量的残差效应大小方面平均提高了 10 倍,这表明它是一种可行的倾向匹配研究替代方法。