Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK.
Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA.
Pharmacoepidemiol Drug Saf. 2022 Apr;31(4):411-423. doi: 10.1002/pds.5412. Epub 2022 Feb 12.
The high-dimensional propensity score (HDPS) is a semi-automated procedure for confounder identification, prioritisation and adjustment in large healthcare databases that requires investigators to specify data dimensions, prioritisation strategy and tuning parameters. In practice, reporting of these decisions is inconsistent and this can undermine the transparency, and reproducibility of results obtained. We illustrate reporting tools, graphical displays and sensitivity analyses to increase transparency and facilitate evaluation of the robustness of analyses involving HDPS.
Using a study from the UK Clinical Practice Research Datalink that implemented HDPS we demonstrate the application of the proposed recommendations.
We identify seven considerations surrounding the implementation of HDPS, such as the identification of data dimensions, method for code prioritisation and number of variables selected. Graphical diagnostic tools include assessing the balance of key confounders before and after adjusting for empirically selected HDPS covariates and the identification of potentially influential covariates. Sensitivity analyses include varying the number of covariates selected and assessing the impact of covariates behaving empirically as instrumental variables. In our example, results were robust to both the number of covariates selected and the inclusion of potentially influential covariates. Furthermore, our HDPS models achieved good balance in key confounders.
The data-adaptive approach of HDPS and the resulting benefits have led to its popularity as a method for confounder adjustment in pharmacoepidemiological studies. Reporting of HDPS analyses in practice may be improved by the considerations and tools proposed here to increase the transparency and reproducibility of study results.
高维倾向评分(HDPS)是一种用于大型医疗保健数据库中混杂因素识别、优先级排序和调整的半自动方法,要求研究人员指定数据维度、优先级排序策略和调整参数。在实践中,这些决策的报告不一致,这可能会破坏结果的透明度和可重复性。我们展示了报告工具、图形显示和敏感性分析,以提高透明度并促进评估涉及 HDPS 的分析的稳健性。
使用来自英国临床实践研究数据链接的一项实施 HDPS 的研究,我们演示了拟议建议的应用。
我们确定了围绕 HDPS 实施的七个考虑因素,例如数据维度的识别、代码优先级排序方法和选择的变量数量。图形诊断工具包括评估在根据经验选择的 HDPS 协变量进行调整前后关键混杂因素的平衡情况,以及识别潜在的有影响的协变量。敏感性分析包括改变选择的协变量数量以及评估协变量表现为工具变量的影响。在我们的示例中,结果对选择的协变量数量和包含潜在有影响的协变量都是稳健的。此外,我们的 HDPS 模型在关键混杂因素方面实现了良好的平衡。
HDPS 的数据自适应方法及其带来的好处使其成为药物流行病学研究中混杂因素调整的一种流行方法。通过这里提出的考虑因素和工具,可以改进 HDPS 分析的报告,以提高研究结果的透明度和可重复性。