高维倾向评分分析的透明度：诊断和报告指南。

Transparency of high-dimensional propensity score analyses: Guidance for diagnostics and reporting.

机构信息

Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK.

Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA.

出版信息

Pharmacoepidemiol Drug Saf. 2022 Apr;31(4):411-423. doi: 10.1002/pds.5412. Epub 2022 Feb 12.

DOI:10.1002/pds.5412

PMID:35092316

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9305520/

Abstract

PURPOSE

The high-dimensional propensity score (HDPS) is a semi-automated procedure for confounder identification, prioritisation and adjustment in large healthcare databases that requires investigators to specify data dimensions, prioritisation strategy and tuning parameters. In practice, reporting of these decisions is inconsistent and this can undermine the transparency, and reproducibility of results obtained. We illustrate reporting tools, graphical displays and sensitivity analyses to increase transparency and facilitate evaluation of the robustness of analyses involving HDPS.

METHODS

Using a study from the UK Clinical Practice Research Datalink that implemented HDPS we demonstrate the application of the proposed recommendations.

RESULTS

We identify seven considerations surrounding the implementation of HDPS, such as the identification of data dimensions, method for code prioritisation and number of variables selected. Graphical diagnostic tools include assessing the balance of key confounders before and after adjusting for empirically selected HDPS covariates and the identification of potentially influential covariates. Sensitivity analyses include varying the number of covariates selected and assessing the impact of covariates behaving empirically as instrumental variables. In our example, results were robust to both the number of covariates selected and the inclusion of potentially influential covariates. Furthermore, our HDPS models achieved good balance in key confounders.

CONCLUSIONS

The data-adaptive approach of HDPS and the resulting benefits have led to its popularity as a method for confounder adjustment in pharmacoepidemiological studies. Reporting of HDPS analyses in practice may be improved by the considerations and tools proposed here to increase the transparency and reproducibility of study results.

摘要

目的

高维倾向评分（HDPS）是一种用于大型医疗保健数据库中混杂因素识别、优先级排序和调整的半自动方法，要求研究人员指定数据维度、优先级排序策略和调整参数。在实践中，这些决策的报告不一致，这可能会破坏结果的透明度和可重复性。我们展示了报告工具、图形显示和敏感性分析，以提高透明度并促进评估涉及 HDPS 的分析的稳健性。

方法

使用来自英国临床实践研究数据链接的一项实施 HDPS 的研究，我们演示了拟议建议的应用。

结果

我们确定了围绕 HDPS 实施的七个考虑因素，例如数据维度的识别、代码优先级排序方法和选择的变量数量。图形诊断工具包括评估在根据经验选择的 HDPS 协变量进行调整前后关键混杂因素的平衡情况，以及识别潜在的有影响的协变量。敏感性分析包括改变选择的协变量数量以及评估协变量表现为工具变量的影响。在我们的示例中，结果对选择的协变量数量和包含潜在有影响的协变量都是稳健的。此外，我们的 HDPS 模型在关键混杂因素方面实现了良好的平衡。

结论

HDPS 的数据自适应方法及其带来的好处使其成为药物流行病学研究中混杂因素调整的一种流行方法。通过这里提出的考虑因素和工具，可以改进 HDPS 分析的报告，以提高研究结果的透明度和可重复性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/9305520/8752931f7ff3/PDS-31-411-g003.jpg

相似文献

Transparency of high-dimensional propensity score analyses: Guidance for diagnostics and reporting.

Pharmacoepidemiol Drug Saf. 2022 Apr;31(4):411-423. doi: 10.1002/pds.5412. Epub 2022 Feb 12.

High-dimensional propensity scores for empirical covariate selection in secondary database studies: Planning, implementation, and reporting.

Pharmacoepidemiol Drug Saf. 2023 Feb;32(2):93-106. doi: 10.1002/pds.5566. Epub 2022 Nov 22.

Combining Super Learner with high-dimensional propensity score to improve confounding adjustment: A real-world application in chronic lymphocytic leukemia.

Pharmacoepidemiol Drug Saf. 2024 Jan;33(1):e5678. doi: 10.1002/pds.5678. Epub 2023 Aug 23.

A methodological review of the high-dimensional propensity score in comparative-effectiveness and safety-of-interventions research finds incomplete reporting relative to algorithm development and robustness.

J Clin Epidemiol. 2024 May;169:111305. doi: 10.1016/j.jclinepi.2024.111305. Epub 2024 Feb 28.

Estimation of high-dimensional propensity scores with multiple exposure levels.

Pharmacoepidemiol Drug Saf. 2020 Jan;29 Suppl 1:53-60. doi: 10.1002/pds.4890. Epub 2019 Sep 30.

Application of High-Dimensional Propensity Score Methods to the National Health and Aging Trends Study.

J Gerontol A Biol Sci Med Sci. 2024 Sep 1;79(9). doi: 10.1093/gerona/glae178.

A comparison of confounder selection and adjustment methods for estimating causal effects using large healthcare databases.

Pharmacoepidemiol Drug Saf. 2022 Apr;31(4):424-433. doi: 10.1002/pds.5403. Epub 2022 Jan 7.

High-dimensional propensity score algorithm in comparative effectiveness research with time-varying interventions.

Stat Med. 2015 Feb 28;34(5):753-81. doi: 10.1002/sim.6377. Epub 2014 Dec 8.

Evaluating the Performance of High-Dimensional Propensity Scores Compared with Standard Propensity Scores for Comparing Antihypertensive Therapies in the CPRD GOLD Database.

Cardiol Ther. 2023 Jun;12(2):393-408. doi: 10.1007/s40119-023-00316-7. Epub 2023 May 5.

引用本文的文献

Simultaneously Dealing With Immortal Time Bias and Residual Confounding: A Case Study of a High-Dimensional Propensity Score Approach With a Nested Case-Control Framework in Multiple Sclerosis Research.

Pharmacoepidemiol Drug Saf. 2025 Jul;34(7):e70174. doi: 10.1002/pds.70174.

Improving ACS prediction in T2DM patients by addressing false records in electronic medical records using propensity score.

Sci Rep. 2025 May 28;15(1):18679. doi: 10.1038/s41598-025-03666-5.

How Effective Are Machine Learning and Doubly Robust Estimators in Incorporating High-Dimensional Proxies to Reduce Residual Confounding?

Pharmacoepidemiol Drug Saf. 2025 May;34(5):e70155. doi: 10.1002/pds.70155.

Identifying markers of health-seeking behaviour and healthcare access in UK electronic health records.

BMJ Open. 2024 Sep 26;14(9):e081781. doi: 10.1136/bmjopen-2023-081781.

High-dimensional Iterative Causal Forest (hdiCF) for Subgroup Identification Using Health Care Claims Data.

Am J Epidemiol. 2024 Sep 5. doi: 10.1093/aje/kwae322.

Application of High-Dimensional Propensity Score Methods to the National Health and Aging Trends Study.

J Gerontol A Biol Sci Med Sci. 2024 Sep 1;79(9). doi: 10.1093/gerona/glae178.

A population-based cohort of drug exposures and adverse pregnancy outcomes in China (DEEP): rationale, design, and baseline characteristics.

Eur J Epidemiol. 2024 Apr;39(4):433-445. doi: 10.1007/s10654-024-01124-6. Epub 2024 Apr 9.

Associations Between Postdischarge Care and Cognitive Impairment-Related Hospital Readmissions for Ketoacidosis and Severe Hypoglycemia in Adults With Diabetes.

High-dimensional propensity scores for empirical covariate selection in secondary database studies: Planning, implementation, and reporting.

Pharmacoepidemiol Drug Saf. 2023 Feb;32(2):93-106. doi: 10.1002/pds.5566. Epub 2022 Nov 22.

Visualizations throughout pharmacoepidemiology study planning, implementation, and reporting.

Pharmacoepidemiol Drug Saf. 2022 Nov;31(11):1140-1152. doi: 10.1002/pds.5529. Epub 2022 Sep 9.

本文引用的文献

Using propensity scores to estimate effects of treatment initiation decisions: State of the science.

Stat Med. 2021 Mar 30;40(7):1718-1735. doi: 10.1002/sim.8866. Epub 2020 Dec 29.

Implementing high-dimensional propensity score principles to improve confounder adjustment in UK electronic health records.

Pharmacoepidemiol Drug Saf. 2020 Nov;29(11):1373-1381. doi: 10.1002/pds.5121. Epub 2020 Sep 14.

A review of the use of propensity score diagnostics in papers published in high-ranking medical journals.

BMC Med Res Methodol. 2020 May 27;20(1):132. doi: 10.1186/s12874-020-00994-0.

Comparing the high-dimensional propensity score for use with administrative data with propensity scores derived from high-quality clinical data.

Stat Methods Med Res. 2020 Feb;29(2):568-588. doi: 10.1177/0962280219842362. Epub 2019 Apr 11.

Theory meets practice: a commentary on VanderWeele's 'principles of confounder selection'.

Eur J Epidemiol. 2019 Mar;34(3):221-222. doi: 10.1007/s10654-019-00495-5. Epub 2019 Mar 6.

The reporting of studies conducted using observational routinely collected health data statement for pharmacoepidemiology (RECORD-PE).

BMJ. 2018 Nov 14;363:k3532. doi: 10.1136/bmj.k3532.

Automated data-adaptive analytics for electronic healthcare data to study causal treatment effects.

Clin Epidemiol. 2018 Jul 6;10:771-788. doi: 10.2147/CLEP.S166545. eCollection 2018.

Erratum: High-dimensional Propensity Score Adjustment in Studies of Treatment Effects Using Health Care Claims Data.

Epidemiology. 2018 Nov;29(6):e63-e64. doi: 10.1097/EDE.0000000000000886.

Evaluating large-scale propensity score performance through real-world and synthetic data experiments.

Int J Epidemiol. 2018 Dec 1;47(6):2005-2014. doi: 10.1093/ije/dyy120.

Can We Train Machine Learning Methods to Outperform the High-dimensional Propensity Score Algorithm?

Epidemiology. 2018 Mar;29(2):191-198. doi: 10.1097/EDE.0000000000000787.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

高维倾向评分分析的透明度：诊断和报告指南。

Transparency of high-dimensional propensity score analyses: Guidance for diagnostics and reporting.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献