用于评估复杂医疗保健数据库中药物流行病学方法的血浆模式模拟。

Plasmode simulation for the evaluation of pharmacoepidemiologic methods in complex healthcare databases.

作者信息

Franklin Jessica M, Schneeweiss Sebastian, Polinski Jennifer M, Rassen Jeremy A

机构信息

Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine Brigham and Women's Hospital and Harvard Medical School 1620 Tremont St., Suite 3030, Boston, MA 02120, USA.

出版信息

Comput Stat Data Anal. 2014 Apr;72:219-226. doi: 10.1016/j.csda.2013.10.018.

DOI:10.1016/j.csda.2013.10.018

PMID:24587587

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3935334/

Abstract

Longitudinal healthcare claims databases are frequently used for studying the comparative safety and effectiveness of medications, but results from these studies may be biased due to residual confounding. It is unclear whether methods for confounding adjustment that have been shown to perform well in small, simple nonrandomized studies are applicable to the large, complex pharmacoepidemiologic studies created from secondary healthcare data. Ordinary simulation approaches for evaluating the performance of statistical methods do not capture important features of healthcare claims. A statistical framework for creating replicated simulation datasets from an empirical cohort study in electronic healthcare claims data is developed and validated. The approach relies on resampling from the observed covariate and exposure data without modification in all simulated datasets to preserve the associations among these variables. Repeated outcomes are simulated using a true treatment effect of the investigator's choice and the baseline hazard function estimated from the empirical data. As an example, this framework is applied to a study of high versus low-intensity statin use and cardiovascular outcomes. Simulated data is based on real data drawn from Medicare Parts A and B linked with a prescription drug insurance claims database maintained by Caremark. Properties of the data simulated using this framework are compared with the empirical data on which the simulations were based. In addition, the simulated datasets are used to compare variable selection strategies for confounder adjustmentvia the propensity score, including high-dimensional approaches that could not be evaluated with ordinary simulation methods. The simulated datasets are found to closely resemble the observed complex data structure but have the advantage of an investigator-specified exposure effect.

摘要

纵向医疗保健索赔数据库经常用于研究药物的比较安全性和有效性，但这些研究的结果可能因残余混杂因素而产生偏差。尚不清楚在小型、简单的非随机研究中表现良好的混杂因素调整方法是否适用于从二级医疗保健数据创建的大型、复杂的药物流行病学研究。用于评估统计方法性能的普通模拟方法无法捕捉医疗保健索赔的重要特征。开发并验证了一种从电子医疗保健索赔数据中的实证队列研究创建复制模拟数据集的统计框架。该方法依赖于从观察到的协变量和暴露数据中进行重采样，在所有模拟数据集中不做修改，以保留这些变量之间的关联。使用研究者选择的真实治疗效果和根据实证数据估计的基线风险函数来模拟重复结果。例如，该框架应用于一项关于高强度与低强度他汀类药物使用及心血管结局的研究。模拟数据基于从医疗保险A部分和B部分提取的真实数据，并与Caremark维护的处方药保险索赔数据库相链接。将使用该框架模拟的数据的属性与模拟所基于的实证数据进行比较。此外，模拟数据集用于比较通过倾向得分进行混杂因素调整的变量选择策略，包括普通模拟方法无法评估的高维方法。发现模拟数据集与观察到的复杂数据结构非常相似，但具有研究者指定的暴露效应这一优势。

相似文献

Plasmode simulation for the evaluation of pharmacoepidemiologic methods in complex healthcare databases.

Comput Stat Data Anal. 2014 Apr;72:219-226. doi: 10.1016/j.csda.2013.10.018.

Regularized Regression Versus the High-Dimensional Propensity Score for Confounding Adjustment in Secondary Database Analyses.

Am J Epidemiol. 2015 Oct 1;182(7):651-9. doi: 10.1093/aje/kwv108. Epub 2015 Aug 1.

Evaluating the Utility of Coarsened Exact Matching for Pharmacoepidemiology Using Real and Simulated Claims Data.

Am J Epidemiol. 2020 Jun 1;189(6):613-622. doi: 10.1093/aje/kwz268.

Comparing the performance of propensity score methods in healthcare database studies with rare outcomes.

Stat Med. 2017 May 30;36(12):1946-1963. doi: 10.1002/sim.7250. Epub 2017 Feb 16.

Machine learning for improving high-dimensional proxy confounder adjustment in healthcare database studies: An overview of the current literature.

Pharmacoepidemiol Drug Saf. 2022 Sep;31(9):932-943. doi: 10.1002/pds.5500. Epub 2022 Jul 5.

Studies with many covariates and few outcomes: selecting covariates and implementing propensity-score-based confounding adjustments.

Epidemiology. 2014 Mar;25(2):268-78. doi: 10.1097/EDE.0000000000000069.

Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation.

Epidemiology. 2018 Jan;29(1):96-106. doi: 10.1097/EDE.0000000000000762.

Synthetic Negative Controls: Using Simulation to Screen Large-scale Propensity Score Analyses.

Epidemiology. 2022 Jul 1;33(4):541-550. doi: 10.1097/EDE.0000000000001482. Epub 2022 Apr 12.

引用本文的文献

Beware of counter-intuitive levels of false discoveries in datasets with strong intra-correlations.

Genome Biol. 2025 Aug 18;26(1):249. doi: 10.1186/s13059-025-03734-z.

A Framework for Generating Realistic Synthetic Tabular Data in a Randomized Controlled Trial Setting.

Stat Med. 2025 Aug;44(18-19):e70227. doi: 10.1002/sim.70227.

High-Dimensional Disease Risk Score for Dealing With Residual Confounding Bias in Estimating Treatment Effects With a Survival Outcome.

Pharmacoepidemiol Drug Saf. 2025 Jul;34(7):e70172. doi: 10.1002/pds.70172.

Computing True Parameter Values in Simulation Studies Using Monte Carlo Integration.

Epidemiology. 2025 Sep 1;36(5):690-693. doi: 10.1097/EDE.0000000000001873. Epub 2025 Jun 13.

Generating synthetic electronic health record data: a methodological scoping review with benchmarking on phenotype data and open-source software.

J Am Med Inform Assoc. 2025 Jul 1;32(7):1227-1240. doi: 10.1093/jamia/ocaf082.

Use of Machine Learning to Compare Disease Risk Scores and Propensity Scores Across Complex Confounding Scenarios: A Simulation Study.

Pharmacoepidemiol Drug Saf. 2025 Jun;34(6):e70165. doi: 10.1002/pds.70165.

Is there a competitive advantage to using multivariate statistical or machine learning methods over the Bross formula in the hdPS framework for bias and variance estimation?

PLoS One. 2025 May 28;20(5):e0324639. doi: 10.1371/journal.pone.0324639. eCollection 2025.

How Effective Are Machine Learning and Doubly Robust Estimators in Incorporating High-Dimensional Proxies to Reduce Residual Confounding?

Pharmacoepidemiol Drug Saf. 2025 May;34(5):e70155. doi: 10.1002/pds.70155.

Semisynthetic simulation for microbiome data analysis.

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf051.

Identification of confounders and estimating the causal effect of place of birth on age-specific childhood vaccination.

BMC Med Inform Decis Mak. 2024 Dec 27;24(1):406. doi: 10.1186/s12911-024-02827-2.

本文引用的文献

Design and validation of a data simulation model for longitudinal healthcare data.

AMIA Annu Symp Proc. 2011;2011:1176-85. Epub 2011 Oct 22.

Covariate selection in high-dimensional propensity score analyses of treatment effects in small samples.

Am J Epidemiol. 2011 Jun 15;173(12):1404-13. doi: 10.1093/aje/kwr001. Epub 2011 May 20.

Estimating the unknown parameters of the natural history of metachronous colorectal cancer using discrete-event simulation.

Med Decis Making. 2011 Jul-Aug;31(4):611-24. doi: 10.1177/0272989X10391809. Epub 2011 Jan 6.

A basic study design for expedited safety signal evaluation based on electronic healthcare data.

Pharmacoepidemiol Drug Saf. 2010 Aug;19(8):858-68. doi: 10.1002/pds.1926.

Confounding control in healthcare database research: challenges and potential approaches.

Med Care. 2010 Jun;48(6 Suppl):S114-20. doi: 10.1097/MLR.0b013e3181dbebe3.

The use of plasmodes as a supplement to simulations: A simple example evaluating individual admixture estimation methodologies.

Comput Stat Data Anal. 2009 Mar 15;53(5):1755-1766. doi: 10.1016/j.csda.2008.02.032.

FluTE, a publicly available stochastic influenza epidemic simulation model.

PLoS Comput Biol. 2010 Jan 29;6(1):e1000656. doi: 10.1371/journal.pcbi.1000656.

Missing data in randomized clinical trials for weight loss: scope of the problem, state of the field, and performance of statistical methods.

PLoS One. 2009 Aug 13;4(8):e6624. doi: 10.1371/journal.pone.0006624.

High-dimensional propensity score adjustment in studies of treatment effects using health care claims data.

Epidemiology. 2009 Jul;20(4):512-22. doi: 10.1097/EDE.0b013e3181a663cc.

A simulation model for diarrhoea and other common recurrent infections: a tool for exploring epidemiological methods.

Epidemiol Infect. 2009 May;137(5):644-53. doi: 10.1017/S095026880800143X. Epub 2008 Oct 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于评估复杂医疗保健数据库中药物流行病学方法的血浆模式模拟。

Plasmode simulation for the evaluation of pharmacoepidemiologic methods in complex healthcare databases.

作者信息

Franklin Jessica M, Schneeweiss Sebastian, Polinski Jennifer M, Rassen Jeremy A

机构信息

Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine Brigham and Women's Hospital and Harvard Medical School 1620 Tremont St., Suite 3030, Boston, MA 02120, USA.

出版信息

Comput Stat Data Anal. 2014 Apr;72:219-226. doi: 10.1016/j.csda.2013.10.018.

DOI:10.1016/j.csda.2013.10.018

PMID:24587587

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3935334/

Abstract

摘要

用于评估复杂医疗保健数据库中药物流行病学方法的血浆模式模拟。

Plasmode simulation for the evaluation of pharmacoepidemiologic methods in complex healthcare databases.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于评估复杂医疗保健数据库中药物流行病学方法的血浆模式模拟。

Plasmode simulation for the evaluation of pharmacoepidemiologic methods in complex healthcare databases.

作者信息

机构信息

出版信息