• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从医疗健康记录中生成用于药物安全性和有效性研究的合成数据。

Generating synthetic data from administrative health records for drug safety and effectiveness studies.

机构信息

Department of Community Health Sciences, University of Manitoba, Winnipeg, Canada.

Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montreal, Canada.

出版信息

Int J Popul Data Sci. 2023 Nov 27;8(1):2176. doi: 10.23889/ijpds.v8i1.2176. eCollection 2023.

DOI:10.23889/ijpds.v8i1.2176
PMID:38414538
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10898503/
Abstract

INTRODUCTION

Administrative health records (AHRs) are used to conduct population-based post-market drug safety and comparative effectiveness studies to inform healthcare decision making. However, the cost of data extraction, and the challenges associated with privacy and securing approvals can make it challenging for researchers to conduct methodological research in a timely manner using real data. Generating synthetic AHRs that reasonably represent the real-world data are beneficial for developing analytic methods and training analysts to rapidly implement study protocols. We generated synthetic AHRs using two methods and compared these synthetic AHRs to real-world AHRs. We described the challenges associated with using synthetic AHRs for real-world study.

METHODS

The real-world AHRs comprised prescription drug records for individuals with healthcare insurance coverage in the Population Research Data Repository (PRDR) from Manitoba, Canada for the 10-year period from 2008 to 2017. Synthetic data were generated using the Observational Medical Dataset Simulator II (OSIM2) and a modification (ModOSIM). Synthetic and real-world data were described using frequencies and percentages. Agreement of prescription drug use measures in PRDR, OSIM2 and ModOSIM was estimated with the concordance coefficient.

RESULTS

The PRDR cohort included 169,586,633 drug records and 1,395 drug types for 1,604,734 individuals. Synthetic data for 1,000,000 individuals were generated using OSIM2 and ModOSIM. Sex and age group distributions were similar in the real-world and synthetic AHRs. However, there were significant differences in the number of drug records and number of unique drugs per person for OSIM2 and ModOSIM when compared with PRDR. For the average number of days of drug use, concordance with the PRDR was 16% (95% confidence interval [CI]: 12%-19%) for OSIM2 and 88% (95% CI: 87%-90%) for ModOSIM.

CONCLUSIONS

ModOSIM data were more similar to PRDR than OSIM2 data on many measures. Synthetic AHRs consistent with those found in real-world settings can be generated using ModOSIM. Synthetic data will benefit rapid implementation of methodological studies and data analyst training.

摘要

简介

行政健康记录(AHR)用于进行基于人群的药物上市后安全性和比较有效性研究,以为医疗保健决策提供信息。然而,数据提取的成本以及与隐私相关的挑战和获得批准可能会使研究人员难以及时使用真实数据进行方法学研究。生成合理代表真实世界数据的合成 AHR 有利于开发分析方法和培训分析师快速实施研究方案。我们使用两种方法生成了合成 AHR,并将这些合成 AHR 与真实世界的 AHR 进行了比较。我们描述了使用合成 AHR 进行真实世界研究所面临的挑战。

方法

真实世界的 AHR 包括来自加拿大马尼托巴省人口研究数据存储库(PRDR)的 2008 年至 2017 年 10 年间个人医疗保险覆盖范围内的处方药记录。使用观察性医疗数据集模拟器 II(OSIM2)和修改版(ModOSIM)生成合成数据。使用频率和百分比描述合成和真实世界数据。使用一致性系数估计 PRDR、OSIM2 和 ModOSIM 中处方药使用措施的一致性。

结果

PRDR 队列包括 169586633 份药物记录和 1395 种药物,涉及 1604734 个人。使用 OSIM2 和 ModOSIM 为 1000000 个人生成了合成数据。真实世界和合成 AHR 中的性别和年龄组分布相似。然而,与 PRDR 相比,OSIM2 和 ModOSIM 的药物记录数量和每人使用的独特药物数量存在显著差异。对于药物使用天数的平均值,与 PRDR 的一致性为 OSIM2 的 16%(95%置信区间[CI]:12%-19%)和 ModOSIM 的 88%(95% CI:87%-90%)。

结论

在许多措施上,ModOSIM 数据比 OSIM2 数据更接近 PRDR。可以使用 ModOSIM 生成与真实世界设置一致的合成 AHR。合成数据将有利于快速实施方法学研究和数据分析师培训。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/530f/10898503/e88d3a8e421d/ijpds-08-2176-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/530f/10898503/8037cf9f2ea7/ijpds-08-2176-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/530f/10898503/6a58efbb662c/ijpds-08-2176-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/530f/10898503/e88d3a8e421d/ijpds-08-2176-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/530f/10898503/8037cf9f2ea7/ijpds-08-2176-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/530f/10898503/6a58efbb662c/ijpds-08-2176-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/530f/10898503/e88d3a8e421d/ijpds-08-2176-g003.jpg

相似文献

1
Generating synthetic data from administrative health records for drug safety and effectiveness studies.从医疗健康记录中生成用于药物安全性和有效性研究的合成数据。
Int J Popul Data Sci. 2023 Nov 27;8(1):2176. doi: 10.23889/ijpds.v8i1.2176. eCollection 2023.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
Healthcare outcomes assessed with observational study designs compared with those assessed in randomized trials.与随机试验中评估的医疗保健结果相比,观察性研究设计评估的医疗保健结果。
Cochrane Database Syst Rev. 2014 Apr 29;2014(4):MR000034. doi: 10.1002/14651858.MR000034.pub2.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
Rapid, point-of-care antigen tests for diagnosis of SARS-CoV-2 infection.用于 SARS-CoV-2 感染诊断的快速、即时抗原检测。
Cochrane Database Syst Rev. 2022 Jul 22;7(7):CD013705. doi: 10.1002/14651858.CD013705.pub3.
7
Comparison of cellulose, modified cellulose and synthetic membranes in the haemodialysis of patients with end-stage renal disease.纤维素、改性纤维素和合成膜在终末期肾病患者血液透析中的比较。
Cochrane Database Syst Rev. 2001(3):CD003234. doi: 10.1002/14651858.CD003234.
8
Antiretroviral post-exposure prophylaxis (PEP) for occupational HIV exposure.职业性HIV暴露后的抗逆转录病毒暴露后预防(PEP)。
Cochrane Database Syst Rev. 2007 Jan 24;2007(1):CD002835. doi: 10.1002/14651858.CD002835.pub3.
9
Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.静脉注射硫酸镁和索他洛尔预防冠状动脉搭桥术后房颤:系统评价与经济学评估
Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.
10
Consequences, costs and cost-effectiveness of workforce configurations in English acute hospitals.英国急症医院劳动力配置的后果、成本及成本效益
Health Soc Care Deliv Res. 2025 Jul;13(25):1-107. doi: 10.3310/ZBAR9152.

引用本文的文献

1
Exploring the Utilization of Synthetic Data in Unsupervised Clustering for Opioid Misuse Analysis.探索合成数据在阿片类药物滥用分析的无监督聚类中的应用。
AMIA Annu Symp Proc. 2025 May 22;2024:1313-1322. eCollection 2024.
2
Leveraging Administrative Health Databases to Address Health Challenges in Farming Populations: Scoping Review and Bibliometric Analysis (1975-2024).利用行政健康数据库应对农业人口的健康挑战:范围综述与文献计量分析(1975 - 2024年)
JMIR Public Health Surveill. 2025 Jan 9;11:e62939. doi: 10.2196/62939.

本文引用的文献

1
An overview of synthetic administrative data for research.合成行政数据研究概述。
Int J Popul Data Sci. 2022 May 23;7(1):1727. doi: 10.23889/ijpds.v7i1.1727. eCollection 2022.
2
Constructing a toolkit to evaluate quality of state and local administrative data.构建一个用于评估州和地方行政数据质量的工具包。
Int J Popul Data Sci. 2019 Jan 31;4(1):937. doi: 10.23889/ijpds.v4i1.937.
3
Generation and evaluation of synthetic patient data.生成和评估合成患者数据。
BMC Med Res Methodol. 2020 May 7;20(1):108. doi: 10.1186/s12874-020-00977-1.
4
Association Between Incretin-Based Drugs and the Risk of Acute Pancreatitis.基于肠促胰岛素的药物与急性胰腺炎风险的关联。
JAMA Intern Med. 2016 Oct 1;176(10):1464-1473. doi: 10.1001/jamainternmed.2016.1522.
5
An Introduction to Health Care Administrative Data.医疗保健管理数据简介。
Can J Hosp Pharm. 2015 May-Jun;68(3):232-7. doi: 10.4212/cjhp.v68i3.1457.
6
Higher potency statins and the risk of new diabetes: multicentre, observational study of administrative databases.高剂量他汀类药物与新发糖尿病风险:行政数据库的多中心观察性研究。
BMJ. 2014 May 29;348:g3244. doi: 10.1136/bmj.g3244.
7
Plasmode simulation for the evaluation of pharmacoepidemiologic methods in complex healthcare databases.用于评估复杂医疗保健数据库中药物流行病学方法的血浆模式模拟。
Comput Stat Data Anal. 2014 Apr;72:219-226. doi: 10.1016/j.csda.2013.10.018.
8
Proton pump inhibitors and the risk of hospitalisation for community-acquired pneumonia: replicated cohort studies with meta-analysis.质子泵抑制剂与社区获得性肺炎住院风险:荟萃分析复制队列研究。
Gut. 2014 Apr;63(4):552-8. doi: 10.1136/gutjnl-2013-304738. Epub 2013 Jul 15.
9
CNODES: the Canadian Network for Observational Drug Effect Studies.CNODES:加拿大药物效应观察研究网络。
Open Med. 2012 Oct 30;6(4):e134-40. Print 2012.
10
Use of high potency statins and rates of admission for acute kidney injury: multicenter, retrospective observational analysis of administrative databases.使用强效他汀类药物与急性肾损伤入院率:行政数据库的多中心回顾性观察性分析。
BMJ. 2013 Mar 18;346:f880. doi: 10.1136/bmj.f880.