Walonoski Jason, Kramer Mark, Nichols Joseph, Quina Andre, Moesel Chris, Hall Dylan, Duffett Carlton, Dube Kudakwashe, Gallagher Thomas, McLachlan Scott
The MITRE Corporation, Bedford, MA, USA.
HIKER Group, Massey University, Palmerston North, New Zealand.
J Am Med Inform Assoc. 2018 Mar 1;25(3):230-238. doi: 10.1093/jamia/ocx079.
Our objective is to create a source of synthetic electronic health records that is readily available; suited to industrial, innovation, research, and educational uses; and free of legal, privacy, security, and intellectual property restrictions.
We developed Synthea, an open-source software package that simulates the lifespans of synthetic patients, modeling the 10 most frequent reasons for primary care encounters and the 10 chronic conditions with the highest morbidity in the United States.
Synthea adheres to a previously developed conceptual framework, scales via open-source deployment on the Internet, and may be extended with additional disease and treatment modules developed by its user community. One million synthetic patient records are now freely available online, encoded in standard formats (eg, Health Level-7 [HL7] Fast Healthcare Interoperability Resources [FHIR] and Consolidated-Clinical Document Architecture), and accessible through an HL7 FHIR application program interface.
Health care lags other industries in information technology, data exchange, and interoperability. The lack of freely distributable health records has long hindered innovation in health care. Approaches and tools are available to inexpensively generate synthetic health records at scale without accidental disclosure risk, lowering current barriers to entry for promising early-stage developments. By engaging a growing community of users, the synthetic data generated will become increasingly comprehensive, detailed, and realistic over time.
Synthetic patients can be simulated with models of disease progression and corresponding standards of care to produce risk-free realistic synthetic health care records at scale.
我们的目标是创建一个随时可用的合成电子健康记录源;适用于工业、创新、研究和教育用途;且不受法律、隐私、安全和知识产权限制。
我们开发了Synthea,这是一个开源软件包,可模拟合成患者的寿命,对美国初级保健就诊的10个最常见原因以及发病率最高的10种慢性病进行建模。
Synthea遵循先前开发的概念框架,通过在互联网上的开源部署进行扩展,并可通过其用户社区开发的额外疾病和治疗模块进行扩展。现在有100万份合成患者记录可在网上免费获取,采用标准格式编码(例如,健康级别7 [HL7] 快速医疗保健互操作性资源 [FHIR] 和整合临床文档架构),并可通过HL7 FHIR应用程序接口访问。
医疗保健在信息技术、数据交换和互操作性方面落后于其他行业。缺乏可自由分发的健康记录长期以来一直阻碍着医疗保健领域的创新。现有的方法和工具能够以低成本大规模生成合成健康记录,且不存在意外披露风险,降低了当前有前景的早期开发的进入壁垒。通过吸引越来越多的用户群体,随着时间的推移,生成的合成数据将变得越来越全面、详细和逼真。
可以使用疾病进展模型和相应的护理标准来模拟合成患者,从而大规模生成无风险的逼真合成医疗保健记录。