Suppr超能文献

使用生成对抗网络生成逼真的合成验证医疗数据集。

Generation of Realistic Synthetic Validation Healthcare Datasets Using Generative Adversarial Networks.

作者信息

Bilici Ozyigit Eda, Arvanitis Theodoros N, Despotou George

机构信息

Institute of Digital Healthcare, WMG, University of Warwick, UK.

出版信息

Stud Health Technol Inform. 2020 Jun 26;272:322-325. doi: 10.3233/SHTI200560.

Abstract

BACKGROUND

Assurance of digital health interventions involves, amongst others, clinical validation, which requires large datasets to test the application in realistic clinical scenarios. Development of such datasets is time consuming and challenging in terms of maintaining patient anonymity and consent.

OBJECTIVE

The development of synthetic datasets that maintain the statistical properties of the real datasets.

METHOD

An artificial neural network based, generative adversarial network was implemented and trained, using numerical and categorical variables, including ICD-9 codes from the MIMIC III dataset, to produce a synthetic dataset.

RESULTS

The synthetic dataset, exhibits a correlation matrix highly similar to the real dataset, good Jaccard similarity and passing the KS test.

CONCLUSIONS

The proof of concept was successful with the approach being promising for further work.

摘要

背景

数字健康干预措施的验证包括临床验证等,这需要大型数据集来测试其在实际临床场景中的应用。开发此类数据集既耗时,又在维护患者匿名性和同意方面具有挑战性。

目的

开发能保持真实数据集统计特性的合成数据集。

方法

使用数值变量和分类变量(包括来自MIMIC III数据集的ICD - 9编码)实现并训练了一个基于人工神经网络的生成对抗网络,以生成合成数据集。

结果

合成数据集呈现出与真实数据集高度相似的相关矩阵、良好的杰卡德相似度并通过了KS检验。

结论

概念验证取得成功,该方法有望用于进一步的研究工作。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验