Suppr超能文献

医学成像的合成数据生成。

Generating Synthetic Data for Medical Imaging.

机构信息

From the Delft University of Technology, Delft, the Netherlands (L.R.K.); Segmed, 3790 El Camino Real #810, Palo Alto, CA 94306 (J.W., A.L., M.C., W.A.K., J.P., M.J.W.); Department of Radiology, University of Washington, Seattle, Wash (D.M.); Department of Radiology, OncoRad/Tumor Imaging Metrics Core, Seattle, Wash (D.M.); Harvard University, Cambridge, Mass (J.P.); Department of Radiology, Stanford University School of Medicine, Palo Alto, Calif (A.S.C.); Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, Calif (A.S.C.); Department of Biomedical Informatics, Harvard Medical School, Boston, Mass (P.R.); Microsoft, Redmond, Wash (M.P.L.); and Department of Radiology and Biomedical Imaging, University of California San Francisco, San Francisco, Calif (M.P.L.).

出版信息

Radiology. 2024 Sep;312(3):e232471. doi: 10.1148/radiol.232471.

Abstract

Artificial intelligence (AI) models for medical imaging tasks, such as classification or segmentation, require large and diverse datasets of images. However, due to privacy and ethical issues, as well as data sharing infrastructure barriers, these datasets are scarce and difficult to assemble. Synthetic medical imaging data generated by AI from existing data could address this challenge by augmenting and anonymizing real imaging data. In addition, synthetic data enable new applications, including modality translation, contrast synthesis, and professional training for radiologists. However, the use of synthetic data also poses technical and ethical challenges. These challenges include ensuring the realism and diversity of the synthesized images while keeping data unidentifiable, evaluating the performance and generalizability of models trained on synthetic data, and high computational costs. Since existing regulations are not sufficient to guarantee the safe and ethical use of synthetic images, it becomes evident that updated laws and more rigorous oversight are needed. Regulatory bodies, physicians, and AI developers should collaborate to develop, maintain, and continually refine best practices for synthetic data. This review aims to provide an overview of the current knowledge of synthetic data in medical imaging and highlights current key challenges in the field to guide future research and development.

摘要

人工智能(AI)模型在医学影像任务(如分类或分割)中需要大量且多样化的图像数据集。然而,由于隐私和道德问题以及数据共享基础设施障碍,这些数据集稀缺且难以收集。人工智能从现有数据中生成的合成医学影像数据可以通过扩充和匿名化真实影像数据来解决这一挑战。此外,合成数据还可以实现新的应用,包括模态转换、对比度合成以及放射科医生的专业培训。然而,合成数据的使用也带来了技术和伦理方面的挑战。这些挑战包括确保合成图像的真实性和多样性,同时保持数据的不可识别性,评估在合成数据上训练的模型的性能和泛化能力,以及高计算成本。由于现有的法规不足以保证合成图像的安全和伦理使用,因此显然需要更新的法律和更严格的监督。监管机构、医生和人工智能开发者应合作制定、维护和不断完善合成数据的最佳实践。这篇综述旨在概述医学影像中合成数据的现有知识,并强调该领域当前的关键挑战,以指导未来的研究和开发。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验