关于用于合成医学文本、时间序列和纵向数据的生成式人工智能模型的综述。

A review on generative AI models for synthetic medical text, time series, and longitudinal data.

作者信息

Loni Mohammad, Poursalim Fatemeh, Asadi Mehdi, Gharehbaghi Arash

机构信息

School of Innovation, Design and Engineering, Mälardalen University, Västerås, Sweden.

Servicehälsan Familjeläkare i Västerås AB, Västerås, Sweden.

出版信息

NPJ Digit Med. 2025 May 15;8(1):281. doi: 10.1038/s41746-024-01409-w.

DOI:10.1038/s41746-024-01409-w

PMID:40374917

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12081667/

Abstract

This paper presents the results of a novel scoping review on the practical models for generating three different types of synthetic health records (SHRs): medical text, time series, and longitudinal data. The innovative aspects of the review, which incorporate study objectives, data modality, and research methodology of the reviewed studies, uncover the importance and the scope of the topic for the digital medicine context. In total, 52 publications met the eligibility criteria for generating medical time series (22), longitudinal data (17), and medical text (13). Privacy preservation was found to be the main research objective of the studied papers, along with class imbalance, data scarcity, and data imputation as the other objectives. The adversarial network-based, probabilistic, and large language models exhibited superiority for generating synthetic longitudinal data, time series, and medical texts, respectively. Finding a reliable performance measure to quantify SHR re-identification risk is the major research gap of the topic.

摘要

本文介绍了一项新颖的范围综述结果，该综述针对生成三种不同类型的合成健康记录（SHR）的实用模型展开：医学文本、时间序列和纵向数据。该综述的创新之处在于纳入了所审查研究的研究目标、数据模态和研究方法，揭示了该主题在数字医学背景下的重要性和范围。总共有52篇出版物符合生成医学时间序列（22篇）、纵向数据（17篇）和医学文本（13篇）的纳入标准。研究发现隐私保护是所研究论文的主要研究目标，此外还有类别不平衡、数据稀缺和数据插补等其他目标。基于对抗网络、概率和大语言模型分别在生成合成纵向数据、时间序列和医学文本方面表现出优势。找到一种可靠的性能度量来量化SHR重新识别风险是该主题的主要研究空白。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e68/12081667/d808b507f828/41746_2024_1409_Fig1_HTML.jpg

相似文献

A review on generative AI models for synthetic medical text, time series, and longitudinal data.

NPJ Digit Med. 2025 May 15;8(1):281. doi: 10.1038/s41746-024-01409-w.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Generative artificial intelligence to produce high-fidelity blastocyst-stage embryo images.

Hum Reprod. 2024 Jun 3;39(6):1197-1207. doi: 10.1093/humrep/deae064.

Generative AI Models in Time-Varying Biomedical Data: Scoping Review.

J Med Internet Res. 2025 Mar 10;27:e59792. doi: 10.2196/59792.

Addressing 6 challenges in generative AI for digital health: A scoping review.

PLOS Digit Health. 2024 May 23;3(5):e0000503. doi: 10.1371/journal.pdig.0000503. eCollection 2024 May.

Generative AI for synthetic data across multiple medical modalities: A systematic review of recent developments and challenges.

Comput Biol Med. 2025 May;189:109834. doi: 10.1016/j.compbiomed.2025.109834. Epub 2025 Mar 1.

Combating COVID-19 Using Generative Adversarial Networks and Artificial Intelligence for Medical Images: Scoping Review.

JMIR Med Inform. 2022 Jun 29;10(6):e37365. doi: 10.2196/37365.

Generative AI in Medical Practice: In-Depth Exploration of Privacy and Security Challenges.

J Med Internet Res. 2024 Mar 8;26:e53008. doi: 10.2196/53008.

Using Synthetic Health Care Data to Leverage Large Language Models for Named Entity Recognition: Development and Validation Study.

J Med Internet Res. 2025 Mar 18;27:e66279. doi: 10.2196/66279.

Updated Primer on Generative Artificial Intelligence and Large Language Models in Medical Imaging for Medical Professionals.

Korean J Radiol. 2024 Mar;25(3):224-242. doi: 10.3348/kjr.2023.0818.

引用本文的文献

Multimodal integration strategies for clinical application in oncology.

Front Pharmacol. 2025 Aug 20;16:1609079. doi: 10.3389/fphar.2025.1609079. eCollection 2025.

Oxidative Stress and Inflammation in Hypoxemic Respiratory Diseases and Their Comorbidities: Molecular Insights and Diagnostic Advances in Chronic Obstructive Pulmonary Disease and Sleep Apnea.

Antioxidants (Basel). 2025 Jul 8;14(7):839. doi: 10.3390/antiox14070839.

本文引用的文献

PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning.

Proc Conf Empir Methods Nat Lang Process. 2022 Dec;2022:2873-2885. doi: 10.18653/v1/2022.emnlp-main.185.

Artificial intelligence in digital pathology: a systematic review and meta-analysis of diagnostic test accuracy.

NPJ Digit Med. 2024 May 4;7(1):114. doi: 10.1038/s41746-024-01106-8.

Foundation metrics for evaluating effectiveness of healthcare conversations powered by generative AI.

NPJ Digit Med. 2024 Mar 29;7(1):82. doi: 10.1038/s41746-024-01074-z.

Large language models to identify social determinants of health in electronic health records.

NPJ Digit Med. 2024 Jan 11;7(1):6. doi: 10.1038/s41746-023-00970-0.

SleepSIM: Conditional GAN-based non-REM sleep EEG Signal Generator.

Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10341043.

A study of generative large language model for medical research and healthcare.

NPJ Digit Med. 2023 Nov 16;6(1):210. doi: 10.1038/s41746-023-00958-w.

Synthesize high-dimensional longitudinal electronic health records via hierarchical autoregressive language model.

Nat Commun. 2023 Aug 31;14(1):5305. doi: 10.1038/s41467-023-41093-0.

Accurate detection of paroxysmal atrial fibrillation with certified-GAN and neural architecture search.

Sci Rep. 2023 Jul 14;13(1):11378. doi: 10.1038/s41598-023-38541-8.

Diffusion-based conditional ECG generation with structured state space models.

Comput Biol Med. 2023 Sep;163:107115. doi: 10.1016/j.compbiomed.2023.107115. Epub 2023 Jun 7.

Improving an Electronic Health Record-Based Clinical Prediction Model Under Label Deficiency: Network-Based Generative Adversarial Semisupervised Approach.

JMIR Med Inform. 2023 Jun 13;11:e47862. doi: 10.2196/47862.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

关于用于合成医学文本、时间序列和纵向数据的生成式人工智能模型的综述。

A review on generative AI models for synthetic medical text, time series, and longitudinal data.

作者信息

Loni Mohammad, Poursalim Fatemeh, Asadi Mehdi, Gharehbaghi Arash

机构信息

School of Innovation, Design and Engineering, Mälardalen University, Västerås, Sweden.

Servicehälsan Familjeläkare i Västerås AB, Västerås, Sweden.

出版信息

NPJ Digit Med. 2025 May 15;8(1):281. doi: 10.1038/s41746-024-01409-w.

DOI:10.1038/s41746-024-01409-w

PMID:40374917

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12081667/

Abstract

摘要

关于用于合成医学文本、时间序列和纵向数据的生成式人工智能模型的综述。

A review on generative AI models for synthetic medical text, time series, and longitudinal data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

关于用于合成医学文本、时间序列和纵向数据的生成式人工智能模型的综述。

A review on generative AI models for synthetic medical text, time series, and longitudinal data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献