Institute of Digital Technologies for Personalized Healthcare - MeDiTech, Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Via la Santa 1, Lugano 6900, Switzerland; Institute of Informatics, University of Bern, Neubrückstrasse 10, Bern 3012, Switzerland.
Institute of Digital Technologies for Personalized Healthcare - MeDiTech, Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Via la Santa 1, Lugano 6900, Switzerland; Department of Quantitative Biomedicine, University of Zurich, Schmelzbergstrasse 26, Zurich 8091, Switzerland.
Comput Biol Med. 2023 Dec;167:107655. doi: 10.1016/j.compbiomed.2023.107655. Epub 2023 Nov 2.
Large high-quality datasets are essential for building powerful artificial intelligence (AI) algorithms capable of supporting advancement in cardiac clinical research. However, researchers working with electrocardiogram (ECG) signals struggle to get access and/or to build one. The aim of the present work is to shed light on a potential solution to address the lack of large and easily accessible ECG datasets. Firstly, the main causes of such a lack are identified and examined. Afterward, the potentials and limitations of cardiac data generation via deep generative models (DGMs) are deeply analyzed. These very promising algorithms have been found capable not only of generating large quantities of ECG signals but also of supporting data anonymization processes, to simplify data sharing while respecting patients' privacy. Their application could help research progress and cooperation in the name of open science. However several aspects, such as a standardized synthetic data quality evaluation and algorithm stability, need to be further explored.
大型高质量数据集对于构建能够支持心脏临床研究进展的强大人工智能 (AI) 算法至关重要。然而,研究心电图 (ECG) 信号的研究人员很难获得和/或构建一个数据集。本研究旨在探讨一种潜在的解决方案,以解决缺乏大型且易于访问的 ECG 数据集的问题。首先,确定并检查了造成这种缺乏的主要原因。之后,深入分析了通过深度生成模型 (DGM) 生成心脏数据的潜力和局限性。这些非常有前途的算法不仅能够生成大量的 ECG 信号,还能够支持数据匿名化过程,从而在尊重患者隐私的同时简化数据共享。它们的应用有助于以开放科学的名义推进研究进展和合作。然而,还需要进一步探索一些方面,例如标准化的合成数据质量评估和算法稳定性。