Gottfried Schatz Research Center: Division of Medical Physics and Biophysics, Medical University of Graz, Graz, Austria.
BioTechMed-Graz, Graz, Austria.
Sci Data. 2023 Aug 8;10(1):531. doi: 10.1038/s41597-023-02416-4.
Mechanistic cardiac electrophysiology models allow for personalized simulations of the electrical activity in the heart and the ensuing electrocardiogram (ECG) on the body surface. As such, synthetic signals possess known ground truth labels of the underlying disease and can be employed for validation of machine learning ECG analysis tools in addition to clinical signals. Recently, synthetic ECGs were used to enrich sparse clinical data or even replace them completely during training leading to improved performance on real-world clinical test data. We thus generated a novel synthetic database comprising a total of 16,900 12 lead ECGs based on electrophysiological simulations equally distributed into healthy control and 7 pathology classes. The pathological case of myocardial infraction had 6 sub-classes. A comparison of extracted features between the virtual cohort and a publicly available clinical ECG database demonstrated that the synthetic signals represent clinical ECGs for healthy and pathological subpopulations with high fidelity. The ECG database is split into training, validation, and test folds for development and objective assessment of novel machine learning algorithms.
机制心脏电生理学模型允许对心脏的电活动和体表的心电图 (ECG) 进行个性化模拟。因此,合成信号具有潜在疾病的已知真实标签,除了临床信号外,还可用于验证机器学习 ECG 分析工具。最近,合成 ECG 被用于在训练期间丰富稀疏的临床数据,甚至完全替代它们,从而提高真实世界临床测试数据的性能。因此,我们生成了一个新的合成数据库,包含总共 16900 个 12 导联 ECG,基于电生理学模拟,在健康对照组和 7 种病理类中均匀分布。心肌梗死的病理情况有 6 个亚类。在虚拟队列和公开可用的临床 ECG 数据库之间提取特征的比较表明,合成信号代表健康和病理亚群的临床 ECG,具有高度的逼真度。ECG 数据库分为训练、验证和测试折,用于开发和客观评估新的机器学习算法。