Department of Biomedical Informatics, Emory University, United States of America.
Department of Epidemiology, Emory University, United States of America.
Physiol Meas. 2022 Aug 26;43(8). doi: 10.1088/1361-6579/ac79fd.
The standard twelve-lead electrocardiogram (ECG) is a widely used tool for monitoring cardiac function and diagnosing cardiac disorders. The development of smaller, lower-cost, and easier-to-use ECG devices may improve access to cardiac care in lower-resource environments, but the diagnostic potential of these devices is unclear. This work explores these issues through a public competition: the 2021 PhysioNet Challenge. In addition, we explore the potential for performance boosting through a meta-learning approach.We sourced 131,149 twelve-lead ECG recordings from ten international sources. We posted 88,253 annotated recordings as public training data and withheld the remaining recordings as hidden validation and test data. We challenged teams to submit containerized, open-source algorithms for diagnosing cardiac abnormalities using various ECG lead combinations, including the code for training their algorithms. We designed and scored the algorithms using an evaluation metric that captures the risks of different misdiagnoses for 30 conditions. After the Challenge, we implemented a semi-consensus voting model on all working algorithms.A total of 68 teams submitted 1,056 algorithms during the Challenge, providing a variety of automated approaches from both academia and industry. The performance differences across the different lead combinations were smaller than the performance differences across the different test databases, showing that generalizability posed a larger challenge to the algorithms than the choice of ECG leads. A voting model improved performance by 3.5%.The use of different ECG lead combinations allowed us to assess the diagnostic potential of reduced-lead ECG recordings, and the use of different data sources allowed us to assess the generalizability of the algorithms to diverse institutions and populations. The submission of working, open-source code for both training and testing and the use of a novel evaluation metric improved the reproducibility, generalizability, and applicability of the research conducted during the Challenge.
标准的 12 导联心电图(ECG)是监测心脏功能和诊断心脏疾病的常用工具。更小、成本更低、更易于使用的 ECG 设备的发展可能会改善资源较少环境中的心脏护理的可及性,但这些设备的诊断潜力尚不清楚。这项工作通过一个公开竞赛来探讨这些问题:2021 年 PhysioNet 挑战赛。此外,我们还通过元学习方法来探索提高性能的潜力。我们从十个国际来源获取了 131149 份 12 导联心电图记录。我们将 88253 份标注记录作为公共训练数据发布,并将其余记录作为隐藏验证和测试数据保留。我们向团队提出了挑战,要求他们使用各种 ECG 导联组合(包括训练算法的代码)提交容器化、开源算法,以诊断心脏异常。我们使用一种评估指标来设计和评分算法,该指标可以捕获 30 种情况下不同误诊的风险。挑战赛结束后,我们对所有工作算法实施了半共识投票模型。共有 68 个团队在挑战赛期间提交了 1056 个算法,提供了来自学术界和工业界的各种自动化方法。不同导联组合之间的性能差异小于不同测试数据库之间的性能差异,这表明算法的泛化能力比 ECG 导联的选择对算法构成更大的挑战。投票模型使性能提高了 3.5%。使用不同的 ECG 导联组合使我们能够评估减少导联 ECG 记录的诊断潜力,使用不同的数据源使我们能够评估算法对不同机构和人群的泛化能力。提交用于训练和测试的工作的、开源代码以及使用新的评估指标提高了挑战赛期间进行的研究的可重复性、泛化能力和适用性。