Suppr超能文献

一种用于在动态咳嗽音频数据的COVID-19检测中维持模型性能的综合漂移自适应框架:模型开发与验证

A Comprehensive Drift-Adaptive Framework for Sustaining Model Performance in COVID-19 Detection From Dynamic Cough Audio Data: Model Development and Validation.

作者信息

Ganitidis Theofanis, Athanasiou Maria, Mitsis Konstantinos, Zarkogianni Konstantia, Nikita Konstantina S

机构信息

School of Electrical and Computer Engineering, National Technical University of Athens, Athens, Greece.

Faculty of Science and Engineering, Maastricht University, Maastricht, The Netherlands.

出版信息

J Med Internet Res. 2025 Jun 3;27:e66919. doi: 10.2196/66919.

Abstract

BACKGROUND

The COVID-19 pandemic has highlighted the need for robust and adaptable diagnostic tools capable of detecting the disease from diverse and continuously evolving data sources. Machine learning models, particularly convolutional neural networks, are promising in this regard. However, the dynamic nature of real-world data can lead to model drift, where the model's performance degrades over time, as the underlying data distribution changes due to evolving disease characteristics, demographic shifts, and variations in recording conditions. Addressing this challenge is crucial to maintaining the accuracy and reliability of these models in ongoing diagnostic applications.

OBJECTIVE

This study aims to develop a comprehensive framework that not only monitors model drift over time but also uses adaptation mechanisms to mitigate performance fluctuations in COVID-19 detection models trained on dynamic cough audio data.

METHODS

Two crowdsourced COVID-19 audio datasets, namely COVID-19 Sounds and Coswara, were used for development and evaluation purposes. Each dataset was divided into 2 distinct periods, namely the development period and postdevelopment period. A baseline convolutional neural network model was initially trained and evaluated using data (ie, coughs from COVID-19 Sounds and shallow coughs from Coswara dataset) from the development period. To detect changes in data distributions and the model's performance between these periods, the maximum mean discrepancy distance was used. Upon detecting significant drift, a retraining procedure was triggered to update the baseline model. The study explored 2 model adaptation approaches, unsupervised domain adaptation and active learning, both of which were comparatively assessed.

RESULTS

The baseline model achieved an area under the receiver operating characteristic curve of 69.13% and a balanced accuracy of 63.38% on the development test set of the COVID-19 Sounds dataset, while for the Coswara dataset, the corresponding values were 66.8% and 61.64%. A decline in performance was observed when the model was evaluated on data from the postdevelopment period, indicating the presence of model drift. The application of the unsupervised domain adaptation approach led to performance improvement in terms of balanced accuracy by up to 22% and 24% for the COVID-19 Sounds and Coswara datasets, respectively. The active learning approach yielded even greater improvement, corresponding to a balanced accuracy increase of up to 30% and 60% for the 2 datasets, respectively.

CONCLUSIONS

The proposed framework successfully addresses the challenge of model drift in COVID-19 detection by enabling continuous adaptation to evolving data distributions. This approach ensures sustained model performance over time, contributing to the development of robust and adaptable diagnostic tools for COVID-19 and potentially other infectious diseases.

摘要

背景

新冠疫情凸显了对强大且适应性强的诊断工具的需求,这些工具能够从多样且不断变化的数据源中检测出该疾病。机器学习模型,尤其是卷积神经网络,在这方面颇具前景。然而,现实世界数据的动态特性可能导致模型漂移,即随着潜在数据分布因疾病特征演变、人口结构变化以及记录条件差异而改变,模型性能会随时间下降。应对这一挑战对于在持续的诊断应用中维持这些模型的准确性和可靠性至关重要。

目的

本研究旨在开发一个全面的框架,该框架不仅能随时间监测模型漂移,还能使用适应机制来减轻基于动态咳嗽音频数据训练的新冠检测模型中的性能波动。

方法

两个众包的新冠音频数据集,即新冠之声(COVID-19 Sounds)和科斯瓦拉(Coswara),用于开发和评估。每个数据集被分为两个不同阶段,即开发阶段和开发后阶段。首先使用来自开发阶段的数据(即新冠之声中的咳嗽声和科斯瓦拉数据集中的浅咳嗽声)训练并评估一个基线卷积神经网络模型。为检测这些阶段之间的数据分布变化和模型性能,使用了最大均值差异距离。一旦检测到显著漂移,就会触发重新训练程序以更新基线模型。该研究探索了两种模型适应方法,即无监督域适应和主动学习,并对两者进行了比较评估。

结果

在新冠之声数据集的开发测试集上,基线模型的受试者工作特征曲线下面积为69.13%,平衡准确率为63.38%,而对于科斯瓦拉数据集,相应的值分别为66.8%和61.64%。当在开发后阶段的数据上评估模型时,观察到性能下降,这表明存在模型漂移。无监督域适应方法的应用分别使新冠之声和科斯瓦拉数据集在平衡准确率方面提高了22%和24%。主动学习方法带来了更大的改进,两个数据集的平衡准确率分别提高了30%和60%。

结论

所提出的框架通过能够持续适应不断变化的数据分布,成功解决了新冠检测中模型漂移的挑战。这种方法确保了模型性能随时间持续稳定,有助于开发用于新冠及潜在其他传染病的强大且适应性强的诊断工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d52/12174887/362b02b186dc/jmir_v27i1e66919_fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验