KU Leuven, Department of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics, Kasteelpark Arenberg 10, B-3001 Leuven, Belgium.
University of Cambridge, Cambridge CB2 1TN, UK.
Comput Biol Med. 2024 Mar;171:108205. doi: 10.1016/j.compbiomed.2024.108205. Epub 2024 Feb 23.
With the increasing prevalence of machine learning in critical fields like healthcare, ensuring the safety and reliability of these systems is crucial. Estimating uncertainty plays a vital role in enhancing reliability by identifying areas of high and low confidence and reducing the risk of errors. This study introduces U-PASS, a specialized human-centered machine learning pipeline tailored for clinical applications, which effectively communicates uncertainty to clinical experts and collaborates with them to improve predictions. U-PASS incorporates uncertainty estimation at every stage of the process, including data acquisition, training, and model deployment. Training is divided into a supervised pre-training step and a semi-supervised recording-wise finetuning step. We apply U-PASS to the challenging task of sleep staging and demonstrate that it systematically improves performance at every stage. By optimizing the training dataset, actively seeking feedback from domain experts for informative samples, and deferring the most uncertain samples to experts, U-PASS achieves an impressive expert-level accuracy of 85% on a challenging clinical dataset of elderly sleep apnea patients. This represents a significant improvement over the starting point at 75% accuracy. The largest improvement gain is due to the deferral of uncertain epochs to a sleep expert. U-PASS presents a promising AI approach to incorporating uncertainty estimation in machine learning pipelines, improving their reliability and unlocking their potential in clinical settings.
随着机器学习在医疗等关键领域的日益普及,确保这些系统的安全性和可靠性至关重要。通过识别高置信度和低置信度区域并降低错误风险,估计不确定性在提高可靠性方面发挥着重要作用。本研究引入了 U-PASS,这是一种专门针对临床应用的以人为中心的机器学习管道,它可以有效地向临床专家传达不确定性,并与他们合作改进预测。U-PASS 在整个过程的每个阶段都进行不确定性估计,包括数据采集、训练和模型部署。训练分为有监督的预训练步骤和半监督的按记录微调步骤。我们将 U-PASS 应用于具有挑战性的睡眠分期任务,并证明它在每个阶段都能系统地提高性能。通过优化训练数据集、积极为有价值的样本寻求领域专家的反馈,以及将最不确定的样本推迟给专家,U-PASS 在一个具有挑战性的老年睡眠呼吸暂停患者临床数据集上实现了令人印象深刻的专家级准确性,达到了 85%。这比起始准确率 75%有了显著提高。最大的改进增益是由于将不确定的时段推迟给睡眠专家。U-PASS 提出了一种有前途的人工智能方法,可以在机器学习管道中纳入不确定性估计,提高其可靠性,并在临床环境中释放其潜力。