Department of Applied Mathematics, Liverpool John Moores University, Liverpool, UK.
Department of Translational Medicine, Artificial Intelligence and Bioinformatics in Cardiothoracic Sciences, Lund University, Lund, Sweden.
Sci Rep. 2022 Nov 14;12(1):19525. doi: 10.1038/s41598-022-23817-2.
The most limiting factor in heart transplantation is the lack of donor organs. With enhanced prediction of outcome, it may be possible to increase the life-years from the organs that become available. Applications of machine learning to tabular data, typical of clinical decision support, pose the practical question of interpretation, which has technical and potential ethical implications. In particular, there is an issue of principle about the predictability of complex data and whether this is inherent in the data or strongly dependent on the choice of machine learning model, leading to the so-called accuracy-interpretability trade-off. We model 1-year mortality in heart transplantation data with a self-explaining neural network, which is benchmarked against a deep learning model on the same development data, in an external validation study with two data sets: (1) UNOS transplants in 2017-2018 (n = 4750) for which the self-explaining and deep learning models are comparable in their AUROC 0.628 [0.602,0.654] cf. 0.635 [0.609,0.662] and (2) Scandinavian transplants during 1997-2018 (n = 2293), showing good calibration with AUROCs of 0.626 [0.588,0.665] and 0.634 [0.570, 0.698], respectively, with and without missing data (n = 982). This shows that for tabular data, predictive models can be transparent and capture important nonlinearities, retaining full predictive performance.
心脏移植的最主要限制因素是供体器官的缺乏。随着对结果预测能力的增强,有可能延长可用器官的使用年限。机器学习在典型的临床决策支持中的表格数据中的应用提出了解释的实际问题,这具有技术和潜在的伦理影响。特别是,关于复杂数据的可预测性存在一个原则问题,即这种可预测性是数据固有的,还是强烈依赖于机器学习模型的选择,这导致了所谓的准确性-可解释性权衡。我们使用可自解释神经网络对心脏移植数据进行了 1 年死亡率建模,并在两项外部验证研究中,将其与同一开发数据上的深度学习模型进行了基准测试:(1)2017-2018 年 UNOS 移植数据(n=4750),可自解释和深度学习模型在 AUROC 0.628[0.602,0.654]上具有可比性,而在 AUROC 0.635[0.609,0.662]上具有可比性;(2)1997-2018 年斯堪的纳维亚移植数据(n=2293),显示出良好的校准效果,AUROCs 分别为 0.626[0.588,0.665]和 0.634[0.570,0.698],有和没有缺失数据(n=982)。这表明,对于表格数据,预测模型可以是透明的,并捕捉到重要的非线性,同时保留全部预测性能。