Billiet Lieven, Van Huffel Sabine, Van Belle Vanya
STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics, Department of Electrical Engineering (ESAT), KU Leuven, Leuven, Belgium.
imec, Leuven, Belgium.
PeerJ Comput Sci. 2018 Apr 2;4:e150. doi: 10.7717/peerj-cs.150. eCollection 2018.
Over the last decades, clinical decision support systems have been gaining importance. They help clinicians to make effective use of the overload of available information to obtain correct diagnoses and appropriate treatments. However, their power often comes at the cost of a black box model which cannot be interpreted easily. This interpretability is of paramount importance in a medical setting with regard to trust and (legal) responsibility. In contrast, existing medical scoring systems are easy to understand and use, but they are often a simplified rule-of-thumb summary of previous medical experience rather than a well-founded system based on available data. Interval Coded Scoring (ICS) connects these two approaches, exploiting the power of sparse optimization to derive scoring systems from training data. The presented toolbox interface makes this theory easily applicable to both small and large datasets. It contains two possible problem formulations based on linear programming or elastic net. Both allow to construct a model for a binary classification problem and establish risk profiles that can be used for future diagnosis. All of this requires only a few lines of code. ICS differs from standard machine learning through its model consisting of interpretable main effects and interactions. Furthermore, insertion of expert knowledge is possible because the training can be semi-automatic. This allows end users to make a trade-off between complexity and performance based on cross-validation results and expert knowledge. Additionally, the toolbox offers an accessible way to assess classification performance via accuracy and the ROC curve, whereas the calibration of the risk profile can be evaluated via a calibration curve. Finally, the colour-coded model visualization has particular appeal if one wants to apply ICS manually on new observations, as well as for validation by experts in the specific application domains. The validity and applicability of the toolbox is demonstrated by comparing it to standard Machine Learning approaches such as Naive Bayes and Support Vector Machines for several real-life datasets. These case studies on medical problems show its applicability as a decision support system. ICS performs similarly in terms of classification and calibration. Its slightly lower performance is countered by its model simplicity which makes it the method of choice if interpretability is a key issue.
在过去几十年中,临床决策支持系统的重要性日益凸显。它们帮助临床医生有效利用大量可用信息,以获得正确诊断和适当治疗。然而,其强大功能往往是以难以解释的黑箱模型为代价的。在医疗环境中,这种可解释性对于信任和(法律)责任至关重要。相比之下,现有的医学评分系统易于理解和使用,但它们通常是以往医学经验的简化经验法则总结,而非基于可用数据的有充分依据的系统。区间编码评分(ICS)将这两种方法联系起来,利用稀疏优化的力量从训练数据中推导评分系统。所展示的工具箱界面使该理论易于应用于小型和大型数据集。它包含基于线性规划或弹性网络的两种可能的问题表述。两者都允许构建二元分类问题的模型,并建立可用于未来诊断的风险概况。所有这些仅需几行代码。ICS与标准机器学习的不同之处在于其模型由可解释的主效应和相互作用组成。此外,由于训练可以是半自动的,因此可以插入专家知识。这使最终用户能够根据交叉验证结果和专家知识在复杂性和性能之间进行权衡。此外,该工具箱提供了一种通过准确率和ROC曲线评估分类性能的便捷方法,而风险概况的校准可以通过校准曲线进行评估。最后,如果想在新观察值上手动应用ICS,以及由特定应用领域的专家进行验证,那么颜色编码的模型可视化具有特别的吸引力。通过将其与朴素贝叶斯和支持向量机等标准机器学习方法针对多个实际数据集进行比较,证明了该工具箱的有效性和适用性。这些关于医疗问题的案例研究展示了其作为决策支持系统的适用性。ICS在分类和校准方面表现相似。其稍低的性能被其模型的简单性所抵消,这使其成为可解释性是关键问题时的首选方法。