Gu Dongxiao, Liang Changyong, Zhao Huimin
School of Management, Hefei University of Technology, 193 Tunxi Road, Hefei, Anhui, 230009, China.
Sheldon B. Lubar School of Business, University of Wisconsin-Milwaukee, 3202 North Maryland Avenue, Milwaukee, WI, 53201, USA.
Artif Intell Med. 2017 Mar;77:31-47. doi: 10.1016/j.artmed.2017.02.003. Epub 2017 Feb 11.
We present the implementation and application of a case-based reasoning (CBR) system for breast cancer related diagnoses. By retrieving similar cases in a breast cancer decision support system, oncologists can obtain powerful information or knowledge, complementing their own experiential knowledge, in their medical decision making.
We observed two problems in applying standard CBR to this context: the abundance of different types of attributes and the difficulty in eliciting appropriate attribute weights from human experts. We therefore used a distance measure named weighted heterogeneous value distance metric, which can better deal with both continuous and discrete attributes simultaneously than the standard Euclidean distance, and a genetic algorithm for learning the attribute weights involved in this distance measure automatically. We evaluated our CBR system in two case studies, related to benign/malignant tumor prediction and secondary cancer prediction, respectively.
Weighted heterogeneous value distance metric with genetic algorithm for weight learning outperformed several alternative attribute matching methods and several classification methods by at least 3.4%, reaching 0.938, 0.883, 0.933, and 0.984 in the first case study, and 0.927, 0.842, 0.939, and 0.989 in the second case study, in terms of accuracy, sensitivity×specificity, F measure, and area under the receiver operating characteristic curve, respectively.
The evaluation result indicates the potential of CBR in the breast cancer diagnosis domain.
我们展示了一个用于乳腺癌相关诊断的基于案例推理(CBR)系统的实现与应用。通过在乳腺癌决策支持系统中检索相似案例,肿瘤学家在医疗决策过程中可以获取有力信息或知识,以补充自身的经验知识。
我们观察到在将标准CBR应用于此情境时存在两个问题:不同类型属性众多,以及难以从人类专家那里获取合适的属性权重。因此,我们使用了一种名为加权异构值距离度量的距离度量方法,它比标准欧几里得距离能更好地同时处理连续和离散属性,还使用了一种遗传算法来自动学习此距离度量中涉及的属性权重。我们在两个案例研究中评估了我们的CBR系统,分别与良性/恶性肿瘤预测和继发性癌症预测相关。
带有用于权重学习的遗传算法的加权异构值距离度量在第一个案例研究中,在准确率、灵敏度×特异度、F度量以及受试者工作特征曲线下面积方面,分别比几种替代的属性匹配方法和几种分类方法至少高出3.4%,达到了0.938、0.883、0.933和0.984;在第二个案例研究中分别达到了0.927、0.842、0.939和0.989。
评估结果表明CBR在乳腺癌诊断领域具有潜力。