Nassif Houssam, Wu Yirong, Page David, Burnside Elizabeth
University of Wisconsin, Madison, USA.
AMIA Annu Symp Proc. 2012;2012:1330-9. Epub 2012 Nov 3.
Overdiagnosis is a phenomenon in which screening identities cancer which may not go on to cause symptoms or death. Women over 65 who develop breast cancer bear the heaviest burden of overdiagnosis. This work introduces novel machine learning algorithms to improve diagnostic accuracy of breast cancer in aging populations. At the same time, we aim at minimizing unnecessary invasive procedures (thus decreasing false positives) and concomitantly addressing overdiagnosis. We develop a novel algorithm. Logical Differential Prediction Bayes Net (LDP-BN), that calculates the risk of breast disease based on mammography findings. LDP-BN uses Inductive Logic Programming (ILP) to learn relational rules, selects older-specific differentially predictive rules, and incorporates them into a Bayes Net, significantly improving its performance. In addition, LDP-BN offers valuable insight into the classification process, revealing novel older-specific rules that link mass presence to invasive, and calcification presence and lack of detectable mass to DCIS.
过度诊断是一种现象,即筛查发现的癌症可能不会发展为出现症状或导致死亡。65岁以上患乳腺癌的女性承受着最重的过度诊断负担。这项工作引入了新颖的机器学习算法,以提高老年人群中乳腺癌的诊断准确性。同时,我们旨在尽量减少不必要的侵入性程序(从而减少假阳性),并同时解决过度诊断问题。我们开发了一种新颖的算法,即逻辑差异预测贝叶斯网络(LDP-BN),它根据乳房X光检查结果计算乳腺疾病风险。LDP-BN使用归纳逻辑编程(ILP)来学习关系规则,选择针对老年人的差异预测规则,并将其纳入贝叶斯网络,从而显著提高其性能。此外,LDP-BN为分类过程提供了有价值的见解,揭示了将肿块存在与浸润性联系起来,以及将钙化存在和未检测到肿块与导管原位癌联系起来的新颖的针对老年人的规则。