Medical Image Analysis Lab, Simon Fraser University, Canada.
Comput Methods Programs Biomed. 2017 Jul;145:85-93. doi: 10.1016/j.cmpb.2017.04.012. Epub 2017 Apr 13.
Feature reduction is an essential stage in computer aided breast cancer diagnosis systems. Multilayer neural networks can be trained to extract relevant features by encoding high-dimensional data into low-dimensional codes. Optimizing traditional auto-encoders works well only if the initial weights are close to a proper solution. They are also trained to only reduce the mean squared reconstruction error (MRE) between the encoder inputs and the decoder outputs, but do not address the classification error. The goal of the current work is to test the hypothesis that extending traditional auto-encoders (which only minimize reconstruction error) to multi-objective optimization for finding Pareto-optimal solutions provides more discriminative features that will improve classification performance when compared to single-objective and other multi-objective approaches (i.e. scalarized and sequential).
In this paper, we introduce a novel multi-objective optimization of deep auto-encoder networks, in which the auto-encoder optimizes two objectives: MRE and mean classification error (MCE) for Pareto-optimal solutions, rather than just MRE. These two objectives are optimized simultaneously by a non-dominated sorting genetic algorithm.
We tested our method on 949 X-ray mammograms categorized into 12 classes. The results show that the features identified by the proposed algorithm allow a classification accuracy of up to 98.45%, demonstrating favourable accuracy over the results of state-of-the-art methods reported in the literature.
We conclude that adding the classification objective to the traditional auto-encoder objective and optimizing for finding Pareto-optimal solutions, using evolutionary multi-objective optimization, results in producing more discriminative features.
特征降维是计算机辅助乳腺癌诊断系统中的一个重要阶段。多层神经网络可以通过将高维数据编码为低维代码来提取相关特征。优化传统的自动编码器只有在初始权重接近适当的解时才有效。它们还被训练成仅降低编码器输入和解码器输出之间的均方重建误差(MRE),但不解决分类误差。目前的工作旨在测试以下假设:将传统的自动编码器(仅最小化重建误差)扩展到多目标优化,以找到帕累托最优解,提供更具判别力的特征,与单目标和其他多目标方法(即标量和顺序)相比,将提高分类性能。
在本文中,我们引入了一种新的深度自动编码器网络的多目标优化,其中自动编码器优化两个目标:MRE 和平均分类误差(MCE),以获得帕累托最优解,而不仅仅是 MRE。这两个目标通过非支配排序遗传算法同时优化。
我们在 949 张 X 射线乳房图像上进行了测试,这些图像分为 12 类。结果表明,所提出的算法所识别的特征可实现高达 98.45%的分类精度,优于文献中报道的最新方法的结果。
我们得出结论,将分类目标添加到传统自动编码器目标中,并通过进化多目标优化来优化寻找帕累托最优解,可产生更具判别力的特征。