Setiono R, Liu H
Dept. of Inf. Syst. and Comput. Sci., Nat. Univ. of Singapore.
IEEE Trans Neural Netw. 1997;8(3):654-62. doi: 10.1109/72.572104.
Feature selection is an integral part of most learning algorithms. Due to the existence of irrelevant and redundant attributes, by selecting only the relevant attributes of the data, higher predictive accuracy can be expected from a machine learning method. In this paper, we propose the use of a three-layer feedforward neural network to select those input attributes that are most useful for discriminating classes in a given set of input patterns. A network pruning algorithm is the foundation of the proposed algorithm. By adding a penalty term to the error function of the network, redundant network connections can be distinguished from those relevant ones by their small weights when the network training process has been completed. A simple criterion to remove an attribute based on the accuracy rate of the network is developed. The network is retrained after removal of an attribute, and the selection process is repeated until no attribute meets the criterion for removal. Our experimental results suggest that the proposed method works very well on a wide variety of classification problems.
特征选择是大多数学习算法不可或缺的一部分。由于存在不相关和冗余的属性,通过仅选择数据的相关属性,可以期望机器学习方法具有更高的预测准确性。在本文中,我们提出使用三层前馈神经网络来选择那些对于区分给定输入模式集中的类别最有用的输入属性。一种网络剪枝算法是所提出算法的基础。通过向网络的误差函数添加一个惩罚项,当网络训练过程完成时,冗余的网络连接可以通过其小权重与那些相关连接区分开来。开发了一个基于网络准确率来移除属性的简单准则。移除一个属性后对网络进行重新训练,并且重复选择过程,直到没有属性满足移除准则。我们的实验结果表明,所提出的方法在各种各样的分类问题上都表现得非常好。