IEEE Trans Cybern. 2015 Apr;45(4):635-46. doi: 10.1109/TCYB.2014.2332037. Epub 2014 Jul 8.
The classification of incomplete patterns is a very challenging task because the object (incomplete pattern) with different possible estimations of missing values may yield distinct classification results. The uncertainty (ambiguity) of classification is mainly caused by the lack of information of the missing data. A new prototype-based credal classification (PCC) method is proposed to deal with incomplete patterns thanks to the belief function framework used classically in evidential reasoning approach. The class prototypes obtained by training samples are respectively used to estimate the missing values. Typically, in a c -class problem, one has to deal with c prototypes, which yield c estimations of the missing values. The different edited patterns based on each possible estimation are then classified by a standard classifier and we can get at most c distinct classification results for an incomplete pattern. Because all these distinct classification results are potentially admissible, we propose to combine them all together to obtain the final classification of the incomplete pattern. A new credal combination method is introduced for solving the classification problem, and it is able to characterize the inherent uncertainty due to the possible conflicting results delivered by different estimations of the missing values. The incomplete patterns that are very difficult to classify in a specific class will be reasonably and automatically committed to some proper meta-classes by PCC method in order to reduce errors. The effectiveness of PCC method has been tested through four experiments with artificial and real data sets.
不完整模式的分类是一项极具挑战性的任务,因为具有不同缺失值估计的目标(不完整模式)可能会产生截然不同的分类结果。分类的不确定性主要是由缺失数据的信息不足引起的。由于证据推理方法中经典使用的置信函数框架,提出了一种新的基于原型的可信分类(PCC)方法来处理不完整模式。通过训练样本获得的类原型分别用于估计缺失值。通常,在 c 类问题中,必须处理 c 个原型,这会产生缺失值的 c 个估计值。然后,基于每个可能的估计值对不同的编辑模式进行分类,对于一个不完整的模式,我们最多可以得到 c 个不同的分类结果。由于所有这些不同的分类结果都是潜在可接受的,因此我们建议将它们全部组合在一起,以获得不完整模式的最终分类。引入了一种新的可信组合方法来解决分类问题,它能够描述由于缺失值的不同估计值产生的可能冲突结果而导致的固有不确定性。PCC 方法能够合理地自动将某些特定类别中难以分类的不完整模式分配到适当的元类别中,以减少错误。通过四个使用人工和真实数据集的实验,验证了 PCC 方法的有效性。