Mansouri Dou El Kefel, Benabdeslem Khalid, Benkabou Seif Eddine, Chaib Souleyman, Chohri Mohamed
IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):9871-9884. doi: 10.1109/TNNLS.2023.3237170. Epub 2024 Jul 8.
In many real-world applications, data are represented by multiple instances and simultaneously associated with multiple labels. These data are always redundant and generally contaminated by different noise levels. As a result, several machine learning models fail to achieve good classification and find an optimal mapping. Feature selection, instance selection, and label selection are three effective dimensionality reduction techniques. Nevertheless, the literature was limited to feature and/or instance selection but has, to some extent, neglected label selection, which also plays an essential role in the preprocessing step, as label noises can adversely affect the performance of the underlying learning algorithms. In this article, we propose a novel framework termed multilabel Feature Instance Label Selection (mFILS) that simultaneously performs feature, instance, and label selections in both convex and nonconvex scenarios. To the best of our knowledge, this article offers, for the first time ever, a study using the triple and simultaneous selection of features, instances, and labels based on convex and nonconvex penalties in a multilabel scenario. Experimental results are built on some known benchmark datasets to validate the effectiveness of the proposed mFILS.
在许多实际应用中,数据由多个实例表示,并同时与多个标签相关联。这些数据总是冗余的,并且通常受到不同噪声水平的污染。因此,一些机器学习模型无法实现良好的分类,也无法找到最优映射。特征选择、实例选择和标签选择是三种有效的降维技术。然而,相关文献仅限于特征和/或实例选择,在某种程度上忽略了标签选择,而标签选择在预处理步骤中也起着至关重要的作用,因为标签噪声会对底层学习算法的性能产生不利影响。在本文中,我们提出了一种新颖的框架,称为多标签特征实例标签选择(mFILS),它在凸和非凸场景中同时执行特征、实例和标签选择。据我们所知,本文首次在多标签场景中基于凸和非凸惩罚对特征、实例和标签进行三重同时选择的研究。实验结果基于一些已知的基准数据集构建,以验证所提出的mFILS的有效性。