• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于条件互信息的多标签特征选择。

Multi-Label Feature Selection with Conditional Mutual Information.

机构信息

Faculty of Information and Technology, Beijing University of Technology, Beijing 100020, China.

Beijing-Dublin International College, Beijing University of Technology, Beijing 100020, China.

出版信息

Comput Intell Neurosci. 2022 Oct 8;2022:9243893. doi: 10.1155/2022/9243893. eCollection 2022.

DOI:10.1155/2022/9243893
PMID:36254206
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9569236/
Abstract

Feature selection is an important way to optimize the efficiency and accuracy of classifiers. However, traditional feature selection methods cannot work with many kinds of data in the real world, such as multi-label data. To overcome this challenge, multi-label feature selection is developed. Multi-label feature selection plays an irreplaceable role in pattern recognition and data mining. This process can improve the efficiency and accuracy of multi-label classification. However, traditional multi-label feature selection based on mutual information does not fully consider the effect of redundancy among labels. The deficiency may lead to repeated computing of mutual information and leave room to enhance the accuracy of multi-label feature selection. To deal with this challenge, this paper proposed a multi-label feature selection based on conditional mutual information among labels (CRMIL). Firstly, we analyze how to reduce the redundancy among features based on existing papers. Secondly, we propose a new approach to diminish the redundancy among labels. This method takes label sets as conditions to calculate the relevance between features and labels. This approach can weaken the impact of the redundancy among labels on feature selection results. Finally, we analyze this algorithm and balance the effects of relevance and redundancy on the evaluation function. For testing CRMIL, we compare it with the other eight multi-label feature selection algorithms on ten datasets and use four evaluation criteria to examine the results. Experimental results illustrate that CRMIL performs better than other existing algorithms.

摘要

特征选择是优化分类器效率和准确性的重要方法。然而,传统的特征选择方法无法处理现实世界中的许多数据,例如多标签数据。为了克服这一挑战,开发了多标签特征选择。多标签特征选择在模式识别和数据挖掘中起着不可替代的作用。这个过程可以提高多标签分类的效率和准确性。然而,传统的基于互信息的多标签特征选择并没有充分考虑标签之间冗余的影响。这种不足可能导致互信息的重复计算,并为提高多标签特征选择的准确性留出空间。为了解决这个挑战,本文提出了一种基于标签间条件互信息的多标签特征选择方法(CRMIL)。首先,我们分析了如何基于现有文献减少特征之间的冗余。其次,我们提出了一种减少标签之间冗余的新方法。该方法将标签集作为条件来计算特征与标签之间的相关性。这种方法可以减弱标签之间冗余对特征选择结果的影响。最后,我们分析了该算法,并平衡了相关性和冗余性对评价函数的影响。为了测试 CRMIL,我们在十个数据集上与其他八个多标签特征选择算法进行了比较,并使用了四个评价标准来检验结果。实验结果表明,CRMIL 比其他现有的算法表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/f250d6cb8842/CIN2022-9243893.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/4ef3c85c99f9/CIN2022-9243893.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/30a2cc164853/CIN2022-9243893.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/5c45567499ed/CIN2022-9243893.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/9de5fd69c0dd/CIN2022-9243893.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/eeff73589d9c/CIN2022-9243893.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/defd62a708ee/CIN2022-9243893.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/e33051155ca3/CIN2022-9243893.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/dd370cbf1340/CIN2022-9243893.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/9b0236d649ae/CIN2022-9243893.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/461d25bb8a07/CIN2022-9243893.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/cf29ff40f02c/CIN2022-9243893.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/39ca36d4111c/CIN2022-9243893.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/7a5ceedf7e8a/CIN2022-9243893.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/77c025d7fbe9/CIN2022-9243893.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/8df9fadf1f8a/CIN2022-9243893.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/1b41402d8044/CIN2022-9243893.016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/7ba395f28e5f/CIN2022-9243893.017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/f250d6cb8842/CIN2022-9243893.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/4ef3c85c99f9/CIN2022-9243893.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/30a2cc164853/CIN2022-9243893.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/5c45567499ed/CIN2022-9243893.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/9de5fd69c0dd/CIN2022-9243893.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/eeff73589d9c/CIN2022-9243893.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/defd62a708ee/CIN2022-9243893.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/e33051155ca3/CIN2022-9243893.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/dd370cbf1340/CIN2022-9243893.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/9b0236d649ae/CIN2022-9243893.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/461d25bb8a07/CIN2022-9243893.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/cf29ff40f02c/CIN2022-9243893.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/39ca36d4111c/CIN2022-9243893.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/7a5ceedf7e8a/CIN2022-9243893.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/77c025d7fbe9/CIN2022-9243893.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/8df9fadf1f8a/CIN2022-9243893.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/1b41402d8044/CIN2022-9243893.016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/7ba395f28e5f/CIN2022-9243893.017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375a/9569236/f250d6cb8842/CIN2022-9243893.alg.001.jpg

相似文献

1
Multi-Label Feature Selection with Conditional Mutual Information.基于条件互信息的多标签特征选择。
Comput Intell Neurosci. 2022 Oct 8;2022:9243893. doi: 10.1155/2022/9243893. eCollection 2022.
2
A Feature Selection Algorithm Integrating Maximum Classification Information and Minimum Interaction Feature Dependency Information.一种集成最大分类信息和最小交互特征依赖信息的特征选择算法。
Comput Intell Neurosci. 2021 Dec 28;2021:3569632. doi: 10.1155/2021/3569632. eCollection 2021.
3
Multi-Label Feature Selection Combining Three Types of Conditional Relevance.结合三种条件相关性的多标签特征选择
Entropy (Basel). 2021 Dec 1;23(12):1617. doi: 10.3390/e23121617.
4
Multi-Label Feature Selection Based on High-Order Label Correlation Assumption.基于高阶标签相关性假设的多标签特征选择
Entropy (Basel). 2020 Jul 21;22(7):797. doi: 10.3390/e22070797.
5
An evolutionary decomposition-based multi-objective feature selection for multi-label classification.一种基于进化分解的多标签分类多目标特征选择方法。
PeerJ Comput Sci. 2020 Mar 2;6:e261. doi: 10.7717/peerj-cs.261. eCollection 2020.
6
Online Multi-Label Streaming Feature Selection Based on Label Group Correlation and Feature Interaction.基于标签组相关性和特征交互的在线多标签流特征选择
Entropy (Basel). 2023 Jul 17;25(7):1071. doi: 10.3390/e25071071.
7
Comparison of five supervised feature selection algorithms leading to top features and gene signatures from multi-omics data in cancer.比较五种监督特征选择算法,这些算法可从癌症的多组学数据中得到顶级特征和基因特征。
BMC Bioinformatics. 2022 Apr 28;23(Suppl 3):153. doi: 10.1186/s12859-022-04678-y.
8
Computer-assisted lip diagnosis on Traditional Chinese Medicine using multi-class support vector machines.基于多类支持向量机的中医唇诊计算机辅助诊断。
BMC Complement Altern Med. 2012 Aug 16;12:127. doi: 10.1186/1472-6882-12-127.
9
FSCME: A Feature Selection Method Combining Copula Correlation and Maximal Information Coefficient by Entropy Weights.FSCME:一种基于熵权的结合 Copula 相关性和最大信息系数的特征选择方法。
IEEE J Biomed Health Inform. 2024 Sep;28(9):5638-5648. doi: 10.1109/JBHI.2024.3409628. Epub 2024 Sep 5.
10
A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox.基于混合特征选择和多标签驱动的齿轮箱智能故障诊断方法。
Sensors (Basel). 2023 May 16;23(10):4792. doi: 10.3390/s23104792.

本文引用的文献

1
Multi-Population Genetic Algorithm for Multilabel Feature Selection Based on Label Complementary Communication.基于标签互补通信的多群体遗传算法用于多标签特征选择
Entropy (Basel). 2020 Aug 10;22(8):876. doi: 10.3390/e22080876.
2
Multilabel classification for exploiting cross-resistance information in HIV-1 drug resistance prediction.利用 HIV-1 耐药性预测中的交叉耐药信息进行多标签分类。
Bioinformatics. 2013 Aug 15;29(16):1946-52. doi: 10.1093/bioinformatics/btt331. Epub 2013 Jun 21.