• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自适应合成数据和集成学习的医学应用混合多标签分类模型

Hybrid Multi-Label Classification Model for Medical Applications Based on Adaptive Synthetic Data and Ensemble Learning.

作者信息

Priyadharshini M, Banu A Faritha, Sharma Bhisham, Chowdhury Subrata, Rabie Khaled, Shongwe Thokozani

机构信息

Department of Computer Science Engineering, Nalla Malla Reddy Engineering College, Hyderabad 500088, Telangana, India.

Department of Computer Science, Karpagam Academy of Higher Education, Coimbatore 631027, Tamil Nadu, India.

出版信息

Sensors (Basel). 2023 Jul 31;23(15):6836. doi: 10.3390/s23156836.

DOI:10.3390/s23156836
PMID:37571619
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10422387/
Abstract

In recent years, both machine learning and computer vision have seen growth in the use of multi-label categorization. SMOTE is now being utilized in existing research for data balance, and SMOTE does not consider that nearby examples may be from different classes when producing synthetic samples. As a result, there can be more class overlap and more noise. To avoid this problem, this work presented an innovative technique called Adaptive Synthetic Data-Based Multi-label Classification (ASDMLC). Adaptive Synthetic (ADASYN) sampling is a sampling strategy for learning from unbalanced data sets. ADASYN weights minority class instances by learning difficulty. For hard-to-learn minority class cases, synthetic data are created. Their numerical variables are normalized with the help of the Min-Max technique to standardize the magnitude of each variable's impact on the outcomes. The values of the attribute in this work are changed to a new range, from 0 to 1, using the normalization approach. To raise the accuracy of multi-label classification, Velocity-Equalized Particle Swarm Optimization (VPSO) is utilized for feature selection. In the proposed approach, to overcome the premature convergence problem, standard PSO has been improved by equalizing the velocity with each dimension of the problem. To expose the inherent label dependencies, the multi-label classification ensemble of Adaptive Neuro-Fuzzy Inference System (ANFIS), Probabilistic Neural Network (PNN), and Clustering-Based Decision tree methods will be processed based on an averaging method. The following criteria, including precision, recall, accuracy, and error rate, are used to assess performance. The suggested model's multi-label classification accuracy is 90.88%, better than previous techniques, which is PCT, HOMER, and ML-Forest is 65.57%, 70.66%, and 82.29%, respectively.

摘要

近年来,机器学习和计算机视觉在多标签分类的应用方面都有所发展。目前,SMOTE已被用于现有研究中的数据平衡,但SMOTE在生成合成样本时没有考虑到附近的示例可能来自不同的类别。因此,可能会出现更多的类重叠和更多的噪声。为了避免这个问题,这项工作提出了一种创新技术,称为基于自适应合成数据的多标签分类(ASDMLC)。自适应合成(ADASYN)采样是一种从不平衡数据集中学习的采样策略。ADASYN根据学习难度对少数类实例进行加权。对于难以学习的少数类情况,创建合成数据。它们的数值变量借助Min-Max技术进行归一化,以标准化每个变量对结果的影响程度。在这项工作中,使用归一化方法将属性值更改为从0到1的新范围。为了提高多标签分类的准确性,采用速度均衡粒子群优化(VPSO)进行特征选择。在所提出的方法中,为了克服早熟收敛问题,通过使速度与问题的每个维度相等来改进标准粒子群优化。为了揭示固有的标签依赖性,将基于平均方法处理自适应神经模糊推理系统(ANFIS)、概率神经网络(PNN)和基于聚类的决策树方法的多标签分类集成。使用以下标准(包括精度、召回率、准确率和错误率)来评估性能。所提出模型的多标签分类准确率为90.88%,优于先前的技术,即PCT、HOMER和ML-Forest,它们的准确率分别为65.57%、70.66%和82.29%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/470c1ebd05b3/sensors-23-06836-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/3592d0789aa2/sensors-23-06836-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/884a0d3c74f3/sensors-23-06836-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/8b43bea6e7cf/sensors-23-06836-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/9ab315ea7526/sensors-23-06836-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/c1fedf1f7281/sensors-23-06836-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/4ccabb6a7421/sensors-23-06836-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/d7b985e78db4/sensors-23-06836-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/8f47545ff7a2/sensors-23-06836-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/470c1ebd05b3/sensors-23-06836-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/3592d0789aa2/sensors-23-06836-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/884a0d3c74f3/sensors-23-06836-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/8b43bea6e7cf/sensors-23-06836-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/9ab315ea7526/sensors-23-06836-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/c1fedf1f7281/sensors-23-06836-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/4ccabb6a7421/sensors-23-06836-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/d7b985e78db4/sensors-23-06836-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/8f47545ff7a2/sensors-23-06836-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a3d/10422387/470c1ebd05b3/sensors-23-06836-g009.jpg

相似文献

1
Hybrid Multi-Label Classification Model for Medical Applications Based on Adaptive Synthetic Data and Ensemble Learning.基于自适应合成数据和集成学习的医学应用混合多标签分类模型
Sensors (Basel). 2023 Jul 31;23(15):6836. doi: 10.3390/s23156836.
2
Fuzzy-based prediction of solar PV and wind power generation for microgrid modeling using particle swarm optimization.基于模糊理论的太阳能光伏和风力发电预测,用于采用粒子群优化算法的微电网建模
Heliyon. 2023 Jan 5;9(1):e12802. doi: 10.1016/j.heliyon.2023.e12802. eCollection 2023 Jan.
3
Leveraging ANFIS with Adam and PSO optimizers for Parkinson's disease.利用带有Adam和粒子群优化算法优化器的自适应神经模糊推理系统来研究帕金森病。
Heliyon. 2024 Apr 26;10(9):e30241. doi: 10.1016/j.heliyon.2024.e30241. eCollection 2024 May 15.
4
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
5
Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification.基于自适应群体聚类的动态多目标合成少数类过采样技术算法,用于处理生物医学数据分类中的二元不平衡数据集。
BioData Min. 2016 Dec 1;9:37. doi: 10.1186/s13040-016-0117-1. eCollection 2016.
6
A multiple combined method for rebalancing medical data with class imbalances.一种用于平衡具有类别不平衡的医学数据的多重组合方法。
Comput Biol Med. 2021 Jul;134:104527. doi: 10.1016/j.compbiomed.2021.104527. Epub 2021 May 31.
7
Weighted butterfly optimization algorithm with intuitionistic fuzzy Gaussian function based adaptive-neuro fuzzy inference system for covid-19 prediction.基于直觉模糊高斯函数的加权蝴蝶优化算法与自适应神经模糊推理系统用于新冠疫情预测
Mater Today Proc. 2022;56:3317-3324. doi: 10.1016/j.matpr.2021.10.153. Epub 2021 Oct 25.
8
RSMOTE: improving classification performance over imbalanced medical datasets.RSMOTE:提升不平衡医学数据集的分类性能
Health Inf Sci Syst. 2020 Jun 12;8(1):22. doi: 10.1007/s13755-020-00112-w. eCollection 2020 Dec.
9
A PSO-based multi-objective multi-label feature selection method in classification.基于粒子群优化的分类多目标多标签特征选择方法。
Sci Rep. 2017 Mar 23;7(1):376. doi: 10.1038/s41598-017-00416-0.
10
Driver Drowsiness Detection Based on Steering Wheel Data Applying Adaptive Neuro-Fuzzy Feature Selection.基于自适应神经模糊特征选择的方向盘数据驾驶员瞌睡检测。
Sensors (Basel). 2019 Feb 22;19(4):943. doi: 10.3390/s19040943.

引用本文的文献

1
A population based optimization of convolutional neural networks for chronic kidney disease prediction.基于群体的卷积神经网络优化用于慢性肾脏病预测
Sci Rep. 2025 Apr 25;15(1):14500. doi: 10.1038/s41598-025-99270-8.
2
Machine learning-driven identification of key risk factors for predicting depression among nurses.机器学习驱动识别护士中预测抑郁症的关键风险因素。
BMC Nurs. 2025 Apr 3;24(1):368. doi: 10.1186/s12912-025-02957-6.
3
Using machine-learning models to predict extubation failure in neonates with bronchopulmonary dysplasia.

本文引用的文献

1
Explainable diabetes classification using hybrid Bayesian-optimized TabNet architecture.使用混合贝叶斯优化的TabNet架构进行可解释的糖尿病分类
Comput Biol Med. 2022 Dec;151(Pt A):106178. doi: 10.1016/j.compbiomed.2022.106178. Epub 2022 Oct 6.
2
AdaBoost Ensemble Methods Using K-Fold Cross Validation for Survivability with the Early Detection of Heart Disease.基于 K 折交叉验证的 AdaBoost 集成方法在心脏病早期检测中的生存能力研究。
Comput Intell Neurosci. 2022 Apr 18;2022:9005278. doi: 10.1155/2022/9005278. eCollection 2022.
3
Accurate and Efficient Large-Scale Multi-Label Learning With Reduced Feature Broad Learning System Using Label Correlation.
使用机器学习模型预测患有支气管肺发育不良的新生儿拔管失败。
BMC Pulm Med. 2024 Jul 1;24(1):308. doi: 10.1186/s12890-024-03133-3.
4
Deep learning algorithm performance in contouring head and neck organs at risk: a systematic review and single-arm meta-analysis.深度学习算法在勾画头颈部危及器官中的性能:系统评价和单臂荟萃分析。
Biomed Eng Online. 2023 Nov 1;22(1):104. doi: 10.1186/s12938-023-01159-y.
基于标签相关性的特征约简广义学习系统实现准确高效的大规模多标签学习
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10240-10253. doi: 10.1109/TNNLS.2022.3165299. Epub 2023 Nov 30.
4
3D convolutional neural networks-based multiclass classification of Alzheimer's and Parkinson's diseases using PET and SPECT neuroimaging modalities.基于3D卷积神经网络的阿尔茨海默病和帕金森病多类别分类,使用正电子发射断层扫描(PET)和单光子发射计算机断层扫描(SPECT)神经成像模态
Brain Inform. 2021 Nov 2;8(1):23. doi: 10.1186/s40708-021-00144-2.
5
False Negative Rates in Benign Thyroid Nodule Diagnosis: Machine Learning for Detecting Malignancy.良性甲状腺结节诊断中的假阴性率:用于检测恶性肿瘤的机器学习
J Surg Res. 2021 Dec;268:562-569. doi: 10.1016/j.jss.2021.06.076. Epub 2021 Aug 28.
6
Design of intelligent diabetes mellitus detection system using hybrid feature selection based XGBoost classifier.基于混合特征选择的 XGBoost 分类器的智能糖尿病检测系统设计。
Comput Biol Med. 2021 Sep;136:104664. doi: 10.1016/j.compbiomed.2021.104664. Epub 2021 Jul 22.
7
Knowledge and practice of diabetic foot care - A scoping review.糖尿病足护理知识与实践 - 范围综述。
Diabetes Metab Syndr. 2021 May-Jun;15(3):783-793. doi: 10.1016/j.dsx.2021.03.030. Epub 2021 Mar 31.
8
Impact of diabetes mellitus on mortality rates and outcomes in myocardial infarction.糖尿病对心肌梗死死亡率和结局的影响。
Diabetes Metab. 2021 Jul;47(4):101211. doi: 10.1016/j.diabet.2020.11.003. Epub 2020 Nov 28.
9
Intuitionistic based segmentation of thyroid nodules in ultrasound images.基于直觉主义的超声图像中甲状腺结节分割
Comput Biol Med. 2020 Jun;121:103776. doi: 10.1016/j.compbiomed.2020.103776. Epub 2020 May 4.
10
End-to-End Feature-Aware Label Space Encoding for Multilabel Classification With Many Classes.端到端特征感知标签空间编码用于多标签分类的多类。
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2472-2487. doi: 10.1109/TNNLS.2017.2691545. Epub 2017 May 9.