• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于包装特征选择的新型 C4.5 装袋算法,用于支持明智的临床决策。

A novel bagging C4.5 algorithm based on wrapper feature selection for supporting wise clinical decision making.

机构信息

National Pilot School of Software, Yunnan University, No. 2, Cuihu North Rd., Kunming 650091, China; Queens' College, University of Cambridge, Cambridge CB3 9ET, UK.

National Pilot School of Software, Yunnan University, No. 2, Cuihu North Rd., Kunming 650091, China.

出版信息

J Biomed Inform. 2018 Feb;78:144-155. doi: 10.1016/j.jbi.2017.11.005. Epub 2017 Nov 11.

DOI:10.1016/j.jbi.2017.11.005
PMID:29137965
Abstract

From the perspective of clinical decision-making in a Medical IoT-based healthcare system, achieving effective and efficient analysis of long-term health data for supporting wise clinical decision-making is an extremely important objective, but determining how to effectively deal with the multi-dimensionality and high volume of generated data obtained from Medical IoT-based healthcare systems is an issue of increasing importance in IoT healthcare data exploration and management. A novel classifier or predicator equipped with a good feature selection function contributes effectively to classification and prediction performance. This paper proposes a novel bagging C4.5 algorithm based on wrapper feature selection, for the purpose of supporting wise clinical decision-making in the medical and healthcare fields. In particular, the new proposed sampling method, S-C4.5-SMOTE, is not only able to overcome the problem of data distortion, but also improves overall system performance because its mechanism aims at effectively reducing the data size without distortion, by keeping datasets balanced and technically smooth. This achievement directly supports the Wrapper method of effective feature selection without the need to consider the problem of huge amounts of data; this is a novel innovation in this work.

摘要

从基于医疗物联网的医疗保健系统中的临床决策角度来看,实现对长期健康数据的有效和高效分析,以支持明智的临床决策,是一个极其重要的目标,但确定如何有效地处理从基于医疗物联网的医疗保健系统中获得的多维和大量生成数据,是物联网医疗保健数据探索和管理中日益重要的问题。具有良好特征选择功能的新型分类器或预测器有助于提高分类和预测性能。本文提出了一种基于包装特征选择的新型装袋 C4.5 算法,旨在支持医疗和保健领域的明智临床决策。特别是,新提出的抽样方法 S-C4.5-SMOTE,不仅能够克服数据扭曲的问题,而且还能通过保持数据集平衡和技术平滑来有效地减少数据量,从而提高整体系统性能。这一成果直接支持了有效的特征选择的包装方法,而无需考虑大量数据的问题;这是这项工作的一项新创新。

相似文献

1
A novel bagging C4.5 algorithm based on wrapper feature selection for supporting wise clinical decision making.一种基于包装特征选择的新型 C4.5 装袋算法,用于支持明智的临床决策。
J Biomed Inform. 2018 Feb;78:144-155. doi: 10.1016/j.jbi.2017.11.005. Epub 2017 Nov 11.
2
Clinical data classification using an enhanced SMOTE and chaotic evolutionary feature selection.使用增强型SMOTE和混沌进化特征选择的临床数据分类
Comput Biol Med. 2020 Nov;126:103991. doi: 10.1016/j.compbiomed.2020.103991. Epub 2020 Sep 18.
3
Effective clinical decision support implementation using a multi filter and wrapper optimisation model for Internet of Things based healthcare data.基于物联网的医疗数据的多过滤器和封装器优化模型在有效的临床决策支持实施中的应用。
Sci Rep. 2024 Sep 18;14(1):21820. doi: 10.1038/s41598-024-71726-3.
4
Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method-a Comparative Study.基于集成数据挖掘技术和特征选择方法的皮肤病预测-对比研究。
Appl Biochem Biotechnol. 2020 Feb;190(2):341-359. doi: 10.1007/s12010-019-03093-z. Epub 2019 Jul 27.
5
Intelligent Machine Learning Approach for Effective Recognition of Diabetes in E-Healthcare Using Clinical Data.智能机器学习方法在电子医疗中使用临床数据有效识别糖尿病
Sensors (Basel). 2020 May 6;20(9):2649. doi: 10.3390/s20092649.
6
A Pareto-based Ensemble with Feature and Instance Selection for Learning from Multi-Class Imbalanced Datasets.基于 Pareto 的特征和实例选择集成学习方法在多类不平衡数据集上的应用。
Int J Neural Syst. 2017 Sep;27(6):1750028. doi: 10.1142/S0129065717500289. Epub 2017 Apr 11.
7
Classification of infectious diseases based on chemiluminescent signatures of phagocytes in whole blood.基于全血中吞噬细胞化学发光特征的传染病分类。
Artif Intell Med. 2011 Jul;52(3):153-63. doi: 10.1016/j.artmed.2011.04.001. Epub 2011 May 14.
8
Comprehensible knowledge model creation for cancer treatment decision making.用于癌症治疗决策的可理解知识模型创建。
Comput Biol Med. 2017 Mar 1;82:119-129. doi: 10.1016/j.compbiomed.2017.01.010. Epub 2017 Jan 29.
9
Ensemble Feature Learning of Genomic Data Using Support Vector Machine.使用支持向量机的基因组数据集成特征学习
PLoS One. 2016 Jun 15;11(6):e0157330. doi: 10.1371/journal.pone.0157330. eCollection 2016.
10
Stable feature selection for clinical prediction: exploiting ICD tree structure using Tree-Lasso.用于临床预测的稳定特征选择:利用树套索法挖掘国际疾病分类树结构
J Biomed Inform. 2015 Feb;53:277-90. doi: 10.1016/j.jbi.2014.11.013. Epub 2014 Dec 9.

引用本文的文献

1
An improved Red-billed blue magpie feature selection algorithm for medical data processing.一种用于医学数据处理的改进型红嘴蓝鹊特征选择算法。
PLoS One. 2025 May 22;20(5):e0324866. doi: 10.1371/journal.pone.0324866. eCollection 2025.
2
An Integrated Machine Learning Approach for Congestive Heart Failure Prediction.一种用于充血性心力衰竭预测的集成机器学习方法。
Diagnostics (Basel). 2024 Mar 29;14(7):736. doi: 10.3390/diagnostics14070736.
3
CGUFS: A clustering-guided unsupervised feature selection algorithm for gene expression data.
CGUFS:一种用于基因表达数据的聚类引导无监督特征选择算法。
J King Saud Univ Comput Inf Sci. 2023 Oct;35(9):101731. doi: 10.1016/j.jksuci.2023.101731.
4
A hybrid feature selection algorithm combining information gain and grouping particle swarm optimization for cancer diagnosis.一种结合信息增益和分组粒子群优化的混合特征选择算法用于癌症诊断。
PLoS One. 2024 Mar 11;19(3):e0290332. doi: 10.1371/journal.pone.0290332. eCollection 2024.
5
Diagnosis of COVID-19 Using Chest X-ray Images and Disease Symptoms Based on Stacking Ensemble Deep Learning.基于堆叠集成深度学习的胸部X光图像和疾病症状对COVID-19的诊断
Diagnostics (Basel). 2023 Jun 5;13(11):1968. doi: 10.3390/diagnostics13111968.
6
A hybrid sampling algorithm combining synthetic minority over-sampling technique and edited nearest neighbor for missed abortion diagnosis.一种结合合成少数过采样技术和编辑最近邻的混合采样算法,用于诊断漏诊的流产。
BMC Med Inform Decis Mak. 2022 Dec 29;22(1):344. doi: 10.1186/s12911-022-02075-2.
7
A transfer learning approach based on gradient boosting machine for diagnosis of Alzheimer's disease.一种基于梯度提升机的迁移学习方法用于阿尔茨海默病的诊断。
Front Aging Neurosci. 2022 Oct 5;14:966883. doi: 10.3389/fnagi.2022.966883. eCollection 2022.
8
Mutational Slime Mould Algorithm for Gene Selection.用于基因选择的突变黏菌算法
Biomedicines. 2022 Aug 22;10(8):2052. doi: 10.3390/biomedicines10082052.
9
A Hybrid Method to Predict Postoperative Survival of Lung Cancer Using Improved SMOTE and Adaptive SVM.一种使用改进的 SMOTE 和自适应 SVM 预测肺癌术后生存的混合方法。
Comput Math Methods Med. 2021 Sep 10;2021:2213194. doi: 10.1155/2021/2213194. eCollection 2021.
10
An ensemble-based feature selection framework to select risk factors of childhood obesity for policy decision making.一种基于集成的特征选择框架,用于选择儿童肥胖的风险因素以辅助政策制定。
BMC Med Inform Decis Mak. 2021 Jul 21;21(1):222. doi: 10.1186/s12911-021-01580-0.