• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于淋巴疾病的随机森林分类器。

A random forest classifier for lymph diseases.

机构信息

Faculty of Computers and Information, Benha University, Egypt.

Faculty of Computers and Information, Cairo University, Egypt; Scientific Research Group in Egypt (SRGE), Egypt.

出版信息

Comput Methods Programs Biomed. 2014 Feb;113(2):465-73. doi: 10.1016/j.cmpb.2013.11.004. Epub 2013 Nov 14.

DOI:10.1016/j.cmpb.2013.11.004
PMID:24290902
Abstract

Machine learning-based classification techniques provide support for the decision-making process in many areas of health care, including diagnosis, prognosis, screening, etc. Feature selection (FS) is expected to improve classification performance, particularly in situations characterized by the high data dimensionality problem caused by relatively few training examples compared to a large number of measured features. In this paper, a random forest classifier (RFC) approach is proposed to diagnose lymph diseases. Focusing on feature selection, the first stage of the proposed system aims at constructing diverse feature selection algorithms such as genetic algorithm (GA), Principal Component Analysis (PCA), Relief-F, Fisher, Sequential Forward Floating Search (SFFS) and the Sequential Backward Floating Search (SBFS) for reducing the dimension of lymph diseases dataset. Switching from feature selection to model construction, in the second stage, the obtained feature subsets are fed into the RFC for efficient classification. It was observed that GA-RFC achieved the highest classification accuracy of 92.2%. The dimension of input feature space is reduced from eighteen to six features by using GA.

摘要

基于机器学习的分类技术为医疗保健的许多领域(包括诊断、预后、筛查等)的决策过程提供支持。特征选择(FS)有望提高分类性能,特别是在数据维度高的情况下,由于训练样本相对较少,而测量特征数量较大,导致数据维度高的问题。本文提出了一种随机森林分类器(RFC)方法来诊断淋巴疾病。该系统的第一阶段侧重于特征选择,旨在构建多种特征选择算法,如遗传算法(GA)、主成分分析(PCA)、 Relief-F、Fisher、顺序前向浮动搜索(SFFS)和顺序后向浮动搜索(SBFS),以降低淋巴疾病数据集的维度。从特征选择到模型构建的转换,在第二阶段,将获得的特征子集输入到 RFC 中进行有效分类。结果表明,GA-RFC 实现了最高的分类精度 92.2%。通过使用 GA,输入特征空间的维度从十八个减少到六个。

相似文献

1
A random forest classifier for lymph diseases.用于淋巴疾病的随机森林分类器。
Comput Methods Programs Biomed. 2014 Feb;113(2):465-73. doi: 10.1016/j.cmpb.2013.11.004. Epub 2013 Nov 14.
2
Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms.基于旋转森林的分类器集成构建,以提高机器学习算法的医学诊断性能。
Comput Methods Programs Biomed. 2011 Dec;104(3):443-51. doi: 10.1016/j.cmpb.2011.03.018. Epub 2011 Apr 30.
3
A novel feature selection approach for biomedical data classification.一种用于生物医学数据分类的新特征选择方法。
J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30.
4
A classification system based on a new wrapper feature selection algorithm for the diagnosis of primary and secondary polycythemia.基于新包装特征选择算法的原发性和继发性红细胞增多症诊断分类系统。
Comput Biol Med. 2013 Dec;43(12):2118-26. doi: 10.1016/j.compbiomed.2013.09.016. Epub 2013 Sep 28.
5
Rotation forest: A new classifier ensemble method.旋转森林:一种新的分类器集成方法。
IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1619-30. doi: 10.1109/TPAMI.2006.211.
6
The application of mutual information-based feature selection and fuzzy LS-SVM-based classifier in motion classification.基于互信息的特征选择与基于模糊最小二乘支持向量机的分类器在运动分类中的应用。
Comput Methods Programs Biomed. 2008 Jun;90(3):275-84. doi: 10.1016/j.cmpb.2008.01.003. Epub 2008 Mar 4.
7
Hybrid extreme rotation forest.混合极端旋转森林
Neural Netw. 2014 Apr;52:33-42. doi: 10.1016/j.neunet.2014.01.003. Epub 2014 Jan 13.
8
A new hybrid intelligent system for accurate detection of Parkinson's disease.一种用于准确检测帕金森病的新型混合智能系统。
Comput Methods Programs Biomed. 2014 Mar;113(3):904-13. doi: 10.1016/j.cmpb.2014.01.004. Epub 2014 Jan 9.
9
A hybrid color space for skin detection using genetic algorithm heuristic search and principal component analysis technique.一种使用遗传算法启发式搜索和主成分分析技术的用于皮肤检测的混合颜色空间。
PLoS One. 2015 Aug 12;10(8):e0134828. doi: 10.1371/journal.pone.0134828. eCollection 2015.
10
An efficient statistical feature selection approach for classification of gene expression data.一种用于基因表达数据分类的高效统计特征选择方法。
J Biomed Inform. 2011 Aug;44(4):529-35. doi: 10.1016/j.jbi.2011.01.001. Epub 2011 Jan 15.

引用本文的文献

1
Comprehensive statistical and machine learning framework for identification of metabolomic biomarkers in breast cancer.用于识别乳腺癌代谢组学生物标志物的综合统计和机器学习框架。
Metabolomics. 2025 Jun 14;21(4):78. doi: 10.1007/s11306-025-02265-9.
2
Simplex-structured matrix factorisation: application of soft clustering to metabolomic data.单纯形结构矩阵分解:软聚类在代谢组学数据中的应用。
Sci Rep. 2025 May 22;15(1):17817. doi: 10.1038/s41598-025-02361-9.
3
Deep transfer learning hybrid techniques for precision in breast cancer tumor histopathology classification.
用于乳腺癌肿瘤组织病理学分类精准度的深度迁移学习混合技术
Health Inf Sci Syst. 2025 Feb 11;13(1):20. doi: 10.1007/s13755-025-00337-7. eCollection 2025 Dec.
4
Classification of female MDD patients with and without suicidal ideation using resting-state functional magnetic resonance imaging and machine learning.使用静息态功能磁共振成像和机器学习对有和无自杀意念的女性重度抑郁症患者进行分类。
Front Hum Neurosci. 2025 Jan 8;18:1427532. doi: 10.3389/fnhum.2024.1427532. eCollection 2024.
5
Artificial Intelligence-Based Diagnosis of Gastric Mesenchymal Tumors Using Digital Endosonography Image Analysis.基于人工智能的胃间质瘤数字超声内镜图像分析诊断
J Clin Med. 2024 Jun 26;13(13):3725. doi: 10.3390/jcm13133725.
6
Physical frailty identification using machine learning to explore the 5-item FRAIL scale, Cardiovascular Health Study index, and Study of Osteoporotic Fractures index.使用机器学习识别身体虚弱:探索 5 项 FRAIL 量表、心血管健康研究指数和骨质疏松性骨折研究指数。
Front Public Health. 2024 May 9;12:1303958. doi: 10.3389/fpubh.2024.1303958. eCollection 2024.
7
Prediction of consumers refill frequency of LPG: A study using explainable machine learning.液化石油气消费者再充装频率的预测:一项使用可解释机器学习的研究。
Heliyon. 2023 Dec 18;10(1):e23466. doi: 10.1016/j.heliyon.2023.e23466. eCollection 2024 Jan 15.
8
SSC: The novel self-stack ensemble model for thyroid disease prediction.SSC:用于甲状腺疾病预测的新型自堆叠集成模型。
PLoS One. 2024 Jan 3;19(1):e0295501. doi: 10.1371/journal.pone.0295501. eCollection 2024.
9
Stacking-ac4C: an ensemble model using mixed features for identifying n4-acetylcytidine in mRNA.Stacking-ac4C:一种使用混合特征的集成模型,用于识别 mRNA 中的 N4-乙酰胞苷。
Front Immunol. 2023 Nov 29;14:1267755. doi: 10.3389/fimmu.2023.1267755. eCollection 2023.
10
Classification models for predicting the bioactivity of pan-TRK inhibitors and SAR analysis.用于预测泛TRK抑制剂生物活性的分类模型及构效关系分析
Mol Divers. 2024 Aug;28(4):2077-2097. doi: 10.1007/s11030-023-10735-2. Epub 2023 Nov 1.