• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

三种数据挖掘技术在肺癌诊断中的对比分析。

Comparative analysis of three data mining techniques in diagnosis of lung cancer.

机构信息

College of Public Health, Zhengzhou University.

The First Affiliated Hospital of Zhengzhou University.

出版信息

Eur J Cancer Prev. 2021 Jan;30(1):15-20. doi: 10.1097/CEJ.0000000000000598.

DOI:10.1097/CEJ.0000000000000598
PMID:32868638
Abstract

There is a lot of abnormal information in the development of lung cancer, and how to extract useful knowledge is urgent from massive information. Data mining technology has become a popular tool for medical classification and prediction. However, each technology has its advantage and disadvantage, and several data mining methods will be applied to conduct the in-depth analysis step by step. And the prediction results of different models are compared. A total of 180 lung cancer patients and 243 lung benign individuals were collected from the First Affiliated Hospital of Zhengzhou University from October 2014 to March 2016, and the prediction models based on epidemiological data, clinical features and tumor markers were developed by artificial neural network (ANN), decision tree C5.0 and support vector machine (SVM). The results showed that there were significant differences between the lung cancer group and the lung benign group in terms of seven tumor markers and 10 epidemiological and clinical indicators. The accuracy rates of ANN, C5.0 and SVM were 76.47, 89.92 and 85.71%, respectively. The results of receiver operating characteristic curve (ROC) curve revealed the area under the ROC curve (AUC) of ANN was 0.811 (0.770-0.847), the AUC of C5.0 was 0.897 (0.864-0.924) and the AUC of SVM was 0.878 (0.843-0.908). It was shown that the decision tree C5.0 model has the least error rate and highest accuracy, and it could be used to diagnose lung cancer.

摘要

肺癌的发展存在大量异常信息,如何从海量信息中提取有用的知识迫在眉睫。数据挖掘技术已成为医学分类和预测的热门工具。然而,每种技术都有其优势和劣势,因此将应用几种数据挖掘方法进行深入分析,逐步进行。并比较不同模型的预测结果。共收集 2014 年 10 月至 2016 年 3 月郑州大学第一附属医院的 180 例肺癌患者和 243 例肺部良性个体,基于流行病学数据、临床特征和肿瘤标志物,采用人工神经网络(ANN)、决策树 C5.0 和支持向量机(SVM)构建预测模型。结果表明,在七种肿瘤标志物和十种流行病学及临床指标方面,肺癌组与肺部良性组之间存在显著差异。ANN、C5.0 和 SVM 的准确率分别为 76.47%、89.92%和 85.71%。受试者工作特征曲线(ROC)的结果表明,ANN 的 ROC 曲线下面积(AUC)为 0.811(0.770-0.847),C5.0 的 AUC 为 0.897(0.864-0.924),SVM 的 AUC 为 0.878(0.843-0.908)。结果表明,决策树 C5.0 模型误差率最小,准确率最高,可用于诊断肺癌。

相似文献

1
Comparative analysis of three data mining techniques in diagnosis of lung cancer.三种数据挖掘技术在肺癌诊断中的对比分析。
Eur J Cancer Prev. 2021 Jan;30(1):15-20. doi: 10.1097/CEJ.0000000000000598.
2
Predicting coronary artery disease: a comparison between two data mining algorithms.预测冠状动脉疾病:两种数据挖掘算法的比较。
BMC Public Health. 2019 Apr 29;19(1):448. doi: 10.1186/s12889-019-6721-5.
3
Development of a machine learning-based multimode diagnosis system for lung cancer.基于机器学习的肺癌多模态诊断系统的开发。
Aging (Albany NY). 2020 May 23;12(10):9840-9854. doi: 10.18632/aging.103249.
4
Pre-operative prediction of advanced prostatic cancer using clinical decision support systems: accuracy comparison between support vector machine and artificial neural network.使用临床决策支持系统对晚期前列腺癌进行术前预测:支持向量机和人工神经网络的准确性比较。
Korean J Radiol. 2011 Sep-Oct;12(5):588-94. doi: 10.3348/kjr.2011.12.5.588. Epub 2011 Aug 24.
5
Comparative evaluation of support vector machines for computer aided diagnosis of lung cancer in CT based on a multi-dimensional data set.基于多维数据集的 CT 图像肺癌计算机辅助诊断中支持向量机的比较评估。
Comput Methods Programs Biomed. 2013 Aug;111(2):519-24. doi: 10.1016/j.cmpb.2013.04.016. Epub 2013 May 31.
6
Seminal quality prediction using data mining methods.使用数据挖掘方法进行精液质量预测。
Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.
7
Comparison of three data mining models for prediction of advanced schistosomiasis prognosis in the Hubei province.比较三种数据挖掘模型在预测湖北省晚期血吸虫病预后中的应用。
PLoS Negl Trop Dis. 2018 Feb 15;12(2):e0006262. doi: 10.1371/journal.pntd.0006262. eCollection 2018 Feb.
8
Patient classification and outcome prediction in IgA nephropathy.IgA肾病的患者分类与预后预测
Comput Biol Med. 2015 Nov 1;66:278-86. doi: 10.1016/j.compbiomed.2015.09.003. Epub 2015 Sep 25.
9
Osteoporosis risk prediction using machine learning and conventional methods.使用机器学习和传统方法进行骨质疏松症风险预测。
Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:188-91. doi: 10.1109/EMBC.2013.6609469.
10
Application of knowledge discovery process on the prediction of stroke.知识发现过程在中风预测中的应用。
Comput Methods Programs Biomed. 2015 May;119(3):181-5. doi: 10.1016/j.cmpb.2015.03.002. Epub 2015 Mar 13.

引用本文的文献

1
Treatment Algorithm for the Resorption of Calcific Tendinitis Using Extracorporeal Shockwave Therapy: A Data Mining Study.体外冲击波疗法治疗钙化性肌腱炎吸收的治疗算法:一项数据挖掘研究
Orthop J Sports Med. 2024 Mar 5;12(3):23259671241231609. doi: 10.1177/23259671241231609. eCollection 2024 Mar.
2
Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit.机器学习在预测 ICU 收治的肺癌患者院内死亡率中的应用。
PLoS One. 2023 Jan 26;18(1):e0280606. doi: 10.1371/journal.pone.0280606. eCollection 2023.
3
Utilization of Decision Tree Algorithms for Supporting the Prediction of Intensive Care Unit Admission of Myasthenia Gravis: A Machine Learning-Based Approach.
利用决策树算法支持重症肌无力患者入住重症监护病房的预测:一种基于机器学习的方法。
J Pers Med. 2022 Jan 2;12(1):32. doi: 10.3390/jpm12010032.