• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用真实数据集进行数据挖掘发现乳腺癌患者的隐藏模式

Discovery of Hidden Patterns in Breast Cancer Patients, Using Data Mining on a Real Data Set.

作者信息

Atashi Alireza, Tohidinezhad Fariba, Dorri Sara, Nazeri Najmeh, Ghousi Rouzbeh, Marashi Sina, Hajialiasgari Fatemeh

机构信息

E-Health Department, Virtual School, Tehran University of Medical Sciences, Tehran, Iran.

Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.

出版信息

Stud Health Technol Inform. 2019 Jul 4;262:142-145. doi: 10.3233/SHTI190037.

DOI:10.3233/SHTI190037
PMID:31349286
Abstract

The aim is to recognize the unknown atterns in a real breast cancer dataset using data mining algorithms as a new method in medicine. Due to excessive missing data in the collection only data on 665 of 809 patients were available. The other missing values were estimated using the EM algorithm in SPSS21 software. Fields have been converted into discrete fields and finally the APRIORI algorithm has been used to analyze and explore the unknown patterns. After the rule extraction, experts in the field of breast cancer eliminated redundant and meaningless relations. 100 association rules with a confidence value of more than 0.9 explored by the APRIORI algorithm and after the clinical expert feedback, 10 clinically meaningful relations have been detected and reported. Due to the high number of risk factors, the use of data mining is effective for cancer data. These patterns provide the future study hypotheses of specific clinical studies.

摘要

目的是使用数据挖掘算法作为医学中的一种新方法,识别真实乳腺癌数据集中的未知模式。由于数据收集过程中存在大量缺失数据,809名患者中仅有665名患者的数据可用。其他缺失值使用SPSS21软件中的期望最大化(EM)算法进行估计。字段已转换为离散字段,最后使用APRIORI算法分析和探索未知模式。在规则提取之后,乳腺癌领域的专家消除了冗余和无意义的关系。APRIORI算法探索出100条置信度值大于0.9的关联规则,经过临床专家反馈,检测并报告了10条具有临床意义的关系。由于风险因素数量众多,数据挖掘在癌症数据方面的应用是有效的。这些模式为特定临床研究提供了未来的研究假设。

相似文献

1
Discovery of Hidden Patterns in Breast Cancer Patients, Using Data Mining on a Real Data Set.利用真实数据集进行数据挖掘发现乳腺癌患者的隐藏模式
Stud Health Technol Inform. 2019 Jul 4;262:142-145. doi: 10.3233/SHTI190037.
2
Decision Support Systems in Health Care - Velocity of Apriori Algorithm.医疗保健中的决策支持系统——Apriori算法的速度
Stud Health Technol Inform. 2017;244:53-57.
3
Mining association rules between stroke risk factors based on the Apriori algorithm.基于Apriori算法挖掘中风危险因素之间的关联规则。
Technol Health Care. 2017 Jul 20;25(S1):197-205. doi: 10.3233/THC-171322.
4
Inferring Intra-Community Microbial Interaction Patterns from Metagenomic Datasets Using Associative Rule Mining Techniques.使用关联规则挖掘技术从宏基因组数据集中推断群落内微生物相互作用模式
PLoS One. 2016 Apr 28;11(4):e0154493. doi: 10.1371/journal.pone.0154493. eCollection 2016.
5
Applying Data Mining Techniques to Extract Hidden Patterns about Breast Cancer Survival in an Iranian Cohort Study.在一项伊朗队列研究中应用数据挖掘技术提取有关乳腺癌生存的隐藏模式。
J Res Health Sci. 2016 Winter;16(1):31-5.
6
Mining association patterns of drug-interactions using post marketing FDA's spontaneous reporting data.利用美国食品药品监督管理局(FDA)上市后自发报告数据挖掘药物相互作用的关联模式。
J Biomed Inform. 2016 Apr;60:294-308. doi: 10.1016/j.jbi.2016.02.009. Epub 2016 Feb 20.
7
Discovering metric temporal constraint networks on temporal databases.发现时态数据库上的度量时态约束网络。
Artif Intell Med. 2013 Jul;58(3):139-54. doi: 10.1016/j.artmed.2013.03.006. Epub 2013 May 6.
8
Use of data-mining to support real-world cost analyses: An example using HER2-positive breast cancer in Iran.利用数据挖掘支持真实世界的成本分析:以伊朗人表皮生长因子受体 2 阳性乳腺癌为例。
PLoS One. 2018 Oct 1;13(10):e0205079. doi: 10.1371/journal.pone.0205079. eCollection 2018.
9
An annotated association mining approach for extracting and visualizing interesting clinical events.一种注释关联挖掘方法,用于提取和可视化有趣的临床事件。
Int J Med Inform. 2021 Apr;148:104366. doi: 10.1016/j.ijmedinf.2020.104366. Epub 2020 Dec 13.
10
RANWAR: rank-based weighted association rule mining from gene expression and methylation data.RANWAR:从基因表达和甲基化数据中进行基于秩的加权关联规则挖掘。
IEEE Trans Nanobioscience. 2015 Jan;14(1):59-66. doi: 10.1109/TNB.2014.2359494. Epub 2014 Sep 23.

引用本文的文献

1
Different Data Mining Approaches Based Medical Text Data.基于医学文本数据的不同数据挖掘方法。
J Healthc Eng. 2021 Dec 6;2021:1285167. doi: 10.1155/2021/1285167. eCollection 2021.