• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

知识发现:数据挖掘和机器学习方法。

Knowledge Discovery: Methods from data mining and machine learning.

机构信息

University of California Davis, USA.

University of California Davis, USA.

出版信息

Soc Sci Res. 2023 Feb;110:102817. doi: 10.1016/j.ssresearch.2022.102817. Epub 2022 Oct 29.

DOI:10.1016/j.ssresearch.2022.102817
PMID:36796993
Abstract

The interdisciplinary field of knowledge discovery and data mining emerged from a necessity of big data requiring new analytical methods beyond the traditional statistical approaches to discover new knowledge from the data mine. This emergent approach is a dialectic research process that is both deductive and inductive. The data mining approach automatically or semi-automatically considers a larger number of joint, interactive, and independent predictors to address causal heterogeneity and improve prediction. Instead of challenging the conventional model-building approach, it plays an important complementary role in improving model goodness of fit, revealing valid and significant hidden patterns in data, identifying nonlinear and non-additive effects, providing insights into data developments, methods, and theory, and enriching scientific discovery. Machine learning builds models and algorithms by learning and improving from data when the explicit model structure is unclear and algorithms with good performance are difficult to attain. The most recent development is to incorporate this new paradigm of predictive modeling with the classical approach of parameter estimation regressions to produce improved models that combine explanation and prediction.

摘要

知识发现和数据挖掘的跨学科领域源于大数据的需求,需要新的分析方法,超越传统的统计方法,从数据矿山中发现新知识。这种新出现的方法是一种辩证的研究过程,既是演绎的,也是归纳的。数据挖掘方法自动或半自动地考虑更多的联合、交互和独立的预测因子,以解决因果异质性并提高预测能力。它不是对传统的模型构建方法提出挑战,而是在提高模型拟合优度、揭示数据中的有效和显著隐藏模式、识别非线性和非可加效应、深入了解数据发展、方法和理论以及丰富科学发现方面发挥着重要的补充作用。当明确的模型结构不清楚且难以获得性能良好的算法时,机器学习通过从数据中学习和改进来构建模型和算法。最新的发展是将这种新的预测建模范式与经典的参数估计回归方法相结合,以产生改进的模型,将解释和预测结合起来。

相似文献

1
Knowledge Discovery: Methods from data mining and machine learning.知识发现:数据挖掘和机器学习方法。
Soc Sci Res. 2023 Feb;110:102817. doi: 10.1016/j.ssresearch.2022.102817. Epub 2022 Oct 29.
2
An Interpretable Data-Driven Medical Knowledge Discovery Pipeline Based on Artificial Intelligence.基于人工智能的可解释数据驱动医学知识发现管道
IEEE J Biomed Health Inform. 2023 Oct;27(10):5099-5109. doi: 10.1109/JBHI.2023.3299339. Epub 2023 Oct 5.
3
Compressive Big Data Analytics: An ensemble meta-algorithm for high-dimensional multisource datasets.压缩大数据分析:一种用于高维多源数据集的集成元算法。
PLoS One. 2020 Aug 28;15(8):e0228520. doi: 10.1371/journal.pone.0228520. eCollection 2020.
4
Knowledge Discovery on Cryptocurrency Exchange Rate Prediction Using Machine Learning Pipelines.使用机器学习管道进行加密货币汇率预测的知识发现。
Sensors (Basel). 2022 Feb 23;22(5):1740. doi: 10.3390/s22051740.
5
A big data approach with artificial neural network and molecular similarity for chemical data mining and endocrine disruption prediction.一种结合人工神经网络和分子相似性的大数据方法用于化学数据挖掘和内分泌干扰预测。
Indian J Pharmacol. 2018 Jul-Aug;50(4):169-176. doi: 10.4103/ijp.IJP_304_17.
6
A novel deep mining model for effective knowledge discovery from omics data.一种用于从组学数据中进行有效知识发现的新型深度挖掘模型。
Artif Intell Med. 2020 Apr;104:101821. doi: 10.1016/j.artmed.2020.101821. Epub 2020 Feb 24.
7
A systematic review of data mining and machine learning for air pollution epidemiology.空气污染流行病学中数据挖掘与机器学习的系统综述。
BMC Public Health. 2017 Nov 28;17(1):907. doi: 10.1186/s12889-017-4914-3.
8
Healthcare pathway discovery and probabilistic machine learning.医疗保健路径发现与概率机器学习。
Int J Med Inform. 2020 May;137:104087. doi: 10.1016/j.ijmedinf.2020.104087. Epub 2020 Feb 24.
9
Joint learning-based causal relation extraction from biomedical literature.基于联合学习的生物医学文献因果关系提取
J Biomed Inform. 2023 Mar;139:104318. doi: 10.1016/j.jbi.2023.104318. Epub 2023 Feb 11.
10
Knowledge Discovery With Machine Learning for Hospital-Acquired Catheter-Associated Urinary Tract Infections.利用机器学习进行医院获得性导尿管相关尿路感染的知识发现
Comput Inform Nurs. 2020 Jan;38(1):28-35. doi: 10.1097/CIN.0000000000000562.

引用本文的文献

1
AI edge cloud service provisioning for knowledge management smart applications.用于知识管理智能应用的人工智能边缘云服务供应
Sci Rep. 2025 Sep 1;15(1):32246. doi: 10.1038/s41598-025-14429-7.
2
Machine learning analysis of greenhouse gas sources impacting Africa's food security nexus.影响非洲粮食安全关系的温室气体来源的机器学习分析
Sci Rep. 2025 Aug 6;15(1):28665. doi: 10.1038/s41598-025-14766-7.
3
Construction and evaluation of a machine learning-based predictive model for enteral nutrition feeding intolerance risk in ICU patients.
基于机器学习的ICU患者肠内营养喂养不耐受风险预测模型的构建与评估
Front Nutr. 2025 Jul 9;12:1600319. doi: 10.3389/fnut.2025.1600319. eCollection 2025.
4
Predicting Visual Acuity after Retinal Vein Occlusion Anti-VEGF Treatment: Development and Validation of an Interpretable Machine Learning Model.预测视网膜静脉阻塞抗VEGF治疗后的视力:一种可解释机器学习模型的开发与验证
J Med Syst. 2025 Apr 29;49(1):57. doi: 10.1007/s10916-025-02190-3.
5
Knowledge discovery of diseases symptoms and rehabilitation measures in Q&A communities.问答社区中疾病症状与康复措施的知识发现
Sci Rep. 2025 Apr 19;15(1):13593. doi: 10.1038/s41598-025-98300-9.
6
Predicting determinants of unimproved water supply in Ethiopia using machine learning analysis of EDHS-2019 data.利用2019年埃塞俄比亚人口与健康调查(EDHS)数据的机器学习分析预测埃塞俄比亚未改善供水的决定因素。
Sci Rep. 2025 Apr 4;15(1):11561. doi: 10.1038/s41598-025-96412-w.
7
Optimizing multi label student performance prediction with GNN-TINet: A contextual multidimensional deep learning framework.使用GNN-TINet优化多标签学生成绩预测:一种上下文多维度深度学习框架。
PLoS One. 2025 Jan 22;20(1):e0314823. doi: 10.1371/journal.pone.0314823. eCollection 2025.
8
A Machine Learning Classification Model for Gastrointestinal Health in Cancer Survivors: Roles of Telomere Length and Social Determinants of Health.一种用于癌症幸存者胃肠道健康的机器学习分类模型:端粒长度和健康的社会决定因素的作用。
Int J Environ Res Public Health. 2024 Dec 19;21(12):1694. doi: 10.3390/ijerph21121694.
9
The KEYWORDS Framework: Standardizing Keyword Selection for Improved Big Data Analytics in Biomedical Literature.关键词框架:规范关键词选择以改进生物医学文献中的大数据分析
J Int Soc Prev Community Dent. 2024 Oct 29;14(5):349-351. doi: 10.4103/jispcd.jispcd_129_24. eCollection 2024 Sep-Oct.
10
Generating Biomedical Knowledge Graphs from Knowledge Bases, Registries, and Multiomic Data.从知识库、注册库和多组学数据生成生物医学知识图谱。
bioRxiv. 2024 Nov 15:2024.11.14.623648. doi: 10.1101/2024.11.14.623648.