• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

临床数据挖掘:转化应用的挑战、机遇和建议。

Clinical data mining: challenges, opportunities, and recommendations for translational applications.

机构信息

Medical Big Data and Bioinformatics Research Centre, First Affiliated Hospital of Gannan Medical University, Ganzhou, China.

School of Public Health and Health Management, Gannan Medical University, Ganzhou, China.

出版信息

J Transl Med. 2024 Feb 20;22(1):185. doi: 10.1186/s12967-024-05005-0.

DOI:10.1186/s12967-024-05005-0
PMID:38378565
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10880222/
Abstract

Clinical data mining of predictive models offers significant advantages for re-evaluating and leveraging large amounts of complex clinical real-world data and experimental comparison data for tasks such as risk stratification, diagnosis, classification, and survival prediction. However, its translational application is still limited. One challenge is that the proposed clinical requirements and data mining are not synchronized. Additionally, the exotic predictions of data mining are difficult to apply directly in local medical institutions. Hence, it is necessary to incisively review the translational application of clinical data mining, providing an analytical workflow for developing and validating prediction models to ensure the scientific validity of analytic workflows in response to clinical questions. This review systematically revisits the purpose, process, and principles of clinical data mining and discusses the key causes contributing to the detachment from practice and the misuse of model verification in developing predictive models for research. Based on this, we propose a niche-targeting framework of four principles: Clinical Contextual, Subgroup-Oriented, Confounder- and False Positive-Controlled (CSCF), to provide guidance for clinical data mining prior to the model's development in clinical settings. Eventually, it is hoped that this review can help guide future research and develop personalized predictive models to achieve the goal of discovering subgroups with varied remedial benefits or risks and ensuring that precision medicine can deliver its full potential.

摘要

临床数据挖掘预测模型为重新评估和利用大量复杂的临床真实世界数据和实验比较数据提供了显著优势,可用于风险分层、诊断、分类和生存预测等任务。然而,其转化应用仍然有限。一个挑战是提出的临床要求与数据挖掘不同步。此外,数据挖掘的奇异预测难以直接应用于当地医疗机构。因此,有必要对临床数据挖掘的转化应用进行深入审查,为开发和验证预测模型提供分析工作流程,以确保分析工作流程针对临床问题的科学有效性。本综述系统地回顾了临床数据挖掘的目的、过程和原则,并讨论了导致其与实践脱节和模型验证在研究中开发预测模型时被滥用的关键原因。在此基础上,我们提出了一个以四个原则为导向的针对性框架:临床语境、亚组导向、混杂因素和假阳性控制(CSCF),以在临床环境中开发模型之前为临床数据挖掘提供指导。最终,希望本综述能够为未来的研究提供指导,并开发个性化的预测模型,以实现发现具有不同治疗效果或风险的亚组的目标,并确保精准医学能够充分发挥其潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/959a/10880222/bd4df5b5f45e/12967_2024_5005_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/959a/10880222/a342b2af938b/12967_2024_5005_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/959a/10880222/bd4df5b5f45e/12967_2024_5005_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/959a/10880222/a342b2af938b/12967_2024_5005_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/959a/10880222/bd4df5b5f45e/12967_2024_5005_Fig2_HTML.jpg

相似文献

1
Clinical data mining: challenges, opportunities, and recommendations for translational applications.临床数据挖掘:转化应用的挑战、机遇和建议。
J Transl Med. 2024 Feb 20;22(1):185. doi: 10.1186/s12967-024-05005-0.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine.BioCreative VI 精准医学赛道概述:精准医学中的蛋白质相互作用和突变挖掘。
Database (Oxford). 2019 Jan 1;2019:bay147. doi: 10.1093/database/bay147.
4
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
5
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
6
Avoiding and identifying errors in health technology assessment models: qualitative study and methodological review.避免和识别健康技术评估模型中的错误:定性研究和方法学综述。
Health Technol Assess. 2010 May;14(25):iii-iv, ix-xii, 1-107. doi: 10.3310/hta14250.
7
Exploring conceptual and theoretical frameworks for nurse practitioner education: a scoping review protocol.探索执业护士教育的概念和理论框架:一项范围综述方案
JBI Database System Rev Implement Rep. 2015 Oct;13(10):146-55. doi: 10.11124/jbisrir-2015-2150.
8
Predictive data mining in clinical medicine: current issues and guidelines.临床医学中的预测性数据挖掘:当前问题与指南
Int J Med Inform. 2008 Feb;77(2):81-97. doi: 10.1016/j.ijmedinf.2006.11.006. Epub 2006 Dec 26.
9
Data mining in clinical big data: the frequently used databases, steps, and methodological models.临床大数据中的数据挖掘:常用数据库、步骤和方法学模型。
Mil Med Res. 2021 Aug 11;8(1):44. doi: 10.1186/s40779-021-00338-z.
10
Integration and Visualization of Translational Medicine Data for Better Understanding of Human Diseases.转化医学数据的整合与可视化,以更好地理解人类疾病。
Big Data. 2016 Jun;4(2):97-108. doi: 10.1089/big.2015.0057.

引用本文的文献

1
An improved Red-billed blue magpie feature selection algorithm for medical data processing.一种用于医学数据处理的改进型红嘴蓝鹊特征选择算法。
PLoS One. 2025 May 22;20(5):e0324866. doi: 10.1371/journal.pone.0324866. eCollection 2025.
2
Gene-level connections between anxiety disorders, ADHD, and head and neck cancer: insights from a computational biology approach.焦虑症、注意力缺陷多动障碍与头颈癌之间的基因水平联系:来自计算生物学方法的见解
Front Psychiatry. 2025 Mar 20;16:1552815. doi: 10.3389/fpsyt.2025.1552815. eCollection 2025.
3
The data-intensive research paradigm: challenges and responses in clinical professional graduate education.

本文引用的文献

1
Clonally expanded CD38 cytotoxic CD8 T cells define the T cell infiltrate in checkpoint inhibitor-associated arthritis.克隆扩增的 CD38 细胞毒性 CD8 T 细胞定义了检查点抑制剂相关关节炎中的 T 细胞浸润。
Sci Immunol. 2023 Jul 28;8(85):eadd1591. doi: 10.1126/sciimmunol.add1591.
2
MSBooster: improving peptide identification rates using deep learning-based features.MSBooster:基于深度学习的特征提高肽段鉴定率。
Nat Commun. 2023 Jul 27;14(1):4539. doi: 10.1038/s41467-023-40129-9.
3
Biomarkers Associated With Severe COVID-19 Among Populations With High Cardiometabolic Risk: A 2-Sample Mendelian Randomization Study.
数据密集型研究范式:临床专业研究生教育中的挑战与应对
Front Med (Lausanne). 2025 Feb 7;12:1461863. doi: 10.3389/fmed.2025.1461863. eCollection 2025.
4
A Machine Learning Classification Model for Gastrointestinal Health in Cancer Survivors: Roles of Telomere Length and Social Determinants of Health.一种用于癌症幸存者胃肠道健康的机器学习分类模型:端粒长度和健康的社会决定因素的作用。
Int J Environ Res Public Health. 2024 Dec 19;21(12):1694. doi: 10.3390/ijerph21121694.
5
Optimum tacrolimus trough levels for enhanced graft survival and safety in kidney transplantation: a retrospective multicenter real-world evidence study.肾移植中提高移植物存活率和安全性的他克莫司最佳谷浓度:一项回顾性多中心真实世界证据研究
Int J Surg. 2024 Oct 1;110(10):6711-6722. doi: 10.1097/JS9.0000000000001800.
与高心血管代谢风险人群中重症 COVID-19 相关的生物标志物:两样本 Mendelian 随机研究。
JAMA Netw Open. 2023 Jul 3;6(7):e2325914. doi: 10.1001/jamanetworkopen.2023.25914.
4
Butyrate reverses ferroptosis resistance in colorectal cancer by inducing c-Fos-dependent xCT suppression.丁酸盐通过诱导 c-Fos 依赖性 xCT 抑制逆转结直肠癌细胞中的铁死亡抵抗。
Redox Biol. 2023 Sep;65:102822. doi: 10.1016/j.redox.2023.102822. Epub 2023 Jul 20.
5
Bidirectional Mendelian Randomization and Multiphenotype GWAS Show Causality and Shared Pathophysiology Between Depression and Type 2 Diabetes.双向孟德尔随机化和多表型 GWAS 表明抑郁和 2 型糖尿病之间存在因果关系和共同的病理生理学。
Diabetes Care. 2023 Sep 1;46(9):1707-1714. doi: 10.2337/dc22-2373.
6
Association of Longer Leukocyte Telomere Length With Cardiac Size, Function, and Heart Failure.白细胞端粒长度与心脏大小、功能和心力衰竭的关系。
JAMA Cardiol. 2023 Sep 1;8(9):808-815. doi: 10.1001/jamacardio.2023.2167.
7
Association of diabetes mellitus with early-onset colorectal cancer: A systematic review and meta-analysis of 19 studies including 10 million individuals and 30,000 events.糖尿病与早发性结直肠癌的关联:包含 1000 万人和 30000 例事件的 19 项研究的系统回顾和荟萃分析。
Diabetes Metab Syndr. 2023 Aug;17(8):102828. doi: 10.1016/j.dsx.2023.102828. Epub 2023 Jul 14.
8
Identifying Children Likely to Benefit From Antibiotics for Acute Sinusitis: A Randomized Clinical Trial.识别可能从急性鼻窦炎抗生素治疗中获益的儿童:一项随机临床试验。
JAMA. 2023 Jul 25;330(4):349-358. doi: 10.1001/jama.2023.10854.
9
An immune-related gene prognostic index for predicting prognosis in patients with colorectal cancer.免疫相关基因预后指数预测结直肠癌患者预后。
Front Immunol. 2023 Jul 6;14:1156488. doi: 10.3389/fimmu.2023.1156488. eCollection 2023.
10
Using metabolomics to predict severe traumatic brain injury outcome (GOSE) at 3 and 12 months.运用代谢组学预测严重创伤性脑损伤患者在 3 个月和 12 个月时的格拉斯哥预后评分(GOSE)。
Crit Care. 2023 Jul 22;27(1):295. doi: 10.1186/s13054-023-04573-9.