• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用真实世界的电子健康记录数据预测多病共存情况下12种癌症相关症状的发生发展。

Using real-world electronic health record data to predict the development of 12 cancer-related symptoms in the context of multimorbidity.

作者信息

Bandyopadhyay Anindita, Albashayreh Alaa, Zeinali Nahid, Fan Weiguo, Gilbertson-White Stephanie

机构信息

Department of Business Analytics, University of Iowa, Iowa City, IA 52242, United States.

College of Nursing, University of Iowa, Iowa City, IA 52242, United States.

出版信息

JAMIA Open. 2024 Sep 12;7(3):ooae082. doi: 10.1093/jamiaopen/ooae082. eCollection 2024 Oct.

DOI:10.1093/jamiaopen/ooae082
PMID:39282082
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11397936/
Abstract

OBJECTIVE

This study uses electronic health record (EHR) data to predict 12 common cancer symptoms, assessing the efficacy of machine learning (ML) models in identifying symptom influencers.

MATERIALS AND METHODS

We analyzed EHR data of 8156 adults diagnosed with cancer who underwent cancer treatment from 2017 to 2020. Structured and unstructured EHR data were sourced from the Enterprise Data Warehouse for Research at the University of Iowa Hospital and Clinics. Several predictive models, including logistic regression, random forest (RF), and XGBoost, were employed to forecast symptom development. The performances of the models were evaluated by F1-score and area under the curve (AUC) on the testing set. The SHapley Additive exPlanations framework was used to interpret these models and identify the predictive risk factors associated with fatigue as an exemplar.

RESULTS

The RF model exhibited superior performance with a macro average AUC of 0.755 and an F1-score of 0.729 in predicting a range of cancer-related symptoms. For instance, the RF model achieved an AUC of 0.954 and an F1-score of 0.914 for pain prediction. Key predictive factors identified included clinical history, cancer characteristics, treatment modalities, and patient demographics depending on the symptom. For example, the odds ratio (OR) for fatigue was significantly influenced by allergy (OR = 2.3, 95% CI: 1.8-2.9) and colitis (OR = 1.9, 95% CI: 1.5-2.4).

DISCUSSION

Our research emphasizes the critical integration of multimorbidity and patient characteristics in modeling cancer symptoms, revealing the considerable influence of chronic conditions beyond cancer itself.

CONCLUSION

We highlight the potential of ML for predicting cancer symptoms, suggesting a pathway for integrating such models into clinical systems to enhance personalized care and symptom management.

摘要

目的

本研究使用电子健康记录(EHR)数据预测12种常见癌症症状,评估机器学习(ML)模型在识别症状影响因素方面的有效性。

材料与方法

我们分析了2017年至2020年接受癌症治疗的8156名成年癌症患者的EHR数据。结构化和非结构化EHR数据来源于爱荷华大学医院和诊所的企业研究数据仓库。采用了几种预测模型,包括逻辑回归、随机森林(RF)和XGBoost,来预测症状的发展。通过测试集上的F1分数和曲线下面积(AUC)评估模型的性能。使用SHapley加法解释框架来解释这些模型,并将与疲劳相关的预测风险因素作为示例进行识别。

结果

RF模型在预测一系列癌症相关症状方面表现出卓越性能,宏观平均AUC为0.755,F1分数为0.729。例如,RF模型在疼痛预测方面的AUC为0.954,F1分数为0.914。确定的关键预测因素包括临床病史、癌症特征、治疗方式以及取决于症状的患者人口统计学特征。例如,过敏(优势比[OR]=2.3,95%置信区间[CI]:1.8 - 2.9)和结肠炎(OR = 1.9,95% CI:1.5 - 2.4)对疲劳的优势比有显著影响。

讨论

我们的研究强调了在对癌症症状进行建模时多病症和患者特征的关键整合,揭示了慢性病在癌症本身之外的重大影响。

结论

我们强调了ML在预测癌症症状方面的潜力,提出了将此类模型整合到临床系统中以加强个性化护理和症状管理的途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddf1/11397936/c5abe440d94b/ooae082f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddf1/11397936/bed1618510c5/ooae082f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddf1/11397936/fc542678432b/ooae082f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddf1/11397936/c5abe440d94b/ooae082f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddf1/11397936/bed1618510c5/ooae082f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddf1/11397936/fc542678432b/ooae082f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddf1/11397936/c5abe440d94b/ooae082f3.jpg

相似文献

1
Using real-world electronic health record data to predict the development of 12 cancer-related symptoms in the context of multimorbidity.利用真实世界的电子健康记录数据预测多病共存情况下12种癌症相关症状的发生发展。
JAMIA Open. 2024 Sep 12;7(3):ooae082. doi: 10.1093/jamiaopen/ooae082. eCollection 2024 Oct.
2
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
3
Deep Learning Approaches for Predicting Glaucoma Progression Using Electronic Health Records and Natural Language Processing.使用电子健康记录和自然语言处理的深度学习方法预测青光眼进展
Ophthalmol Sci. 2022 Feb 12;2(2):100127. doi: 10.1016/j.xops.2022.100127. eCollection 2022 Jun.
4
Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.基于 FHIR 的电子健康记录表型框架的开发:以从出院小结中识别肥胖且伴有多种合并症的患者为例。
J Biomed Inform. 2019 Nov;99:103310. doi: 10.1016/j.jbi.2019.103310. Epub 2019 Oct 14.
5
Predicting seizure recurrence after an initial seizure-like episode from routine clinical notes using large language models: a retrospective cohort study.使用大型语言模型从常规临床记录预测首次类似癫痫发作后的癫痫复发:一项回顾性队列研究。
Lancet Digit Health. 2023 Dec;5(12):e882-e894. doi: 10.1016/S2589-7500(23)00179-6.
6
Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。
Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.
7
Predicting near-term glaucoma progression: An artificial intelligence approach using clinical free-text notes and data from electronic health records.预测近期青光眼进展:一种使用临床自由文本记录和电子健康记录数据的人工智能方法。
Front Med (Lausanne). 2023 Apr 13;10:1157016. doi: 10.3389/fmed.2023.1157016. eCollection 2023.
8
Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records.基于机器学习的 COVID-19 患者出院预测模型:利用电子健康记录进行开发和评估。
PLoS One. 2023 Oct 20;18(10):e0292888. doi: 10.1371/journal.pone.0292888. eCollection 2023.
9
Symptom-BERT: Enhancing Cancer Symptom Detection in EHR Clinical Notes.症状-BERT:增强电子健康记录临床记录中的癌症症状检测
J Pain Symptom Manage. 2024 Aug;68(2):190-198.e1. doi: 10.1016/j.jpainsymman.2024.05.015. Epub 2024 May 23.
10
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.

引用本文的文献

1
The Use of Machine Learning for Analyzing Real-World Data in Disease Prediction and Management: Systematic Review.机器学习在疾病预测与管理中分析真实世界数据的应用:系统评价
JMIR Med Inform. 2025 Jun 19;13:e68898. doi: 10.2196/68898.
2
Deep Learning Approaches to Forecast Physical and Mental Deterioration During Chemotherapy in Patients with Cancer.深度学习方法预测癌症患者化疗期间身心衰退情况
Diagnostics (Basel). 2025 Apr 9;15(8):956. doi: 10.3390/diagnostics15080956.
3
Non-Invasive Cancer Detection Using Blood Test and Predictive Modeling Approach.

本文引用的文献

1
Natural Language Processing Accurately Differentiates Cancer Symptom Information in Electronic Health Record Narratives.自然语言处理能准确区分电子健康记录中的癌症症状信息。
JCO Clin Cancer Inform. 2024 Aug;8:e2300235. doi: 10.1200/CCI.23.00235.
2
Machine Learning Approaches to Predict Symptoms in People With Cancer: Systematic Review.预测癌症患者症状的机器学习方法:系统综述
JMIR Cancer. 2024 Mar 19;10:e52322. doi: 10.2196/52322.
3
The Iowa Health Data Resource (IHDR): an innovative framework for transforming the clinical health data ecosystem.
使用血液检测和预测建模方法进行非侵入性癌症检测。
Adv Appl Bioinform Chem. 2025 Jan 10;17:159-178. doi: 10.2147/AABC.S488604. eCollection 2024.
爱荷华州健康数据资源(IHDR):一个用于转变临床健康数据生态系统的创新框架。
J Am Med Inform Assoc. 2024 Feb 16;31(3):720-726. doi: 10.1093/jamia/ocad236.
4
Multimorbidity in people living with and beyond cancer: a scoping review.癌症患者及癌症康复者的多种疾病共患情况:一项范围综述
Am J Cancer Res. 2023 Sep 15;13(9):4346-4365. eCollection 2023.
5
A comprehensive analysis of recent advancements in cancer detection using machine learning and deep learning models for improved diagnostics.对使用机器学习和深度学习模型改进癌症检测诊断的最新进展进行全面分析。
J Cancer Res Clin Oncol. 2023 Nov;149(15):14365-14408. doi: 10.1007/s00432-023-05216-w. Epub 2023 Aug 4.
6
Machine learning explainability in nasopharyngeal cancer survival using LIME and SHAP.基于 LIME 和 SHAP 的机器学习在鼻咽癌生存分析中的可解释性。
Sci Rep. 2023 Jun 2;13(1):8984. doi: 10.1038/s41598-023-35795-0.
7
Quality of Life and Side Effects Management in Cancer Treatment-A Cross Sectional Study.癌症治疗中的生活质量和副作用管理——一项横断面研究。
Int J Environ Res Public Health. 2023 Jan 17;20(3):1708. doi: 10.3390/ijerph20031708.
8
Extreme gradient boosting model to assess risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual prediction using SHapley Additive exPlanations.极端梯度提升模型评估甲状腺乳头状癌患者中央区颈淋巴结转移风险:使用SHapley加性解释进行个体预测
Comput Methods Programs Biomed. 2022 Oct;225:107038. doi: 10.1016/j.cmpb.2022.107038. Epub 2022 Jul 23.
9
A prediction model for xerostomia in locoregionally advanced nasopharyngeal carcinoma patients receiving radical radiotherapy.局部晚期鼻咽癌根治性放疗后口干预测模型。
BMC Oral Health. 2022 Jun 17;22(1):239. doi: 10.1186/s12903-022-02269-0.
10
A decision tree prediction model for a short-term outcome of delirium in patients with advanced cancer receiving pharmacological interventions: A secondary analysis of a multicenter and prospective observational study (Phase-R).接受药物干预的晚期癌症患者谵妄短期结局的决策树预测模型:一项多中心前瞻性观察研究(Phase-R)的二次分析
Palliat Support Care. 2022 Apr;20(2):153-158. doi: 10.1017/S1478951521001565.