• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习模型预测内科门诊预约的爽约情况。

Predicting no-shows at outpatient appointments in internal medicine using machine learning models.

作者信息

Ocampo Osorio Felipe, Pedroza Gomez Santiago, Rebellón Sanchez David Esteban, Ramirez Fernandez Richard, Tabares-Soto Reinel, Bravo-Ortíz Mario Alejandro, Cruz Suarez Gustavo Adolfo

机构信息

Unidad de Inteligencia Artificial, Fundación Valle del Lili, Cali, Valle del Cauca, Colombia.

Centro de Investigaciones Clínicas, Fundación Valle del Lili, Cali, Valle del Cauca, Colombia.

出版信息

PeerJ Comput Sci. 2025 Apr 22;11:e2762. doi: 10.7717/peerj-cs.2762. eCollection 2025.

DOI:10.7717/peerj-cs.2762
PMID:40567710
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12190658/
Abstract

The high prevalence of patient absenteeism in medical appointments poses significant challenges for healthcare providers and patients, causing delays in service delivery and increasing operational inefficiencies. Addressing this issue is crucial in the internal medicine department, a fundamental pillar of comprehensive adult healthcare that manages various chronic and complex conditions. To mitigate absenteeism, we present an innovative application of machine learning models specifically designed to predict the risk of patient absenteeism in the internal medicine department of Fundación Valle del Lili, a high-complexity hospital in Colombia. Leveraging an institutional database, we conducted a statistical analysis to identify critical variables influencing absenteeism risk, including clinical and sociodemographic factors and characteristics of previously attended appointments. Our study evaluated seven distinct machine learning models, explored various data processing techniques, and addressed class imbalance through oversampling and undersampling strategies. Hyperparameter optimization was conducted for each model configuration, culminating in selecting the Bagging RandomForest model, which demonstrated outstanding performance when combined with standardized data and balanced using the Synthetic Minority Oversampling Technique (SMOTE). Additionally, Shapley values (SHAP) were applied to enhance the interpretability of the model, enabling the identification of the most influential variables in predicting medical absenteeism, such as the number of previous absences, the day and month of the appointment, and diagnosed diseases. The selected model achieved a predictive accuracy of 84.80 ± 0.81%, an AUC value of 0.89, an F1-score of 84.75%, and a recall of 83.02% in cross-validation experiments. These results highlight the potential of our experimental approach to identify the most suitable model for proactively predicting patients at high risk of absenteeism, optimizing resource allocation, and improving the quality of medical care in internal medicine in the future. Our methodology provides a foundation for reducing operational inefficiencies and strengthening intervention strategies. This benefits healthcare providers and patients through more timely and effective care. Ultimately, this approach contributes to improving patient outcomes and institutional efficiency.

摘要

患者在医疗预约中缺勤率高,给医疗服务提供者和患者带来了重大挑战,导致服务提供延迟,运营效率低下。解决这一问题在内科部门至关重要,内科是成人综合医疗保健的基本支柱,负责管理各种慢性和复杂病症。为了减少缺勤情况,我们展示了机器学习模型的一种创新应用,该模型专门设计用于预测哥伦比亚高复杂性医院Fundación Valle del Lili内科患者的缺勤风险。利用机构数据库,我们进行了统计分析,以确定影响缺勤风险的关键变量,包括临床和社会人口学因素以及之前就诊预约的特征。我们的研究评估了七种不同的机器学习模型,探索了各种数据处理技术,并通过过采样和欠采样策略解决了类别不平衡问题。对每个模型配置进行了超参数优化,最终选择了Bagging随机森林模型,该模型在与标准化数据结合并使用合成少数过采样技术(SMOTE)进行平衡时表现出色。此外,应用了Shapley值(SHAP)来提高模型的可解释性,从而能够识别预测医疗缺勤中最具影响力的变量,如之前的缺勤次数、预约的日期和月份以及诊断出的疾病。在交叉验证实验中,所选模型的预测准确率为84.80±0.81%,AUC值为0.89,F1分数为84.75%,召回率为83.02%。这些结果凸显了我们实验方法的潜力,即识别最适合主动预测高缺勤风险患者的模型,优化资源分配,并在未来提高内科医疗质量。我们的方法为减少运营效率低下和加强干预策略提供了基础。这通过更及时有效的护理使医疗服务提供者和患者受益。最终,这种方法有助于改善患者预后和机构效率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/e658efc879dd/peerj-cs-11-2762-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/b1af5d9a3c2a/peerj-cs-11-2762-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/09264c031fc4/peerj-cs-11-2762-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/524292d9fcb5/peerj-cs-11-2762-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/bc92ada884ce/peerj-cs-11-2762-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/a5252e9a414c/peerj-cs-11-2762-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/aafed1361240/peerj-cs-11-2762-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/e658efc879dd/peerj-cs-11-2762-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/b1af5d9a3c2a/peerj-cs-11-2762-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/09264c031fc4/peerj-cs-11-2762-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/524292d9fcb5/peerj-cs-11-2762-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/bc92ada884ce/peerj-cs-11-2762-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/a5252e9a414c/peerj-cs-11-2762-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/aafed1361240/peerj-cs-11-2762-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f718/12190658/e658efc879dd/peerj-cs-11-2762-g007.jpg

相似文献

1
Predicting no-shows at outpatient appointments in internal medicine using machine learning models.使用机器学习模型预测内科门诊预约的爽约情况。
PeerJ Comput Sci. 2025 Apr 22;11:e2762. doi: 10.7717/peerj-cs.2762. eCollection 2025.
2
Stakeholders' perceptions and experiences of factors influencing the commissioning, delivery, and uptake of general health checks: a qualitative evidence synthesis.利益相关者对影响一般健康检查的委托、提供和接受因素的看法与体验:一项定性证据综合分析
Cochrane Database Syst Rev. 2025 Mar 20;3(3):CD014796. doi: 10.1002/14651858.CD014796.pub2.
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
7
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块型银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2022 May 23;5(5):CD011535. doi: 10.1002/14651858.CD011535.pub5.
10
Computer and mobile technology interventions for self-management in chronic obstructive pulmonary disease.用于慢性阻塞性肺疾病自我管理的计算机和移动技术干预措施。
Cochrane Database Syst Rev. 2017 May 23;5(5):CD011425. doi: 10.1002/14651858.CD011425.pub2.

本文引用的文献

1
Processing imbalanced medical data at the data level with assisted-reproduction data as an example.以辅助生殖数据为例,在数据层面处理不平衡的医学数据。
BioData Min. 2024 Sep 4;17(1):29. doi: 10.1186/s13040-024-00384-y.
2
Classification of Alzheimer's disease stages from magnetic resonance images using deep learning.利用深度学习从磁共振图像对阿尔茨海默病阶段进行分类。
PeerJ Comput Sci. 2023 Aug 24;9:e1490. doi: 10.7717/peerj-cs.1490. eCollection 2023.
3
Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data.
利用临床数据,通过深度学习和带网格搜索的机器学习预测乳腺癌转移的后期发生情况。
J Clin Med. 2022 Sep 29;11(19):5772. doi: 10.3390/jcm11195772.
4
A novel multi-class imbalanced EEG signals classification based on the adaptive synthetic sampling (ADASYN) approach.一种基于自适应合成采样(ADASYN)方法的新型多类不平衡脑电信号分类。
PeerJ Comput Sci. 2021 May 14;7:e523. doi: 10.7717/peerj-cs.523. eCollection 2021.
5
Predicting scheduled hospital attendance with artificial intelligence.利用人工智能预测预约就诊情况。
NPJ Digit Med. 2019 Apr 12;2:26. doi: 10.1038/s41746-019-0103-3. eCollection 2019.
6
Prevalence, Predictors, and the Financial Impact of Missed Appointments in an Academic Adolescent Clinic.学术性青少年诊所中预约失约的患病率、预测因素及经济影响
Cureus. 2018 Nov 19;10(11):e3613. doi: 10.7759/cureus.3613.
7
[Absenteeism and associated factors in scheduled visits to a Preventive Medicine outpatient clinic].[预防医学门诊预定就诊中的缺勤情况及相关因素]
J Healthc Qual Res. 2018 Mar-Apr;33(2):82-87. doi: 10.1016/j.cali.2017.12.006. Epub 2018 Mar 9.
8
No-shows in appointment scheduling - a systematic literature review.失约于预约安排 - 系统文献回顾。
Health Policy. 2018 Apr;122(4):412-421. doi: 10.1016/j.healthpol.2018.02.002. Epub 2018 Feb 15.
9
Prevalence and risk factors associated with non-attendance in neurodevelopmental follow-up clinic among infants with CHD.先天性心脏病(CHD)患儿神经发育随访门诊失访的患病率及相关危险因素。
Cardiol Young. 2018 Apr;28(4):554-560. doi: 10.1017/S1047951117002748. Epub 2018 Jan 23.
10
Random Forest.随机森林
J Insur Med. 2017;47(1):31-39. doi: 10.17849/insm-47-01-31-39.1.