• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 Light Gradient Boosting Machine 的室内患者住院时间预测模型。

Length of Stay Prediction Model of Indoor Patients Based on Light Gradient Boosting Machine.

机构信息

School of Computer Sciences, Universiti Sains Malaysia, Penang 11800, Malaysia.

出版信息

Comput Intell Neurosci. 2022 Aug 30;2022:9517029. doi: 10.1155/2022/9517029. eCollection 2022.

DOI:10.1155/2022/9517029
PMID:36082346
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9448550/
Abstract

The influx of hospital patients has become common in recent years. Hospital management departments need to redeploy healthcare resources to meet the massive medical needs of patients. In this process, the hospital length of stay (LOS) of different patients is a crucial reference to the management department. Therefore, building a model to predict LOS is of great significance. Five machine learning (ML) algorithms named Lasso regression (LR), ridge regression (RR), random forest regression (RFR), light gradient boosting machine (LightGBM), and extreme gradient boosting regression (XGBR) and six feature encoding methods named label encoding, count encoding, one-hot encoding, target encoding, leave-one-out encoding, and the proposed encoding method are used to construct the regression prediction model. The Scikit-Learn toolbox on the Python platform builds the prediction model. The input is the dataset named Hospital Inpatient Discharges (SPARCS De-Identified) 2017 with 2343569 instances provided by the New York State Department of Health verify the model after removing 2.2% of the missing data, and the model ultimately uses mean squared error (MSE) and coefficient of determination ( ) as the performance measurement. The results show that the model with the LightGBM algorithm and the proposed encoding method has the best (96.0%) and MSE score (2.231).

摘要

近年来,医院患者人数的涌入已变得很常见。医院管理部门需要重新部署医疗资源,以满足大量患者的医疗需求。在这个过程中,不同患者的医院住院时间(LOS)是管理部门的重要参考。因此,建立一个预测 LOS 的模型具有重要意义。本文使用了五种机器学习(ML)算法,分别是 Lasso 回归(LR)、岭回归(RR)、随机森林回归(RFR)、轻梯度提升机(LightGBM)和极端梯度提升回归(XGBR),以及六种特征编码方法,分别是标签编码、计数编码、独热编码、目标编码、留一法编码和提出的编码方法,来构建回归预测模型。该模型使用 Python 平台上的 Scikit-Learn 工具箱构建。输入数据集名为 Hospital Inpatient Discharges (SPARCS De-Identified) 2017,包含 2343569 个实例,由纽约州卫生部提供。在删除 2.2%的缺失数据后,对模型进行验证,最终模型使用均方误差(MSE)和决定系数( )作为性能度量。结果表明,使用 LightGBM 算法和提出的编码方法的模型具有最佳的 (96.0%)和 MSE 得分(2.231)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/0267fcb31388/CIN2022-9517029.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/e8945dd65bcd/CIN2022-9517029.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/16662502a430/CIN2022-9517029.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/4f9d94d1cba1/CIN2022-9517029.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/c1c528384c96/CIN2022-9517029.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/0267fcb31388/CIN2022-9517029.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/e8945dd65bcd/CIN2022-9517029.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/16662502a430/CIN2022-9517029.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/4f9d94d1cba1/CIN2022-9517029.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/c1c528384c96/CIN2022-9517029.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41e5/9448550/0267fcb31388/CIN2022-9517029.005.jpg

相似文献

1
Length of Stay Prediction Model of Indoor Patients Based on Light Gradient Boosting Machine.基于 Light Gradient Boosting Machine 的室内患者住院时间预测模型。
Comput Intell Neurosci. 2022 Aug 30;2022:9517029. doi: 10.1155/2022/9517029. eCollection 2022.
2
Machine learning-based prediction of hospital prolonged length of stay admission at emergency department: a Gradient Boosting algorithm analysis.基于机器学习的急诊科住院时间延长预测:梯度提升算法分析
Front Artif Intell. 2023 Jul 28;6:1179226. doi: 10.3389/frai.2023.1179226. eCollection 2023.
3
Risk score prediction model based on single nucleotide polymorphism for predicting malaria: a machine learning approach.基于单核苷酸多态性的疟疾风险评分预测模型:一种机器学习方法。
BMC Bioinformatics. 2022 Aug 7;23(1):325. doi: 10.1186/s12859-022-04870-0.
4
Predicting hospital length of stay using machine learning on a large open health dataset.利用大型开源健康数据集的机器学习预测住院时间。
BMC Health Serv Res. 2024 Jul 29;24(1):860. doi: 10.1186/s12913-024-11238-y.
5
Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models.使用机器学习模型预测肺癌患者术后肺功能
Tuberc Respir Dis (Seoul). 2023 Jul;86(3):203-215. doi: 10.4046/trd.2022.0048. Epub 2023 Apr 11.
6
Quantifying the impact of addressing data challenges in prediction of length of stay.量化解决数据挑战对住院时间预测的影响。
BMC Med Inform Decis Mak. 2021 Oct 30;21(1):298. doi: 10.1186/s12911-021-01660-1.
7
Prediction of Greenhouse Tomato Crop Evapotranspiration Using XGBoost Machine Learning Model.使用XGBoost机器学习模型预测温室番茄作物蒸散量
Plants (Basel). 2022 Jul 25;11(15):1923. doi: 10.3390/plants11151923.
8
Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records.基于电子病历的机器学习预测糖尿病肾病 3 年风险。
J Transl Med. 2022 Mar 26;20(1):143. doi: 10.1186/s12967-022-03339-1.
9
Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure.考虑环境暴露因素的机器学习方法预测心血管病入院高峰日
BMC Med Inform Decis Mak. 2020 May 1;20(1):83. doi: 10.1186/s12911-020-1101-8.
10
Predicting length of stay and mortality among hospitalized patients with type 2 diabetes mellitus and hypertension.预测住院 2 型糖尿病伴高血压患者的住院时间和死亡率。
Int J Med Inform. 2021 Oct;154:104569. doi: 10.1016/j.ijmedinf.2021.104569. Epub 2021 Sep 4.

引用本文的文献

1
Development of a risk prediction model for sepsis-related delirium based on multiple machine learning approaches and an online calculator.基于多种机器学习方法和在线计算器开发脓毒症相关性谵妄风险预测模型。
PLoS One. 2025 Jul 16;20(7):e0323831. doi: 10.1371/journal.pone.0323831. eCollection 2025.
2
Development of an emergency department length-of-stay prediction model based on machine learning.基于机器学习的急诊科住院时间预测模型的开发。
World J Emerg Med. 2025 May 1;16(3):220-224. doi: 10.5847/wjem.j.1920-8642.2025.048.
3
Development of Multiservice Machine Learning Models to Predict Postsurgical Length of Stay and Discharge Disposition at the Time of Case Posting.

本文引用的文献

1
Machine learning to predict passenger mortality and hospital length of stay following motor vehicle collision.机器学习预测机动车碰撞后乘客死亡率和住院时间。
Neurosurg Focus. 2022 Apr;52(4):E12. doi: 10.3171/2022.1.FOCUS21739.
2
An explainable machine learning framework for lung cancer hospital length of stay prediction.用于肺癌住院时间预测的可解释机器学习框架。
Sci Rep. 2022 Jan 12;12(1):607. doi: 10.1038/s41598-021-04608-7.
3
Prospective and external validation of stroke discharge planning machine learning models.前瞻性和外部验证脑卒中出院计划机器学习模型。
开发多服务机器学习模型以预测病例发布时的术后住院时间和出院处置情况。
Ann Surg Open. 2025 Jan 31;6(1):e547. doi: 10.1097/AS9.0000000000000547. eCollection 2025 Mar.
4
Equitable hospital length of stay prediction for patients with learning disabilities and multiple long-term conditions using machine learning.使用机器学习对学习障碍和多种长期病症患者的住院时间进行公平预测。
Front Digit Health. 2025 Feb 14;7:1538793. doi: 10.3389/fdgth.2025.1538793. eCollection 2025.
5
Light gradient boosting-based prediction of quality of life among oral cancer-treated patients.基于 Light Gradient Boosting 的口腔癌治疗患者生活质量预测。
BMC Oral Health. 2024 Mar 19;24(1):349. doi: 10.1186/s12903-024-04050-x.
6
OEDL: an optimized ensemble deep learning method for the prediction of acute ischemic stroke prognoses using union features.OEDL:一种使用联合特征预测急性缺血性中风预后的优化集成深度学习方法。
Front Neurol. 2023 Jun 21;14:1158555. doi: 10.3389/fneur.2023.1158555. eCollection 2023.
J Clin Neurosci. 2022 Feb;96:80-84. doi: 10.1016/j.jocn.2021.12.031. Epub 2022 Jan 6.
4
Machine learning using preoperative patient factors can predict duration of surgery and length of stay for total knee arthroplasty.使用术前患者因素的机器学习可以预测全膝关节置换术的手术时间和住院时间。
Int J Med Inform. 2022 Feb;158:104670. doi: 10.1016/j.ijmedinf.2021.104670. Epub 2021 Dec 22.
5
Quantifying the impact of addressing data challenges in prediction of length of stay.量化解决数据挑战对住院时间预测的影响。
BMC Med Inform Decis Mak. 2021 Oct 30;21(1):298. doi: 10.1186/s12911-021-01660-1.
6
The application of machine learning algorithms in predicting the length of stay following femoral neck fracture.机器学习算法在预测股骨颈骨折后住院时间方面的应用。
Int J Med Inform. 2021 Nov;155:104572. doi: 10.1016/j.ijmedinf.2021.104572. Epub 2021 Sep 13.
7
Hospital Length of Stay Prediction Methods: A Systematic Review.住院时间预测方法:系统评价。
Med Care. 2021 Oct 1;59(10):929-938. doi: 10.1097/MLR.0000000000001596.
8
The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation.在回归分析评估中,决定系数R平方比对称平均绝对百分比误差(SMAPE)、平均绝对误差(MAE)、平均绝对百分比误差(MAPE)、均方误差(MSE)和均方根误差(RMSE)更具信息量。
PeerJ Comput Sci. 2021 Jul 5;7:e623. doi: 10.7717/peerj-cs.623. eCollection 2021.
9
Hospital capacity during the COVID-19 pandemic.新冠疫情期间的医院收治能力。
Glob Health Med. 2021 Apr 30;3(2):56-59. doi: 10.35772/ghm.2021.01031.
10
Medically Necessary, Time-Sensitive Procedures: Scoring System to Ethically and Efficiently Manage Resource Scarcity and Provider Risk During the COVID-19 Pandemic.医学必需、时间敏感的程序:在 COVID-19 大流行期间,用于在道德和高效地管理资源稀缺和提供者风险的评分系统。
J Am Coll Surg. 2020 Aug;231(2):281-288. doi: 10.1016/j.jamcollsurg.2020.04.011. Epub 2020 Apr 9.