Kwon Hi Jeong, Park Sholhui, Park Young Hoon, Baik Seung Min, Park Dong Jin
Department of Laboratory Medicine, Yeouido St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea.
Department of Laboratory Medicine, Ewha Womans University College of Medicine, Seoul, Korea.
Digit Health. 2024 Jan 17;10:20552076231224245. doi: 10.1177/20552076231224245. eCollection 2024 Jan-Dec.
Modern healthcare systems face challenges related to the stable and sufficient blood supply of blood due to shortages. This study aimed to predict the monthly blood transfusion requirements in medical institutions using an artificial intelligence model based on national open big data related to transfusion.
Data regarding blood types and components in Korea from January 2010 to December 2021 were obtained from the Health Insurance Review and Assessment Service and Statistics Korea. The data were collected from a single medical institution. Using the obtained information, predictive models were developed, including eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), and category boosting (CatBoost). An ensemble model was created using these three models.
The prediction performance of XGBoost, LGBM, and CatBoost demonstrated a mean absolute error ranging from 14.6657 for AB+ red blood cells (RBCs) to 84.0433 for A+ platelet concentrate (PC) and a root mean squared error ranging from 18.5374 for AB+ RBCs to 118.6245 for B+ PC. The error range was further improved by creating ensemble models, wherein the department requesting blood was the most influential parameter affecting transfusion prediction performance for different blood products and types. Except for the department, the features that affected the prediction performance varied for each product and blood type, including the number of RBC antibody screens, crossmatch, nationwide blood donations, and surgeries.
Based on blood-related open big data, the developed blood-demand prediction algorithm can efficiently provide medical facilities with an appropriate volume of blood ahead of time.
现代医疗系统因血液短缺面临与血液稳定充足供应相关的挑战。本研究旨在使用基于国家输血相关开放大数据的人工智能模型预测医疗机构的每月输血需求。
从韩国健康保险审查评估服务机构和韩国统计局获取2010年1月至2021年12月韩国血型和血液成分的数据。数据收集自单一医疗机构。利用所获信息开发预测模型,包括极端梯度提升(XGBoost)、轻量级梯度提升机(LGBM)和类别提升(CatBoost)。使用这三种模型创建了一个集成模型。
XGBoost、LGBM和CatBoost的预测性能显示,平均绝对误差范围从AB + 红细胞(RBC)的14.6657到A + 血小板浓缩液(PC)的84.0433,均方根误差范围从AB + RBC的18.5374到B + PC的118.6245。通过创建集成模型,误差范围进一步缩小,其中申请用血的科室是影响不同血液制品和血型输血预测性能的最具影响力参数。除科室外,影响预测性能的特征因每种制品和血型而异,包括RBC抗体筛查数量、交叉配血、全国献血量和手术量。
基于与血液相关的开放大数据,所开发的血液需求预测算法能够提前有效地为医疗机构提供适量的血液。