使用机器学习模型预测肺癌患者术后肺功能

Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models.

作者信息

Kwon Oh Beom, Han Solji, Lee Hwa Young, Kang Hye Seon, Kim Sung Kyoung, Kim Ju Sang, Park Chan Kwon, Lee Sang Haak, Kim Seung Joon, Kim Jin Woo, Yeo Chang Dong

机构信息

Division of Pulmonary, Critical Care and Sleep Medicine, Department of Internal Medicine, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.

Department of Applied Statistics, Yonsei University, Seoul, Republic of Korea.

出版信息

Tuberc Respir Dis (Seoul). 2023 Jul;86(3):203-215. doi: 10.4046/trd.2022.0048. Epub 2023 Apr 11.

DOI:10.4046/trd.2022.0048

PMID:37038881

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10323210/

Abstract

BACKGROUND

Surgical resection is the standard treatment for early-stage lung cancer. Since postoperative lung function is related to mortality, predicted postoperative lung function is used to determine the treatment modality. The aim of this study was to evaluate the predictive performance of linear regression and machine learning models.

METHODS

We extracted data from the Clinical Data Warehouse and developed three sets: set I, the linear regression model; set II, machine learning models omitting the missing data: and set III, machine learning models imputing the missing data. Six machine learning models, the least absolute shrinkage and selection operator (LASSO), Ridge regression, ElasticNet, Random Forest, eXtreme gradient boosting (XGBoost), and the light gradient boosting machine (LightGBM) were implemented. The forced expiratory volume in 1 second measured 6 months after surgery was defined as the outcome. Five-fold cross-validation was performed for hyperparameter tuning of the machine learning models. The dataset was split into training and test datasets at a 70:30 ratio. Implementation was done after dataset splitting in set III. Predictive performance was evaluated by R2 and mean squared error (MSE) in the three sets.

RESULTS

A total of 1,487 patients were included in sets I and III and 896 patients were included in set II. In set I, the R2 value was 0.27 and in set II, LightGBM was the best model with the highest R2 value of 0.5 and the lowest MSE of 154.95. In set III, LightGBM was the best model with the highest R2 value of 0.56 and the lowest MSE of 174.07.

CONCLUSION

The LightGBM model showed the best performance in predicting postoperative lung function.

摘要

背景

手术切除是早期肺癌的标准治疗方法。由于术后肺功能与死亡率相关，因此使用预测的术后肺功能来确定治疗方式。本研究的目的是评估线性回归和机器学习模型的预测性能。

方法

我们从临床数据仓库中提取数据，并开发了三组：第一组，线性回归模型；第二组，省略缺失数据的机器学习模型；第三组，插补缺失数据的机器学习模型。实施了六种机器学习模型，即最小绝对收缩和选择算子（LASSO）、岭回归、弹性网络、随机森林、极端梯度提升（XGBoost）和轻梯度提升机（LightGBM）。将术后6个月测量的第1秒用力呼气量定义为结果。对机器学习模型进行五折交叉验证以进行超参数调整。数据集按70:30的比例分为训练集和测试集。第三组在数据集拆分后进行实施。通过三组中的R2和均方误差（MSE）评估预测性能。

结果

第一组和第三组共纳入1487例患者，第二组纳入896例患者。在第一组中，R2值为0.27，在第二组中，LightGBM是最佳模型，R2值最高为0.5，MSE最低为154.95。在第三组中，LightGBM是最佳模型，R2值最高为0.56，MSE最低为174.07。

结论

LightGBM模型在预测术后肺功能方面表现最佳。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9cce/10323210/c391c7502b24/trd-2022-0048f1.jpg

相似文献

Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models.

Tuberc Respir Dis (Seoul). 2023 Jul;86(3):203-215. doi: 10.4046/trd.2022.0048. Epub 2023 Apr 11.

Assessment and quantification of ovarian reserve on the basis of machine learning models.

Front Endocrinol (Lausanne). 2023 Mar 15;14:1087429. doi: 10.3389/fendo.2023.1087429. eCollection 2023.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Development and validation of a machine-learning model for prediction of hypoxemia after extubation in intensive care units.

Ann Transl Med. 2022 May;10(10):577. doi: 10.21037/atm-22-2118.

Prediction of postoperative cardiopulmonary complications after lung resection in a Chinese population: A machine learning-based study.

Front Oncol. 2022 Sep 23;12:1003722. doi: 10.3389/fonc.2022.1003722. eCollection 2022.

Development of a machine learning model to predict the risk of late cardiogenic shock in patients with ST-segment elevation myocardial infarction.

Ann Transl Med. 2021 Jul;9(14):1162. doi: 10.21037/atm-21-2905.

Preoperative prediction of vessel invasion in locally advanced gastric cancer based on computed tomography radiomics and machine learning.

Oncol Lett. 2023 May 22;26(1):293. doi: 10.3892/ol.2023.13879. eCollection 2023 Jul.

Prediction Model of Osteonecrosis of the Femoral Head After Femoral Neck Fracture: Machine Learning-Based Development and Validation Study.

JMIR Med Inform. 2021 Nov 19;9(11):e30079. doi: 10.2196/30079.

Predictive modeling of blood pressure during hemodialysis: a comparison of linear model, random forest, support vector regression, XGBoost, LASSO regression and ensemble method.

Comput Methods Programs Biomed. 2020 Oct;195:105536. doi: 10.1016/j.cmpb.2020.105536. Epub 2020 May 22.

Prediction of recurrence of ischemic stroke within 1 year of discharge based on machine learning MRI radiomics.

Front Neurosci. 2023 May 4;17:1110579. doi: 10.3389/fnins.2023.1110579. eCollection 2023.

引用本文的文献

Machine learning-based estimation of patient body weight from radiation dose metrics in computed tomography.

J Appl Clin Med Phys. 2024 Sep;25(9):e14467. doi: 10.1002/acm2.14467. Epub 2024 Jul 23.

The role of the diaphragm in prediction of respiratory function in the immediate postoperative period in lung cancer patients using a machine learning model.

World J Surg Oncol. 2023 Dec 22;21(1):393. doi: 10.1186/s12957-023-03278-1.

本文引用的文献

The Value of Residual Volume/Total Lung Capacity as an Indicator for Predicting Postoperative Lung Function in Non-Small Lung Cancer.

J Clin Med. 2021 Sep 15;10(18):4159. doi: 10.3390/jcm10184159.

Early outcome prediction for out-of-hospital cardiac arrest with initial shockable rhythm using machine learning models.

Resuscitation. 2021 Jan;158:49-56. doi: 10.1016/j.resuscitation.2020.11.020. Epub 2020 Nov 20.

Video-Assisted Thoracoscopic Surgery vs Thoracotomy for Non-Small Cell Lung Cancer Greater Than 5 cm: Is VATS a feasible approach for large tumors?

J Cardiothorac Surg. 2020 Sep 18;15(1):261. doi: 10.1186/s13019-020-01305-w.

SICE: an improved missing data imputation technique.

J Big Data. 2020;7(1):37. doi: 10.1186/s40537-020-00313-w. Epub 2020 Jun 12.

Predicting Postoperative Lung Function Following Lung Cancer Resection: A Systematic Review and Meta-analysis.

EClinicalMedicine. 2019 Sep 10;15:7-13. doi: 10.1016/j.eclinm.2019.08.015. eCollection 2019 Oct.

A machine learning based model for Out of Hospital cardiac arrest outcome classification and sensitivity analysis.

Resuscitation. 2019 May;138:134-140. doi: 10.1016/j.resuscitation.2019.03.012. Epub 2019 Mar 15.

Global Epidemiology of Lung Cancer.

Ann Glob Health. 2019 Jan 22;85(1):8. doi: 10.5334/aogh.2419.

Lung function in the late postoperative phase and influencing factors in patients undergoing pulmonary lobectomy.

J Thorac Dis. 2018 May;10(5):2916-2923. doi: 10.21037/jtd.2018.05.27.

Statistical data preparation: management of missing values and outliers.

Korean J Anesthesiol. 2017 Aug;70(4):407-411. doi: 10.4097/kjae.2017.70.4.407. Epub 2017 Jul 27.

Machine Learning and Prediction in Medicine - Beyond the Peak of Inflated Expectations.

N Engl J Med. 2017 Jun 29;376(26):2507-2509. doi: 10.1056/NEJMp1702071.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用机器学习模型预测肺癌患者术后肺功能

Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献