基于机器学习的肝癌风险预测模型的构建与评估

Construction and evaluation of a liver cancer risk prediction model based on machine learning.

作者信息

Wang Ying-Ying, Yang Wan-Xia, Du Qia-Jun, Liu Zhen-Hua, Lu Ming-Hua, You Chong-Ge

机构信息

Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China.

出版信息

World J Gastrointest Oncol. 2024 Sep 15;16(9):3839-3850. doi: 10.4251/wjgo.v16.i9.3839.

DOI:10.4251/wjgo.v16.i9.3839

PMID:39350987

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11438789/

Abstract

BACKGROUND

Liver cancer is one of the most prevalent malignant tumors worldwide, and its early detection and treatment are crucial for enhancing patient survival rates and quality of life. However, the early symptoms of liver cancer are often not obvious, resulting in a late-stage diagnosis in many patients, which significantly reduces the effectiveness of treatment. Developing a highly targeted, widely applicable, and practical risk prediction model for liver cancer is crucial for enhancing the early diagnosis and long-term survival rates among affected individuals.

AIM

To develop a liver cancer risk prediction model by employing machine learning techniques, and subsequently assess its performance.

METHODS

In this study, a total of 550 patients were enrolled, with 190 hepatocellular carcinoma (HCC) and 195 cirrhosis patients serving as the training cohort, and 83 HCC and 82 cirrhosis patients forming the validation cohort. Logistic regression (LR), support vector machine (SVM), random forest (RF), and least absolute shrinkage and selection operator (LASSO) regression models were developed in the training cohort. Model performance was assessed in the validation cohort. Additionally, this study conducted a comparative evaluation of the diagnostic efficacy between the ASAP model and the model developed in this study using receiver operating characteristic curve, calibration curve, and decision curve analysis (DCA) to determine the optimal predictive model for assessing liver cancer risk.

RESULTS

Six variables including age, white blood cell, red blood cell, platelet counts, alpha-fetoprotein and protein induced by vitamin K absence or antagonist II levels were used to develop LR, SVM, RF, and LASSO regression models. The RF model exhibited superior discrimination, and the area under curve of the training and validation sets was 0.969 and 0.858, respectively. These values significantly surpassed those of the LR (0.850 and 0.827), SVM (0.860 and 0.803), LASSO regression (0.845 and 0.831), and ASAP (0.866 and 0.813) models. Furthermore, calibration and DCA indicated that the RF model exhibited robust calibration and clinical validity.

CONCLUSION

The RF model demonstrated excellent prediction capabilities for HCC and can facilitate early diagnosis of HCC in clinical practice.

摘要

背景

肝癌是全球最常见的恶性肿瘤之一，其早期检测和治疗对于提高患者生存率和生活质量至关重要。然而，肝癌的早期症状往往不明显，导致许多患者在晚期才被诊断出来，这显著降低了治疗效果。开发一种针对肝癌的高度靶向、广泛适用且实用的风险预测模型对于提高受影响个体的早期诊断率和长期生存率至关重要。

目的

运用机器学习技术开发肝癌风险预测模型，并随后评估其性能。

方法

在本研究中，共纳入550例患者，其中190例肝细胞癌（HCC）患者和195例肝硬化患者作为训练队列，83例HCC患者和82例肝硬化患者组成验证队列。在训练队列中开发逻辑回归（LR）、支持向量机（SVM）、随机森林（RF）和最小绝对收缩和选择算子（LASSO）回归模型。在验证队列中评估模型性能。此外，本研究使用受试者工作特征曲线、校准曲线和决策曲线分析（DCA）对ASAP模型和本研究开发的模型之间的诊断效能进行了比较评估，以确定评估肝癌风险的最佳预测模型。

结果

使用年龄、白细胞、红细胞、血小板计数、甲胎蛋白和维生素K缺乏或拮抗剂II诱导蛋白水平这六个变量开发了LR、SVM、RF和LASSO回归模型。RF模型表现出卓越的区分能力，训练集和验证集的曲线下面积分别为0.969和0.858。这些值显著超过了LR（0.850和0.827）、SVM（0.860和0.803）、LASSO回归（0.845和0.831）以及ASAP（0.86和0.813）模型。此外，校准和DCA表明RF模型具有稳健的校准和临床有效性。

结论

RF模型对HCC显示出出色的预测能力，可在临床实践中促进HCC的早期诊断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0371/11438789/408bdc063990/WJGO-16-3839-g001.jpg

相似文献

Construction and evaluation of a liver cancer risk prediction model based on machine learning.基于机器学习的肝癌风险预测模型的构建与评估

World J Gastrointest Oncol. 2024 Sep 15;16(9):3839-3850. doi: 10.4251/wjgo.v16.i9.3839.

Construction of the prediction model for multiple myeloma based on machine learning.基于机器学习的多发性骨髓瘤预测模型的构建。

Int J Lab Hematol. 2024 Oct;46(5):918-926. doi: 10.1111/ijlh.14324. Epub 2024 May 31.

Development of a machine learning-based model to predict prognosis of alpha-fetoprotein-positive hepatocellular carcinoma.基于机器学习的模型预测 AFP 阳性肝细胞癌预后的研究

J Transl Med. 2024 May 13;22(1):455. doi: 10.1186/s12967-024-05203-w.

[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.

[Construction of a predictive model for in-hospital mortality of sepsis patients in intensive care unit based on machine learning].基于机器学习构建重症监护病房脓毒症患者院内死亡率预测模型

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2023 Jul;35(7):696-701. doi: 10.3760/cma.j.cn121430-20221219-01104.

Predicting Prostate Cancer Upgrading of Biopsy Gleason Grade Group at Radical Prostatectomy Using Machine Learning-Assisted Decision-Support Models.使用机器学习辅助决策支持模型预测前列腺癌根治术时活检Gleason分级组的升级情况。

Cancer Manag Res. 2020 Dec 22;12:13099-13110. doi: 10.2147/CMAR.S286167. eCollection 2020.

Construction and evaluation of a mortality prediction model for patients with acute kidney injury undergoing continuous renal replacement therapy based on machine learning algorithms.基于机器学习算法的行连续性肾脏替代治疗的急性肾损伤患者死亡率预测模型的构建与评估。

Ann Med. 2024 Dec;56(1):2388709. doi: 10.1080/07853890.2024.2388709. Epub 2024 Aug 19.

Derivation and validation of machine learning models for preoperative estimation of microvascular invasion risk in hepatocellular carcinoma.用于术前评估肝细胞癌微血管侵犯风险的机器学习模型的推导与验证

Ann Transl Med. 2023 Mar 31;11(6):249. doi: 10.21037/atm-22-2828. Epub 2023 Jan 6.

Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。

Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.

Comparison of Machine Learning Algorithms and Nomogram Construction for Diabetic Retinopathy Prediction in Type 2 Diabetes Mellitus Patients.机器学习算法与列线图构建在 2 型糖尿病患者糖尿病视网膜病变预测中的比较。

Ophthalmic Res. 2024;67(1):537-548. doi: 10.1159/000541294. Epub 2024 Sep 4.

引用本文的文献

Explainable attention-enhanced heuristic paradigm for multi-view prognostic risk score development in hepatocellular carcinoma.用于肝细胞癌多视图预后风险评分开发的可解释注意力增强启发式范式

Hepatol Int. 2025 Mar 16. doi: 10.1007/s12072-025-10793-8.

本文引用的文献

From patterns to patients: Advances in clinical machine learning for cancer diagnosis, prognosis, and treatment.从模式到患者：癌症诊断、预后和治疗的临床机器学习进展。

Cell. 2023 Apr 13;186(8):1772-1791. doi: 10.1016/j.cell.2023.01.035. Epub 2023 Mar 10.

Utility of combining PIVKA-II and AFP in the surveillance and monitoring of hepatocellular carcinoma in the Asia-Pacific region.联合检测 PIVKA-II 和 AFP 在亚太地区肝细胞癌监测和随访中的应用。

Clin Mol Hepatol. 2023 Apr;29(2):277-292. doi: 10.3350/cmh.2022.0212. Epub 2023 Jan 30.

Clinical value of serum AFP and PIVKA-II for diagnosis, treatment and prognosis of hepatocellular carcinoma.血清 AFP 和 PIVKA-II 对肝细胞癌的诊断、治疗和预后的临床价值。

J Clin Lab Anal. 2023 Jan;37(1):e24823. doi: 10.1002/jcla.24823. Epub 2022 Dec 29.

The causal relationship between white blood cell counts and hepatocellular carcinoma: a Mendelian randomization study.白细胞计数与肝细胞癌之间的因果关系：一项孟德尔随机化研究。

Eur J Med Res. 2022 Dec 6;27(1):278. doi: 10.1186/s40001-022-00900-y.

Global burden of primary liver cancer in 2020 and predictions to 2040.2020 年全球原发性肝癌负担及 2040 年预测。

J Hepatol. 2022 Dec;77(6):1598-1606. doi: 10.1016/j.jhep.2022.08.021. Epub 2022 Oct 5.

Hepatocellular carcinoma.肝细胞癌

Lancet. 2022 Oct 15;400(10360):1345-1362. doi: 10.1016/S0140-6736(22)01200-4. Epub 2022 Sep 6.

Involvement of microRNAs and their potential diagnostic, therapeutic, and prognostic role in hepatocellular carcinoma.微小 RNA 的参与及其在肝细胞癌中的潜在诊断、治疗和预后作用。

J Clin Lab Anal. 2022 Oct;36(10):e24673. doi: 10.1002/jcla.24673. Epub 2022 Aug 29.

Gamma-Glutamyl Transpeptidase-to-Platelet ratio predicts liver fibrosis in patients with concomitant chronic hepatitis B and nonalcoholic fatty liver disease.γ-谷氨酰转肽酶/血小板比值预测合并慢性乙型肝炎和非酒精性脂肪性肝病患者的肝纤维化。

J Clin Lab Anal. 2022 Aug;36(8):e24596. doi: 10.1002/jcla.24596. Epub 2022 Jul 9.

Circulating biomarkers in the diagnosis and management of hepatocellular carcinoma.循环生物标志物在肝细胞癌的诊断和治疗中的应用。

Nat Rev Gastroenterol Hepatol. 2022 Oct;19(10):670-681. doi: 10.1038/s41575-022-00620-y. Epub 2022 Jun 8.

Increased platelet aggregation in patients with decompensated cirrhosis indicates higher risk of further decompensation and death.失代偿期肝硬化患者血小板聚集增加表明进一步恶化和死亡的风险更高。

J Hepatol. 2022 Sep;77(3):660-669. doi: 10.1016/j.jhep.2022.03.009. Epub 2022 Mar 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于机器学习的肝癌风险预测模型的构建与评估

Construction and evaluation of a liver cancer risk prediction model based on machine learning.

作者信息

机构信息

出版信息

BACKGROUND

AIM

METHODS

RESULTS

CONCLUSION

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献