• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

原发性肝癌远处转移机器学习模型的开发与验证:一项基于人群的研究。

Development and validation of machine learning models for distant metastasis of primary hepatic carcinoma: a population-based study.

作者信息

Lu Cong, He Ying, Chen Chun-Ru, Wu Lun, Song Dan, Wang Chen-Hong, Zhang Le-Qing, Miao Jing-Yi, Zheng Yong-Bin, Wang Wei

机构信息

Department of Gastrointestinal Surgery, Renmin Hospital of Wuhan University, Wuhan, 430060, Hubei Province, China.

Department of Stomatology, Renmin Hospital of Wuhan University, Wuhan, 430060, China.

出版信息

Discov Oncol. 2025 Jun 16;16(1):1120. doi: 10.1007/s12672-025-02894-5.

DOI:10.1007/s12672-025-02894-5
PMID:40522547
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12170961/
Abstract

BACKGROUND

Primary liver cancer is the sixth most common cancer globally and ranks third in cancer-related mortality. Patients with distant metastasis (PLCDM) have particularly low survival rates and are more difficult to treat. This study aims to identify risk factors associated with distant metastasis and overall survival (OS) in primary liver cancer and to determine the optimal predictive models using machine learning.

METHODS

We extracted data from the SEER database (Incidence-SEER Research Data, 17 Registries, Nov 2022 Sub (2000-2020)) and identified risk factors for distant metastasis using logistic regression. Eight machine learning models were constructed using the "tidymodels" package in R and evaluated based on ROC curves, AUC, and accuracy. Cox regression was used to identify risk factors for OS, and Cox and Random Survival Forest (RSF) models were compared using time-dependent ROC curves. The best-performing model was interpreted using Shapley analysis. We also developed user-friendly web applications using the "shiny" package in R for clinical use.

RESULTS

Multivariate analysis identified grade, T stage, N stage, tumor size, and surgery as independent risk factors for PLCDM. The Random Forest (RF) model showed the best performance with AUC values of 0.836, 0.817, and 0.846 in the training, internal validation, and external validation cohorts, respectively, and favorable Brier scores and accuracy. Shapley analysis ranked the risk factors by contribution as surgery, T stage, tumor size, N stage, and grade. Cox regression identified grade, surgery, and T stage as independent prognostic factors for OS. The Cox model outperformed the RSF model in time-dependent ROC analysis. Calibration and decision curve analysis (DCA) further confirmed its strong predictive performance and clinical utility. Shapley analysis ranked the risk factors as grade, surgery, and T stage.

CONCLUSIONS

We successfully constructed and validated optimal models for predicting PLCDM and its prognosis. These models provide valuable tools to guide clinical decision-making for PLCDM.

摘要

背景

原发性肝癌是全球第六大常见癌症,在癌症相关死亡率中排名第三。发生远处转移的原发性肝癌患者(PLCDM)生存率特别低,治疗难度更大。本研究旨在确定原发性肝癌远处转移和总生存期(OS)的相关危险因素,并使用机器学习确定最佳预测模型。

方法

我们从SEER数据库(发病率-SEER研究数据,17个登记处,2022年11月更新版(2000 - 2020年))中提取数据,并使用逻辑回归确定远处转移的危险因素。在R语言中使用“tidymodels”包构建了八个机器学习模型,并基于ROC曲线、AUC和准确性进行评估。使用Cox回归确定OS的危险因素,并使用时间依赖性ROC曲线比较Cox模型和随机生存森林(RSF)模型。使用Shapley分析对表现最佳的模型进行解释。我们还使用R语言中的“shiny”包开发了便于临床使用的用户友好型网络应用程序。

结果

多变量分析确定分级、T分期、N分期、肿瘤大小和手术是PLCDM的独立危险因素。随机森林(RF)模型表现最佳,在训练队列、内部验证队列和外部验证队列中的AUC值分别为0.836、0.817和0.846,且具有良好的Brier评分和准确性。Shapley分析按贡献程度对危险因素进行排序,依次为手术、T分期、肿瘤大小、N分期和分级。Cox回归确定分级、手术和T分期是OS的独立预后因素。在时间依赖性ROC分析中,Cox模型优于RSF模型。校准和决策曲线分析(DCA)进一步证实了其强大的预测性能和临床实用性。Shapley分析将危险因素排序为分级、手术和T分期。

结论

我们成功构建并验证了预测PLCDM及其预后的最佳模型。这些模型为指导PLCDM的临床决策提供了有价值的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/8507cf81bf8f/12672_2025_2894_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/a9ea53f23543/12672_2025_2894_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/7d9fa0fc1dda/12672_2025_2894_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/5f01ce532b83/12672_2025_2894_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/dea58b478ff8/12672_2025_2894_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/1d33244170c6/12672_2025_2894_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/8a8118dd47de/12672_2025_2894_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/8507cf81bf8f/12672_2025_2894_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/a9ea53f23543/12672_2025_2894_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/7d9fa0fc1dda/12672_2025_2894_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/5f01ce532b83/12672_2025_2894_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/dea58b478ff8/12672_2025_2894_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/1d33244170c6/12672_2025_2894_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/8a8118dd47de/12672_2025_2894_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f18/12170961/8507cf81bf8f/12672_2025_2894_Fig7_HTML.jpg

相似文献

1
Development and validation of machine learning models for distant metastasis of primary hepatic carcinoma: a population-based study.原发性肝癌远处转移机器学习模型的开发与验证:一项基于人群的研究。
Discov Oncol. 2025 Jun 16;16(1):1120. doi: 10.1007/s12672-025-02894-5.
2
An explainable machine learning model for predicting the risk of distant metastasis in intrahepatic cholangiocarcinoma: a population-based cohort study.一种用于预测肝内胆管癌远处转移风险的可解释机器学习模型:一项基于人群的队列研究。
Discov Oncol. 2025 Jun 18;16(1):1140. doi: 10.1007/s12672-025-02952-y.
3
Dynamic nomogram for predicting the overall survival and cancer-specific survival of patients with gastrointestinal neuroendocrine tumor: a SEER-based retrospective cohort study and external validation.预测胃肠道神经内分泌肿瘤患者总生存期和癌症特异性生存期的动态列线图:一项基于监测、流行病学和最终结果(SEER)数据库的回顾性队列研究及外部验证
Front Oncol. 2025 Jun 4;15:1594591. doi: 10.3389/fonc.2025.1594591. eCollection 2025.
4
Development and interpretation of machine learning-based prognostic models for predicting high-risk prognostic pathological components in pulmonary nodules: integrating clinical features, serum tumor marker and imaging features.基于机器学习的预测肺结节高危预后病理成分的预后模型的开发与解读:整合临床特征、血清肿瘤标志物和影像特征
J Cancer Res Clin Oncol. 2025 Jun 17;151(6):190. doi: 10.1007/s00432-025-06241-7.
5
Molecular feature-based classification of retroperitoneal liposarcoma: a prospective cohort study.基于分子特征的腹膜后脂肪肉瘤分类:一项前瞻性队列研究。
Elife. 2025 May 23;14:RP100887. doi: 10.7554/eLife.100887.
6
Trends in incidence, mortality, and conditional survival of anaplastic thyroid cancer over the last two decades in the USA.美国过去二十年间间变性甲状腺癌的发病率、死亡率及条件生存率趋势
Front Endocrinol (Lausanne). 2025 Jun 4;16:1585679. doi: 10.3389/fendo.2025.1585679. eCollection 2025.
7
Interventions for fertility preservation in women with cancer undergoing chemotherapy.对接受化疗的癌症女性进行生育力保存的干预措施。
Cochrane Database Syst Rev. 2025 Jun 19;6:CD012891. doi: 10.1002/14651858.CD012891.pub2.
8
Development and validation of novel machine learning-based prognostic models and propensity score matching for comparison of surgical approaches in mucinous breast cancer.基于新型机器学习的预后模型的开发与验证以及倾向评分匹配用于黏液性乳腺癌手术方法比较
Front Endocrinol (Lausanne). 2025 Jun 3;16:1557858. doi: 10.3389/fendo.2025.1557858. eCollection 2025.
9
Risk factors and clinical risk stratification of distant metastasis in early-stage lung cancer in never smokers.从不吸烟者早期肺癌远处转移的危险因素及临床风险分层
World J Surg Oncol. 2025 Jun 18;23(1):238. doi: 10.1186/s12957-025-03892-1.
10
Differentiation of early-stage tumors from benign lesions manifesting as pure ground-glass nodule: a clinical prediction study based on AI-derived quantitative parameters.早期肿瘤与表现为纯磨玻璃结节的良性病变的鉴别:基于人工智能衍生定量参数的临床预测研究
Front Oncol. 2025 May 19;15:1573735. doi: 10.3389/fonc.2025.1573735. eCollection 2025.

本文引用的文献

1
The effectiveness and safety of therapies for hepatocellular carcinoma with tumor thrombus in the hepatic vein, inferior vena cave and/or right atrium: a systematic review and single-arm meta-analysis.肝静脉、下腔静脉和/或右心房存在肿瘤血栓的肝细胞癌治疗的有效性和安全性:一项系统评价和单臂荟萃分析。
Expert Rev Anticancer Ther. 2025 May;25(5):561-570. doi: 10.1080/14737140.2025.2489651. Epub 2025 Apr 6.
2
Tumor-associated lymphatic vessel density is a postoperative prognostic biomarker of hepatobiliary cancers: a systematic review and meta-analysis.肿瘤相关淋巴管密度是肝胆癌术后的预后生物标志物:一项系统评价和荟萃分析。
Front Immunol. 2025 Jan 7;15:1519999. doi: 10.3389/fimmu.2024.1519999. eCollection 2024.
3
Portal Venous and Hepatic Arterial Coefficients Predict Post-Hepatectomy Overall and Recurrence-Free Survival in Patients with Hepatocellular Carcinoma: A Retrospective Study.
门静脉和肝动脉系数预测肝细胞癌患者肝切除术后的总生存率和无复发生存率:一项回顾性研究
J Hepatocell Carcinoma. 2024 Jul 9;11:1389-1402. doi: 10.2147/JHC.S462168. eCollection 2024.
4
Local Ablation Therapy for Hepatocellular Carcinoma: Clinical Significance of Tumor Size, Location, and Biology.肝细胞癌的局部消融治疗:肿瘤大小、位置及生物学特性的临床意义
Invest Radiol. 2025 Jan 1;60(1):53-59. doi: 10.1097/RLI.0000000000001100. Epub 2024 Jul 8.
5
Development and validation of an early diagnosis model for bone metastasis in non-small cell lung cancer based on serological characteristics of the bone metastasis mechanism.基于骨转移机制血清学特征的非小细胞肺癌骨转移早期诊断模型的建立与验证
EClinicalMedicine. 2024 Apr 26;72:102617. doi: 10.1016/j.eclinm.2024.102617. eCollection 2024 Jun.
6
Development and validation of a random forest algorithm for source attribution of animal and human Typhimurium and monophasic variants of Typhimurium isolates in England and Wales utilising whole genome sequencing data.利用全基因组测序数据开发并验证一种随机森林算法,用于英格兰和威尔士动物及人类鼠伤寒沙门氏菌以及鼠伤寒沙门氏菌单相变体分离株的溯源分析。
Front Microbiol. 2024 Mar 12;14:1254860. doi: 10.3389/fmicb.2023.1254860. eCollection 2023.
7
Integrated multi-omics profiling to dissect the spatiotemporal evolution of metastatic hepatocellular carcinoma.整合多组学分析以剖析转移性肝细胞癌的时空演变。
Cancer Cell. 2024 Jan 8;42(1):135-156.e17. doi: 10.1016/j.ccell.2023.11.010. Epub 2023 Dec 14.
8
The predictive value of modified-DeepSurv in overall survivals of patients with lung cancer.改良版DeepSurv对肺癌患者总生存期的预测价值。
iScience. 2023 Oct 18;26(11):108200. doi: 10.1016/j.isci.2023.108200. eCollection 2023 Nov 17.
9
Random Forest Modeling of Acute Toxicity in Anal Cancer: Effects of Peritoneal Cavity Contouring Approaches on Model Performance.随机森林模型在分析癌症急性毒性中的应用:腹腔轮廓处理方法对模型性能的影响。
Int J Radiat Oncol Biol Phys. 2024 Feb 1;118(2):554-564. doi: 10.1016/j.ijrobp.2023.08.042. Epub 2023 Aug 22.
10
Development and validation of a diagnostic and prognostic model for lung metastasis of hepatocellular carcinoma: a study based on the SEER database.肝细胞癌肺转移诊断和预后模型的开发与验证:一项基于监测、流行病学和最终结果(SEER)数据库的研究
Front Med (Lausanne). 2023 Jul 19;10:1171023. doi: 10.3389/fmed.2023.1171023. eCollection 2023.