合成数据生成方法改进了胰腺癌患者术后早期肿瘤复发的风险预测模型。

Synthetic data generation method improves risk prediction model for early tumor recurrence after surgery in patients with pancreatic cancer.

作者信息

Jeong HyeJeong, Lee Jeong-Moo, Kim Hyeong Seok, Chae Hochang, Yoon So Jeong, Shin Sang Hyun, Han In Woong, Heo Jin Seok, Min Ji Hye, Hyun Seung Hyup, Kim Hongbeom

机构信息

Division of Hepatobiliary-Pancreatic Surgery, Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea.

Division of Hepatobiliary-Pancreatic Surgery, Department of Surgery, Daejeon Eulji University Medical Center, Eulji University School of Medicine, Daejeon, South Korea.

出版信息

Sci Rep. 2025 Aug 29;15(1):31885. doi: 10.1038/s41598-025-15800-4.

DOI:10.1038/s41598-025-15800-4

PMID:40883332

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12397232/

Abstract

Pancreatic cancer is aggressive with high recurrence rates, necessitating accurate prediction models for effective treatment planning, particularly for neoadjuvant chemotherapy or upfront surgery. This study explores the use of variational autoencoder (VAE)-generated synthetic data to predict early tumor recurrence (within six months) in pancreatic cancer patients who underwent upfront surgery. Preoperative data of 158 patients between January 2021 and December 2022 was analyzed, and machine learning models-including Logistic Regression, Random Forest (RF), Gradient Boosting Machine (GBM), and Deep Neural Networks (DNN)-were trained on both original and synthetic datasets. The VAE-generated dataset (n = 94) closely matched the original data (p > 0.05) and enhanced model performance, improving accuracy (GBM: 0.81 to 0.87; RF: 0.84 to 0.87) and sensitivity (GBM: 0.73 to 0.91; RF: 0.82 to 0.91). PET/CT-derived metabolic parameters were the strongest predictors, accounting for 54.7% of the model predictive power with maximum standardized uptake value (SUVmax) showing the highest importance (0.182, 95% CI: 0.165-0.199). This study demonstrates that synthetic data can significantly enhance predictive models for pancreatic cancer recurrence, especially in data-limited scenarios, offering a promising strategy for oncology prediction models.

摘要

胰腺癌侵袭性强，复发率高，因此需要准确的预测模型来制定有效的治疗方案，特别是对于新辅助化疗或直接手术。本研究探讨使用变分自编码器（VAE）生成的合成数据来预测接受直接手术的胰腺癌患者的早期肿瘤复发（六个月内）。分析了2021年1月至2022年12月期间158例患者的术前数据，并在原始数据集和合成数据集上训练了机器学习模型，包括逻辑回归、随机森林（RF）、梯度提升机（GBM）和深度神经网络（DNN）。VAE生成的数据集（n = 94）与原始数据紧密匹配（p > 0.05），并提高了模型性能，提高了准确率（GBM：从0.81提高到0.87；RF：从0.84提高到0.87）和灵敏度（GBM：从0.73提高到0.91；RF：从0.82提高到0.91）。PET/CT衍生的代谢参数是最强的预测因子，占模型预测能力的54.7%，最大标准化摄取值（SUVmax）显示出最高的重要性（0.182，95%CI：0.165 - 0.199）。本研究表明，合成数据可以显著增强胰腺癌复发的预测模型，特别是在数据有限的情况下，为肿瘤学预测模型提供了一种有前景的策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fbd/12397232/8fd0f02aa411/41598_2025_15800_Fig1_HTML.jpg

相似文献

Synthetic data generation method improves risk prediction model for early tumor recurrence after surgery in patients with pancreatic cancer.

Sci Rep. 2025 Aug 29;15(1):31885. doi: 10.1038/s41598-025-15800-4.

F-18 FDG PET/CT based Preoperative Machine Learning Prediction Models for Evaluating Regional Lymph Node Metastasis Status of Patients with Colon Cancer.

Asian Pac J Cancer Prev. 2025 Jan 1;26(1):85-90. doi: 10.31557/APJCP.2025.26.1.85.

Development of Machine Learning-based Algorithms to Predict the 2- and 5-year Risk of TKA After Tibial Plateau Fracture Treatment.

Clin Orthop Relat Res. 2025 Mar 12. doi: 10.1097/CORR.0000000000003442.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

18F-FDG PET/CT-based deep radiomic models for enhancing chemotherapy response prediction in breast cancer.

Med Oncol. 2025 Aug 11;42(9):425. doi: 10.1007/s12032-025-02982-0.

Preoperative Prediction of Perineural Invasion in Pancreatic Ductal Adenocarcinoma Using Machine Learning Radiomics Based on Contrast-Enhanced CT Imaging.

J Imaging Inform Med. 2024 Nov 11. doi: 10.1007/s10278-024-01325-1.

PET-CT for assessing mediastinal lymph node involvement in patients with suspected resectable non-small cell lung cancer.

Cochrane Database Syst Rev. 2014 Nov 13;2014(11):CD009519. doi: 10.1002/14651858.CD009519.pub2.

A systematic review of positron emission tomography (PET) and positron emission tomography/computed tomography (PET/CT) for the diagnosis of breast cancer recurrence.

Health Technol Assess. 2010 Oct;14(50):1-103. doi: 10.3310/hta14500.

Machine learning based screening of biomarkers associated with cell death and immunosuppression of multiple life stages sepsis populations.

Sci Rep. 2025 Aug 19;15(1):30302. doi: 10.1038/s41598-025-14600-0.

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

本文引用的文献

MRI Apparent Diffusion Coefficient (ADC) as a Biomarker of Tumour Response: Imaging-Pathology Correlation in Patients with Hepatic Metastases from Colorectal Cancer (EORTC 1423).

Cancers (Basel). 2023 Jul 12;15(14):3580. doi: 10.3390/cancers15143580.

Immediate surgery compared with short-course neoadjuvant gemcitabine plus capecitabine, FOLFIRINOX, or chemoradiotherapy in patients with borderline resectable pancreatic cancer (ESPAC5): a four-arm, multicentre, randomised, phase 2 trial.

Lancet Gastroenterol Hepatol. 2023 Feb;8(2):157-168. doi: 10.1016/S2468-1253(22)00348-X. Epub 2022 Dec 12.

Dynamic recurrence risk and adjuvant chemotherapy benefit prediction by ctDNA in resected NSCLC.

Nat Commun. 2021 Nov 19;12(1):6770. doi: 10.1038/s41467-021-27022-z.

New criteria of resectability for pancreatic cancer: A position paper by the Japanese Society of Hepato-Biliary-Pancreatic Surgery (JSHBPS).

J Hepatobiliary Pancreat Sci. 2022 Jul;29(7):725-731. doi: 10.1002/jhbp.1049. Epub 2021 Oct 20.

Metabolic activity by FDG-PET/CT after neoadjuvant chemotherapy in borderline resectable and locally advanced pancreatic cancer and association with survival.

Br J Surg. 2021 Dec 17;109(1):61-70. doi: 10.1093/bjs/znab229.

Dynamic Perfusion Computed Tomography and Apparent Diffusion Coefficient as Potential Markers for Poorly Differentiated Pancreatic Adenocarcinoma.

Dig Surg. 2021;38(2):128-135. doi: 10.1159/000511973. Epub 2021 Jan 27.

High CA19-9 level in resectable pancreatic cancer is a potential indication of neoadjuvant treatment.

Pancreatology. 2021 Jan;21(1):130-137. doi: 10.1016/j.pan.2020.11.026. Epub 2020 Dec 4.

Borderline Resectable and Locally Advanced Pancreatic Cancer: FDG PET/MRI and CT Tumor Metrics for Assessment of Pathologic Response to Neoadjuvant Therapy and Prediction of Survival.

AJR Am J Roentgenol. 2021 Sep;217(3):730-740. doi: 10.2214/AJR.20.24567. Epub 2020 Oct 21.

Preoperative carbohydrate antigen 19-9 and standard uptake value of positron emission tomography-computed tomography as prognostic markers in patients with pancreatic ductal adenocarcinoma.

J Hepatobiliary Pancreat Sci. 2022 Oct;29(10):1133-1141. doi: 10.1002/jhbp.845. Epub 2020 Nov 10.

Total Neoadjuvant Therapy for Operable Pancreatic Cancer.

Ann Surg Oncol. 2021 Apr;28(4):2246-2256. doi: 10.1245/s10434-020-09149-3. Epub 2020 Sep 30.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

合成数据生成方法改进了胰腺癌患者术后早期肿瘤复发的风险预测模型。

Synthetic data generation method improves risk prediction model for early tumor recurrence after surgery in patients with pancreatic cancer.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献