Suppr超能文献

基于临床数据,运用机器学习方法对卵巢癌进行早期预测和风险分层。

Early prediction and risk stratification of ovarian cancer based on clinical data using machine learning approaches.

作者信息

Gui Ting, Cao Dongyan, Yang Jiaxin, Wei Zhenhao, Xie Jiatong, Wang Wei, Xiang Yang, Peng Peng

机构信息

Department of Obstetrics and Gynecology, National Clinical Research Center for Obstetric and Gynecologic Diseases, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, People's Republic of China.

Goodwill Hessian Health Technology Co. Ltd, Beijing, People's Republic of China.

出版信息

J Gynecol Oncol. 2025 Jul;36(4):e53. doi: 10.3802/jgo.2025.36.e53. Epub 2024 Dec 17.

Abstract

OBJECTIVE

Our study was aimed to construct a predictive model to advance ovarian cancer diagnosis by machine learning.

METHODS

A retrospective analysis of patients with pelvic/adnexal/ovarian mass was performed. Potential features related to ovarian cancer were obtained as many as possible. The optimal machine learning algorithm was selected among six candidates through 5-fold cross validation. Top 20 features having the most powerful predictive significance were ranked by Shapley Additive Interpretation (Shap) method. Clinical validation was further performed to confirm whether our model could advance diagnosis of ovarian cancer.

RESULTS

A total of 9,799 patients were collected. The inclusion criteria included age >18 years old, the first diagnosis being pelvic/adnexal/ovarian mass of undetermined significance, and pathological report indispensable. Four hundred and thirty-eight dimensional features were obtained after filtration. LightGBM showed the best performance with accuracy 88%. Among the top 20 features, 55% belonged to laboratory test report, 35% came from imaging examination report, and 10% were attributed to basic demographics and main symptom. Age, CA125, and risk of ovarian malignancy algorithm were the top three. Our predictive model performed stably in testing and clinical validation datasets, and was found to advance the diagnosis of ovarian cancer about 17 days before clinical pathological examination.

CONCLUSION

LightGBM was the optimal algorithm for our predictive model with accuracy of 88%. Laboratory test and imaging examination played essential roles in diagnosing ovarian cancer. Our model could advance the diagnosis of ovarian cancer before clinical pathological examination.

摘要

目的

本研究旨在构建一种预测模型,通过机器学习来改进卵巢癌的诊断。

方法

对盆腔/附件/卵巢肿块患者进行回顾性分析。尽可能获取与卵巢癌相关的潜在特征。通过五折交叉验证从六个候选算法中选择最优的机器学习算法。采用Shapley值法(Shap)对具有最强预测意义的前20个特征进行排序。进一步进行临床验证,以确认我们的模型是否能改进卵巢癌的诊断。

结果

共收集9799例患者。纳入标准包括年龄>18岁、首次诊断为意义未明的盆腔/附件/卵巢肿块且有不可或缺的病理报告。过滤后获得438个维度的特征。LightGBM表现最佳,准确率达88%。在前20个特征中,55%属于实验室检查报告,35%来自影像学检查报告,10%归因于基本人口统计学和主要症状。年龄、CA125和卵巢恶性肿瘤风险算法位列前三。我们的预测模型在测试和临床验证数据集中表现稳定,发现在临床病理检查前约17天就能改进卵巢癌的诊断。

结论

LightGBM是我们预测模型的最优算法,准确率为88%。实验室检查和影像学检查在卵巢癌诊断中起重要作用。我们的模型能在临床病理检查前改进卵巢癌的诊断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7711/12226325/406ed72448bf/jgo-36-e53-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验