Suppr超能文献

机器学习可预测新发糖尿病患者患胰腺癌的风险。

Machine Learning Predicts Patients With New-onset Diabetes at Risk of Pancreatic Cancer.

作者信息

Khan Salman, Bhushan Bharath

机构信息

Department of Medicine, West Virginia University School of Medicine, West Virginia University, Morgantown, WV.

Northeast Ohio Medical University, Rootstown, OH.

出版信息

J Clin Gastroenterol. 2024 Aug 1;58(7):681-691. doi: 10.1097/MCG.0000000000001897.

Abstract

BACKGROUND

New-onset diabetes represent a high-risk cohort to screen for pancreatic cancer.

GOALS

Develop a machine model to predict pancreatic cancer among patients with new-onset diabetes.

STUDY

A retrospective cohort of patients with new-onset diabetes was assembled from multiple health care networks in the United States. An XGBoost machine learning model was designed from a portion of this cohort (the training set) and tested on the remaining part of the cohort (the test set). Shapley values were used to explain the XGBoost's model features. Model performance was compared with 2 contemporary models designed to predict pancreatic cancer among patients with new-onset diabetes.

RESULTS

In the test set, the XGBoost model had an area under the curve of 0.80 (0.76 to 0.85) compared with 0.63 and 0.68 for other models. Using cutoffs based on the Youden index, the sensitivity of the XGBoost model was 75%, the specificity was 70%, the accuracy was 70%, the positive predictive value was 1.2%, and the negative predictive value was >99%. The XGBoost model obtained a positive predictive value of at least 2.5% with a sensitivity of 38%. The XGBoost model was the only model that detected at least 50% of patients with cancer one year after the onset of diabetes. All 3 models had similar features that predicted pancreatic cancer, including older age, weight loss, and the rapid destabilization of glucose homeostasis.

CONCLUSION

Machine learning models isolate a high-risk cohort from those with new-onset diabetes at risk for pancreatic cancer.

摘要

背景

新发糖尿病患者是筛查胰腺癌的高危人群。

目标

开发一种机器模型,用于预测新发糖尿病患者患胰腺癌的风险。

研究

从美国多个医疗保健网络中选取了一组新发糖尿病患者作为回顾性队列。基于该队列的一部分(训练集)设计了一个XGBoost机器学习模型,并在队列的其余部分(测试集)上进行测试。使用Shapley值来解释XGBoost模型的特征。将该模型的性能与另外两个旨在预测新发糖尿病患者患胰腺癌风险的当代模型进行比较。

结果

在测试集中,XGBoost模型的曲线下面积为0.80(0.76至0.85),而其他两个模型分别为0.63和0.68。根据约登指数确定临界值,XGBoost模型的灵敏度为75%,特异度为70%,准确度为70%,阳性预测值为1.2%,阴性预测值>99%(接近100%)。XGBoost模型在灵敏度为38%时获得了至少2.5%的阳性预测值。XGBoost模型是唯一一个在糖尿病发病一年后能检测出至少50%癌症患者的模型。所有三个模型都有类似的预测胰腺癌的特征,包括年龄较大、体重减轻和血糖稳态的快速不稳定。

结论

机器学习模型从有患胰腺癌风险的新发糖尿病患者中识别出了高危人群。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验