Zhang Wen-Hai, Tan Yang, Huang Zhen, Tan Qi-Xing, Zhang Yue-Mei, Chen Bin-Jie, Wei Chang-Yuan
Department of Breast Surgery, Guangxi Medical University Cancer Hospital, Nanning, Guangxi, China.
Department of Pathology, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China.
Front Oncol. 2024 Jun 14;14:1409273. doi: 10.3389/fonc.2024.1409273. eCollection 2024.
This study aims to develop an artificial intelligence model utilizing clinical blood markers, ultrasound data, and breast biopsy pathological information to predict the distant metastasis in breast cancer patients.
Data from two medical centers were utilized, Clinical blood markers, ultrasound data, and breast biopsy pathological information were separately extracted and selected. Feature dimensionality reduction was performed using Spearman correlation and LASSO regression. Predictive models were constructed using LR and LightGBM machine learning algorithms and validated on internal and external validation sets. Feature correlation analysis was conducted for both models.
The LR model achieved AUC values of 0.892, 0.816, and 0.817 for the training, internal validation, and external validation cohorts, respectively. The LightGBM model achieved AUC values of 0.971, 0.861, and 0.890 for the same cohorts, respectively. Clinical decision curve analysis showed a superior net benefit of the LightGBM model over the LR model in predicting distant metastasis in breast cancer. Key features identified included creatine kinase isoenzyme (CK-MB) and alpha-hydroxybutyrate dehydrogenase.
This study developed an artificial intelligence model using clinical blood markers, ultrasound data, and pathological information to identify distant metastasis in breast cancer patients. The LightGBM model demonstrated superior predictive accuracy and clinical applicability, suggesting it as a promising tool for early diagnosis of distant metastasis in breast cancer.
本研究旨在开发一种利用临床血液标志物、超声数据和乳腺活检病理信息来预测乳腺癌患者远处转移的人工智能模型。
利用来自两个医疗中心的数据,分别提取并选择临床血液标志物、超声数据和乳腺活检病理信息。使用斯皮尔曼相关性和LASSO回归进行特征降维。使用逻辑回归(LR)和LightGBM机器学习算法构建预测模型,并在内部和外部验证集上进行验证。对两个模型都进行了特征相关性分析。
LR模型在训练、内部验证和外部验证队列中的AUC值分别为0.892、0.816和0.817。LightGBM模型在相同队列中的AUC值分别为0.971、0.861和0.890。临床决策曲线分析表明,在预测乳腺癌远处转移方面,LightGBM模型的净效益优于LR模型。确定的关键特征包括肌酸激酶同工酶(CK-MB)和α-羟丁酸脱氢酶。
本研究开发了一种利用临床血液标志物、超声数据和病理信息来识别乳腺癌患者远处转移的人工智能模型。LightGBM模型表现出卓越的预测准确性和临床适用性,表明它是早期诊断乳腺癌远处转移的一种有前景的工具。