Department of Epidemiology and Biostatistics, School of Health, Health Sciences Research Center, Addiction Institute, Mazandaran University of Medical Sciences, Sari, Iran.
Gastrointestinal Cancer Research Center, Non-Communicable Diseases Research Institute, Mazandaran University of Medical Sciences, Sari, Iran.
BMC Med Inform Decis Mak. 2024 Oct 28;24(1):314. doi: 10.1186/s12911-024-02725-7.
Breast cancer is the most common cancer in women. Previous studies have investigated estimating and predicting the proportional hazard rates and survival in breast cancer. This study deals with predicting accelerated hazards (AH) rate based on age categories in breast cancer patients using deep learning methods. The AH has a time-dependent structure whose rate changes according to time and variable effects. We have collected data related to 1225 female patients with breast cancer at the Mandarin University of Medical Sciences. The patients' demographic and clinical characteristics including family history, age, history of tobacco use, hysterectomy, first menstruation age, gravida, number of breastfeeding, disease grade, marital status, and survival status have been recorded. Initially, we dealt with predicting three age groups of patients: ≤ 40, 41-60, and ≥ 61 years. Then, the prediction of accelerated risk value based on age categories for each breast cancer patient through deep learning and the importance of variables using LightGBM is discussed. Improving clinical management and treatment of breast cancer requires advanced methods such as time-dependent AH calculation. When the behavioral effect is assumed as a time scale change between hazard functions, the AH model is more appropriate for randomized clinical trials. The study results demonstrate the proper performance of the proposed model for predicting AH by age categories based on breast cancer patients' demographic and clinical characteristics.
乳腺癌是女性最常见的癌症。以前的研究已经探讨了估计和预测乳腺癌的比例风险率和生存率。本研究使用深度学习方法处理基于年龄类别预测乳腺癌患者的加速风险(AH)率。AH 具有时变结构,其速率根据时间和变量效应而变化。我们收集了与马赞达兰医科大学 1225 名女性乳腺癌患者相关的数据。患者的人口统计学和临床特征包括家族史、年龄、吸烟史、子宫切除术、初潮年龄、孕次、母乳喂养次数、疾病分级、婚姻状况和生存状况。最初,我们处理了预测三个年龄段的患者:≤40 岁、41-60 岁和≥61 岁。然后,通过深度学习和 LightGBM 讨论了基于年龄类别预测每位乳腺癌患者的加速风险值以及变量的重要性。改进乳腺癌的临床管理和治疗需要先进的方法,如计算时变 AH。当行为效应被假设为危险函数之间的时间尺度变化时,AH 模型更适合于随机临床试验。研究结果表明,基于乳腺癌患者的人口统计学和临床特征,该模型在预测年龄类别方面具有适当的 AH 预测性能。