Ding Yuanzhao
School of Geography and the Environment, University of Oxford, South Parks Road, Oxford OX1 3QY, UK.
Diseases. 2024 Jun 30;12(7):139. doi: 10.3390/diseases12070139.
In recent years, the escalating environmental challenges have contributed to a rising incidence of cancer. The precise anticipation of cancer incidence and mortality rates has emerged as a pivotal focus in scientific inquiry, exerting a profound impact on the formulation of public health policies. This investigation adopts a pioneering machine learning framework to address this critical issue, utilizing a dataset encompassing 72,591 comprehensive records that include essential variables such as age, case count, population size, race, gender, site, and year of diagnosis. Diverse machine learning algorithms, including decision trees, random forests, logistic regression, support vector machines, and neural networks, were employed in this study. The ensuing analysis revealed testing accuracies of 62.17%, 61.92%, 54.53%, 55.72%, and 62.30% for the respective models. This state-of-the-art model not only enhances our understanding of cancer dynamics but also equips researchers and policymakers with the capability of making meticulous projections concerning forthcoming cancer incidence and mortality rates. Considering sustainability, the application of this advanced machine learning framework emphasizes the importance of judiciously utilizing extensive and intricate databases. By doing so, it facilitates a more sustainable approach to healthcare planning, allowing for informed decision-making that takes into account the long-term ecological and societal impacts of cancer-related policies. This integrative perspective underscores the broader commitment to sustainable practices in both health research and public policy formulation.
近年来,不断升级的环境挑战导致癌症发病率上升。对癌症发病率和死亡率的精确预测已成为科学研究的关键焦点,对公共卫生政策的制定产生了深远影响。本研究采用了一种开创性的机器学习框架来解决这一关键问题,使用了一个包含72591条综合记录的数据集,这些记录包括年龄、病例数、人口规模、种族、性别、发病部位和诊断年份等重要变量。本研究采用了多种机器学习算法,包括决策树、随机森林、逻辑回归、支持向量机和神经网络。随后的分析显示,各模型的测试准确率分别为62.17%、61.92%、54.53%、55.72%和62.30%。这种先进的模型不仅增进了我们对癌症动态的理解,还使研究人员和政策制定者能够对未来的癌症发病率和死亡率做出精确预测。考虑到可持续性,这种先进的机器学习框架的应用强调了明智利用广泛而复杂的数据库的重要性。通过这样做,它促进了一种更具可持续性的医疗保健规划方法,允许在考虑癌症相关政策的长期生态和社会影响的情况下做出明智的决策。这种综合观点强调了在健康研究和公共政策制定中对可持续实践的更广泛承诺。