• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过整合环境、社会经济和卫生系统预测因素建立的用于伊朗甲状腺癌流行病学研究的先进机器学习框架。

Advanced machine learning framework for thyroid cancer epidemiology in Iran through integration of environmental socioeconomic and health system predictors.

作者信息

Soleimani Mohsen, Chiti Hossein

机构信息

Metabolic Diseases Research Center, Health and Metabolic Diseases Research Institute, Zanjan University of Medical Sciences, Zanjan, Iran.

出版信息

Sci Rep. 2025 Aug 14;15(1):29901. doi: 10.1038/s41598-025-15324-x.

DOI:10.1038/s41598-025-15324-x
PMID:40813883
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12354753/
Abstract

The global escalation of thyroid cancer (TC) incidence, coupled with pronounced provincial and gender-based disparities in Iran, underscores an urgent public health challenge that remains underexplored through integrative analyses of environmental, socioeconomic, and healthcare factors. This study addresses this critical gap by employing an advanced multi-model machine learning (ML) framework to elucidate the spatiotemporal determinants of TC incidence across Iran's 31 provinces, offering novel insights to inform evidence-based public health strategies. Leveraging data from the Iranian National Population-based Cancer Registry (INPCR) spanning 2014-2017, we synthesized a comprehensive dataset comprising 55 variables sourced from diverse public repositories. Age-standardized incidence rates (ASRs) were meticulously computed and stratified by sex and province, followed by the application of nine ML models for feature selection including Random Forest, XG-Boost, Cat-Boost, and various regression techniques. The significance of identified predictors was rigorously validated using SHAP (SHapley Additive exPlanations) analysis across Random Forest, XG-Boost, and Cat-Boost frameworks. The analysis disclosed considerable variation in TC incidence across the population. The overall four-year ASR was 11.13 per 100,000, with females exhibiting a markedly higher rate of 35.1 per 100,000, significantly exceeding that of males at 9.6 per 100,000. Prominent predictors included Sunshine Duration (SHAP values: - 0.046 overall, 0.015 in females, - 0.005 in males), Provincial-Education-rates, Elevation in Meter, Laboratory-availability, and community Marriage-rates. Significant provincial disparities were observed in the mean ASR of TC across the entire population, notably exemplified by Yazd's elevated mean ASR of 9.2 per 100,000 in contrast to Semnan's markedly lower rate of 1.2 per 100,000 over the period 2014-2017. The ML models demonstrated moderate to robust predictive accuracy (R²: 0.21-0.86), underscoring distinct sex-specific risk profiles. This pioneering study illuminates the pivotal roles of climatic, socioeconomic, healthcare access, and environmental factors in shaping TC incidence in Iran, revealing significant regional and gender-specific variations. These findings advocate for the development of targeted public health interventions aimed at mitigating environmental exposures and rectifying healthcare disparities, thereby enhancing the precision and efficacy of TC prevention strategies.

摘要

甲状腺癌(TC)发病率在全球范围内不断攀升,伊朗国内各省之间以及不同性别之间存在显著差异,这凸显了一个紧迫的公共卫生挑战,而通过对环境、社会经济和医疗因素进行综合分析,这一挑战仍未得到充分探索。本研究通过采用先进的多模型机器学习(ML)框架来解决这一关键差距,以阐明伊朗31个省份TC发病率的时空决定因素,为基于证据的公共卫生策略提供新的见解。利用伊朗国家人口癌症登记处(INPCR)2014 - 2017年的数据,我们合成了一个包含55个变量的综合数据集,这些变量来自不同的公共数据库。精心计算年龄标准化发病率(ASR),并按性别和省份进行分层,随后应用九种ML模型进行特征选择,包括随机森林、XG - Boost、Cat - Boost以及各种回归技术。使用SHAP(Shapley加性解释)分析在随机森林、XG - Boost和Cat - Boost框架下对已识别预测因子的重要性进行了严格验证。分析揭示了整个人口中TC发病率存在显著差异。四年总体ASR为每10万人11.13例,女性发病率明显更高,为每10万人35.1例,显著超过男性的每10万人9.6例。突出的预测因子包括日照时长(SHAP值:总体为 - 0.046,女性为0.015,男性为 - 0.005)、省级教育率、海拔高度(米)、实验室可及性以及社区结婚率。在整个人口中,观察到TC的平均ASR存在显著的省份差异,例如,在2014 - 2017年期间,亚兹德的平均ASR较高,为每10万人9.2例,而塞姆南的发病率则明显较低,为每10万人1.2例。ML模型显示出中等至较强的预测准确性(R²:0.21 - 0.86),突出了不同的性别特异性风险特征。这项开创性研究阐明了气候、社会经济、医疗可及性和环境因素在塑造伊朗TC发病率方面的关键作用,揭示了显著的区域和性别特异性差异。这些发现倡导制定有针对性的公共卫生干预措施,旨在减少环境暴露并纠正医疗差距,从而提高TC预防策略的精准性和有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e158/12354753/373bdb7cd478/41598_2025_15324_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e158/12354753/9c78d210efdf/41598_2025_15324_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e158/12354753/373bdb7cd478/41598_2025_15324_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e158/12354753/9c78d210efdf/41598_2025_15324_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e158/12354753/373bdb7cd478/41598_2025_15324_Fig2_HTML.jpg

相似文献

1
Advanced machine learning framework for thyroid cancer epidemiology in Iran through integration of environmental socioeconomic and health system predictors.通过整合环境、社会经济和卫生系统预测因素建立的用于伊朗甲状腺癌流行病学研究的先进机器学习框架。
Sci Rep. 2025 Aug 14;15(1):29901. doi: 10.1038/s41598-025-15324-x.
2
Racial and socioeconomic disparities in long term survival after surgery and radiation for spinal cord hemangioblastoma.脊髓血管母细胞瘤手术和放疗后长期生存中的种族和社会经济差异。
Sci Rep. 2025 Aug 21;15(1):30704. doi: 10.1038/s41598-025-13330-7.
3
Machine Learning Model for Predicting Coronary Heart Disease Risk: Development and Validation Using Insights From a Japanese Population-Based Study.预测冠心病风险的机器学习模型:基于日本人群研究的见解进行开发与验证
JMIR Cardio. 2025 May 12;9:e68066. doi: 10.2196/68066.
4
Spatial heterogeneity and temporal trends of thyroid cancer incidence in Iran from 2014 to 2017.2014年至2017年伊朗甲状腺癌发病率的空间异质性和时间趋势。
Sci Rep. 2025 Jul 16;15(1):25845. doi: 10.1038/s41598-025-11655-x.
5
Application of machine learning algorithms and SHAP explanations to predict fertility preference among reproductive women in Somalia.机器学习算法和SHAP解释在预测索马里育龄妇女生育偏好中的应用。
Sci Rep. 2025 Jul 20;15(1):26301. doi: 10.1038/s41598-025-04704-y.
6
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
7
Optimized feature selection and advanced machine learning for stroke risk prediction in revascularized coronary artery disease patients.优化特征选择与先进机器学习用于预测冠状动脉疾病血运重建患者的卒中风险
BMC Med Inform Decis Mak. 2025 Jul 24;25(1):276. doi: 10.1186/s12911-025-03116-2.
8
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
9
Development of a machine learning model and a web application for predicting neurological outcome at hospital discharge in spinal cord injury patients.开发用于预测脊髓损伤患者出院时神经功能结局的机器学习模型和网络应用程序。
Spine J. 2025 Jan 31. doi: 10.1016/j.spinee.2025.01.005.
10
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

本文引用的文献

1
Investigating papillary thyroid cancer risk factors among women living at the central region of Iran: a case-control study.伊朗中部地区女性甲状腺乳头状癌危险因素调查:一项病例对照研究。
BMC Endocr Disord. 2025 Jan 16;25(1):12. doi: 10.1186/s12902-025-01833-3.
2
Association Between Environmental Air Pollution and Thyroid Cancer and Nodules: A Systematic Review.环境空气污染与甲状腺癌及结节之间的关联:一项系统综述。
Thyroid. 2024 Dec;34(12):1451-1464. doi: 10.1089/thy.2024.0317. Epub 2024 Nov 18.
3
Evolving epidemiological patterns of thyroid cancer and estimates of overdiagnosis in 2013-17 in 63 countries worldwide: a population-based study.
全球 63 个国家 2013-2017 年甲状腺癌的流行模式演变及过度诊断估计:一项基于人群的研究。
Lancet Diabetes Endocrinol. 2024 Nov;12(11):824-836. doi: 10.1016/S2213-8587(24)00223-7. Epub 2024 Oct 7.
4
Reproductive Factors and Thyroid Cancer Risk: The Multiethnic Cohort Study.生殖因素与甲状腺癌风险:多民族队列研究。
J Womens Health (Larchmt). 2024 Sep;33(9):1158-1165. doi: 10.1089/jwh.2023.0947. Epub 2024 May 22.
5
Impacts of socioeconomic and environmental factors on neoplasms incidence rates using machine learning and GIS: a cross-sectional study in Iran.利用机器学习和 GIS 分析社会经济和环境因素对肿瘤发病率的影响:伊朗的一项横断面研究。
Sci Rep. 2024 May 8;14(1):10604. doi: 10.1038/s41598-024-61397-5.
6
Correlation between socioeconomic indices and epidemiological indices of thyroid cancer from 1990 to 2019 year: a global ecologic study.1990 年至 2019 年甲状腺癌的社会经济指标与流行病学指标的相关性:一项全球生态学研究。
BMC Cancer. 2024 Apr 15;24(1):467. doi: 10.1186/s12885-024-12176-y.
7
Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.2022 年全球癌症统计数据:全球 185 个国家和地区 36 种癌症的发病率和死亡率全球估计数。
CA Cancer J Clin. 2024 May-Jun;74(3):229-263. doi: 10.3322/caac.21834. Epub 2024 Apr 4.
8
A gender-specific geodatabase of five cancer types with the highest frequency of occurrence in Iran.伊朗高发五种癌症类型的性别特异性地理数据库。
BMC Res Notes. 2024 Mar 19;17(1):83. doi: 10.1186/s13104-024-06737-4.
9
A narrative review of deep learning in thyroid imaging: current progress and future prospects.甲状腺成像中深度学习的叙述性综述:当前进展与未来前景
Quant Imaging Med Surg. 2024 Feb 1;14(2):2069-2088. doi: 10.21037/qims-23-908. Epub 2024 Jan 19.
10
Exploring the geospatial epidemiology of breast cancer in Iran: identifying significant risk factors and spatial patterns for evidence-based prevention strategies.探讨伊朗乳腺癌的地理流行病学:确定基于证据的预防策略的重要风险因素和空间模式。
BMC Cancer. 2023 Dec 11;23(1):1219. doi: 10.1186/s12885-023-11555-1.