Suppr超能文献

一种用于降尺度癌症数据的多约束蒙特卡罗模拟方法。

A multi-constraint Monte Carlo Simulation approach to downscaling cancer data.

作者信息

Liu Lingbo, Cowan Lauren, Wang Fahui, Onega Tracy

机构信息

Center for Geographic Analysis, Harvard University, MA, 02138, USA.

Department of Population Health Sciences, University of Utah, Huntsman Cancer Institute, Salt Lake City, UT, 84112, USA.

出版信息

Health Place. 2025 Jan;91:103411. doi: 10.1016/j.healthplace.2024.103411. Epub 2025 Jan 6.

Abstract

This study employs an innovative multi-constraint Monte Carlo simulation method to estimate suppressed county-level cancer counts for population subgroups and extend the downscaling from county to ZIP Code Tabulation Areas (ZCTA) in the U.S. Given the known cancer counts at a higher geographic level and larger demographic groups at the same geographic level as constraints, this method uses the population structure as probability in the Monte Carlo simulation process to estimate suppressed data entries. It not only ensures consistency across various data levels but also accounts for demographic structure that drives varying cancer risks. The 2016-2020 cancer incidence data from the Utah Cancer Registry is used to validate our approach. The method yields results with high precision and consistency across the full urban-rural continuum, and significantly outperforms several machine-learning models such as Random Forest and Extreme Gradient Boosting.

摘要

本研究采用一种创新的多约束蒙特卡罗模拟方法,来估计人口亚组中被抑制的县级癌症病例数,并将降尺度分析从美国的县扩展到邮政编码分区(ZCTA)。鉴于在较高地理层面已知的癌症病例数以及同一地理层面较大人口群体作为约束条件,该方法在蒙特卡罗模拟过程中使用人口结构作为概率来估计被抑制的数据条目。它不仅确保了不同数据层面之间的一致性,还考虑了导致癌症风险各异的人口结构。利用犹他州癌症登记处2016 - 2020年的癌症发病率数据来验证我们的方法。该方法在整个城乡连续体中产生了高精度和一致性的结果,并且显著优于随机森林和极端梯度提升等几种机器学习模型。

相似文献

4
Random cancers as supported by registry data.
Stat Med. 2020 Sep 20;39(21):2767-2778. doi: 10.1002/sim.8573. Epub 2020 May 10.
5
Predicting county-level cancer incidence rates and counts in the USA.预测美国县级癌症发病率和发病数。
Stat Med. 2013 Sep 30;32(22):3911-25. doi: 10.1002/sim.5833. Epub 2013 May 13.
9
Spatial extreme learning machines: An application on prediction of disease counts.空间极限学习机:在疾病计数预测中的应用。
Stat Methods Med Res. 2019 Sep;28(9):2583-2594. doi: 10.1177/0962280218767985. Epub 2018 Apr 9.

本文引用的文献

2
Precision public health: is it all about the data?精准公共卫生:一切都关乎数据吗?
J Public Health Policy. 2022 Dec;43(4):481-486. doi: 10.1057/s41271-022-00367-5. Epub 2022 Sep 13.
3
Data aggregation hides Pacific Islander health disparities.数据汇总掩盖了太平洋岛民的健康差异。
Lancet. 2022 Jul 2;400(10345):2-3. doi: 10.1016/S0140-6736(22)01100-X. Epub 2022 Jun 16.
5
Methods for Small Area Population Forecasts: State-of-the-Art and Research Needs.小区域人口预测方法:现状与研究需求
Popul Res Policy Rev. 2022;41(3):865-898. doi: 10.1007/s11113-021-09671-6. Epub 2021 Aug 16.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验