• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

美国邮政编码分区层面的癌症发病率数据,通过具有多重约束的蒙特卡洛模拟进行插值。

Cancer incidence data at the ZIP Code Tabulation Area level in the United States interpolated by Monte Carlo simulation with multiple constraints.

作者信息

Liu Lingbo, Wang Fahui, Onega Tracy

机构信息

Center for Geographic Analysis, Harvard University, Cambridge, MA, USA.

Department of Geography and Anthropology, Louisiana State University, Baton Rouge, LA, USA.

出版信息

Sci Data. 2025 May 30;12(1):909. doi: 10.1038/s41597-025-05254-8.

DOI:10.1038/s41597-025-05254-8
PMID:40447591
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12125315/
Abstract

High-quality cancer data are fundamental for public health research and policy, but cancer data for small geographic units and population subgroups in the United States are rarely available due to small-sample suppression rules, spatial coarsening, and data incompleteness. These limitations hinder high-resolution spatial analyses and precision public health interventions. This study provides a high-resolution cancer incidence dataset for the U.S., generated through a multi-constraint Monte Carlo simulation framework that reconstructs suppressed county-level cancer data and systematically disaggregates them to ZIP Code Tabulation Areas (ZCTAs), guided by demographic constraints. This method integrates population subgroup structures and macro-level incidence rates as constraints, ensuring consistency and reliability across spatial scales. The resulting dataset spans multiple geographic units, from state and county levels to ZCTAs, enabling comprehensive analyses of cancer burden, in-depth spatial analyses, and precision public health interventions across multiple scales.

摘要

高质量的癌症数据是公共卫生研究和政策的基础,但由于小样本抑制规则、空间粗化和数据不完整,美国小地理区域和人口亚组的癌症数据很少可得。这些限制阻碍了高分辨率空间分析和精准公共卫生干预。本研究提供了一个美国的高分辨率癌症发病率数据集,该数据集通过多约束蒙特卡罗模拟框架生成,该框架重建了被抑制的县级癌症数据,并在人口统计学约束的指导下将其系统地分解为邮政编码分区(ZCTA)。该方法将人口亚组结构和宏观层面的发病率作为约束条件,确保跨空间尺度的一致性和可靠性。所得数据集跨越多个地理单元,从州和县级到ZCTA,能够对癌症负担进行全面分析、进行深入的空间分析以及跨多个尺度的精准公共卫生干预。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/a9c76587b349/41597_2025_5254_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/ae942936c6ee/41597_2025_5254_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/21d263d9b743/41597_2025_5254_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/7a2883d51f48/41597_2025_5254_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/a83a39d3f020/41597_2025_5254_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/a9c76587b349/41597_2025_5254_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/ae942936c6ee/41597_2025_5254_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/21d263d9b743/41597_2025_5254_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/7a2883d51f48/41597_2025_5254_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/a83a39d3f020/41597_2025_5254_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9124/12125315/a9c76587b349/41597_2025_5254_Fig5_HTML.jpg

相似文献

1
Cancer incidence data at the ZIP Code Tabulation Area level in the United States interpolated by Monte Carlo simulation with multiple constraints.美国邮政编码分区层面的癌症发病率数据,通过具有多重约束的蒙特卡洛模拟进行插值。
Sci Data. 2025 May 30;12(1):909. doi: 10.1038/s41597-025-05254-8.
2
A multi-constraint Monte Carlo Simulation approach to downscaling cancer data.一种用于降尺度癌症数据的多约束蒙特卡罗模拟方法。
Health Place. 2025 Jan;91:103411. doi: 10.1016/j.healthplace.2024.103411. Epub 2025 Jan 6.
3
On the use of ZIP codes and ZIP code tabulation areas (ZCTAs) for the spatial analysis of epidemiological data.关于使用邮政编码和邮政编码分区(ZCTA)进行流行病学数据的空间分析
Int J Health Geogr. 2006 Dec 13;5:58. doi: 10.1186/1476-072X-5-58.
4
Chronic Disease Prevalence in the US: Sociodemographic and Geographic Variations by Zip Code Tabulation Area.美国慢性疾病的患病率:按邮政编码区划分的社会人口统计学和地理差异。
Prev Chronic Dis. 2024 Feb 29;21:E14. doi: 10.5888/pcd21.230267.
5
Predicting county-level cancer incidence rates and counts in the USA.预测美国县级癌症发病率和发病数。
Stat Med. 2013 Sep 30;32(22):3911-25. doi: 10.1002/sim.5833. Epub 2013 May 13.
6
Investigation of geographic disparities of diabetes-related hospitalizations in Florida using flexible spatial scan statistics: An ecological study.利用灵活空间扫描统计调查佛罗里达州糖尿病住院相关的地理差异:一项生态研究。
PLoS One. 2024 Jun 4;19(6):e0298182. doi: 10.1371/journal.pone.0298182. eCollection 2024.
7
Integrating data at multiple spatial scales to estimate the local burden of the opioid syndemic.整合多个空间尺度的数据以估计阿片类药物共病的局部负担。
Spat Spatiotemporal Epidemiol. 2025 Jun;53:100720. doi: 10.1016/j.sste.2025.100720. Epub 2025 Apr 22.
8
Access to Federally Qualified Health Centers and HIV Outcomes in the U.S. South.美国南部获得合格联邦健康中心服务与 HIV 结局的关系。
Am J Prev Med. 2024 May;66(5):770-779. doi: 10.1016/j.amepre.2023.12.008. Epub 2023 Dec 13.
9
Analyzing spatial aggregation error in statistical models of late-stage cancer risk: a Monte Carlo simulation approach.分析晚期癌症风险统计模型中的空间聚集误差:蒙特卡罗模拟方法。
Int J Health Geogr. 2010 Oct 19;9:51. doi: 10.1186/1476-072X-9-51.
10
Small Area Forecasting of Opioid-Related Mortality: Bayesian Spatiotemporal Dynamic Modeling Approach.阿片类药物相关死亡率的小区域预测:贝叶斯时空动态建模方法。
JMIR Public Health Surveill. 2023 Feb 10;9:e41450. doi: 10.2196/41450.

本文引用的文献

1
A multi-constraint Monte Carlo Simulation approach to downscaling cancer data.一种用于降尺度癌症数据的多约束蒙特卡罗模拟方法。
Health Place. 2025 Jan;91:103411. doi: 10.1016/j.healthplace.2024.103411. Epub 2025 Jan 6.
2
A novel MissForest-based missing values imputation approach with recursive feature elimination in medical applications.一种基于 MissForest 的新的缺失值插补方法,在医学应用中采用递归特征消除。
BMC Med Res Methodol. 2024 Nov 8;24(1):269. doi: 10.1186/s12874-024-02392-2.
3
Identify the most appropriate imputation method for handling missing values in clinical structured datasets: a systematic review.
识别处理临床结构化数据集缺失值的最合适插补方法:系统评价。
BMC Med Res Methodol. 2024 Aug 28;24(1):188. doi: 10.1186/s12874-024-02310-6.
4
NCI Cancer Research Data Commons: Lessons Learned and Future State.NCI 癌症研究数据共享:经验教训和未来发展方向。
Cancer Res. 2024 May 2;84(9):1404-1409. doi: 10.1158/0008-5472.CAN-23-2730.
5
NCI Cancer Research Data Commons: Resources to Share Key Cancer Data.NCI 癌症研究数据共享社区:分享关键癌症数据的资源。
Cancer Res. 2024 May 2;84(9):1388-1395. doi: 10.1158/0008-5472.CAN-23-2468.
6
The impact of imputation quality on machine learning classifiers for datasets with missing values.插补质量对具有缺失值数据集的机器学习分类器的影响。
Commun Med (Lond). 2023 Oct 6;3(1):139. doi: 10.1038/s43856-023-00356-z.
7
Measuring The Enduring Imprint Of Structural Racism On American Neighborhoods.衡量结构性种族主义对美国社区的持久影响。
Health Aff (Millwood). 2023 Oct;42(10):1374-1382. doi: 10.1377/hlthaff.2023.00659.
8
Developing Geographic Areas for Cancer Reporting Using Automated Zone Design.使用自动区域设计来开发癌症报告的地理区域。
Am J Epidemiol. 2022 Nov 19;191(12):2109-2119. doi: 10.1093/aje/kwac155.
9
Uncertainty in geospatial health: challenges and opportunities ahead.地理空间卫生中的不确定性:未来的挑战与机遇。
Ann Epidemiol. 2022 Jan;65:15-30. doi: 10.1016/j.annepidem.2021.10.002. Epub 2021 Oct 14.
10
Machine learning in the prediction of cancer therapy.机器学习在癌症治疗预测中的应用
Comput Struct Biotechnol J. 2021 Jul 8;19:4003-4017. doi: 10.1016/j.csbj.2021.07.003. eCollection 2021.