• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于具有不完全协变量的癌症地图绘制的广义相加模型。

Generalized additive models for cancer mapping with incomplete covariates.

作者信息

French Jonathan L, Wand Matthew P

机构信息

Biostatistics, Global Research and Development, Pfizer, Inc, 50 Pequot Avenue, New London, CT 06320, USA.

出版信息

Biostatistics. 2004 Apr;5(2):177-91. doi: 10.1093/biostatistics/5.2.177.

DOI:10.1093/biostatistics/5.2.177
PMID:15054024
Abstract

Maps depicting cancer incidence rates have become useful tools in public health research, giving valuable information about the spatial variation in rates of disease. Typically, these maps are generated using count data aggregated over areas such as counties or census blocks. However, with the proliferation of geographic information systems and related databases, it is becoming easier to obtain exact spatial locations for the cancer cases and suitable control subjects. The use of such point data allows us to adjust for individual-level covariates, such as age and smoking status, when estimating the spatial variation in disease risk. Unfortunately, such covariate information is often subject to missingness. We propose a method for mapping cancer risk when covariates are not completely observed. We model these data using a logistic generalized additive model. Estimates of the linear and non-linear effects are obtained using a mixed effects model representation. We develop an EM algorithm to account for missing data and the random effects. Since the expectation step involves an intractable integral, we estimate the E-step with a Laplace approximation. This framework provides a general method for handling missing covariate values when fitting generalized additive models. We illustrate our method through an analysis of cancer incidence data from Cape Cod, Massachusetts. These analyses demonstrate that standard complete-case methods can yield biased estimates of the spatial variation of cancer risk.

摘要

描绘癌症发病率的地图已成为公共卫生研究中的有用工具,能提供有关疾病发病率空间变化的宝贵信息。通常,这些地图是使用在县或人口普查街区等区域汇总的计数数据生成的。然而,随着地理信息系统及相关数据库的激增,获取癌症病例和合适对照对象的确切空间位置变得更加容易。使用此类点数据使我们在估计疾病风险的空间变化时能够调整个体层面的协变量,如年龄和吸烟状况。不幸的是,此类协变量信息常常存在缺失情况。我们提出一种在协变量未被完全观测到时绘制癌症风险地图的方法。我们使用逻辑广义相加模型对这些数据进行建模。线性和非线性效应的估计通过混合效应模型表示来获得。我们开发了一种期望最大化(EM)算法来处理缺失数据和随机效应。由于期望步骤涉及一个难以处理的积分,我们用拉普拉斯近似来估计期望步骤。这个框架为在拟合广义相加模型时处理缺失协变量值提供了一种通用方法。我们通过对马萨诸塞州科德角的癌症发病率数据进行分析来说明我们的方法。这些分析表明,标准的完整病例方法可能会对癌症风险的空间变化产生有偏差的估计。

相似文献

1
Generalized additive models for cancer mapping with incomplete covariates.用于具有不完全协变量的癌症地图绘制的广义相加模型。
Biostatistics. 2004 Apr;5(2):177-91. doi: 10.1093/biostatistics/5.2.177.
2
Extended follow-up and spatial analysis of the American Cancer Society study linking particulate air pollution and mortality.美国癌症协会关于空气污染颗粒与死亡率关系研究的长期随访及空间分析
Res Rep Health Eff Inst. 2009 May(140):5-114; discussion 115-36.
3
Effects of long-term exposure to traffic-related air pollution on respiratory and cardiovascular mortality in the Netherlands: the NLCS-AIR study.长期暴露于交通相关空气污染对荷兰呼吸道和心血管疾病死亡率的影响:荷兰长期队列空气污染研究(NLCS-AIR研究)
Res Rep Health Eff Inst. 2009 Mar(139):5-71; discussion 73-89.
4
Semiparametric models for missing covariate and response data in regression models.回归模型中缺失协变量和响应数据的半参数模型。
Biometrics. 2006 Mar;62(1):177-84. doi: 10.1111/j.1541-0420.2005.00438.x.
5
Regression analysis with missing covariate data using estimating equations.使用估计方程对缺失协变量数据进行回归分析。
Biometrics. 1996 Dec;52(4):1165-82.
6
Adjusting for nonignorable missingness when estimating generalized additive models.在估计广义相加模型时对不可忽略的缺失值进行调整。
Biom J. 2010 Apr;52(2):186-200. doi: 10.1002/bimj.200900202.
7
Estimating the intensity of a spatial point process from locations coarsened by incomplete geocoding.从不完整地理编码粗化的位置估计空间点过程的强度。
Biometrics. 2008 Mar;64(1):262-70. doi: 10.1111/j.1541-0420.2007.00870.x. Epub 2007 Aug 3.
8
Disease mapping and spatial regression with count data.利用计数数据进行疾病映射与空间回归。
Biostatistics. 2007 Apr;8(2):158-83. doi: 10.1093/biostatistics/kxl008. Epub 2006 Jun 29.
9
Analysis of matched case-control data in presence of nonignorable missing exposure.存在不可忽略的缺失暴露情况下匹配病例对照数据的分析。
Biometrics. 2008 Mar;64(1):106-14. doi: 10.1111/j.1541-0420.2007.00828.x. Epub 2007 Jun 15.
10
A Bayesian hierarchical model for the estimation of two incomplete surveillance data sets.一种用于估计两个不完整监测数据集的贝叶斯层次模型。
Stat Med. 2008 Jul 30;27(17):3269-85. doi: 10.1002/sim.3190.

引用本文的文献

1
Factors associated with early sexual debut among adolescents and youth in Mozambique: A geo-additive survival analysis of the Mozambique 2021 AIDS indicator survey.莫桑比克青少年和青年过早开始性行为的相关因素:对莫桑比克2021年艾滋病指标调查的地理加性生存分析
PLoS One. 2025 Jun 25;20(6):e0326869. doi: 10.1371/journal.pone.0326869. eCollection 2025.
2
Knot selection for low-rank kriging models of spatial risk in case-control studies.病例对照研究中空间风险的低阶克里金模型的节点选择。
Spat Spatiotemporal Epidemiol. 2022 Jun;41:100483. doi: 10.1016/j.sste.2022.100483. Epub 2022 Jan 21.
3
Spatiotemporal high-resolution prediction and mapping: methodology and application to dengue disease.
时空高分辨率预测与绘图:登革热疾病的方法与应用
J Geogr Syst. 2022;24(4):527-581. doi: 10.1007/s10109-021-00368-0. Epub 2022 Feb 19.
4
Daily activity locations k-anonymity for the evaluation of disclosure risk of individual GPS datasets.日常活动地点 k-匿名化评估个体 GPS 数据集的披露风险。
Int J Health Geogr. 2020 Mar 5;19(1):7. doi: 10.1186/s12942-020-00201-9.
5
Spatial Distribution of HIV Prevalence among Young People in Mozambique.莫桑比克青年人群中 HIV 感染率的空间分布
Int J Environ Res Public Health. 2020 Jan 31;17(3):885. doi: 10.3390/ijerph17030885.
6
Ensuring Confidentiality of Geocoded Health Data: Assessing Geographic Masking Strategies for Individual-Level Data.确保地理编码健康数据的保密性:评估个体层面数据的地理掩码策略。
Adv Med. 2014;2014:567049. doi: 10.1155/2014/567049. Epub 2014 Apr 29.
7
A Bayesian semiparametric approach with change points for spatial ordinal data.一种用于空间有序数据且带有变化点的贝叶斯半参数方法。
Stat Methods Med Res. 2016 Apr;25(2):644-58. doi: 10.1177/0962280212463415. Epub 2012 Oct 14.
8
Semiparametric regression during 2003-2007.2003年至2007年期间的半参数回归
Electron J Stat. 2009 Jan 1;3:1193-1256. doi: 10.1214/09-EJS525.
9
Combining area-based and individual-level data in the geostatistical mapping of late-stage cancer incidence.在晚期癌症发病率的地理统计绘图中结合基于区域和个体层面的数据。
Spat Spatiotemporal Epidemiol. 2009 Oct-Dec;1(1):61-71. doi: 10.1016/j.sste.2009.07.001.
10
Bayesian spatial modeling of disease risk in relation to multivariate environmental risk fields.贝叶斯空间模型在疾病风险与多变量环境风险场关系中的应用。
Stat Med. 2010 Jan 15;29(1):142-57. doi: 10.1002/sim.3777.