基于人群的癌症流行病学中双稳健方法的数据自适应估计：急诊就诊的肺癌死亡率的风险差异。

Data-Adaptive Estimation for Double-Robust Methods in Population-Based Cancer Epidemiology: Risk Differences for Lung Cancer Mortality by Emergency Presentation.

机构信息

Faculty of Epidemiology and Population Health, Department of Non-Communicable Disease Epidemiology, Cancer Survival Group, London School of Hygiene and Tropical Medicine, London, United Kingdom.

Laboratory for Psychiatric Biostatistics, McLean Hospital, Belmont, Massachusetts.

出版信息

Am J Epidemiol. 2018 Apr 1;187(4):871-878. doi: 10.1093/aje/kwx317.

DOI:10.1093/aje/kwx317

PMID:29020131

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5888939/

Abstract

In this paper, we propose a structural framework for population-based cancer epidemiology and evaluate the performance of double-robust estimators for a binary exposure in cancer mortality. We conduct numerical analyses to study the bias and efficiency of these estimators. Furthermore, we compare 2 different model selection strategies based on 1) Akaike's Information Criterion and the Bayesian Information Criterion and 2) machine learning algorithms, and we illustrate double-robust estimators' performance in a real-world setting. In simulations with correctly specified models and near-positivity violations, all but the naive estimators had relatively good performance. However, the augmented inverse-probability-of-treatment weighting estimator showed the largest relative bias. Under dual model misspecification and near-positivity violations, all double-robust estimators were biased. Nevertheless, the targeted maximum likelihood estimator showed the best bias-variance trade-off, more precise estimates, and appropriate 95% confidence interval coverage, supporting the use of the data-adaptive model selection strategies based on machine learning algorithms. We applied these methods to estimate adjusted 1-year mortality risk differences in 183,426 lung cancer patients diagnosed after admittance to an emergency department versus persons with a nonemergency cancer diagnosis in England (2006-2013). The adjusted mortality risk (for patients diagnosed with lung cancer after admittance to an emergency department) was 16% higher in men and 18% higher in women, suggesting the importance of interventions targeting early detection of lung cancer signs and symptoms.

摘要

在本文中，我们提出了一种基于人群的癌症流行病学结构框架，并评估了用于癌症死亡率中二元暴露的双重稳健估计量的性能。我们进行了数值分析，以研究这些估计量的偏差和效率。此外，我们比较了基于 1）Akaike 信息准则和贝叶斯信息准则和 2）机器学习算法的两种不同的模型选择策略，并在实际环境中说明了双重稳健估计量的性能。在模型正确指定和接近正性违反的模拟中，除了天真的估计量外，所有估计量的性能都相对较好。然而，增强的逆处理权重估计量表现出最大的相对偏差。在双重模型误定和接近正性违反的情况下，所有双重稳健估计量都存在偏差。然而，靶向最大似然估计量表现出最佳的偏差方差权衡、更精确的估计值和适当的 95%置信区间覆盖，支持使用基于机器学习算法的数据自适应模型选择策略。我们将这些方法应用于估计 183426 名在英国急诊部门就诊后诊断为肺癌的患者与非急诊癌症诊断患者的 1 年调整死亡率风险差异（2006-2013 年）。调整后的死亡率风险（急诊部门就诊后诊断为肺癌的患者）在男性中高 16%，在女性中高 18%，这表明针对肺癌症状和体征的早期检测的干预措施的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87c0/5888939/5568cb7bc7ad/kwx317f01.jpg

相似文献

Data-Adaptive Estimation for Double-Robust Methods in Population-Based Cancer Epidemiology: Risk Differences for Lung Cancer Mortality by Emergency Presentation.基于人群的癌症流行病学中双稳健方法的数据自适应估计：急诊就诊的肺癌死亡率的风险差异。

Am J Epidemiol. 2018 Apr 1;187(4):871-878. doi: 10.1093/aje/kwx317.

Double Robust Efficient Estimators of Longitudinal Treatment Effects: Comparative Performance in Simulations and a Case Study.纵向治疗效果的双重稳健有效估计量：模拟中的比较性能及一个案例研究

Int J Biostat. 2019 Feb 26;15(2):/j/ijb.2019.15.issue-2/ijb-2017-0054/ijb-2017-0054.xml. doi: 10.1515/ijb-2017-0054.

Effect Estimation in Point-Exposure Studies with Binary Outcomes and High-Dimensional Covariate Data - A Comparison of Targeted Maximum Likelihood Estimation and Inverse Probability of Treatment Weighting.二元结局和高维协变量数据的点暴露研究中的效应估计——靶向最大似然估计与治疗权重逆概率的比较

Int J Biostat. 2016 Nov 1;12(2). doi: 10.1515/ijb-2015-0034.

Collaborative double robust targeted maximum likelihood estimation.协作双稳健靶向最大似然估计

Int J Biostat. 2010 May 17;6(1):Article 17. doi: 10.2202/1557-4679.1181.

Challenges in Obtaining Valid Causal Effect Estimates with Machine Learning Algorithms.使用机器学习算法获取有效因果效应估计值面临的挑战。

Am J Epidemiol. 2023 Sep 1;192(9). doi: 10.1093/aje/kwab201. Epub 2021 Jul 15.

Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies.观察性研究中因果推断的靶向最大似然估计

Am J Epidemiol. 2017 Jan 1;185(1):65-73. doi: 10.1093/aje/kww165. Epub 2016 Dec 9.

Machine Learning for Causal Inference: On the Use of Cross-fit Estimators.机器学习在因果推断中的应用：基于交叉拟合估计量的研究。

Epidemiology. 2021 May 1;32(3):393-401. doi: 10.1097/EDE.0000000000001332.

The relative performance of targeted maximum likelihood estimators.靶向最大似然估计量的相对性能。

Int J Biostat. 2011;7(1). doi: 10.2202/1557-4679.1308. Epub 2011 Aug 17.

Targeted maximum likelihood estimation in safety analysis.目标最大似然估计在安全性分析中的应用。

J Clin Epidemiol. 2013 Aug;66(8 Suppl):S91-8. doi: 10.1016/j.jclinepi.2013.02.017.

When does adjusting covariate under randomization help? A comparative study on current practices.随机化时调整协变量有何帮助？对现行实践的比较研究。

BMC Med Res Methodol. 2024 Oct 26;24(1):250. doi: 10.1186/s12874-024-02375-3.

引用本文的文献

Machine learning in causal inference for epidemiology.流行病学中的因果推理中的机器学习。

Eur J Epidemiol. 2024 Oct;39(10):1097-1108. doi: 10.1007/s10654-024-01173-x. Epub 2024 Nov 13.

SARS-CoV-2 infection by trimester of pregnancy and adverse perinatal outcomes: a Mexican retrospective cohort study.妊娠期 SARS-CoV-2 感染与不良围产结局：墨西哥回顾性队列研究。

BMJ Open. 2024 Apr 11;14(4):e075928. doi: 10.1136/bmjopen-2023-075928.

Deep Ensemble Machine Learning Framework for the Estimation of Concentrations.深度集成机器学习框架用于估算浓度。

Environ Health Perspect. 2022 Mar;130(3):37004. doi: 10.1289/EHP9752. Epub 2022 Mar 7.

Impact of androgen deprivation therapy on mortality of prostate cancer patients with COVID-19: a propensity score-based analysis.雄激素剥夺疗法对新冠肺炎前列腺癌患者死亡率的影响：一项基于倾向评分的分析。

Infect Agent Cancer. 2021 Nov 25;16(1):66. doi: 10.1186/s13027-021-00406-y.

Introduction to computational causal inference using reproducible Stata, R, and Python code: A tutorial.使用可重现的Stata、R和Python代码进行计算因果推断入门教程

Stat Med. 2022 Jan 30;41(2):407-432. doi: 10.1002/sim.9234. Epub 2021 Oct 28.

Association of medical male circumcision and sexually transmitted infections in a population-based study using targeted maximum likelihood estimation.基于人群的研究采用靶向极大似然估计法分析医疗男性割礼与性传播感染的相关性。

BMC Public Health. 2021 Sep 8;21(1):1642. doi: 10.1186/s12889-021-11705-9.

Metalworking Fluids and Colon Cancer Risk: Longitudinal Targeted Minimum Loss-based Estimation.金属加工液与结肠癌风险：基于纵向靶向最小损失的估计

Environ Epidemiol. 2019 Feb 12;3(1):e035. doi: 10.1097/EE9.0000000000000035. eCollection 2019 Feb.

Intersections of machine learning and epidemiological methods for health services research.机器学习与流行病学方法在卫生服务研究中的交汇。

Int J Epidemiol. 2021 Jan 23;49(6):1763-1770. doi: 10.1093/ije/dyaa035.

Using longitudinal targeted maximum likelihood estimation in complex settings with dynamic interventions.在具有动态干预的复杂环境中使用纵向靶向极大似然估计。

Stat Med. 2019 Oct 30;38(24):4888-4911. doi: 10.1002/sim.8340. Epub 2019 Aug 22.

Comparison of Parametric and Nonparametric Estimators for the Association Between Incident Prepregnancy Obesity and Stillbirth in a Population-Based Cohort Study.基于人群队列研究中，比较参数和非参数估计在偶发性孕前肥胖与死胎之间的关联。

Am J Epidemiol. 2019 Jul 1;188(7):1328-1336. doi: 10.1093/aje/kwz081.

本文引用的文献

Reproducibility, reliability and validity of population-based administrative health data for the assessment of cancer non-related comorbidities.基于人群的行政健康数据用于评估癌症非相关合并症的可重复性、可靠性和有效性。

PLoS One. 2017 Mar 6;12(3):e0172814. doi: 10.1371/journal.pone.0172814. eCollection 2017.

Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies.观察性研究中因果推断的靶向最大似然估计

Am J Epidemiol. 2017 Jan 1;185(1):65-73. doi: 10.1093/aje/kww165. Epub 2016 Dec 9.

The impact of comorbidity on cancer and its treatment.共病对癌症及其治疗的影响。

CA Cancer J Clin. 2016 Jul;66(4):337-50. doi: 10.3322/caac.21342. Epub 2016 Feb 17.

The impact of patient comorbidity on cancer stage at diagnosis.患者合并症对诊断时癌症分期的影响。

Br J Cancer. 2015 Nov 3;113(9):1375-80. doi: 10.1038/bjc.2015.355. Epub 2015 Oct 13.

Comparative effectiveness research in cancer with observational data.利用观察性数据开展癌症比较疗效研究。

Am Soc Clin Oncol Educ Book. 2015:e330-5. doi: 10.14694/EdBook_AM.2015.35.e330.

The effect of emergency presentation on surgery and survival in lung cancer patients in England, 2006-2008.2006 - 2008年英国急诊就诊对肺癌患者手术及生存的影响

Cancer Epidemiol. 2015 Aug;39(4):612-6. doi: 10.1016/j.canep.2015.04.008. Epub 2015 May 13.

Global surveillance of cancer survival 1995-2009: analysis of individual data for 25,676,887 patients from 279 population-based registries in 67 countries (CONCORD-2).1995 - 2009年全球癌症生存情况监测：对来自67个国家279个基于人群的登记处的25,676,887例患者的个体数据进行分析（CONCORD - 2）

Lancet. 2015 Mar 14;385(9972):977-1010. doi: 10.1016/S0140-6736(14)62038-9. Epub 2014 Nov 26.

Mortality prediction in intensive care units with the Super ICU Learner Algorithm (SICULA): a population-based study.重症监护病房死亡率预测的超级 ICU 学习者算法（SICULA）：一项基于人群的研究。

Lancet Respir Med. 2015 Jan;3(1):42-52. doi: 10.1016/S2213-2600(14)70239-5. Epub 2014 Nov 24.

Enhancing cancer registry data for comparative effectiveness research (CER) project: overview and methodology.增强用于比较效果研究（CER）项目的癌症登记数据：概述与方法

J Registry Manag. 2014 Fall;41(3):103-12.

The parametric g-formula for time-to-event data: intuition and a worked example.用于事件发生时间数据的参数化g公式：直观理解与实例分析。

Epidemiology. 2014 Nov;25(6):889-97. doi: 10.1097/EDE.0000000000000160.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于人群的癌症流行病学中双稳健方法的数据自适应估计：急诊就诊的肺癌死亡率的风险差异。

Data-Adaptive Estimation for Double-Robust Methods in Population-Based Cancer Epidemiology: Risk Differences for Lung Cancer Mortality by Emergency Presentation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献