半连续变量和连续变量的双变量测量误差模型：在营养流行病学中的应用

A bivariate measurement error model for semicontinuous and continuous variables: Application to nutritional epidemiology.

作者信息

Kipnis Victor, Freedman Laurence S, Carroll Raymond J, Midthune Douglas

机构信息

Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, Bethesda, Maryland.

Information Management Services, Inc., Rockville, Maryland and Gertner Institute for Epidemiology and Health Policy Research, Sheba Medical Center, Tel Hashomer, Israel.

出版信息

Biometrics. 2016 Mar;72(1):106-15. doi: 10.1111/biom.12377. Epub 2015 Aug 31.

DOI:10.1111/biom.12377

PMID:26332011

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4775438/

Abstract

Semicontinuous data in the form of a mixture of a large portion of zero values and continuously distributed positive values frequently arise in many areas of biostatistics. This article is motivated by the analysis of relationships between disease outcomes and intakes of episodically consumed dietary components. An important aspect of studies in nutritional epidemiology is that true diet is unobservable and commonly evaluated by food frequency questionnaires with substantial measurement error. Following the regression calibration approach for measurement error correction, unknown individual intakes in the risk model are replaced by their conditional expectations given mismeasured intakes and other model covariates. Those regression calibration predictors are estimated using short-term unbiased reference measurements in a calibration substudy. Since dietary intakes are often "energy-adjusted," e.g., by using ratios of the intake of interest to total energy intake, the correct estimation of the regression calibration predictor for each energy-adjusted episodically consumed dietary component requires modeling short-term reference measurements of the component (a semicontinuous variable), and energy (a continuous variable) simultaneously in a bivariate model. In this article, we develop such a bivariate model, together with its application to regression calibration. We illustrate the new methodology using data from the NIH-AARP Diet and Health Study (Schatzkin et al., 2001, American Journal of Epidemiology 154, 1119-1125), and also evaluate its performance in a simulation study.

摘要

在生物统计学的许多领域中，经常会出现大量零值与连续分布的正值混合形式的半连续数据。本文的动机源于对疾病结局与偶尔摄入的膳食成分之间关系的分析。营养流行病学研究的一个重要方面是，真实饮食是不可观察的，通常通过存在大量测量误差的食物频率问卷来评估。遵循测量误差校正的回归校准方法，风险模型中未知的个体摄入量被其在测量错误的摄入量和其他模型协变量条件下的条件期望所取代。这些回归校准预测因子是在校准子研究中使用短期无偏参考测量来估计的。由于膳食摄入量通常是“能量调整的”，例如通过使用感兴趣的摄入量与总能量摄入量的比率，对于每个经能量调整的偶尔摄入的膳食成分，回归校准预测因子的正确估计需要在双变量模型中同时对该成分（一个半连续变量）和能量（一个连续变量）的短期参考测量进行建模。在本文中，我们开发了这样一个双变量模型及其在回归校准中的应用。我们使用来自美国国立卫生研究院 - 美国退休人员协会饮食与健康研究（Schatzkin等人，2001年，《美国流行病学杂志》154卷，第1119 - 1125页）的数据说明了新方法，并在模拟研究中评估了其性能。

相似文献

A bivariate measurement error model for semicontinuous and continuous variables: Application to nutritional epidemiology.半连续变量和连续变量的双变量测量误差模型：在营养流行病学中的应用

Biometrics. 2016 Mar;72(1):106-15. doi: 10.1111/biom.12377. Epub 2015 Aug 31.

Fitting a bivariate measurement error model for episodically consumed dietary components.为间歇性摄入的膳食成分拟合双变量测量误差模型。

Int J Biostat. 2011;7(1):1. doi: 10.2202/1557-4679.1267. Epub 2011 Jan 6.

Int J Epidemiol. 2004 Dec;33(6):1373-81. doi: 10.1093/ije/dyh138. Epub 2004 Aug 27.

A zero-augmented generalized gamma regression calibration to adjust for covariate measurement error: A case of an episodically consumed dietary intake.一种用于校正协变量测量误差的零增强广义伽马回归校准：以间歇性摄入的饮食摄入量为例。

Biom J. 2017 Jan;59(1):94-109. doi: 10.1002/bimj.201600043. Epub 2016 Oct 5.

Use of two-part regression calibration model to correct for measurement error in episodically consumed foods in a single-replicate study design: EPIC case study.在单重复研究设计中使用两部分回归校准模型校正偶尔食用食物的测量误差：欧洲癌症与营养前瞻性调查（EPIC）案例研究

PLoS One. 2014 Nov 17;9(11):e113160. doi: 10.1371/journal.pone.0113160. eCollection 2014.

Modeling data with excess zeros and measurement error: application to evaluating relationships between episodically consumed foods and health outcomes.对存在过多零值和测量误差的数据进行建模：应用于评估偶尔食用的食物与健康结果之间的关系。

Biometrics. 2009 Dec;65(4):1003-10. doi: 10.1111/j.1541-0420.2009.01223.x.

Validating an FFQ for intake of episodically consumed foods: application to the National Institutes of Health-AARP Diet and Health Study.验证一份用于评估间歇性摄入食物的 FFQ：在 NIH-AARP 饮食与健康研究中的应用。

Public Health Nutr. 2011 Jul;14(7):1212-21. doi: 10.1017/S1368980011000632. Epub 2011 Apr 13.

Evaluation of a two-part regression calibration to adjust for dietary exposure measurement error in the Cox proportional hazards model: A simulation study.在Cox比例风险模型中评估两部分回归校准以校正膳食暴露测量误差：一项模拟研究。

Biom J. 2016 Jul;58(4):766-82. doi: 10.1002/bimj.201500009. Epub 2016 Mar 22.

Systematic review of statistical approaches to quantify, or correct for, measurement error in a continuous exposure in nutritional epidemiology.系统评价统计方法在营养流行病学中量化或校正连续暴露测量误差的方法。

BMC Med Res Methodol. 2017 Sep 19;17(1):146. doi: 10.1186/s12874-017-0421-6.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

引用本文的文献

Assessing adult physical activity and compliance with 2008 CDC guidelines using a Bayesian two-part measurement error model.使用贝叶斯两部分测量误差模型评估成年人的身体活动及对2008年美国疾病控制与预防中心指南的依从性。

J Appl Stat. 2022 Jun 21;50(13):2777-2795. doi: 10.1080/02664763.2022.2088706. eCollection 2023.

The relationship between moderate to vigorous physical activity and metabolic syndrome: a Bayesian measurement error approach.中度至剧烈身体活动与代谢综合征之间的关系：一种贝叶斯测量误差方法。

J Appl Stat. 2022 May 12;50(10):2246-2266. doi: 10.1080/02664763.2022.2073336. eCollection 2023.

Nutritional Epidemiology and Dietary Assessment for Patients With Kidney Disease: A Primer.肾脏疾病患者的营养流行病学和膳食评估：入门篇。

Am J Kidney Dis. 2023 Jun;81(6):717-727. doi: 10.1053/j.ajkd.2022.11.014. Epub 2023 Jan 4.

A three-part regression calibration to handle excess zeroes, skewness and heteroscedasticity in adjusting for measurement error in dietary intake data.一种用于处理膳食摄入数据测量误差调整中过多零值、偏度和异方差性的三部分回归校准方法。

J Appl Stat. 2020 Nov 13;49(4):884-901. doi: 10.1080/02664763.2020.1845622. eCollection 2022.

Consumption of low nutritive value foods and cardiometabolic risk factors among French-speaking adults from Quebec, Canada: the PREDISE study.加拿大魁北克省讲法语成年人的低营养价值食物消费与心血管代谢危险因素：PREDISE 研究。

Nutr J. 2019 Aug 29;18(1):49. doi: 10.1186/s12937-019-0474-y.

Correcting for measurement error in fractional polynomial models using Bayesian modelling and regression calibration, with an application to alcohol and mortality.使用贝叶斯建模和回归校准校正分数多项式模型中的测量误差，并应用于酒精与死亡率研究。

Biom J. 2019 May;61(3):558-573. doi: 10.1002/bimj.201700279. Epub 2019 Mar 20.

Modeling energy balance while correcting for measurement error via free knot splines.通过自由结点样条拟合模型来校正测量误差的能量平衡。

PLoS One. 2018 Aug 30;13(8):e0201892. doi: 10.1371/journal.pone.0201892. eCollection 2018.

A method for sensitivity analysis to assess the effects of measurement error in multiple exposure variables using external validation data.一种使用外部验证数据进行敏感性分析以评估多个暴露变量测量误差影响的方法。

BMC Med Res Methodol. 2016 Oct 13;16(1):139. doi: 10.1186/s12874-016-0240-1.

本文引用的文献

Covariate measurement error correction methods in mediation analysis with failure time data.具有删失时间数据的中介分析中的协变量测量误差校正方法

Biometrics. 2014 Dec;70(4):835-44. doi: 10.1111/biom.12205. Epub 2014 Aug 19.

Fitting a bivariate measurement error model for episodically consumed dietary components.为间歇性摄入的膳食成分拟合双变量测量误差模型。

Int J Biostat. 2011;7(1):1. doi: 10.2202/1557-4679.1267. Epub 2011 Jan 6.

A NEW MULTIVARIATE MEASUREMENT ERROR MODEL WITH ZERO-INFLATED DIETARY DATA, AND ITS APPLICATION TO DIETARY ASSESSMENT.一种用于零膨胀饮食数据的新型多元测量误差模型及其在饮食评估中的应用。

Ann Appl Stat. 2011 Jun 1;5(2B):1456-1487. doi: 10.1214/10-AOAS446.

A prospective study of meat, cooking methods, meat mutagens, heme iron, and lung cancer risks.一项关于肉类、烹饪方法、肉类诱变剂、血红素铁与肺癌风险的前瞻性研究。

Am J Clin Nutr. 2009 Jun;89(6):1884-94. doi: 10.3945/ajcn.2008.27272. Epub 2009 Apr 15.

Biometrics. 2009 Dec;65(4):1003-10. doi: 10.1111/j.1541-0420.2009.01223.x.

Intakes of fruit, vegetables, and specific botanical groups in relation to lung cancer risk in the NIH-AARP Diet and Health Study.美国国立卫生研究院-美国退休人员协会饮食与健康研究中水果、蔬菜及特定植物类别摄入量与肺癌风险的关系

Am J Epidemiol. 2008 Nov 1;168(9):1024-34. doi: 10.1093/aje/kwn212. Epub 2008 Sep 12.

A new statistical method for estimating the usual intake of episodically consumed foods with application to their distribution.一种用于估计偶发性消费食品通常摄入量及其分布的新统计方法。

J Am Diet Assoc. 2006 Oct;106(10):1575-87. doi: 10.1016/j.jada.2006.07.003.

Analysis of repeated measures data with clumping at zero.对零值处有聚集的重复测量数据进行分析。

Stat Methods Med Res. 2002 Aug;11(4):341-55. doi: 10.1191/0962280202sm291ra.

Design and serendipity in establishing a large cohort with wide dietary intake distributions : the National Institutes of Health-American Association of Retired Persons Diet and Health Study.建立一个具有广泛饮食摄入分布的大型队列中的设计与意外发现：美国国立卫生研究院-美国退休人员协会饮食与健康研究

Am J Epidemiol. 2001 Dec 15;154(12):1119-25. doi: 10.1093/aje/154.12.1119.

The problem of profound mismeasurement and the power of epidemiological studies of diet and cancer.深度测量误差问题以及饮食与癌症的流行病学研究的影响力

Nutr Cancer. 1988;11(4):243-50. doi: 10.1080/01635588809513994.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验