Suppr超能文献

相关测量误差——对营养流行病学的影响

Correlated measurement error--implications for nutritional epidemiology.

作者信息

Day N E, Wong M Y, Bingham S, Khaw K T, Luben R, Michels K B, Welch A, Wareham N J

机构信息

Strangeways Research Laboratory, Institute of Public Health, University of Cambridge, Cambridge, CB1 8RN, UK.

出版信息

Int J Epidemiol. 2004 Dec;33(6):1373-81. doi: 10.1093/ije/dyh138. Epub 2004 Aug 27.

Abstract

BACKGROUND

In nutritional epidemiology, it is common to fit models in which several dietary variables are included. However, with standard instruments for dietary assessment, not only are the intakes of many nutrients often highly correlated, but the errors in the estimation of the intake of different nutrients are also correlated. The effect of this error correlation on the results of observational studies has been little investigated. This paper describes the effect on multivariate regression coefficients of different levels of correlation, both between the variables themselves and between the errors of estimation of these variables.

METHODS

Using a simple model for the multivariate error structure, we examine the effect on the estimates of bivariate linear regression coefficients of (1) differential precision of measurement of the two independent variables, (2) differing levels of correlation between the true values of the two variables, and (3) differing levels of correlation between the errors of measurement of the two variables. As an example, the prediction of plasma vitamin C levels by dietary intake variables is considered, using data from the European Prospective Investigation of Cancer (EPIC) Norfolk study in which dietary intake was estimated using both a food frequency questionnaire (FFQ) and a 7-day diary (7DD). The dietary variables considered are vitamin C, fat, and energy, with different approaches taken to energy adjustment.

RESULTS

When the error correlation is zero, the estimates of the bivariate regression coefficients reflect the precision of measurement of the two variables and mutual confounding. The sum of the observed regression coefficients is biased towards the null as in univariate regression. When the error correlation is non-zero but below about 0.7, the effect is minor. However, as the error correlation increases beyond 0.8 the effect becomes large and highly dependent on the relative precision with which the two variables are measured. At the extreme, the bivariate estimates can become indefinitely large. In the example, the error correlation between fat and energy using the FFQ appears to be over 0.9, the corresponding value for the 7DD being approximately 0.85. The error correlation between vitamin C and fat, and vitamin C and energy, appears to be below 0.5 and smaller for the 7DD than for the FFQ. The impact of these error correlations on bivariate regression coefficients is large. The effect of energy adjustment differs widely between vitamin C and fat.

CONCLUSION

High levels of error correlation can have a large effect on bivariate regression estimates, varying widely depending on which two variables are considered. In particular, the effect of energy adjustment will vary widely. For vitamin C, the effect of energy adjustment appears negligible, whereas for fat the effect is large indicating that error correlation close to one can partially remove regression dilution due to measurement error. If, for fat intake, energy adjustment is performed by using energy density, the partial removal of regression dilution is achieved at the expense of substantial reduction in the true variance.

摘要

背景

在营养流行病学中,拟合包含多个饮食变量的模型很常见。然而,使用标准的饮食评估工具时,不仅许多营养素的摄入量往往高度相关,而且不同营养素摄入量估计中的误差也相互关联。这种误差相关性对观察性研究结果的影响鲜有研究。本文描述了变量之间以及这些变量估计误差之间不同程度的相关性对多元回归系数的影响。

方法

使用一个简单的多元误差结构模型,我们研究了以下因素对双变量线性回归系数估计值的影响:(1)两个自变量测量精度的差异;(2)两个变量真实值之间不同程度的相关性;(3)两个变量测量误差之间不同程度的相关性。例如,利用欧洲癌症前瞻性调查(EPIC)诺福克研究的数据,考虑通过饮食摄入变量预测血浆维生素C水平,该研究中饮食摄入量通过食物频率问卷(FFQ)和7天饮食日记(7DD)进行估计。所考虑的饮食变量为维生素C、脂肪和能量,并采用不同方法进行能量调整。

结果

当误差相关性为零时,双变量回归系数的估计值反映了两个变量的测量精度和相互混杂情况。观察到的回归系数之和如在单变量回归中一样偏向于零假设。当误差相关性非零但低于约0.7时,影响较小。然而,当误差相关性增加到超过0.8时,影响变得很大,并且高度依赖于两个变量的测量相对精度。在极端情况下,双变量估计值可能变得无限大。在该示例中,使用FFQ时脂肪和能量之间的误差相关性似乎超过0.9,7DD的相应值约为0.85。维生素C与脂肪以及维生素C与能量之间的误差相关性似乎低于0.5,并且7DD的误差相关性比FFQ的更小。这些误差相关性对双变量回归系数的影响很大。能量调整对维生素C和脂肪的影响差异很大。

结论

高水平的误差相关性会对双变量回归估计产生很大影响,具体影响因所考虑的两个变量而异。特别是,能量调整的效果会有很大差异。对于维生素C,能量调整的影响似乎可以忽略不计,而对于脂肪,影响很大,这表明接近1的误差相关性可以部分消除测量误差导致的回归稀释。如果对于脂肪摄入量,通过使用能量密度进行能量调整,则在以大幅降低真实方差为代价的情况下实现回归稀释的部分消除。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验