Agogo George O, van der Voet Hilko, Van't Veer Pieter, van Eeuwijk Fred A, Boshuizen Hendriek C
Biometris, Wageningen University and Research Centre, Postbus 16, 6700 AA, Wageningen, The Netherlands.
National Institute for Public Health and the Environment, Postbus 1, 3720 BA Bilthoven, The Netherlands.
Biom J. 2016 Jul;58(4):766-82. doi: 10.1002/bimj.201500009. Epub 2016 Mar 22.
Dietary questionnaires are prone to measurement error, which bias the perceived association between dietary intake and risk of disease. Short-term measurements are required to adjust for the bias in the association. For foods that are not consumed daily, the short-term measurements are often characterized by excess zeroes. Via a simulation study, the performance of a two-part calibration model that was developed for a single-replicate study design was assessed by mimicking leafy vegetable intake reports from the multicenter European Prospective Investigation into Cancer and Nutrition (EPIC) study. In part I of the fitted two-part calibration model, a logistic distribution was assumed; in part II, a gamma distribution was assumed. The model was assessed with respect to the magnitude of the correlation between the consumption probability and the consumed amount (hereafter, cross-part correlation), the number and form of covariates in the calibration model, the percentage of zero response values, and the magnitude of the measurement error in the dietary intake. From the simulation study results, transforming the dietary variable in the regression calibration to an appropriate scale was found to be the most important factor for the model performance. Reducing the number of covariates in the model could be beneficial, but was not critical in large-sample studies. The performance was remarkably robust when fitting a one-part rather than a two-part model. The model performance was minimally affected by the cross-part correlation.
饮食调查问卷容易出现测量误差,这会使饮食摄入量与疾病风险之间的感知关联产生偏差。需要进行短期测量以调整这种关联中的偏差。对于并非每日都食用的食物,短期测量往往具有过多的零值特征。通过一项模拟研究,针对单重复研究设计开发的两部分校准模型的性能,通过模拟多中心欧洲癌症与营养前瞻性调查(EPIC)研究中的绿叶蔬菜摄入量报告进行了评估。在所拟合的两部分校准模型的第一部分中,假定为逻辑分布;在第二部分中,假定为伽马分布。该模型在消费概率与消费量之间的相关性大小(以下简称交叉部分相关性)、校准模型中协变量的数量和形式、零响应值的百分比以及饮食摄入量测量误差的大小方面进行了评估。从模拟研究结果来看,在校准回归中将饮食变量转换为适当的尺度被发现是模型性能的最重要因素。减少模型中协变量的数量可能有益,但在大样本研究中并非关键因素。拟合单部分模型而非两部分模型时,性能非常稳健。模型性能受交叉部分相关性的影响最小。