Rosner Bernard, Hendrickson Sara, Willett Walter
Channing Division of Network Medicine, Harvard Medical School, Boston, MA, U.S.A.; Department of Biostatistics, Harvard School of Public Health, Boston, MA, U.S.A.
Stat Med. 2015 Jan 30;34(2):297-306. doi: 10.1002/sim.6327. Epub 2014 Oct 24.
Nutrient intake is often measured with substantial error both in commonly used surrogate instruments such as a food frequency questionnaire (FFQ) and in gold standard-type instruments such as a diet record (DR). If there is a correlated error between the FFQ and DR, then standard measurement error correction methods based on regression calibration can produce biased estimates of the regression coefficient (λ) of true intake on surrogate intake. However, if a biomarker exists and the error in the biomarker is independent of the error in the FFQ and DR, then the method of triads can be used to obtain unbiased estimates of λ, provided that there are replicate biomarker data on at least a subsample of validation study subjects. Because biomarker measurements are expensive, for a fixed budget, one can use a either design where a large number of subjects have one biomarker measure and only a small subsample is replicated or a design that has a smaller number of subjects and has most or all subjects validated. The purpose of this paper is to optimize the proportion of subjects with replicated biomarker measures, where optimization is with respect to minimizing the variance of ln(λ̂). The methodology is illustrated using vitamin C intake data from the European Prospective Investigation into Cancer and Nutrition study where plasma vitamin C is the biomarker. In this example, the optimal validation study design is to have 21% of subjects with replicated biomarker measures.
在常用的替代工具(如食物频率问卷(FFQ))和金标准类型的工具(如饮食记录(DR))中,营养素摄入量的测量往往存在较大误差。如果FFQ和DR之间存在相关误差,那么基于回归校准的标准测量误差校正方法可能会对替代摄入量上的真实摄入量回归系数(λ)产生有偏差的估计。然而,如果存在生物标志物且生物标志物中的误差与FFQ和DR中的误差无关,那么只要在验证研究对象的至少一个子样本上有重复的生物标志物数据,就可以使用三联法来获得λ的无偏估计。由于生物标志物测量成本高昂,对于固定预算,人们可以采用两种设计之一:一种是大量受试者有一次生物标志物测量,只有一小部分子样本进行重复测量;另一种是受试者数量较少,且大多数或所有受试者都经过验证。本文的目的是优化有重复生物标志物测量的受试者比例,这里的优化是针对最小化ln(λ̂)的方差。使用来自欧洲癌症与营养前瞻性调查研究的维生素C摄入量数据(其中血浆维生素C为生物标志物)来说明该方法。在这个例子中,最佳验证研究设计是让21%的受试者有重复的生物标志物测量。