Kazman Josh B, Scott Jonathan M, Deuster Patricia A
Department of Military and Emergency Medicine,Uniformed Services University of the Health Sciences,Bethesda,MD 20814-4712,USA.
Br J Nutr. 2017 Sep;118(5):383-391. doi: 10.1017/S0007114517002215.
The limitations for self-reporting of dietary patterns are widely recognised as a major vulnerability of FFQ and the dietary screeners/scales derived from FFQ. Such instruments can yield inconsistent results to produce questionable interpretations. The present article discusses the value of psychometric approaches and standards in addressing these drawbacks for instruments used to estimate dietary habits and nutrient intake. We argue that a FFQ or screener that treats diet as a 'latent construct' can be optimised for both internal consistency and the value of the research results. Latent constructs, a foundation for item response theory (IRT)-based scales (e.g. Patient Reported Outcomes Measurement Information System) are typically introduced in the design stage of an instrument to elicit critical factors that cannot be observed or measured directly. We propose an iterative approach that uses such modelling to refine FFQ and similar instruments. To that end, we illustrate the benefits of psychometric modelling by using items and data from a sample of 12 370 Soldiers who completed the 2012 US Army Global Assessment Tool (GAT). We used factor analysis to build the scale incorporating five out of eleven survey items. An IRT-driven assessment of response category properties indicates likely problems in the ordering or wording of several response categories. Group comparisons, examined with differential item functioning (DIF), provided evidence of scale validity across each Army sub-population (sex, service component and officer status). Such an approach holds promise for future FFQ.
饮食模式自我报告的局限性被广泛认为是食物频率问卷(FFQ)以及源自FFQ的饮食筛查工具/量表的一个主要弱点。这类工具可能会得出不一致的结果,从而产生可疑的解读。本文讨论了心理测量方法和标准在解决用于估计饮食习惯和营养摄入量的工具的这些缺点方面的价值。我们认为,将饮食视为一种“潜在结构”的FFQ或筛查工具可以在内部一致性和研究结果价值方面都得到优化。潜在结构是基于项目反应理论(IRT)的量表(例如患者报告结局测量信息系统)的基础,通常在工具的设计阶段引入,以引出无法直接观察或测量的关键因素。我们提出一种迭代方法,利用这种建模来改进FFQ及类似工具。为此,我们通过使用来自12370名完成2012年美国陆军全球评估工具(GAT)的士兵样本的项目和数据,说明了心理测量建模的好处。我们使用因子分析构建了包含11个调查项目中5个项目的量表。基于IRT对反应类别属性的评估表明,几个反应类别的排序或措辞可能存在问题。通过差异项目功能(DIF)进行的组间比较,为该量表在每个陆军亚群体(性别、军种组成和军官身份)中的有效性提供了证据。这种方法对未来的FFQ有很大的前景。