Department of Mathematical Sciences, US Military Academy, West Point, NY, USA.
Department of Data Science, COS and Associates Ltd., Hong Kong, China.
Am J Clin Nutr. 2020 Feb 1;111(2):256-265. doi: 10.1093/ajcn/nqz196.
Regression to the mean (RTM) is a statistical phenomenon where initial measurements of a variable in a nonrandom sample at the extreme ends of a distribution tend to be closer to the mean upon a second measurement. Unfortunately, failing to account for the effects of RTM can lead to incorrect conclusions on the observed mean difference between the 2 repeated measurements in a nonrandom sample that is preferentially selected for deviating from the population mean of the measured variable in a particular direction. Study designs that are susceptible to misattributing RTM as intervention effects have been prevalent in nutrition and obesity research. This field often conducts secondary analyses of existing intervention data or evaluates intervention effects in those most at risk (i.e., those with observations at the extreme ends of a distribution).
To provide best practices to avoid unsubstantiated conclusions as a result of ignoring RTM in nutrition and obesity research.
We outlined best practices for identifying whether RTM is likely to be leading to biased inferences, using a flowchart that is available as a web-based app at https://dustyturner.shinyapps.io/DecisionTreeMeanRegression/. We also provided multiple methods to quantify the degree of RTM.
Investigators can adjust analyses to include the RTM effect, thereby plausibly removing its biasing influence on estimating the true intervention effect.
The identification of RTM and implementation of proper statistical practices will help advance the field by improving scientific rigor and the accuracy of conclusions. This trial was registered at clinicaltrials.gov as NCT00427193.
回归均值(RTM)是一种统计学现象,即在非随机样本中,分布两端的初始变量测量值在第二次测量时往往更接近平均值。不幸的是,如果未能考虑 RTM 的影响,可能会导致对非随机样本中两次重复测量观察到的均值差异的错误结论,该非随机样本是优先选择的,以便在特定方向上偏离测量变量的总体均值。在营养和肥胖研究中,容易将 RTM 错误归因于干预效果的研究设计一直很流行。该领域经常对现有干预数据进行二次分析,或评估处于高风险人群(即分布两端观察值的人群)的干预效果。
提供最佳实践,以避免因忽略营养和肥胖研究中的 RTM 而得出无根据的结论。
我们使用一个流程图概述了最佳实践,以确定 RTM 是否可能导致有偏差的推断,该流程图可在 https://dustyturner.shinyapps.io/DecisionTreeMeanRegression/ 作为网络应用程序获得。我们还提供了多种量化 RTM 程度的方法。
研究人员可以调整分析以包含 RTM 效应,从而合理地消除其对估计真实干预效应的偏倚影响。
确定 RTM 并实施适当的统计实践将有助于通过提高科学严谨性和结论的准确性来推动该领域的发展。本试验在 clinicaltrials.gov 注册为 NCT00427193。