From the aDepartment of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montreal, Canada; bGerald Bronfman Department of Oncology, McGill University, Montreal, Canada; and cInstitute for Health and Social Policy, McGill University, Montreal, Canada.
Epidemiology. 2017 Nov;28(6):817-826. doi: 10.1097/EDE.0000000000000727.
Panel study designs are common in environmental epidemiology, whereby repeated measurements are collected from a panel of subjects to evaluate short-term within-subject changes in response variables over time. In planning such studies, questions of how many subjects to include and how many different exposure conditions to measure are commonly asked at the design stage. In practice, these choices are constrained by budget, logistics, and participant burden and must be carefully balanced against statistical considerations of precision and power. In this article, we provide intuitive sample size formulae for the precision of regression coefficients derived from panel studies and show how they can be applied in planning such studies. We show that there are five determinants of the precision with which regression coefficients can be estimated: (1) the residual variance of the responses; (2) the variance of the slopes; (3) the number of subjects; (4) the number of measurements/subject; and (5) the within-subject range of the exposure values "X" at which the responses are measured. The planning of such studies would be greatly improved if investigators regularly reported all of the variance components in fitted random-effects models: currently, literature values for the relevant variance parameters are often not readily available and must be estimated through pilot studies or subjective estimates of "reasonable values."
面板研究设计在环境流行病学中很常见,通过对一组研究对象进行重复测量,以评估随时间推移的响应变量的短期个体内变化。在规划此类研究时,通常会在设计阶段提出要纳入多少研究对象和测量多少种不同的暴露条件等问题。实际上,这些选择受到预算、物流和参与者负担的限制,并且必须与统计考虑的精度和功效进行仔细平衡。在本文中,我们提供了从面板研究中得出的回归系数的精度的直观样本量公式,并展示了如何在规划此类研究中应用它们。我们表明,回归系数可以估计的精度有五个决定因素:(1)响应的残差方差;(2)斜率的方差;(3)研究对象的数量;(4)每个研究对象的测量次数;以及(5)响应测量时暴露值“X”的个体内范围。如果研究人员定期报告拟合随机效应模型中的所有方差分量,那么此类研究的规划将得到极大改善:目前,相关方差参数的文献值通常不易获得,必须通过初步研究或对“合理值”的主观估计来进行估计。