Shepherd Bryan E, Blevins Peratikos Meridith, Rebeiro Peter F, Duda Stephany N, McGowan Catherine C
Am J Epidemiol. 2017 Aug 15;186(4):387-392. doi: 10.1093/aje/kwx066.
Reproducible research is important for assessing the integrity of findings and disseminating methods, but it requires making original study data sets publicly available. This requirement is difficult to meet in settings with sensitive data, which can mean that resulting studies are not reproducible. For studies in which data cannot be shared, we propose a pragmatic approach to make research quasi-reproducible. On a publicly available website without restriction, researchers should post 1) analysis code used in the published study, 2) simulated data, and 3) results obtained by applying the analysis code used in the published study to the simulated data. Although it is not a perfect solution, such an approach makes analyses transparent for critical evaluation and dissemination and is therefore a significant improvement over current practice.
可重复性研究对于评估研究结果的完整性和传播研究方法很重要,但它要求公开原始研究数据集。在处理敏感数据的情况下,这一要求很难满足,这可能意味着最终的研究无法被重复。对于数据无法共享的研究,我们提出一种务实的方法,使研究具有准可重复性。在一个无限制的公开网站上,研究人员应发布:1)已发表研究中使用的分析代码;2)模拟数据;3)将已发表研究中使用的分析代码应用于模拟数据所获得的结果。虽然这不是一个完美的解决方案,但这种方法能使分析过程透明,便于进行批判性评估和传播,因此相对于当前的做法有显著改进。