Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA.
Department of Statistics, University of Washington, Seattle, WA, USA.
BMC Med. 2020 Mar 26;18(1):69. doi: 10.1186/s12916-020-01518-9.
A verbal autopsy (VA) is an interview conducted with the caregivers of someone who has recently died to describe the circumstances of the death. In recent years, several algorithmic methods have been developed to classify cause of death using VA data. The performance of one method-InSilicoVA-was evaluated in a study by Flaxman et al., published in BMC Medicine in 2018. The results of that study are different from those previously published by our group.
Based on the description of methods in the Flaxman et al. study, we attempt to replicate the analysis to understand why the published results differ from those of our previous work.
We failed to reproduce the results published in Flaxman et al. Most of the discrepancies we find likely result from undocumented differences in data pre-processing, and/or values assigned to key parameters governing the behavior of the algorithm.
This finding highlights the importance of making replication code available along with published results. All code necessary to replicate the work described here is freely available on GitHub.
死因推断(VA)是对最近死亡者的照料者进行的访谈,以描述死亡情况。近年来,已经开发出几种算法方法,使用 VA 数据来对死因进行分类。一种名为“InSilicoVA”的方法的性能在 Flaxman 等人于 2018 年发表在《BMC 医学》上的研究中进行了评估。该研究的结果与我们小组之前发表的结果不同。
根据 Flaxman 等人研究中的方法描述,我们尝试复制分析,以了解为什么发表的结果与我们之前的工作不同。
我们未能复制 Flaxman 等人发表的结果。我们发现的大多数差异很可能是由于数据预处理和/或分配给控制算法行为的关键参数的值方面未记录的差异所致。
这一发现强调了随已发表结果一起提供复制代码的重要性。此处描述的工作所需的所有代码都可在 GitHub 上免费获得。