Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands.
Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands.
Eur J Epidemiol. 2021 Sep;36(9):889-898. doi: 10.1007/s10654-021-00794-w. Epub 2021 Aug 15.
Etiological research aims to uncover causal effects, whilst prediction research aims to forecast an outcome with the best accuracy. Causal and prediction research usually require different methods, and yet their findings may get conflated when reported and interpreted. The aim of the current study is to quantify the frequency of conflation between etiological and prediction research, to discuss common underlying mistakes and provide recommendations on how to avoid these. Observational cohort studies published in January 2018 in the top-ranked journals of six distinct medical fields (Cardiology, Clinical Epidemiology, Clinical Neurology, General and Internal Medicine, Nephrology and Surgery) were included for the current scoping review. Data on conflation was extracted through signaling questions. In total, 180 studies were included. Overall, 26% (n = 46) contained conflation between etiology and prediction. The frequency of conflation varied across medical field and journal impact factor. From the causal studies 22% was conflated, mainly due to the selection of covariates based on their ability to predict without taking the causal structure into account. Within prediction studies 38% was conflated, the most frequent reason was a causal interpretation of covariates included in a prediction model. Conflation of etiology and prediction is a common methodological error in observational medical research and more frequent in prediction studies. As this may lead to biased estimations and erroneous conclusions, researchers must be careful when designing, interpreting and disseminating their research to ensure this conflation is avoided.
病因研究旨在揭示因果效应,而预测研究旨在以最佳精度预测结果。因果研究和预测研究通常需要不同的方法,但当报告和解释时,它们的发现可能会混淆。本研究的目的是量化病因学和预测研究之间混淆的频率,讨论常见的潜在错误,并就如何避免这些错误提供建议。本范围综述纳入了 2018 年 1 月在六个不同医学领域(心脏病学、临床流行病学、临床神经病学、普通和内科、肾脏病学和外科学)排名最高的期刊上发表的观察性队列研究。通过信号问题提取混淆数据。共纳入 180 项研究。总体而言,有 26%(n=46)的研究存在病因学和预测之间的混淆。混淆的频率因医学领域和期刊影响因子而异。从因果研究中,有 22%是混淆的,主要是由于选择协变量是基于其预测能力,而没有考虑因果结构。在预测研究中,有 38%是混淆的,最常见的原因是对预测模型中包含的协变量进行因果解释。病因学和预测的混淆是观察性医学研究中常见的方法学错误,在预测研究中更为常见。由于这可能导致估计值偏倚和结论错误,因此研究人员在设计、解释和传播研究结果时必须小心,以确保避免这种混淆。