Breitling Lutz P, Dragomir Anca D, Duan Chongyang, Luta George
Medical Faculty, University of Heidelberg, Heidelberg, Germany.
Department of Oncology, Georgetown University, Washington, DC, USA.
Glob Epidemiol. 2025 Jan 22;9:100186. doi: 10.1016/j.gloepi.2025.100186. eCollection 2025 Jun.
Real-world data are playing an increasingly important role in regulatory decision making. Adequately addressing bias is of paramount importance in this context. Structural representations of bias using directed acyclic graphs (DAGs) provide a unified approach to conceptualize bias, distinguish between different types of bias, and identify ways to address bias. DAG-based data simulation further enhances the scope of this approach. Recently, DAGs have been used to demonstrate how missing eligibility information can compromise emulated target trial analysis, a cutting edge approach to estimate treatment effects using real-world data. The importance of simulation for methodological research has received substantial recognition in the past few years, and others have argued that simulating data based on DAGs can be especially helpful for understanding various epidemiological concepts. In the present work, we present two concrete examples of how simulations based on DAGs can be used to gain insights into issues commonly encountered in real-world analytics, i.e., regression modelling to address confounding bias, and the potential extent of selection bias. Increasing accessibility and extending the simulation algorithms of existing software to include longitudinal and time-to-event data are identified as priorities for further development. With such extensions, simulations based on DAGs would be an even more powerful tool to advance our understanding of the rapidly growing toolbox of real-world analytics.
真实世界数据在监管决策中发挥着越来越重要的作用。在这种情况下,充分解决偏差至关重要。使用有向无环图(DAG)对偏差进行结构化表示,为概念化偏差、区分不同类型的偏差以及确定解决偏差的方法提供了一种统一的方法。基于DAG的数据模拟进一步扩展了这种方法的范围。最近,DAG已被用于证明缺失的资格信息如何影响模拟目标试验分析,这是一种使用真实世界数据估计治疗效果的前沿方法。在过去几年中,模拟对方法学研究的重要性已得到广泛认可,其他人认为基于DAG模拟数据对理解各种流行病学概念特别有帮助。在本工作中,我们给出两个具体例子,说明基于DAG的模拟如何用于深入了解真实世界分析中常见的问题,即用于解决混杂偏差的回归建模以及选择偏差的潜在程度。提高现有软件的可访问性并扩展其模拟算法以纳入纵向数据和事件发生时间数据被确定为进一步发展的优先事项。通过这些扩展,基于DAG的模拟将成为一个更强大的工具,以促进我们对快速增长的真实世界分析工具箱的理解。