Mondol Momenul Haque, Karim Mohammad Ehsanul
School of Population and Public Health, University of British Columbia, Vancouver, Canada.
Department of Statistics, University of Barishal, Barishal, Bangladesh.
Am J Epidemiol. 2024 Dec 10. doi: 10.1093/aje/kwae447.
Within epidemiological research, estimating treatment effects from observational data presents notable challenges. Targeted Maximum Likelihood Estimation (TMLE) emerges as a robust method, addressing these challenges by accurately modeling treatment effects. This approach uniquely combines the precision of correctly specified models with the versatility of data-adaptive, flexible machine learning algorithms. Despite its effectiveness, TMLE's integration of complex algorithms can introduce bias and under-coverage. This issue is addressed through the Double Cross-fit TMLE (DC-TMLE) approach, enhancing accuracy and reducing biases inherent in observational studies. However, DC-TMLE's potential remains underexplored in epidemiological research, primarily due to the lack of comprehensive methodological guidance and the complexity of its computational implementation. Recognizing this gap, our paper contributes a detailed, reproducible guide for implementing DC-TMLE in R, aimed specifically at epidemiological applications. We demonstrate the utility of this method using an openly available clinical dataset, underscoring its relevance and adaptability for robust epidemiological analysis. This guide aims to facilitate broader adoption of DC-TMLE in epidemiological studies, promoting more accurate and reliable treatment effect estimations in observational research.
在流行病学研究中,从观察性数据估计治疗效果面临着显著挑战。靶向最大似然估计(TMLE)作为一种稳健的方法应运而生,它通过准确地对治疗效果进行建模来应对这些挑战。这种方法独特地将正确设定模型的精度与数据自适应、灵活的机器学习算法的通用性结合起来。尽管TMLE很有效,但它对复杂算法的整合可能会引入偏差和覆盖不足的问题。双交叉拟合TMLE(DC-TMLE)方法解决了这个问题,提高了准确性并减少了观察性研究中固有的偏差。然而,DC-TMLE在流行病学研究中的潜力仍未得到充分探索,主要是由于缺乏全面的方法学指导以及其计算实现的复杂性。认识到这一差距,我们的论文提供了一份在R中实现DC-TMLE的详细、可重现的指南,特别针对流行病学应用。我们使用一个公开可用的临床数据集展示了这种方法的实用性,强调了其在稳健的流行病学分析中的相关性和适应性。本指南旨在促进DC-TMLE在流行病学研究中的更广泛应用,推动观察性研究中更准确、可靠的治疗效果估计。