Am J Epidemiol. 2022 Jun 27;191(7):1283-1289. doi: 10.1093/aje/kwab264.
In this paper, we consider methods for generating draws of a binary random variable whose expectation conditional on covariates follows a logistic regression model with known covariate coefficients. We examine approximations for finding a "balancing intercept," that is, a value for the intercept of the logistic model that leads to a desired marginal expectation for the binary random variable. We show that a recently proposed analytical approximation can produce inaccurate results, especially when targeting more extreme marginal expectations or when the linear predictor of the regression model has high variance. We then formulate the balancing intercept as a solution to an integral equation, implement a numerical approximation for solving the equation based on Monte Carlo methods, and show that the approximation works well in practice. Our approach to the basic problem of the balancing intercept provides an example of a broadly applicable strategy for formulating and solving problems that arise in the design of simulation studies used to evaluate or teach epidemiologic methods.
在本文中,我们考虑了生成二项随机变量的样本的方法,其条件期望基于已知协变量系数的逻辑回归模型。我们研究了寻找“平衡截距”的近似方法,即逻辑模型的截距值,使得二项随机变量的边际期望达到所需的值。我们表明,最近提出的解析近似方法可能会产生不准确的结果,特别是当目标是更极端的边际期望或回归模型的线性预测器具有较高方差时。然后,我们将平衡截距表示为积分方程的解,基于蒙特卡罗方法实现了求解该方程的数值逼近,并表明该逼近在实践中效果良好。我们对平衡截距的基本问题的处理为制定和解决在用于评估或教授流行病学方法的模拟研究设计中出现的问题提供了一种广泛适用的策略。