Kolassa J E, Tanner M A
Department of Biostatistics, University of Rochester, New York 14642, USA.
Biometrics. 1999 Mar;55(1):246-51. doi: 10.1111/j.0006-341x.1999.00246.x.
This article presents an algorithm for approximate frequentist conditional inference on two or more parameters for any regression model in the Generalized Linear Model (GLIM) family. We thereby extend highly accurate inference beyond the cases of logistic regression and contingency tables implimented in commercially available software. The method makes use of the double saddlepoint approximations of Skovgaard (1987, Journal of Applied Probability 24, 875-887) and Jensen (1992, Biometrika 79, 693-703) to the conditional cumulative distribution function of a sufficient statistic given the remaining sufficient statistics. This approximation is then used in conjunction with noniterative Monte Carlo methods to generate a sample from a distribution that approximates the joint distribution of the sufficient statistics associated with the parameters of interest conditional on the observed values of the sufficient statistics associated with the nuisance parameters. This algorithm is an alternate approach to that presented by Kolassa and Tanner (1994, Journal of the American Statistical Association 89, 697-702), in which a Markov chain is generated whose equilibrium distribution under certain regularity conditions approximates the joint distribution of interest. In Kolassa and Tanner (1994), the Gibbs sampler was used in conjunction with these univariate conditional distribution function approximations. The method of this paper does not require the construction and simulation of a Markov chain, thus avoiding the need to develop regularity conditions under which the algorithm converges and the need for the data analyst to check convergence of the particular chain. Examples involving logistic and truncated Poisson regression are presented.
本文提出了一种算法,用于对广义线性模型(GLIM)族中任何回归模型的两个或多个参数进行近似频率主义条件推断。由此,我们将高精度推断扩展到了商业软件中实现的逻辑回归和列联表之外的情况。该方法利用了斯科夫加德(1987年,《应用概率杂志》24卷,875 - 887页)和延森(1992年,《生物统计学》79卷,693 - 703页)对给定其余充分统计量时充分统计量的条件累积分布函数的双鞍点近似。然后,将这种近似与非迭代蒙特卡罗方法结合使用,从一个分布中生成样本,该分布近似于在与讨厌参数相关的充分统计量的观测值条件下,与感兴趣参数相关的充分统计量的联合分布。该算法是对科拉萨和坦纳(1994年,《美国统计协会杂志》89卷,697 - 702页)所提出方法的一种替代方法,在他们的方法中生成了一个马尔可夫链,在某些正则条件下其平衡分布近似于感兴趣的联合分布。在科拉萨和坦纳(1994年)的方法中,吉布斯采样器与这些单变量条件分布函数近似结合使用。本文的方法不需要构建和模拟马尔可夫链,从而避免了开发算法收敛所需的正则条件以及数据分析师检查特定链收敛性的需求。文中给出了涉及逻辑回归和截断泊松回归的示例。