a Department of Educational Psychology , The University of Georgia.
b Department of Psychology , The Ohio State University.
Multivariate Behav Res. 2017 Sep-Oct;52(5):533-550. doi: 10.1080/00273171.2017.1329082. Epub 2017 Jun 8.
Confidence intervals (CIs) are fundamental inferential devices which quantify the sampling variability of parameter estimates. In item response theory, CIs have been primarily obtained from large-sample Wald-type approaches based on standard error estimates, derived from the observed or expected information matrix, after parameters have been estimated via maximum likelihood. An alternative approach to constructing CIs is to quantify sampling variability directly from the likelihood function with a technique known as profile-likelihood confidence intervals (PL CIs). In this article, we introduce PL CIs for item response theory models, compare PL CIs to classical large-sample Wald-type CIs, and demonstrate important distinctions among these CIs. CIs are then constructed for parameters directly estimated in the specified model and for transformed parameters which are often obtained post-estimation. Monte Carlo simulation results suggest that PL CIs perform consistently better than Wald-type CIs for both non-transformed and transformed parameters.
置信区间(CIs)是一种基本的推理工具,用于量化参数估计的抽样变异性。在项目反应理论中,CIs 主要是通过基于标准误差估计的大样本 Wald 型方法获得的,这些估计是在通过最大似然法估计参数后,从观测或预期信息矩阵中得出的。构造 CIs 的另一种方法是通过一种称为似然函数轮廓置信区间(PL CIs)的技术,直接从似然函数中量化抽样变异性。在本文中,我们为项目反应理论模型引入了 PL CIs,比较了 PL CIs 与经典的大样本 Wald 型 CIs,并展示了这些 CIs 之间的重要区别。然后,我们为直接在指定模型中估计的参数以及在估计后通常获得的转换参数构建了 CIs。蒙特卡罗模拟结果表明,PL CIs 在非转换和转换参数方面的表现均优于 Wald 型 CIs。