Hirji K F, Elashoff R M, Moore D H, Bennett D E
Department of Biomathematics, UCLA School of Medicine 90024-1766.
Stat Med. 1988 Jul;7(7):765-72. doi: 10.1002/sim.4780070706.
We compare exact and asymptotic methods for variable selection in matched case-control studies. Data from a study of melanoma among the employees of the Lawrence Livermore National Laboratory illustrate the comparisons. Relative to large sample methods, the exact method almost always yielded larger p-values. The differences in p-values became more pronounced with inclusion of more variables in the logistic model. Thus, when the sample size is not large, and there are many covariates under study, use of the exact method tends to select more parsimonious models and avoids overfit of the data.
我们比较了匹配病例对照研究中变量选择的精确方法和渐近方法。劳伦斯利弗莫尔国家实验室员工的黑色素瘤研究数据说明了这些比较。相对于大样本方法,精确方法几乎总是产生更大的p值。随着逻辑模型中纳入更多变量,p值的差异变得更加明显。因此,当样本量不大且有许多协变量正在研究时,使用精确方法倾向于选择更简约的模型并避免数据过度拟合。