D'Angelo Gina, Ran Di
Oncology Statistical Innovation, AstraZeneca, Gaithersburg, Maryland, USA.
Pharm Stat. 2025 Jan-Feb;24(1):e2422. doi: 10.1002/pst.2422. Epub 2024 Aug 6.
Preclinical studies are broad and can encompass cellular research, animal trials, and small human trials. Preclinical studies tend to be exploratory and have smaller datasets that often consist of biomarker data. Logistic regression is typically the model of choice for modeling a binary outcome with explanatory variables such as genetic, imaging, and clinical data. Small preclinical studies can have challenging data that may include a complete separation or quasi-complete separation issue that will result in logistic regression inflated coefficient estimates and standard errors. Penalized regression approaches such as Firth's logistic regression are a solution to reduce the bias in the estimates. In this tutorial, a number of examples with separation (complete or quasi-complete) are illustrated and the results from both logistic regression and Firth's logistic regression are compared to demonstrate the inflated estimates from the standard logistic regression model and bias-reduction of the estimates from the penalized Firth's approach. R code and datasets are provided in the supplement.
临床前研究范围广泛,可涵盖细胞研究、动物试验和小型人体试验。临床前研究往往具有探索性,数据集较小,通常包含生物标志物数据。逻辑回归通常是用于对二元结局与遗传、影像和临床数据等解释变量进行建模的首选模型。小型临床前研究可能会遇到具有挑战性的数据,其中可能包括完全分离或准完全分离问题,这将导致逻辑回归系数估计值和标准误膨胀。诸如费思逻辑回归等惩罚回归方法是减少估计偏差的一种解决方案。在本教程中,展示了一些存在分离(完全或准完全)情况的示例,并比较了逻辑回归和费思逻辑回归的结果,以证明标准逻辑回归模型的估计值膨胀以及惩罚性费思方法估计值的偏差减少。补充材料中提供了R代码和数据集。