Suissa S, Blais L
Department of Epidemiology and Biostatistics, McGill University, Montreal, Quebec, Canada.
Stat Med. 1995 Feb 15;14(3):247-55. doi: 10.1002/sim.4780140303.
Clinical research often involves continuous outcome measures, such as blood cholesterol, that are amenable to statistical techniques of analysis based on the mean, such as the t-test or multiple linear regression. Clinical interest, however, frequently focuses on the proportion of subjects who fall below or above a clinically relevant cut-off value, as a measure of the risk of disease. The customary approach to analyse such data is to dichotomize the continuous outcome measure and use statistical techniques based on binary data and the binomial distribution. In this paper, we use a parametric approach and the framework of generalized linear models to fit various regression models, including the logistic, on the basis of the original continuous outcome. We consider the Gaussian and the three-parameter log-normal distributions for the continuous outcome, assessing both precision and bias under various conditions. In simulation analyses, we find that we are unable to fit some of the samples with the 'dichotomous' approach, but we can with the 'continuous' approach, and that the latter yields estimates between 25 and 85 per cent more efficient than the former. We illustrate the method, programmed using GLIM macros, with data from clinical studies of the risk of hypoxaemia during open thoracic surgery and the risk of nocturnal hypoglycaemia among diabetic children.
临床研究常常涉及连续的结局指标,如血液胆固醇水平,这些指标适用于基于均值的统计分析技术,如t检验或多元线性回归。然而,临床关注的焦点通常是低于或高于临床相关临界值的受试者比例,以此作为疾病风险的一种度量。分析此类数据的传统方法是将连续结局指标进行二分,并使用基于二元数据和二项分布的统计技术。在本文中,我们采用参数方法和广义线性模型框架,基于原始的连续结局来拟合各种回归模型,包括逻辑回归模型。我们考虑连续结局的高斯分布和三参数对数正态分布,评估在各种条件下的精度和偏差。在模拟分析中,我们发现对于某些样本,“二分法”无法进行拟合,但“连续法”可以,并且后者产生的估计值比前者效率高25%至85%。我们使用GLIM宏编程展示了该方法,并结合了开胸手术期间低氧血症风险和糖尿病儿童夜间低血糖风险的临床研究数据。