Kowalski J, Tu X M, Jia G, Perlis M, Frank E, Crits-Christoph P, Kupfer D J
Division of Oncology Biostatistics, Johns Hopkins University, U.S.A.
Stat Med. 2003 Feb 28;22(4):595-610. doi: 10.1002/sim.1332.
The lack of control over covariates in practice motivates the need for their adjustment when measuring the degree of association between two sets of variables, for which canonical correlation is traditionally used. In most studies however, there is also a lack of control over the attributes of responses for the sets of variables of interest. In particular, a portion of the response variable may be continuous and the other discrete. For such settings, the traditional partial canonical correlation approach is restrictive, since a covariate-adjustment for a set of continuous variables is assumed. By ignoring the assumption of continuous variates and proceeding with a partial canonical correlation analysis in the presence of continuous and discrete variates, results in canonical correlation estimates that are not consistent. In this paper we generalize the traditional partial canonical correlation approach to covariate-adjustment by allowing the response variables to contain continuous, as well as discrete, variates. The methodology is illustrated with a psychiatric application for examining which sleep variables relate to which depressive symptoms, as measured by commonly used constructs that presents with both continuous and discrete outcomes.
在实际操作中,由于缺乏对协变量的控制,因此在测量两组变量之间的关联程度时需要对其进行调整,传统上使用典型相关分析来实现这一点。然而,在大多数研究中,对于感兴趣的变量集的响应属性也缺乏控制。特别是,响应变量的一部分可能是连续的,而另一部分是离散的。对于这种情况,传统的偏典型相关方法具有局限性,因为它假定对一组连续变量进行协变量调整。通过忽略连续变量的假设,并在存在连续和离散变量的情况下进行偏典型相关分析,会导致典型相关估计不一致。在本文中,我们通过允许响应变量包含连续变量和离散变量,将传统的偏典型相关方法推广到协变量调整。我们通过一个精神病学应用实例来说明该方法,该实例用于研究哪些睡眠变量与哪些抑郁症状相关,这些症状通过同时呈现连续和离散结果的常用指标来衡量。