Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, Canada.
BMC Med Res Methodol. 2011 Aug 22;11:120. doi: 10.1186/1471-2288-11-120.
Infection of the CNS is considered to be the major cause of encephalitis and more than 100 different pathogens have been recognized as causative agents. Despite being identified worldwide as an important public health concern, studies on encephalitis are very few and often focus on particular types (with respect to causative agents) of encephalitis (e.g. West Nile, Japanese, etc.). Moreover, a number of other infectious and non-infectious conditions present with similar symptoms, and distinguishing encephalitis from other disguising conditions continues to a challenging task.
We used canonical correlation analysis (CCA) to assess associations between set of exposure variable and set of symptom and diagnostic variables in human encephalitis. Data consists of 208 confirmed cases of encephalitis from a prospective multicenter study conducted in the United Kingdom. We used a covariance matrix based on Gini's measure of similarity and used permutation based approaches to test significance of canonical variates.
Results show that weak pair-wise correlation exists between the risk factor (exposure and demographic) and symptom/laboratory variables. However, the first canonical variate from CCA revealed strong multivariate correlation (ρ = 0.71, se = 0.03, p = 0.013) between the two sets. We found a moderate correlation (ρ = 0.54, se = 0.02) between the variables in the second canonical variate, however, the value is not statistically significant (p = 0.68). Our results also show that a very small amount of the variation in the symptom sets is explained by the exposure variables. This indicates that host factors, rather than environmental factors might be important towards understanding the etiology of encephalitis and facilitate early diagnosis and treatment of encephalitis patients.
There is no standard laboratory diagnostic strategy for investigation of encephalitis and even experienced physicians are often uncertain about the cause, appropriate therapy and prognosis of encephalitis. Exploration of human encephalitis data using advanced multivariate statistical modelling approaches that can capture the inherent complexity in the data is, therefore, crucial in understanding the causes of human encephalitis. Moreover, application of multivariate exploratory techniques will generate clinically important hypotheses and offer useful insight into the number and nature of variables worthy of further consideration in a confirmatory statistical analysis.
中枢神经系统感染被认为是脑炎的主要原因,已经确认有 100 多种不同的病原体是致病因子。尽管它在世界范围内被认为是一个重要的公共卫生问题,但关于脑炎的研究非常少,而且通常集中在特定类型(针对病原体)的脑炎(例如,西尼罗河、日本乙型脑炎等)。此外,还有许多其他传染性和非传染性疾病也有类似的症状,因此将脑炎与其他伪装疾病区分开来仍然是一项具有挑战性的任务。
我们使用典型相关分析(CCA)来评估一组暴露变量与一组症状和诊断变量之间在人类脑炎中的关联。该数据由在英国进行的一项前瞻性多中心研究中的 208 例确诊脑炎病例组成。我们使用基于基尼相似性度量的协方差矩阵,并使用基于置换的方法来检验典型变量的显著性。
结果表明,危险因素(暴露和人口统计学)与症状/实验室变量之间存在微弱的两两相关性。然而,CCA 的第一个典型变量揭示了两组之间的强多变量相关性(ρ=0.71,se=0.03,p=0.013)。我们发现第二典型变量中的变量之间存在中度相关性(ρ=0.54,se=0.02),但该值在统计学上不显著(p=0.68)。我们的结果还表明,症状组的变化只有很小一部分可以用暴露变量来解释。这表明宿主因素,而不是环境因素,对于理解脑炎的病因以及促进脑炎患者的早期诊断和治疗可能很重要。
目前还没有用于脑炎调查的标准实验室诊断策略,即使是经验丰富的医生也常常不确定脑炎的病因、适当的治疗方法和预后。因此,使用能够捕捉数据内在复杂性的先进多元统计建模方法探索人类脑炎数据对于理解人类脑炎的病因至关重要。此外,应用多元探索性技术将生成具有临床重要性的假设,并为进一步在验证性统计分析中考虑有价值的变量数量和性质提供有用的见解。