Wang Jiuzhou, Safo Sandra E
Division of Biostatistics, University of Minnesota, MN.
ArXiv. 2021 Nov 18:arXiv:2111.09964v2.
COVID-19 severity is due to complications from SARS-Cov-2 but the clinical course of the infection varies for individuals, emphasizing the need to better understand the disease at the molecular level. We use clinical and multiple molecular data (or views) obtained from patients with and without COVID-19 who were (or not) admitted to the intensive care unit to shed light on COVID-19 severity. Methods for jointly associating the views and separating the COVID-19 groups (i.e., one-step methods) have focused on linear relationships. The relationships between the views and COVID-19 patient groups, however, are too complex to be understood solely by linear methods. Existing nonlinear one-step methods cannot be used to identify signatures to aid in our understanding of the complexity of the disease. We propose Deep IDA (Integrative Discriminant Analysis) to address analytical challenges in our problem of interest. Deep IDA learns nonlinear projections of two or more views that maximally associate the views and separate the classes in each view, and permits feature ranking for interpretable findings. Our applications demonstrate that Deep IDA has competitive classification rates compared to other state-of-the-art methods and is able to identify molecular signatures that facilitate an understanding of COVID-19 severity.
COVID-19的严重程度归因于SARS-CoV-2引发的并发症,但感染的临床病程因个体而异,这凸显了在分子水平上更好地了解该疾病的必要性。我们使用从入住或未入住重症监护病房的COVID-19患者和非COVID-19患者那里获得的临床和多种分子数据(或视图),以阐明COVID-19的严重程度。将这些视图联合起来并区分COVID-19组的方法(即一步法)一直侧重于线性关系。然而,这些视图与COVID-19患者组之间的关系过于复杂,无法仅通过线性方法来理解。现有的非线性一步法无法用于识别有助于我们理解该疾病复杂性的特征。我们提出深度集成判别分析(Deep IDA)来解决我们感兴趣问题中的分析挑战。深度集成判别分析学习两个或更多视图的非线性投影,以使这些视图最大程度地关联起来,并在每个视图中区分不同类别,同时允许进行特征排序以获得可解释的结果。我们的应用表明,与其他现有最先进方法相比,深度集成判别分析具有具有竞争力的分类率,并且能够识别有助于理解COVID-19严重程度的分子特征。