Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA.
Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA; Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka, India; Center for Molecular Medicine, National Institute of Mental Health and Neurosciences, Bangalore, Karnataka, India; Manipal Academy of Higher Education, Manipal, Karnataka, India.
Lancet Digit Health. 2022 Sep;4(9):e632-e645. doi: 10.1016/S2589-7500(22)00112-1. Epub 2022 Jul 11.
COVID-19 is a multi-system disorder with high variability in clinical outcomes among patients who are admitted to hospital. Although some cytokines such as interleukin (IL)-6 are believed to be associated with severity, there are no early biomarkers that can reliably predict patients who are more likely to have adverse outcomes. Thus, it is crucial to discover predictive markers of serious complications.
In this retrospective cohort study, we analysed samples from 455 participants with COVID-19 who had had a positive SARS-CoV-2 RT-PCR result between April 14, 2020, and Dec 1, 2020 and who had visited one of three Mayo Clinic sites in the USA (Minnesota, Arizona, or Florida) in the same period. These participants were assigned to three subgroups depending on disease severity as defined by the WHO ordinal scale of clinical improvement (outpatient, severe, or critical). Our control cohort comprised of 182 anonymised age-matched and sex-matched plasma samples that were available from the Mayo Clinic Biorepository and banked before the COVID-19 pandemic. We did a deep profiling of circulatory cytokines and other proteins, lipids, and metabolites from both cohorts. Most patient samples were collected before, or around the time of, hospital admission, representing ideal samples for predictive biomarker discovery. We used proximity extension assays to quantify cytokines and circulatory proteins and tandem mass spectrometry to measure lipids and metabolites. Biomarker discovery was done by applying an AutoGluon-tabular classifier to a multiomics dataset, producing a stacked ensemble of cutting-edge machine learning algorithms. Global proteomics and glycoproteomics on a subset of patient samples with matched pre-COVID-19 plasma samples was also done.
We quantified 1463 cytokines and circulatory proteins, along with 902 lipids and 1018 metabolites. By developing a machine-learning-based prediction model, a set of 102 biomarkers, which predicted severe and clinical COVID-19 outcomes better than the traditional set of cytokines, were discovered. These predictive biomarkers included several novel cytokines and other proteins, lipids, and metabolites. For example, altered amounts of C-type lectin domain family 6 member A (CLEC6A), ether phosphatidylethanolamine (P-18:1/18:1), and 2-hydroxydecanoate, as reported here, have not previously been associated with severity in COVID-19. Patient samples with matched pre-COVID-19 plasma samples showed similar trends in muti-omics signatures along with differences in glycoproteomics profile.
A multiomic molecular signature in the plasma of patients with COVID-19 before being admitted to hospital can be exploited to predict a more severe course of disease. Machine learning approaches can be applied to highly complex and multidimensional profiling data to reveal novel signatures of clinical use. The absence of validation in an independent cohort remains a major limitation of the study.
Eric and Wendy Schmidt.
COVID-19 是一种多系统疾病,住院患者的临床结局差异很大。虽然某些细胞因子(如白细胞介素[IL]-6)被认为与严重程度有关,但目前尚无可靠预测预后不良风险的早期生物标志物。因此,发现严重并发症的预测标志物至关重要。
在这项回顾性队列研究中,我们分析了 2020 年 4 月 14 日至 12 月 1 日期间 SARS-CoV-2 RT-PCR 结果呈阳性且在同一时期曾访问过美国梅奥诊所(明尼苏达州、亚利桑那州或佛罗里达州)三个地点之一的 455 例 COVID-19 患者的样本。根据世界卫生组织临床改善等级量表(门诊、严重或危急)将这些患者分为三个亚组。我们的对照队列由来自梅奥诊所生物库的 182 份匿名年龄和性别匹配的血浆样本组成,这些样本在 COVID-19 大流行之前就已经存在。我们对两个队列的循环细胞因子和其他蛋白质、脂质和代谢物进行了深度分析。大多数患者样本在入院前或入院时采集,代表了用于预测生物标志物发现的理想样本。我们使用邻近延伸分析法定量检测细胞因子和循环蛋白,并使用串联质谱法测量脂质和代谢物。通过将 AutoGluon-tabular 分类器应用于多组学数据集,我们发现了 102 种生物标志物,这些标志物在预测严重和临床 COVID-19 结局方面优于传统的细胞因子集。这些预测生物标志物包括几种新型细胞因子和其他蛋白质、脂质和代谢物。例如,C 型凝集素结构域家族 6 成员 A(CLEC6A)、醚磷脂酰乙醇胺(P-18:1/18:1)和 2-羟基癸酸的含量变化,如本文所述,以前与 COVID-19 中的严重程度无关。具有匹配的 COVID-19 前血浆样本的患者样本显示出相似的多组学特征趋势,同时在糖蛋白组学特征上存在差异。
住院前 COVID-19 患者血浆中的多组学分子特征可用于预测更严重的疾病过程。机器学习方法可以应用于高度复杂和多维的分析数据,以揭示具有临床应用价值的新特征。在独立队列中未进行验证仍然是该研究的主要局限性。
Eric 和 Wendy Schmidt。