Pickering Chad, Zhou Bo, Xu Gege, Rice Rachel, Ramachandran Prasanna, Huang Hector, Pham Tho D, Schapiro Jeffrey M, Cong Xin, Chakraborty Saborni, Edwards Karlie, Reddy Srinivasa T, Guirgis Faheem, Wang Taia T, Serie Daniel, Lindpaintner Klaus
InterVenn Biosciences, South San Francisco, CA 94080, USA.
Blood Center, Palo Alto, CA 94304, USA.
Viruses. 2022 Mar 7;14(3):553. doi: 10.3390/v14030553.
Glycosylation is the most common form of post-translational modification of proteins, critically affecting their structure and function. Using liquid chromatography and mass spectrometry for high-resolution site-specific quantification of glycopeptides coupled with high-throughput artificial intelligence-powered data processing, we analyzed differential protein glycoisoform distributions of 597 abundant serum glycopeptides and nonglycosylated peptides in 50 individuals who had been seriously ill with COVID-19 and in 22 individuals who had recovered after an asymptomatic course of COVID-19. As additional comparison reference phenotypes, we included 12 individuals with a history of infection with a common cold coronavirus, 16 patients with bacterial sepsis, and 15 healthy subjects without history of coronavirus exposure. We found statistically significant differences, at FDR < 0.05, for normalized abundances of 374 of the 597 peptides and glycopeptides interrogated between symptomatic and asymptomatic COVID-19 patients. Similar statistically significant differences were seen when comparing symptomatic COVID-19 patients to healthy controls (350 differentially abundant peptides and glycopeptides) and common cold coronavirus seropositive subjects (353 differentially abundant peptides and glycopeptides). Among healthy controls and sepsis patients, 326 peptides and glycopeptides were found to be differentially abundant, of which 277 overlapped with biomarkers that showed differential expression between symptomatic COVID-19 cases and healthy controls. Among symptomatic COVID-19 cases and sepsis patients, 101 glycopeptide and peptide biomarkers were found to be statistically significantly abundant. Using both supervised and unsupervised machine learning techniques, we found specific glycoprotein profiles to be strongly predictive of symptomatic COVID-19 infection. LASSO-regularized multivariable logistic regression and K-means clustering yielded accuracies of 100% in an independent test set and of 96% overall, respectively. Our findings are consistent with the interpretation that a majority of glycoprotein modifications observed which are shared among symptomatic COVID-19 and sepsis patients likely represent a generic consequence of a severe systemic immune and inflammatory state. However, there are glycoisoform changes that are specific and particular to severe COVID-19 infection. These may be representative of either COVID-19-specific consequences or susceptibility to or predisposition for a severe course of the disease. Our findings support the potential value of glycoproteomic biomarkers in the biomedical understanding and, potentially, the clinical management of serious acute infectious conditions.
糖基化是蛋白质翻译后修饰最常见的形式,对其结构和功能有至关重要的影响。我们运用液相色谱和质谱技术对糖肽进行高分辨率位点特异性定量,并结合高通量人工智能驱动的数据处理,分析了50例新冠肺炎重症患者和22例新冠肺炎无症状感染康复者体内597种丰富血清糖肽和非糖基化肽的差异蛋白糖异构体分布。作为额外的比较参考表型,我们纳入了12例曾感染普通感冒冠状病毒的个体、16例细菌性败血症患者以及15例无冠状病毒暴露史的健康受试者。我们发现,在FDR<0.05时,597种肽和糖肽的标准化丰度在有症状和无症状新冠肺炎患者之间存在统计学显著差异。将有症状新冠肺炎患者与健康对照(350种差异丰度的肽和糖肽)以及普通感冒冠状病毒血清阳性个体(353种差异丰度的肽和糖肽)进行比较时,也观察到了类似的统计学显著差异。在健康对照和败血症患者中,发现326种肽和糖肽丰度存在差异,其中277种与在有症状新冠肺炎病例和健康对照之间表现出差异表达的生物标志物重叠。在有症状新冠肺炎病例和败血症患者中,发现101种糖肽和肽生物标志物在统计学上显著丰富。使用监督和无监督机器学习技术,我们发现特定的糖蛋白谱对有症状新冠肺炎感染具有很强的预测性。LASSO正则化多变量逻辑回归和K均值聚类在独立测试集中的准确率分别为100%,总体准确率为96%。我们的研究结果支持这样的解释:在有症状新冠肺炎患者和败血症患者中观察到的大多数糖蛋白修饰可能代表严重全身免疫和炎症状态的一般后果。然而,存在特定于严重新冠肺炎感染的糖异构体变化。这些变化可能代表新冠肺炎的特定后果,或者是对该疾病严重病程的易感性或倾向。我们的研究结果支持糖蛋白质组学生物标志物在生物医学理解以及潜在的严重急性感染性疾病临床管理中的潜在价值。