蛋白质背景信息的整合改善了基于蛋白质的新冠肺炎患者分层。

Integration of protein context improves protein-based COVID-19 patient stratification.

作者信息

Gao Jinlong, He Jiale, Zhang Fangfei, Xiao Qi, Cai Xue, Yi Xiao, Zheng Siqi, Zhang Ying, Wang Donglian, Zhu Guangjun, Wang Jing, Shen Bo, Ralser Markus, Guo Tiannan, Zhu Yi

机构信息

Westlake Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.

Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, Hangzhou, Zhejiang, China.

出版信息

Clin Proteomics. 2022 Aug 11;19(1):31. doi: 10.1186/s12014-022-09370-0.

DOI:10.1186/s12014-022-09370-0

PMID:35953823

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9366758/

Abstract

BACKGROUND

Classification of disease severity is crucial for the management of COVID-19. Several studies have shown that individual proteins can be used to classify the severity of COVID-19. Here, we aimed to investigate whether integrating four types of protein context data, namely, protein complexes, stoichiometric ratios, pathways and network degrees will improve the severity classification of COVID-19.

METHODS

We performed machine learning based on three previously published datasets. The first was a SWATH (sequential window acquisition of all theoretical fragment ion spectra) MS (mass spectrometry) based proteomic dataset. The second was a TMTpro 16plex labeled shotgun proteomics dataset. The third was a SWATH dataset of an independent patient cohort.

RESULTS

Besides twelve proteins, machine learning also prioritized two complexes, one stoichiometric ratio, five pathways, and five network degrees, resulting a 25-feature panel. As a result, a model based on the 25 features led to effective classification of severe cases with an AUC of 0.965, outperforming the models with proteins only. Complement component C9, transthyretin (TTR) and TTR-RBP (transthyretin-retinol binding protein) complex, the stoichiometric ratio of SAA2 (serum amyloid A proteins 2)/YLPM1 (YLP Motif Containing 1), and the network degree of SIRT7 (Sirtuin 7) and A2M (alpha-2-macroglobulin) were highlighted as potential markers by this classifier. This classifier was further validated with a TMT-based proteomic data set from the same cohort (test dataset 1) and an independent SWATH-based proteomic data set from Germany (test dataset 2), reaching an AUC of 0.900 and 0.908, respectively. Machine learning models integrating protein context information achieved higher AUCs than models with only one feature type.

CONCLUSION

Our results show that the integration of protein context including protein complexes, stoichiometric ratios, pathways, network degrees, and proteins improves phenotype prediction.

摘要

背景

疾病严重程度分类对于新冠病毒病（COVID-19）的管理至关重要。多项研究表明，单个蛋白质可用于对COVID-19的严重程度进行分类。在此，我们旨在研究整合四种类型的蛋白质背景数据，即蛋白质复合物、化学计量比、信号通路和网络度，是否会改善COVID-19的严重程度分类。

方法

我们基于三个先前发表的数据集进行机器学习。第一个是基于SWATH（所有理论碎片离子光谱的顺序窗口采集）质谱（MS）的蛋白质组学数据集。第二个是TMTpro 16plex标记的鸟枪法蛋白质组学数据集。第三个是一个独立患者队列的SWATH数据集。

结果

除了12种蛋白质外，机器学习还对两种复合物、一种化学计量比、五条信号通路和五个网络度进行了优先级排序，从而形成了一个包含25个特征的组合。结果，基于这25个特征的模型能够有效分类重症病例，曲线下面积（AUC）为0.965，优于仅使用蛋白质的模型。补体成分C9、转甲状腺素蛋白（TTR）和TTR-RBP（转甲状腺素蛋白-视黄醇结合蛋白）复合物、血清淀粉样蛋白A2（SAA2）/含YLP基序1（YLPM1）的化学计量比，以及沉默调节蛋白7（SIRT7）和α-2-巨球蛋白（A2M）的网络度被该分类器突出显示为潜在标志物。该分类器在来自同一队列的基于TMT的蛋白质组学数据集（测试数据集1）和来自德国的独立的基于SWATH的蛋白质组学数据集（测试数据集2）中得到进一步验证，AUC分别达到0.900和0.908。整合蛋白质背景信息的机器学习模型比仅使用一种特征类型的模型具有更高的AUC。

结论

我们的结果表明，整合包括蛋白质复合物、化学计量比、信号通路、网络度和蛋白质在内的蛋白质背景信息可改善表型预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c394/9367071/c1960d4b2f41/12014_2022_9370_Fig2_HTML.jpg

相似文献

Integration of protein context improves protein-based COVID-19 patient stratification.

Clin Proteomics. 2022 Aug 11;19(1):31. doi: 10.1186/s12014-022-09370-0.

Optimization of Acquisition and Data-Processing Parameters for Improved Proteomic Quantification by Sequential Window Acquisition of All Theoretical Fragment Ion Mass Spectrometry.

J Proteome Res. 2017 Feb 3;16(2):738-747. doi: 10.1021/acs.jproteome.6b00767. Epub 2017 Jan 3.

Application of information theoretic feature selection and machine learning methods for the development of genetic risk prediction models.

Sci Rep. 2021 Dec 2;11(1):23335. doi: 10.1038/s41598-021-00854-x.

Identification of plasma proteins associated with oesophageal cancer chemotherapeutic treatment outcomes using SWATH-MS.

J Proteomics. 2022 Aug 30;266:104684. doi: 10.1016/j.jprot.2022.104684. Epub 2022 Jul 14.

Machine Learning of Plasma Proteomics Classifies Diagnosis of Interstitial Lung Disease.

Am J Respir Crit Care Med. 2024 Aug 15;210(4):444-454. doi: 10.1164/rccm.202309-1692OC.

COVIDpro: Database for mining protein dysregulation in patients with COVID-19.

bioRxiv. 2022 Sep 30:2022.09.27.509819. doi: 10.1101/2022.09.27.509819.

SWATH mass spectrometry as a tool for quantitative profiling of the matrisome.

J Proteomics. 2018 Oct 30;189:11-22. doi: 10.1016/j.jprot.2018.02.026. Epub 2018 Mar 1.

Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.

Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.

A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.

Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

引用本文的文献

Research Trends in Proteomic Studies Using Serum from COVID-19 Patients: A Bibliometric Analysis.

Curr Med Chem. 2025;32(12):2275-2290. doi: 10.2174/0109298673286915240329063441.

A pathway activity-based proteomic classifier stratifies prostate tumors into two subtypes.

Clin Proteomics. 2023 Nov 11;20(1):50. doi: 10.1186/s12014-023-09441-w.

Immunology of SARS-CoV-2 infection and vaccination.

Clin Chim Acta. 2023 May 1;545:117390. doi: 10.1016/j.cca.2023.117390. Epub 2023 May 13.

本文引用的文献

A proteomic survival predictor for COVID-19 patients in intensive care.

PLOS Digit Health. 2022 Jan 18;1(1):e0000007. doi: 10.1371/journal.pdig.0000007. eCollection 2022 Jan.

Complement activation induces excessive T cell cytotoxicity in severe COVID-19.

Cell. 2022 Feb 3;185(3):493-512.e25. doi: 10.1016/j.cell.2021.12.040. Epub 2021 Dec 28.

Next generation plasma proteome profiling of COVID-19 patients with mild to moderate symptoms.

EBioMedicine. 2021 Dec;74:103723. doi: 10.1016/j.ebiom.2021.103723. Epub 2021 Nov 27.

Potential Use of Serum Proteomics for Monitoring COVID-19 Progression to Complement RT-PCR Detection.

J Proteome Res. 2022 Jan 7;21(1):90-100. doi: 10.1021/acs.jproteome.1c00525. Epub 2021 Nov 16.

High-resolution serum proteome trajectories in COVID-19 reveal patient-specific seroconversion.

EMBO Mol Med. 2021 Aug 9;13(8):e14167. doi: 10.15252/emmm.202114167. Epub 2021 Jul 7.

A serum proteome signature to predict mortality in severe COVID-19 patients.

Life Sci Alliance. 2021 Jul 5;4(9). doi: 10.26508/lsa.202101099. Print 2021 Sep.

Combined Metabolic Activators Accelerates Recovery in Mild-to-Moderate COVID-19.

Adv Sci (Weinh). 2021 Sep;8(17):e2101222. doi: 10.1002/advs.202101222. Epub 2021 Jun 28.

Unbiased Analysis of Temporal Changes in Immune Serum Markers in Acute COVID-19 Infection With Emphasis on Organ Failure, Anti-Viral Treatment, and Demographic Characteristics.

Front Immunol. 2021 Jun 11;12:650465. doi: 10.3389/fimmu.2021.650465. eCollection 2021.

A time-resolved proteomic and prognostic map of COVID-19.

Cell Syst. 2021 Aug 18;12(8):780-794.e7. doi: 10.1016/j.cels.2021.05.005. Epub 2021 Jun 14.

SARS-CoV-2 RNAemia and proteomic trajectories inform prognostication in COVID-19 patients admitted to intensive care.

Nat Commun. 2021 Jun 7;12(1):3406. doi: 10.1038/s41467-021-23494-1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

蛋白质背景信息的整合改善了基于蛋白质的新冠肺炎患者分层。

Integration of protein context improves protein-based COVID-19 patient stratification.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献