Suppr超能文献

机器学习方法预测呼吸道病毒感染和症状严重程度的比较分析。

Comparative analysis of machine learning approaches for predicting respiratory virus infection and symptom severity.

机构信息

Department of Management Information Systems, Sivas Cumhuriyet University, Sivas, Turkey.

Department of Computer Engineering, Abdullah Gül University, Kayseri, Turkey.

出版信息

PeerJ. 2023 Jun 30;11:e15552. doi: 10.7717/peerj.15552. eCollection 2023.

Abstract

Respiratory diseases are among the major health problems causing a burden on hospitals. Diagnosis of infection and rapid prediction of severity without time-consuming clinical tests could be beneficial in preventing the spread and progression of the disease, especially in countries where health systems remain incapable. Personalized medicine studies involving statistics and computer technologies could help to address this need. In addition to individual studies, competitions are also held such as Dialogue for Reverse Engineering Assessment and Methods (DREAM) challenge which is a community-driven organization with a mission to research biology, bioinformatics, and biomedicine. One of these competitions was the Respiratory Viral DREAM Challenge, which aimed to develop early predictive biomarkers for respiratory virus infections. These efforts are promising, however, the prediction performance of the computational methods developed for detecting respiratory diseases still has room for improvement. In this study, we focused on improving the performance of predicting the infection and symptom severity of individuals infected with various respiratory viruses using gene expression data collected before and after exposure. The publicly available gene expression dataset in the Gene Expression Omnibus, named GSE73072, containing samples exposed to four respiratory viruses (H1N1, H3N2, human rhinovirus (HRV), and respiratory syncytial virus (RSV)) was used as input data. Various preprocessing methods and machine learning algorithms were implemented and compared to achieve the best prediction performance. The experimental results showed that the proposed approaches obtained a prediction performance of 0.9746 area under the precision-recall curve (AUPRC) for infection (., shedding) prediction (SC-1), 0.9182 AUPRC for symptom class prediction (SC-2), and 0.6733 Pearson correlation for symptom score prediction (SC-3) by outperforming the best leaderboard scores of Respiratory Viral DREAM Challenge (a 4.48% improvement for SC-1, a 13.68% improvement for SC-2, and a 13.98% improvement for SC-3). Additionally, over-representation analysis (ORA), which is a statistical method for objectively determining whether certain genes are more prevalent in pre-defined sets such as pathways, was applied using the most significant genes selected by feature selection methods. The results show that pathways associated with the 'adaptive immune system' and 'immune disease' are strongly linked to pre-infection and symptom development. These findings contribute to our knowledge about predicting respiratory infections and are expected to facilitate the development of future studies that concentrate on predicting not only infections but also the associated symptoms.

摘要

呼吸系统疾病是导致医院负担的主要健康问题之一。无需耗时的临床检测即可对感染进行诊断并快速预测严重程度,这可能有助于预防疾病的传播和恶化,尤其是在卫生系统仍不完善的国家。涉及统计学和计算机技术的个性化医学研究可以帮助满足这一需求。除了个体研究外,还会举办竞赛,例如“反向工程评估与方法对话”(DREAM)挑战赛,这是一个由社区驱动的组织,其使命是研究生物学、生物信息学和生物医学。其中一项竞赛是呼吸道病毒 DREAM 挑战赛,旨在开发针对呼吸道病毒感染的早期预测生物标志物。这些努力是有希望的,但是,用于检测呼吸道疾病的计算方法的预测性能仍有改进的空间。在这项研究中,我们专注于使用暴露前后收集的基因表达数据来提高对各种呼吸道病毒感染个体的感染和症状严重程度的预测性能。使用来自基因表达综合数据库(GEO)的公开基因表达数据集 GSE73072 作为输入数据,该数据集包含了暴露于四种呼吸道病毒(H1N1、H3N2、人类鼻病毒(HRV)和呼吸道合胞病毒(RSV))的样本。实施了各种预处理方法和机器学习算法,并进行了比较,以达到最佳的预测性能。实验结果表明,所提出的方法在感染(脱落)预测(SC-1)方面获得了 0.9746 的精度-召回曲线下面积(AUPRC),在症状类别预测(SC-2)方面获得了 0.9182 的 AUPRC,在症状评分预测(SC-3)方面获得了 0.6733 的 Pearson 相关系数,优于呼吸道病毒 DREAM 挑战赛的最佳排行榜得分(SC-1 提高了 4.48%,SC-2 提高了 13.68%,SC-3 提高了 13.98%)。此外,还应用了过表达分析(ORA),这是一种用于客观确定某些基因是否在途径等预定义集合中更为普遍的统计方法。结果表明,与“适应性免疫系统”和“免疫性疾病”相关的途径与感染前和症状发展密切相关。这些发现有助于我们了解预测呼吸道感染的知识,并有望促进未来不仅关注感染,还关注相关症状的预测的研究的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba3/10317018/4ef4f8b54984/peerj-11-15552-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验