Liu Zhiyang, Meng Mei, Ding ShiJian, Zhou XiaoChao, Feng KaiYan, Huang Tao, Cai Yu-Dong
School of Life Sciences, Changchun Sci-Tech University, Changchun, China.
State Key Laboratory of Oncogenes and Related Genes, Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Front Microbiol. 2022 Sep 23;13:1007295. doi: 10.3389/fmicb.2022.1007295. eCollection 2022.
Patients infected with SARS-CoV-2 at various severities have different clinical manifestations and treatments. Mild or moderate patients usually recover with conventional medical treatment, but severe patients require prompt professional treatment. Thus, stratifying infected patients for targeted treatment is meaningful. A computational workflow was designed in this study to identify key blood methylation features and rules that can distinguish the severity of SARS-CoV-2 infection. First, the methylation features in the expression profile were deeply analyzed by a Monte Carlo feature selection method. A feature list was generated. Next, this ranked feature list was fed into the incremental feature selection method to determine the optimal features for different classification algorithms, thereby further building optimal classifiers. These selected key features were analyzed by functional enrichment to detect their biofunctional information. Furthermore, a set of rules were set up by a white-box algorithm, decision tree, to uncover different methylation patterns on various severity of SARS-CoV-2 infection. Some genes (PARP9, MX1, IRF7), corresponding to essential methylation sites, and rules were validated by published academic literature. Overall, this study contributes to revealing potential expression features and provides a reference for patient stratification. The physicians can prioritize and allocate health and medical resources for COVID-19 patients based on their predicted severe clinical outcomes.
感染不同严重程度新冠病毒的患者有不同的临床表现和治疗方法。轻症或中症患者通常通过常规治疗康复,但重症患者需要及时的专业治疗。因此,对感染患者进行分层以进行针对性治疗是有意义的。本研究设计了一种计算工作流程,以识别可区分新冠病毒感染严重程度的关键血液甲基化特征和规则。首先,通过蒙特卡洛特征选择方法深入分析表达谱中的甲基化特征,生成一个特征列表。接下来,将这个排名后的特征列表输入到增量特征选择方法中,以确定不同分类算法的最优特征,从而进一步构建最优分类器。通过功能富集分析这些选定的关键特征,以检测它们的生物功能信息。此外,通过白盒算法决策树建立了一组规则,以揭示新冠病毒感染不同严重程度下的不同甲基化模式。一些与关键甲基化位点相对应的基因(PARP9、MX1、IRF7)和规则通过已发表的学术文献得到验证。总体而言,本研究有助于揭示潜在的表达特征,并为患者分层提供参考。医生可以根据预测的严重临床结果,为新冠患者优先安排和分配健康和医疗资源。