Department of Information Systems, University of Haifa, Haifa, Israel.
PLoS One. 2022 Mar 10;17(3):e0264919. doi: 10.1371/journal.pone.0264919. eCollection 2022.
Mechanical ventilation (MV) is a lifesaving therapy used for patients with respiratory failure. Nevertheless, MV is associated with numerous complications and increased mortality. The aim of this study is to define the effects of MV on gene expression of direct and peripheral human tissues.
Classification models were applied to Genotype-Tissue Expression Project (GTEx) gene expression data of six representative tissues-liver, adipose, skin, nerve-tibial, muscle and lung, for performance comparison and feature analysis. We utilized 18 prediction models using the Random Forest (RF), XGBoost (eXtreme Gradient Boosting) decision tree and ANN (Artificial Neural Network) methods to classify ventilation and non-ventilation samples and to compare their prediction performance for the six tissues. In the model comparison, the AUC (area under receiver operating curve), accuracy, precision, recall, and F1 score were used to evaluate the predictive performance of each model. We then conducted feature analysis per each tissue to detect MV marker genes followed by pathway enrichment analysis for these genes.
XGBoost outperformed the other methods and predicted samples had undergone MV with an average accuracy for the six tissues of 0.951 and average AUC of 0.945. The feature analysis detected a combination of MV marker genes per each tested tissue, some common across several tissues. MV marker genes were mainly related to inflammation and fibrosis as well as cell development and movement regulation. The MV marker genes were significantly enriched in inflammatory and viral pathways.
The XGBoost method demonstrated clear enhanced performance and feature analysis compared to the other models. XGBoost was helpful in detecting the tissue-specific marker genes for identifying transcriptomic changes related to MV. Our results show that MV is associated with reduced development and movement in the tissues and higher inflammation and injury not only in direct tissues such as the lungs but also in peripheral tissues and thus should be carefully considered before being implemented.
机械通气(MV)是一种用于呼吸衰竭患者的救生治疗方法。然而,MV 与许多并发症和死亡率增加有关。本研究旨在定义 MV 对直接和外周人体组织基因表达的影响。
我们应用分类模型对六个代表性组织(肝脏、脂肪、皮肤、神经-胫骨、肌肉和肺)的基因型组织表达项目(GTEx)基因表达数据进行分类,以进行性能比较和特征分析。我们使用 18 种预测模型,包括随机森林(RF)、极端梯度提升(XGBoost)决策树和人工神经网络(ANN)方法,对通气和非通气样本进行分类,并比较它们在六种组织中的预测性能。在模型比较中,我们使用 AUC(接收者操作曲线下的面积)、准确性、精度、召回率和 F1 分数来评估每个模型的预测性能。然后,我们对每种组织进行特征分析,以检测 MV 标记基因,然后对这些基因进行通路富集分析。
XGBoost 的表现优于其他方法,对六种组织的平均预测准确率为 0.951,平均 AUC 为 0.945。特征分析检测到每种测试组织的 MV 标记基因组合,其中一些在几个组织中都存在。MV 标记基因主要与炎症和纤维化以及细胞发育和运动调节有关。MV 标记基因在炎症和病毒途径中显著富集。
与其他模型相比,XGBoost 方法表现出明显增强的性能和特征分析。XGBoost 有助于检测组织特异性标记基因,以识别与 MV 相关的转录组变化。我们的结果表明,MV 与组织中发育和运动减少以及炎症和损伤增加有关,不仅在直接组织(如肺)中,而且在周围组织中也是如此,因此在实施之前应仔细考虑。