Department of Anesthesia, Zhejiang Cancer Hospital, Hangzhou, Zhejiang, China (mainland).
Med Sci Monit. 2020 Feb 7;26:e919035. doi: 10.12659/MSM.919035.
BACKGROUND This study aimed to use three modeling methods, logistic regression analysis, random forest analysis, and fully-connected neural network analysis, to develop a diagnostic gene signature for the diagnosis of ventilator-associated pneumonia (VAP). MATERIAL AND METHODS GSE30385 from the Gene Expression Omnibus (GEO) database identified differentially expressed genes (DEGs) associated with patients with VAP. Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment identified the molecular functions of the DEGs. The least absolute shrinkage and selection operator (LASSO) regression analysis algorithm was used to select key genes. Three modeling methods, including logistic regression analysis, random forest analysis, and fully-connected neural network analysis, also known as also known as the feed-forward multi-layer perceptron (MLP), were used to identify the diagnostic gene signature for patients with VAP. RESULTS Sixty-six DEGs were identified for patients who had VAP (VAP+) and who did not have VAP (VAP-). Ten essential or feature genes were identified. Upregulated genes included matrix metallopeptidase 8 (MMP8), arginase 1 (ARG1), haptoglobin (HP), interleukin 18 receptor 1 (IL18R1), and NLR family apoptosis inhibitory protein (NAIP). Down-regulated genes included complement factor D (CFD), pleckstrin homology-like domain family A member 2 (PHLDA2), plasminogen activator, urokinase (PLAU), laminin subunit beta 3 (LAMB3), and dual-specificity phosphatase 2 (DUSP2). Logistic regression, random forest, and MLP analysis showed receiver operating characteristic (ROC) curve area under the curve (AUC) values of 0.85, 0.86, and 0.87, respectively. CONCLUSIONS Logistic regression analysis, random forest analysis, and MLP analysis identified a ten-gene signature for the diagnosis of VAP.
本研究旨在使用逻辑回归分析、随机森林分析和全连接神经网络分析三种建模方法,为呼吸机相关性肺炎(VAP)的诊断开发一个诊断基因特征。
从基因表达综合数据库(GEO)中鉴定出与 VAP 患者相关的差异表达基因(DEGs)。基因本体论(GO)和京都基因与基因组百科全书(KEGG)通路富集分析鉴定了 DEGs 的分子功能。最小绝对收缩和选择算子(LASSO)回归分析算法用于选择关键基因。逻辑回归分析、随机森林分析和全连接神经网络分析(也称为前馈多层感知器(MLP))这三种建模方法用于识别 VAP 患者的诊断基因特征。
共鉴定出 66 个与患有 VAP(VAP+)和未患有 VAP(VAP-)的患者相关的 DEGs。鉴定出 10 个关键或特征基因。上调基因包括基质金属蛋白酶 8(MMP8)、精氨酸酶 1(ARG1)、触珠蛋白(HP)、白细胞介素 18 受体 1(IL18R1)和 NOD 样受体家族凋亡抑制蛋白(NAIP)。下调基因包括补体因子 D(CFD)、pleckstrin 同源样结构域家族 A 成员 2(PHLDA2)、尿激酶型纤溶酶原激活物(PLAU)、层粘连蛋白亚基β 3(LAMB3)和双特异性磷酸酶 2(DUSP2)。逻辑回归、随机森林和 MLP 分析显示,受试者工作特征(ROC)曲线的曲线下面积(AUC)值分别为 0.85、0.86 和 0.87。
逻辑回归分析、随机森林分析和 MLP 分析鉴定出一个用于诊断 VAP 的十基因特征。