Department of Thoracic Surgery, Shanghai Pulmonary Hospital, Tongji University School of Medicine, Shanghai, China.
BMC Pulm Med. 2021 Nov 25;21(1):383. doi: 10.1186/s12890-021-01758-2.
Pulmonary arterial hypertension (PH) secondary to pulmonary fibrosis (PF) is one of the most common complications in PF patients, it causes severe disease and usually have a poor prognosis. Whether the combination of PH and PF is a unique disease phenotype is unclear. We aimed to screen the key modules associated with PH-PF immune infiltration based on WGCNA and identify the hub genes for molecular typing.
Using the gene expression profile GSE24988 of PF patients with or without PH from the Gene Expression Omnibus (GEO) database, we evaluated immune cell infiltration using Cibersortx and immune cell gene signature files. Different immune cell types were screened using the Wilcoxon test; differentially expressed genes were screened using samr. The molecular pathways implicated in these differential responses were identified using Gene Ontology and Kyoto Encyclopedia of Genes and Genomes functional enrichment analyses. A weighted co-expression network of the differential genes was constructed, relevant co-expression modules were identified, and relationships between modules and differential immune cell infiltration were calculated. The modules most relevant to this disease were identified using weighted correlation network analysis. From these, we constructed a co-expression network; using the STRING database, we integrated the values into the human protein-protein interaction network before constructing a co-expression interaction subnet, screening genes associated with immunity and unsupervised molecular typing, and analyzing the immune cell infiltration and expression of key genes in each disease type.
Of the 22 immune cell types from the PF GEO data, 20 different immune cell types were identified. There were 1622 differentially expressed genes (295 upregulated and 1327 downregulated). The resulting weighted co-expression network identified six co-expression modules. These were screened to identify the modules most relevant to the disease phenotype (the green module). By calculating the correlations between modules and the differentially infiltrated immune cells, extracting the green module co-expression network (46 genes), extracting 25 key genes using gene significance and module-membership thresholds, and combining these with the 10 key genes in the human protein-protein interaction network, we identified five immune cell-related marker genes that might be applied as biomarkers. Using these marker genes, we evaluated these disease samples using unsupervised clustering molecular typing.
Our results demonstrated that all PF combined with PH samples belonged to four categories. Studies on the five key genes are required to validate their diagnostic and prognostic value.
肺纤维化(PF)继发肺动脉高压(PH)是 PF 患者最常见的并发症之一,它导致严重的疾病,通常预后不良。PH 与 PF 的联合是否是一种独特的疾病表型尚不清楚。我们旨在基于 WGCNA 筛选与 PH-PF 免疫浸润相关的关键模块,并鉴定用于分子分型的枢纽基因。
使用基因表达谱 GSE24988 来自基因表达综合数据库(GEO)数据库中 PF 患者是否伴有 PH,我们使用 Cibersortx 和免疫细胞基因特征文件评估免疫细胞浸润。使用 Wilcoxon 检验筛选不同的免疫细胞类型;使用 samr 筛选差异表达基因。使用基因本体论和京都基因与基因组百科全书功能富集分析鉴定涉及这些差异反应的分子途径。构建差异基因的加权共表达网络,鉴定相关的共表达模块,并计算模块与差异免疫细胞浸润之间的关系。使用加权相关网络分析确定与该疾病最相关的模块。从这些模块中,我们构建了一个共表达网络;使用 STRING 数据库,我们将值整合到人类蛋白质-蛋白质相互作用网络中,然后构建共表达交互子网,筛选与免疫相关的基因和无监督分子分型,并分析每种疾病类型的关键基因的免疫细胞浸润和表达。
从 PF GEO 数据中的 22 种免疫细胞类型中,鉴定出 20 种不同的免疫细胞类型。有 1622 个差异表达基因(295 个上调和 1327 个下调)。所得加权共表达网络鉴定出六个共表达模块。筛选这些模块以确定与疾病表型最相关的模块(绿色模块)。通过计算模块与差异浸润免疫细胞之间的相关性,提取绿色模块共表达网络(46 个基因),使用基因显著性和模块成员阈值提取 25 个关键基因,并将这些基因与人类蛋白质-蛋白质相互作用网络中的 10 个关键基因结合起来,我们确定了五个可能作为生物标志物应用的免疫细胞相关标记基因。使用这些标记基因,我们使用无监督聚类分子分型评估这些疾病样本。
我们的结果表明,所有 PF 合并 PH 样本均属于四类。需要对这五个关键基因进行研究,以验证其诊断和预后价值。