Department of Chemical Engineering, Norwegian University of Science and Technology (NTNU), Trondheim, Norway.
Institute for Lung Research, Universities of Giessen and Marburg Lung Centre, Philipps University Marburg, Marburg, Germany.
Sci Rep. 2021 May 13;11(1):10258. doi: 10.1038/s41598-021-89762-8.
Chronic obstructive pulmonary disease (COPD) kills over three million people worldwide every year. Despite its high global impact, the knowledge about the underlying molecular mechanisms is still limited. In this study, we aimed to extend the available knowledge by identifying a small set of COPD-associated genes. We analysed different publicly available gene expression datasets containing whole lung tissue (WLT) and airway epithelium (AE) samples from over 400 human subjects for differentially expressed genes (DEGs). We reduced the resulting sets of 436 and 663 DEGs using a novel computational approach that utilises a random depth-first search to identify genes which improve the distinction between COPD patients and controls along the first principle component of the data. Our method identified small sets of 10 and 15 genes in the WLT and AE, respectively. These sets of genes significantly (p < 10) distinguish COPD patients from controls with high fidelity. The final sets revealed novel genes like cysteine rich protein 1 (CRIP1) or secretoglobin family 3A member 2 (SCGB3A2) that may underlie fundamental molecular mechanisms of COPD in these tissues.
慢性阻塞性肺疾病(COPD)每年在全球导致超过 300 万人死亡。尽管它在全球的影响很大,但对其潜在的分子机制的了解仍然有限。在这项研究中,我们旨在通过鉴定一小部分与 COPD 相关的基因来扩展现有知识。我们分析了来自 400 多个人类受试者的不同公开的基因表达数据集,这些数据集包含全肺组织(WLT)和气道上皮(AE)样本,以寻找差异表达基因(DEGs)。我们使用一种新的计算方法来减少这两组分别为 436 和 663 个 DEGs,该方法利用随机深度优先搜索来识别那些可以沿着数据的第一主成分改善 COPD 患者和对照组之间区分的基因。我们的方法在 WLT 和 AE 中分别确定了 10 个和 15 个基因的小集合。这些基因集合可以非常准确(p < 10)地将 COPD 患者与对照组区分开来。最终的基因集合揭示了一些新的基因,如富含半胱氨酸蛋白 1(CRIP1)或分泌球蛋白家族 3A 成员 2(SCGB3A2),它们可能是这些组织中 COPD 基本分子机制的基础。