Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA.
Joint CMU-Pitt PhD Program in Computational Biology, Pittsburgh, PA, USA.
Respir Res. 2023 Jan 25;24(1):30. doi: 10.1186/s12931-023-02316-6.
Chronic obstructive pulmonary disease (COPD) varies significantly in symptomatic and physiologic presentation. Identifying disease subtypes from molecular data, collected from easily accessible blood samples, can help stratify patients and guide disease management and treatment.
Blood gene expression measured by RNA-sequencing in the COPDGene Study was analyzed using a network perturbation analysis method. Each COPD sample was compared against a learned reference gene network to determine the part that is deregulated. Gene deregulation values were used to cluster the disease samples.
The discovery set included 617 former smokers from COPDGene. Four distinct gene network subtypes are identified with significant differences in symptoms, exercise capacity and mortality. These clusters do not necessarily correspond with the levels of lung function impairment and are independently validated in two external cohorts: 769 former smokers from COPDGene and 431 former smokers in the Multi-Ethnic Study of Atherosclerosis (MESA). Additionally, we identify several genes that are significantly deregulated across these subtypes, including DSP and GSTM1, which have been previously associated with COPD through genome-wide association study (GWAS).
The identified subtypes differ in mortality and in their clinical and functional characteristics, underlining the need for multi-dimensional assessment potentially supplemented by selected markers of gene expression. The subtypes were consistent across cohorts and could be used for new patient stratification and disease prognosis.
慢性阻塞性肺疾病(COPD)在症状和生理表现上有很大的差异。从易于获得的血液样本中提取分子数据,识别疾病亚型有助于对患者进行分层,并指导疾病管理和治疗。
COPDGene 研究中通过 RNA 测序测量的血液基因表达,采用网络干扰分析方法进行分析。将每个 COPD 样本与学习到的参考基因网络进行比较,以确定失调的部分。使用基因失调值对疾病样本进行聚类。
发现集包括 COPDGene 中的 617 名前吸烟者。通过比较,确定了 4 种不同的基因网络亚型,它们在症状、运动能力和死亡率方面存在显著差异。这些聚类不一定与肺功能损伤程度相对应,并在 COPDGene 中的另外两个外部队列(769 名前吸烟者)和多民族动脉粥样硬化研究(MESA)中的 431 名前吸烟者中得到了独立验证。此外,我们还发现了一些在这些亚型中显著失调的基因,包括 DSP 和 GSTM1,它们之前通过全基因组关联研究(GWAS)与 COPD 相关。
所确定的亚型在死亡率以及临床和功能特征方面存在差异,这强调了需要进行多维评估,可能需要补充选择基因表达标志物。这些亚型在各队列中具有一致性,可以用于新的患者分层和疾病预后。