Department of Health Sciences, University of Leicester, Leicester, UK.
Research & Development, GlaxoSmithKline, Stevenage, UK.
Thorax. 2023 Jun;78(6):551-558. doi: 10.1136/thoraxjnl-2021-218563. Epub 2022 May 9.
Considerable clinical heterogeneity in idiopathic pulmonary fibrosis (IPF) suggests the existence of multiple disease endotypes. Identifying these endotypes would improve our understanding of the pathogenesis of IPF and could allow for a biomarker-driven personalised medicine approach. We aimed to identify clinically distinct groups of patients with IPF that could represent distinct disease endotypes.
We co-normalised, pooled and clustered three publicly available blood transcriptomic datasets (total 220 IPF cases). We compared clinical traits across clusters and used gene enrichment analysis to identify biological pathways and processes that were over-represented among the genes that were differentially expressed across clusters. A gene-based classifier was developed and validated using three additional independent datasets (total 194 IPF cases).
We identified three clusters of patients with IPF with statistically significant differences in lung function (p=0.009) and mortality (p=0.009) between groups. Gene enrichment analysis implicated mitochondrial homeostasis, apoptosis, cell cycle and innate and adaptive immunity in the pathogenesis underlying these groups. We developed and validated a 13-gene cluster classifier that predicted mortality in IPF (high-risk clusters vs low-risk cluster: HR 4.25, 95% CI 2.14 to 8.46, p=3.7×10).
We have identified blood gene expression signatures capable of discerning groups of patients with IPF with significant differences in survival. These clusters could be representative of distinct pathophysiological states, which would support the theory of multiple endotypes of IPF. Although more work must be done to confirm the existence of these endotypes, our classifier could be a useful tool in patient stratification and outcome prediction in IPF.
特发性肺纤维化(IPF)存在显著的临床异质性,提示其存在多种疾病表型。明确这些表型将有助于我们深入了解 IPF 的发病机制,并可能实现基于生物标志物的个体化医疗方法。本研究旨在确定具有不同临床特征的 IPF 患者亚群,这些亚群可能代表不同的疾病表型。
我们对三个公开的血液转录组数据集(共 220 例 IPF 病例)进行共归一化、合并和聚类。我们比较了聚类间的临床特征,并通过基因富集分析确定了在聚类间差异表达基因中过度表达的生物学途径和过程。使用另外三个独立数据集(共 194 例 IPF 病例)开发和验证了基于基因的分类器。
我们确定了三个具有不同临床特征的 IPF 患者亚群,这些亚群的肺功能(p=0.009)和死亡率(p=0.009)存在统计学差异。基因富集分析提示线粒体稳态、细胞凋亡、细胞周期以及固有和适应性免疫与这些亚群的发病机制有关。我们开发并验证了一个 13 基因聚类分类器,该分类器可预测 IPF 的死亡率(高危聚类与低危聚类:HR 4.25,95%CI 2.14 至 8.46,p=3.7×10)。
我们已确定了能够区分具有显著生存差异的 IPF 患者亚群的血液基因表达特征。这些聚类可能代表不同的病理生理状态,支持 IPF 存在多种表型的理论。虽然还需要进一步的工作来证实这些表型的存在,但我们的分类器可能是 IPF 患者分层和预后预测的有用工具。