Division of Pulmonary, Allergy and Critical Care Medicine.
Department of Cellular and Molecular Pathology, and.
JCI Insight. 2024 Nov 8;9(21):e180239. doi: 10.1172/jci.insight.180239.
Transcriptomic analyses have advanced the understanding of complex disease pathophysiology including chronic obstructive pulmonary disease (COPD). However, identifying relevant biologic causative factors has been limited by the integration of high dimensionality data. COPD is characterized by lung destruction and inflammation, with smoke exposure being a major risk factor. To define previously unknown biological mechanisms in COPD, we utilized unsupervised and supervised interpretable machine learning analyses of single-cell RNA-Seq data from the mouse smoke-exposure model to identify significant latent factors (context-specific coexpression modules) impacting pathophysiology. The machine learning transcriptomic signatures coupled to protein networks uncovered a reduction in network complexity and new biological alterations in actin-associated gelsolin (GSN), which was transcriptionally linked to disease state. GSN was altered in airway epithelial cells in the mouse model and in human COPD. GSN was increased in plasma from patients with COPD, and smoke exposure resulted in enhanced GSN release from airway cells from patients with COPD. This method provides insights into rewiring of transcriptional networks that are associated with COPD pathogenesis and provides a translational analytical platform for other diseases.
转录组分析提高了对复杂疾病病理生理学的认识,包括慢性阻塞性肺疾病(COPD)。然而,由于高维度数据的整合,确定相关的生物学因果因素受到了限制。COPD 的特征是肺破坏和炎症,吸烟是主要的危险因素。为了确定 COPD 中以前未知的生物学机制,我们利用无监督和有监督的可解释机器学习分析来自小鼠吸烟暴露模型的单细胞 RNA-Seq 数据,以识别影响病理生理学的显著潜在因素(特定于上下文的共表达模块)。与蛋白质网络耦合的机器学习转录组特征揭示了网络复杂性的降低和与疾病状态相关的肌动蛋白相关凝胶蛋白(GSN)的新生物学改变。在小鼠模型中的气道上皮细胞和人类 COPD 中改变了 GSN。COPD 患者的血浆中增加了 GSN,吸烟暴露导致 COPD 患者气道细胞中 GSN 的释放增强。这种方法提供了与 COPD 发病机制相关的转录网络重布线的见解,并为其他疾病提供了转化分析平台。