Ghosh Debajyoti, Ding Lili, Bernstein Jonathan A, Mersha Tesfaye B
Immunology and Allergy, Department of Internal Medicine, University of Cincinnati, OH.
Division of Biostatistics and Epidemiology, Cincinnati Children's Hospital Medical Center, Department of Pediatrics, University of Cincinnati, OH.
G3 (Bethesda). 2020 Nov 5;10(11):4049-4062. doi: 10.1534/g3.120.401718.
An integrative analysis focused on multi-tissue transcriptomics has not been done for asthma. Tissue-specific DEGs remain undetected in many multi-tissue analyses, which influences identification of disease-relevant pathways and potential drug candidates. Transcriptome data from 609 cases and 196 controls, generated using airway epithelium, bronchial, nasal, airway macrophages, distal lung fibroblasts, proximal lung fibroblasts, CD4+ lymphocytes, CD8+ lymphocytes from whole blood and induced sputum samples, were retrieved from Gene Expression Omnibus (GEO). Differentially regulated asthma-relevant genes identified from each sample type were used to identify (a) tissue-specific and tissue-shared asthma pathways, (b) their connection to GWAS-identified disease genes to identify candidate tissue for functional studies, (c) to select surrogate sample for invasive tissues, and finally (d) to identify potential drug candidates connectivity map analysis. We found that inter-tissue similarity in gene expression was more pronounced at pathway/functional level than at gene level with highest similarity between bronchial epithelial cells and lung fibroblasts, and lowest between airway epithelium and whole blood samples. Although public-domain gene expression data are limited by inadequately annotated per-sample demographic and clinical information which limited the analysis, our tissue-resolved analysis clearly demonstrated relative importance of unique and shared asthma pathways, At the pathway level, IL-1b signaling and ERK signaling were significant in many tissue types, while Insulin-like growth factor and TGF-beta signaling were relevant in only airway epithelial tissue. IL-12 (in macrophages) and Immunoglobulin signaling (in lymphocytes) and chemokines (in nasal epithelium) were the highest expressed pathways. Overall, the IL-1 signaling genes (inflammatory) were relevant in the airway compartment, while pro-Th2 genes including IL-13 and STAT6 were more relevant in fibroblasts, lymphocytes, macrophages and bronchial biopsies. These genes were also associated with asthma in the GWAS catalog. Support Vector Machine showed that DEGs based on macrophages and epithelial cells have the highest and lowest discriminatory accuracy, respectively. Drug (entinostat, BMS-345541) and genetic perturbagens (KLF6, BCL10, INFB1 and BAMBI) negatively connected to disease at multi-tissue level could potentially repurposed for treating asthma. Collectively, our study indicates that the DEGs, perturbagens and disease are connected differentially depending on tissue/cell types. While most of the existing literature describes asthma transcriptome data from individual sample types, the present work demonstrates the utility of multi-tissue transcriptome data. Future studies should focus on collecting transcriptomic data from multiple tissues, age and race groups, genetic background, disease subtypes and on the availability of better-annotated data in the public domain.
尚未针对哮喘进行聚焦于多组织转录组学的综合分析。在许多多组织分析中,组织特异性差异表达基因(DEGs)仍未被检测到,这影响了疾病相关通路和潜在药物候选物的识别。从基因表达综合数据库(GEO)中检索了来自609例病例和196例对照的转录组数据,这些数据使用气道上皮、支气管、鼻腔、气道巨噬细胞、远端肺成纤维细胞、近端肺成纤维细胞、全血中的CD4 +淋巴细胞、CD8 +淋巴细胞以及诱导痰样本生成。从每种样本类型中鉴定出的差异调节的哮喘相关基因用于识别:(a)组织特异性和组织共享的哮喘通路;(b)它们与全基因组关联研究(GWAS)鉴定的疾病基因的联系,以识别功能研究的候选组织;(c)选择侵入性组织的替代样本;最后(d)通过连通性图谱分析识别潜在的药物候选物。我们发现,基因表达的组织间相似性在通路/功能水平比在基因水平更明显,支气管上皮细胞和肺成纤维细胞之间的相似性最高,气道上皮和全血样本之间的相似性最低。尽管公共领域的基因表达数据受到每个样本人口统计学和临床信息注释不足的限制,这限制了分析,但我们的组织解析分析清楚地证明了独特和共享的哮喘通路的相对重要性。在通路水平,IL-1β信号传导和ERK信号传导在许多组织类型中很重要,而胰岛素样生长因子和TGF-β信号传导仅在气道上皮组织中相关。IL-12(在巨噬细胞中)、免疫球蛋白信号传导(在淋巴细胞中)和趋化因子(在鼻上皮中)是表达最高的通路。总体而言,IL-1信号传导基因(炎症相关)在气道区室中相关,而包括IL-13和STAT6在内的促Th2基因在成纤维细胞、淋巴细胞、巨噬细胞和支气管活检中更相关。这些基因在GWAS目录中也与哮喘相关。支持向量机显示,基于巨噬细胞和上皮细胞的DEGs分别具有最高和最低的鉴别准确性。在多组织水平与疾病呈负相关的药物(恩替诺特, BMS-345541)和基因干扰物(KLF6、BCL10、INFB1和BAMBI)可能有治疗哮喘的新用途。总体而言,我们的研究表明,DEGs、干扰物和疾病根据组织/细胞类型的不同而有差异地联系。虽然大多数现有文献描述了来自单个样本类型的哮喘转录组数据,但本研究证明了多组织转录组数据的实用性。未来的研究应集中于从多个组织、年龄和种族群体、遗传背景、疾病亚型收集转录组数据,以及公共领域中注释更好的数据的可用性。