Department of Drug Discovery Medicine, Kyoto University Graduate School of Medicine, 53 Shogoin Kawahara-Cho, Sakyo-Ku, Kyoto, 606-8507, Japan.
Department of Complex Systems Science, Graduate School of Informatics, Nagoya University, Furo-Cho, Chikusa-Ku, Nagoya, 464-8602, Japan.
Epigenetics Chromatin. 2023 Sep 25;16(1):34. doi: 10.1186/s13072-023-00510-w.
Despite well-documented effects on human health, the action modes of environmental pollutants are incompletely understood. Although transcriptome-based approaches are widely used to predict associations between chemicals and disorders, the molecular cues regulating pollutant-derived gene expression changes remain unclear. Therefore, we developed a data-mining approach, termed "DAR-ChIPEA," to identify transcription factors (TFs) playing pivotal roles in the action modes of pollutants.
Large-scale public ChIP-Seq data (human, n = 15,155; mouse, n = 13,156) were used to predict TFs that are enriched in the pollutant-induced differentially accessible genomic regions (DARs) obtained from epigenome analyses (ATAC-Seq). The resultant pollutant-TF matrices were then cross-referenced to a repository of TF-disorder associations to account for pollutant modes of action. We subsequently evaluated the performance of the proposed method using a chemical perturbation data set to compare the outputs of the DAR-ChIPEA and our previously developed differentially expressed gene (DEG)-ChIPEA methods using pollutant-induced DEGs as input. We then adopted the proposed method to predict disease-associated mechanisms triggered by pollutants.
The proposed approach outperformed other methods using the area under the receiver operating characteristic curve score. The mean score of the proposed DAR-ChIPEA was significantly higher than that of our previously described DEG-ChIPEA (0.7287 vs. 0.7060; Q = 5.278 × 10; two-tailed Wilcoxon rank-sum test). The proposed approach further predicted TF-driven modes of action upon pollutant exposure, indicating that (1) TFs regulating Th1/2 cell homeostasis are integral in the pathophysiology of tributyltin-induced allergic disorders; (2) fine particulates (PM) inhibit the binding of C/EBPs, Rela, and Spi1 to the genome, thereby perturbing normal blood cell differentiation and leading to immune dysfunction; and (3) lead induces fatty liver by disrupting the normal regulation of lipid metabolism by altering hepatic circadian rhythms.
Highlighting genome-wide chromatin change upon pollutant exposure to elucidate the epigenetic landscape of pollutant responses outperformed our previously described method that focuses on gene-adjacent domains only. Our approach has the potential to reveal pivotal TFs that mediate deleterious effects of pollutants, thereby facilitating the development of strategies to mitigate damage from environmental pollution.
尽管环境污染物对人类健康的影响已得到充分证实,但人们对其作用模式仍了解甚少。尽管基于转录组的方法被广泛用于预测化学物质与疾病之间的关联,但调节污染物诱导的基因表达变化的分子线索仍不清楚。因此,我们开发了一种数据挖掘方法,称为“DAR-ChIPEA”,用于识别在污染物作用模式中起关键作用的转录因子(TFs)。
利用大规模公共 ChIP-Seq 数据(人类,n=15155;小鼠,n=13156),预测富集在表观基因组分析(ATAC-Seq)获得的污染物诱导的差异可及基因组区域(DARs)中的 TF。然后,将所得的污染物-TF 矩阵与 TF-疾病关联库交叉引用,以解释污染物的作用模式。我们随后使用化学物质扰动数据集评估了所提出方法的性能,将 DAR-ChIPEA 和我们之前开发的基于污染物诱导差异表达基因(DEG)的 DEG-ChIPEA 方法的输出进行比较,输入为污染物诱导的 DEG。然后,我们采用了所提出的方法来预测污染物引发的疾病相关机制。
使用接收器工作特征曲线下面积(AUC)评分,所提出的方法优于其他方法。所提出的 DAR-ChIPEA 的平均评分显著高于我们之前描述的 DEG-ChIPEA(0.7287 对 0.7060;Q=5.278×10;双侧 Wilcoxon 秩和检验)。该方法进一步预测了污染物暴露后 TF 驱动的作用模式,表明:(1)调节 Th1/2 细胞稳态的 TF 在三丁基锡诱导的过敏疾病的病理生理学中至关重要;(2)细颗粒物(PM)抑制 C/EBPs、Rela 和 Spi1 与基因组的结合,从而扰乱正常的血细胞分化并导致免疫功能障碍;(3)铅通过改变肝的昼夜节律扰乱正常的脂质代谢调节,导致脂肪肝。
强调污染物暴露后全基因组染色质变化,以阐明污染物反应的表观遗传景观,优于我们之前仅关注基因邻近区域的方法。我们的方法有可能揭示介导污染物有害影响的关键 TF,从而有助于制定减轻环境污染损害的策略。