Icahn Institute for Genomics and Multiscale Biology and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
Division of Allergy & Immunology, Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
Sci Rep. 2019 Sep 10;9(1):12970. doi: 10.1038/s41598-019-49498-y.
Biological and regulatory mechanisms underlying many multi-gene expression-based disease biomarkers are often not readily evident. We describe an innovative framework, NeTFactor, that combines network analyses with gene expression data to identify transcription factors (TFs) that significantly and maximally regulate such a biomarker. NeTFactor uses a computationally-inferred context-specific gene regulatory network and applies topological, statistical, and optimization methods to identify regulator TFs. Application of NeTFactor to a multi-gene expression-based asthma biomarker identified ETS translocation variant 4 (ETV4) and peroxisome proliferator-activated receptor gamma (PPARG) as the biomarker's most significant TF regulators. siRNA-based knock down of these TFs in an airway epithelial cell line model demonstrated significant reduction of cytokine expression relevant to asthma, validating NeTFactor's top-scoring findings. While PPARG has been associated with airway inflammation, ETV4 has not yet been implicated in asthma, thus indicating the possibility of novel, disease-relevant discovery by NeTFactor. We also show that NeTFactor's results are robust when the gene regulatory network and biomarker are derived from independent data. Additionally, our application of NeTFactor to a different disease biomarker identified TF regulators of interest. These results illustrate that the application of NeTFactor to multi-gene expression-based biomarkers could yield valuable insights into regulatory mechanisms and biological processes underlying disease.
许多基于多基因表达的疾病生物标志物的生物学和调控机制并不明显。我们描述了一种创新的框架 NeTFactor,它将网络分析与基因表达数据相结合,以识别显著且最大程度调节此类生物标志物的转录因子(TF)。NeTFactor 使用计算推断的上下文特定基因调控网络,并应用拓扑、统计和优化方法来识别调节剂 TF。将 NeTFactor 应用于基于多基因表达的哮喘生物标志物的研究,确定 ETS 易位变体 4(ETV4)和过氧化物酶体增殖物激活受体γ(PPARG)是该生物标志物最重要的 TF 调节剂。在气道上皮细胞系模型中,基于 siRNA 的这些 TF 敲低显著降低了与哮喘相关的细胞因子表达,验证了 NeTFactor 的顶级发现。虽然 PPARG 与气道炎症有关,但 ETV4 尚未与哮喘有关联,这表明 NeTFactor 有可能发现新的与疾病相关的发现。我们还表明,当基因调控网络和生物标志物来自独立的数据时,NeTFactor 的结果是稳健的。此外,我们将 NeTFactor 应用于不同的疾病生物标志物,确定了感兴趣的 TF 调节剂。这些结果表明,将 NeTFactor 应用于基于多基因表达的生物标志物可以深入了解疾病的调控机制和生物学过程。