Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15206, USA.
Department of Pharmaceutical Sciences, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA, 15206, USA.
NPJ Syst Biol Appl. 2020 Nov 6;6(1):35. doi: 10.1038/s41540-020-00158-2.
Cellular signaling systems play a vital role in maintaining homeostasis when a cell is exposed to different perturbations. Components of the systems are organized as hierarchical networks, and perturbing different components often leads to transcriptomic profiles that exhibit compositional statistical patterns. Mining such patterns to investigate how cellular signals are encoded is an important problem in systems biology, where artificial intelligence techniques can be of great assistance. Here, we investigated the capability of deep generative models (DGMs) to modeling signaling systems and learn representations of cellular states underlying transcriptomic responses to diverse perturbations. Specifically, we show that the variational autoencoder and the supervised vector-quantized variational autoencoder can accurately regenerate gene expression data in response to perturbagen treatments. The models can learn representations that reveal the relationships between different classes of perturbagens and enable mappings between drugs and their target genes. In summary, DGMs can adequately learn and depict how cellular signals are encoded. The resulting representations have broad applications, demonstrating the power of artificial intelligence in systems biology and precision medicine.
细胞信号系统在细胞受到不同干扰时对于维持内稳态起着至关重要的作用。该系统的组成部分被组织成层次网络,干扰不同的组件通常会导致表现出组成统计模式的转录组谱。挖掘这些模式以研究细胞信号是如何编码的,是系统生物学中的一个重要问题,人工智能技术在此可以提供很大的帮助。在这里,我们研究了深度生成模型(DGM)对信号系统建模和学习转录组对各种干扰反应的细胞状态表示的能力。具体来说,我们表明变分自动编码器和有监督的向量量化变分自动编码器可以准确地再生基因表达数据,以响应扰动剂处理。这些模型可以学习表示,揭示不同类别的扰动剂之间的关系,并能够在药物与其靶基因之间进行映射。总之,DGM 可以充分学习和描绘细胞信号是如何编码的。由此产生的表示形式具有广泛的应用,展示了人工智能在系统生物学和精准医学中的强大功能。