Suppr超能文献

利用变分因果推断学习基因扰动效应

Learning Genetic Perturbation Effects with Variational Causal Inference.

作者信息

Liu Emily, Zhang Jiaqi, Uhler Caroline

机构信息

Department of Electrical Engineering and Computer Science, MIT.

Eric and Wendy Schmidt Center, Broad Institute.

出版信息

bioRxiv. 2025 Jun 5:2025.06.05.657988. doi: 10.1101/2025.06.05.657988.

Abstract

Advances in sequencing technologies have enhanced the understanding of gene regulation in cells. In particular, Perturb-seq has enabled high-resolution profiling of the transcriptomic response to genetic perturbations at the single-cell level. This understanding has implications in functional genomics and potentially for identifying therapeutic targets. Various computational models have been developed to predict perturbational effects. While deep learning models excel at interpolating observed perturbational data, they tend to overfit and may not generalize well to unseen perturbations. In contrast, mechanistic models, such as linear causal models based on gene regulatory networks, hold greater potential for extrapolation, as they encapsulate regulatory information that can predict responses to unseen perturbations. However, their application has been limited to small studies due to overly simplistic assumptions, making them less effective in handling noisy, large-scale single-cell data. We propose a hybrid approach that combines a mechanistic causal model with variational deep learning, termed Single Cell Causal Variational Autoencoder (SCCVAE). The mechanistic model employs a learned regulatory network to represent perturbational changes as shift interventions that propagate through the learned network. SCCVAE integrates this mechanistic causal model into a variational autoencoder, generating rich, comprehensive transcriptomic responses. Our results indicate that SCCVAE exhibits superior performance over current state-of-the-art baselines for extrapolating to predict unseen perturbational responses. Additionally, for the observed perturbations, the latent space learned by SCCVAE allows for the identification of functional perturbation modules and simulation of single-gene knockdown experiments of varying penetrance, presenting a robust tool for interpreting and interpolating perturbational responses at the single-cell level.

摘要

测序技术的进步加深了我们对细胞基因调控的理解。特别是,Perturb-seq能够在单细胞水平上对基因扰动的转录组反应进行高分辨率分析。这种理解对功能基因组学具有重要意义,并可能有助于确定治疗靶点。已经开发了各种计算模型来预测扰动效应。虽然深度学习模型在插值观察到的扰动数据方面表现出色,但它们往往会过拟合,并且可能无法很好地推广到未见过的扰动。相比之下,机制模型,如基于基因调控网络的线性因果模型,具有更大的外推潜力,因为它们封装了可以预测对未见过的扰动反应的调控信息。然而,由于假设过于简单,它们的应用仅限于小型研究,这使得它们在处理有噪声的大规模单细胞数据时效果较差。我们提出了一种将机制因果模型与变分深度学习相结合的混合方法,称为单细胞因果变分自动编码器(SCCVAE)。机制模型采用学习到的调控网络,将扰动变化表示为通过学习到的网络传播的移位干预。SCCVAE将这种机制因果模型集成到变分自动编码器中,生成丰富、全面的转录组反应。我们的结果表明,SCCVAE在推断预测未见过的扰动反应方面表现优于当前最先进的基线。此外,对于观察到的扰动,SCCVAE学习到的潜在空间允许识别功能扰动模块,并模拟不同外显率的单基因敲低实验,为在单细胞水平上解释和插值扰动反应提供了一个强大的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46ab/12157634/93eff31f98a1/nihpp-2025.06.05.657988v1-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验