利用变分因果推断学习基因扰动效应

Learning Genetic Perturbation Effects with Variational Causal Inference.

作者信息

Liu Emily, Zhang Jiaqi, Uhler Caroline

机构信息

Department of Electrical Engineering and Computer Science, MIT.

Eric and Wendy Schmidt Center, Broad Institute.

出版信息

bioRxiv. 2025 Jun 5:2025.06.05.657988. doi: 10.1101/2025.06.05.657988.

DOI:10.1101/2025.06.05.657988

PMID:40501829

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12157634/

Abstract

Advances in sequencing technologies have enhanced the understanding of gene regulation in cells. In particular, Perturb-seq has enabled high-resolution profiling of the transcriptomic response to genetic perturbations at the single-cell level. This understanding has implications in functional genomics and potentially for identifying therapeutic targets. Various computational models have been developed to predict perturbational effects. While deep learning models excel at interpolating observed perturbational data, they tend to overfit and may not generalize well to unseen perturbations. In contrast, mechanistic models, such as linear causal models based on gene regulatory networks, hold greater potential for extrapolation, as they encapsulate regulatory information that can predict responses to unseen perturbations. However, their application has been limited to small studies due to overly simplistic assumptions, making them less effective in handling noisy, large-scale single-cell data. We propose a hybrid approach that combines a mechanistic causal model with variational deep learning, termed Single Cell Causal Variational Autoencoder (SCCVAE). The mechanistic model employs a learned regulatory network to represent perturbational changes as shift interventions that propagate through the learned network. SCCVAE integrates this mechanistic causal model into a variational autoencoder, generating rich, comprehensive transcriptomic responses. Our results indicate that SCCVAE exhibits superior performance over current state-of-the-art baselines for extrapolating to predict unseen perturbational responses. Additionally, for the observed perturbations, the latent space learned by SCCVAE allows for the identification of functional perturbation modules and simulation of single-gene knockdown experiments of varying penetrance, presenting a robust tool for interpreting and interpolating perturbational responses at the single-cell level.

摘要

测序技术的进步加深了我们对细胞基因调控的理解。特别是，Perturb-seq能够在单细胞水平上对基因扰动的转录组反应进行高分辨率分析。这种理解对功能基因组学具有重要意义，并可能有助于确定治疗靶点。已经开发了各种计算模型来预测扰动效应。虽然深度学习模型在插值观察到的扰动数据方面表现出色，但它们往往会过拟合，并且可能无法很好地推广到未见过的扰动。相比之下，机制模型，如基于基因调控网络的线性因果模型，具有更大的外推潜力，因为它们封装了可以预测对未见过的扰动反应的调控信息。然而，由于假设过于简单，它们的应用仅限于小型研究，这使得它们在处理有噪声的大规模单细胞数据时效果较差。我们提出了一种将机制因果模型与变分深度学习相结合的混合方法，称为单细胞因果变分自动编码器（SCCVAE）。机制模型采用学习到的调控网络，将扰动变化表示为通过学习到的网络传播的移位干预。SCCVAE将这种机制因果模型集成到变分自动编码器中，生成丰富、全面的转录组反应。我们的结果表明，SCCVAE在推断预测未见过的扰动反应方面表现优于当前最先进的基线。此外，对于观察到的扰动，SCCVAE学习到的潜在空间允许识别功能扰动模块，并模拟不同外显率的单基因敲低实验，为在单细胞水平上解释和插值扰动反应提供了一个强大的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46ab/12157634/93eff31f98a1/nihpp-2025.06.05.657988v1-f0001.jpg

相似文献

Learning Genetic Perturbation Effects with Variational Causal Inference.利用变分因果推断学习基因扰动效应

bioRxiv. 2025 Jun 5:2025.06.05.657988. doi: 10.1101/2025.06.05.657988.

Short-Term Memory Impairment短期记忆障碍

Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理（2025年结石病专家共识）

Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.

The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》

Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

The use of Open Dialogue in Trauma Informed Care services for mental health consumers and their family networks: A scoping review.创伤知情护理服务中使用开放对话模式为心理健康消费者及其家庭网络提供服务：范围综述。

J Psychiatr Ment Health Nurs. 2024 Aug;31(4):681-698. doi: 10.1111/jpm.13023. Epub 2024 Jan 17.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Sexual Harassment and Prevention Training性骚扰与预防培训

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验：定性证据综合。

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

Autistic Students' Experiences of Employment and Employability Support while Studying at a UK University.自闭症学生在英国大学学习期间的就业经历及就业支持情况

Autism Adulthood. 2025 Apr 3;7(2):212-222. doi: 10.1089/aut.2024.0112. eCollection 2025 Apr.

MORPH Predicts the Single-Cell Outcome of Genetic Perturbations Across Conditions and Data Modalities.MORPH可预测跨条件和数据模式的基因扰动的单细胞结果。

bioRxiv. 2025 Jul 2:2025.06.27.661992. doi: 10.1101/2025.06.27.661992.

本文引用的文献

Deep-learning-based gene perturbation effect prediction does not yet outperform simple linear baselines.基于深度学习的基因扰动效应预测尚未超越简单的线性基线。

Nat Methods. 2025 Aug;22(8):1657-1661. doi: 10.1038/s41592-025-02772-6. Epub 2025 Aug 4.

PerturbNet predicts single-cell responses to unseen chemical and genetic perturbations.PerturbNet预测单细胞对未知化学和基因扰动的反应。

Mol Syst Biol. 2025 Jul 10. doi: 10.1038/s44320-025-00131-3.

Toward a foundation model of causal cell and tissue biology with a Perturbation Cell and Tissue Atlas.用扰动细胞和组织图谱构建因果细胞和组织生物学的基础模型。

Cell. 2024 Aug 22;187(17):4520-4545. doi: 10.1016/j.cell.2024.07.035.

scGPT: toward building a foundation model for single-cell multi-omics using generative AI.scGPT：迈向使用生成式人工智能构建单细胞多组学基础模型

Nat Methods. 2024 Aug;21(8):1470-1480. doi: 10.1038/s41592-024-02201-0. Epub 2024 Feb 26.

Year in review 2023.2023年回顾

Nat Methods. 2024 Jan;21(1):1-2. doi: 10.1038/s41592-023-02158-6.

Predicting transcriptional outcomes of novel multigene perturbations with GEARS.用 GEARS 预测新型多基因扰动的转录结果。

Nat Biotechnol. 2024 Jun;42(6):927-935. doi: 10.1038/s41587-023-01905-6. Epub 2023 Aug 17.

Transfer learning enables predictions in network biology.迁移学习可实现网络生物学预测。

Nature. 2023 Jun;618(7965):616-624. doi: 10.1038/s41586-023-06139-9. Epub 2023 May 31.

High-content CRISPR screening.高内涵CRISPR筛选

Nat Rev Methods Primers. 2022;2(1). doi: 10.1038/s43586-022-00098-7. Epub 2022 Feb 10.

Predicting cellular responses to complex perturbations in high-throughput screens.高通量筛选中预测细胞对复杂扰动的反应。

Mol Syst Biol. 2023 Jun 12;19(6):e11517. doi: 10.15252/msb.202211517. Epub 2023 May 8.

Dissecting cell identity via network inference and in silico gene perturbation.通过网络推断和计算机基因扰动解析细胞身份。

Nature. 2023 Feb;614(7949):742-751. doi: 10.1038/s41586-022-05688-9. Epub 2023 Feb 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用变分因果推断学习基因扰动效应

Learning Genetic Perturbation Effects with Variational Causal Inference.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献