• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scOTM:一种使用大语言模型预测单细胞扰动反应的深度学习框架。

scOTM: A Deep Learning Framework for Predicting Single-Cell Perturbation Responses with Large Language Models.

作者信息

Wang Yuchen, Lu Tianchi, Chen Xingjian, Yao Zhongyu, Wong Ka-Chun

机构信息

Department of Computer Science, City University of Hong Kong, Kowloon Tong, Hong Kong SAR 999077, China.

Cutaneous Biology Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02148, USA.

出版信息

Bioengineering (Basel). 2025 Aug 20;12(8):884. doi: 10.3390/bioengineering12080884.

DOI:10.3390/bioengineering12080884
PMID:40868397
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12383350/
Abstract

Modeling drug-induced transcriptional responses at the single-cell level is essential for advancing human healthcare, particularly in understanding disease mechanisms, assessing therapeutic efficacy, and anticipating adverse effects. However, existing approaches often impose a rigid constraint by enforcing pointwise alignment of latent representations to a standard normal prior, which limits expressiveness and results in biologically uninformative embeddings, especially in complex biological systems. Additionally, many methods inadequately address the challenges of unpaired data, typically relying on naive averaging strategies that ignore cell-type specificity and intercellular heterogeneity. To overcome these limitations, we propose scOTM, a deep learning framework designed to predict single-cell perturbation responses from unpaired data, focusing on generalization to unseen cell types. scOTM integrates prior biological knowledge of perturbations and cellular states, derived from large language models specialized for molecular and single-cell corpora. These informative representations are incorporated into a variational autoencoder with maximum mean discrepancy regularization, allowing flexible modeling of transcriptional shifts without imposing a strict constraint of alignment to a standard normal prior. scOTM further employs optimal transport to establish an efficient and interpretable mapping between control and perturbed distributions, effectively capturing the transcriptional shifts underlying response variation. Extensive experiments demonstrate that scOTM outperforms existing methods in predicting whole-transcriptome responses and identifying top differentially expressed genes. Furthermore, scOTM exhibits superior robustness in data-limited settings and strong generalization capabilities across cell types.

摘要

在单细胞水平上对药物诱导的转录反应进行建模对于推动人类医疗保健至关重要,特别是在理解疾病机制、评估治疗效果和预测不良反应方面。然而,现有方法通常通过强制将潜在表示逐点对齐到标准正态先验来施加严格约束,这限制了表达能力并导致生物学上无信息的嵌入,尤其是在复杂的生物系统中。此外,许多方法不能充分应对未配对数据的挑战,通常依赖于忽略细胞类型特异性和细胞间异质性的简单平均策略。为了克服这些限制,我们提出了scOTM,这是一个深度学习框架,旨在从未配对数据中预测单细胞扰动反应,重点是对未见过的细胞类型进行泛化。scOTM整合了来自专门针对分子和单细胞语料库的大语言模型的扰动和细胞状态的先验生物学知识。这些信息丰富的表示被纳入具有最大均值差异正则化的变分自编码器中,允许对转录变化进行灵活建模,而无需对与标准正态先验的对齐施加严格约束。scOTM进一步采用最优传输来在对照分布和扰动分布之间建立高效且可解释的映射,有效地捕获响应变化背后的转录变化。大量实验表明,scOTM在预测全转录组反应和识别顶级差异表达基因方面优于现有方法。此外,scOTM在数据有限的环境中表现出卓越的稳健性,并且在不同细胞类型之间具有强大的泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/f35cf41d549d/bioengineering-12-00884-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/53666751e1da/bioengineering-12-00884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/11737bfd0c96/bioengineering-12-00884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/4daff67aa2df/bioengineering-12-00884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/a3c1bcdfd6c0/bioengineering-12-00884-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/26b52650d8c1/bioengineering-12-00884-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/393bf2411aaa/bioengineering-12-00884-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/3ab7f62d58ea/bioengineering-12-00884-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/9a3566831a19/bioengineering-12-00884-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/f35cf41d549d/bioengineering-12-00884-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/53666751e1da/bioengineering-12-00884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/11737bfd0c96/bioengineering-12-00884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/4daff67aa2df/bioengineering-12-00884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/a3c1bcdfd6c0/bioengineering-12-00884-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/26b52650d8c1/bioengineering-12-00884-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/393bf2411aaa/bioengineering-12-00884-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/3ab7f62d58ea/bioengineering-12-00884-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/9a3566831a19/bioengineering-12-00884-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c35b/12383350/f35cf41d549d/bioengineering-12-00884-g009.jpg

相似文献

1
scOTM: A Deep Learning Framework for Predicting Single-Cell Perturbation Responses with Large Language Models.scOTM:一种使用大语言模型预测单细胞扰动反应的深度学习框架。
Bioengineering (Basel). 2025 Aug 20;12(8):884. doi: 10.3390/bioengineering12080884.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
CXR-MultiTaskNet a unified deep learning framework for joint disease localization and classification in chest radiographs.CXR-MultiTaskNet:一种用于胸部X光片中疾病联合定位与分类的统一深度学习框架。
Sci Rep. 2025 Aug 31;15(1):32022. doi: 10.1038/s41598-025-16669-z.
5
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
7
Anterior Approach Total Ankle Arthroplasty with Patient-Specific Cut Guides.使用患者特异性截骨导向器的前路全踝关节置换术。
JBJS Essent Surg Tech. 2025 Aug 15;15(3). doi: 10.2106/JBJS.ST.23.00027. eCollection 2025 Jul-Sep.
8
Plug-and-play use of tree-based methods: consequences for clinical prediction modeling.基于树的方法的即插即用:对临床预测模型的影响。
J Clin Epidemiol. 2025 Aug;184:111834. doi: 10.1016/j.jclinepi.2025.111834. Epub 2025 May 19.
9
Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.使用语义语言内容和变压器深度学习架构评估认知能力下降。
Int J Lang Commun Disord. 2024 May-Jun;59(3):1110-1127. doi: 10.1111/1460-6984.12973. Epub 2023 Nov 16.
10
Sexual Harassment and Prevention Training性骚扰与预防培训

本文引用的文献

1
Progress and opportunities of foundation models in bioinformatics.生物信息学中基础模型的进展与机遇。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae548.
2
scPRAM accurately predicts single-cell gene expression perturbation response based on attention mechanism.scPRAM 基于注意力机制准确预测单细胞基因表达扰动响应。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae265.
3
Advancing Precision Medicine: A Review of Innovative In Silico Approaches for Drug Development, Clinical Pharmacology and Personalized Healthcare.
推进精准医学:药物研发、临床药理学和个性化医疗中创新的计算机模拟方法综述。
Pharmaceutics. 2024 Feb 27;16(3):332. doi: 10.3390/pharmaceutics16030332.
4
scGPT: toward building a foundation model for single-cell multi-omics using generative AI.scGPT:迈向使用生成式人工智能构建单细胞多组学基础模型
Nat Methods. 2024 Aug;21(8):1470-1480. doi: 10.1038/s41592-024-02201-0. Epub 2024 Feb 26.
5
scPerturb: harmonized single-cell perturbation data.scPerturb:协调的单细胞扰动数据。
Nat Methods. 2024 Mar;21(3):531-540. doi: 10.1038/s41592-023-02144-y. Epub 2024 Jan 26.
6
The Reactome Pathway Knowledgebase 2024.Reactome 通路知识库 2024.
Nucleic Acids Res. 2024 Jan 5;52(D1):D672-D678. doi: 10.1093/nar/gkad1025.
7
Learning single-cell perturbation responses using neural optimal transport.利用神经最优传输学习单细胞扰动响应。
Nat Methods. 2023 Nov;20(11):1759-1768. doi: 10.1038/s41592-023-01969-x. Epub 2023 Sep 28.
8
Generative modeling of single-cell gene expression for dose-dependent chemical perturbations.用于剂量依赖性化学扰动的单细胞基因表达生成建模。
Patterns (N Y). 2023 Aug 11;4(8):100817. doi: 10.1016/j.patter.2023.100817.
9
Analysis and modeling of cancer drug responses using cell cycle phase-specific rate effects.利用细胞周期时相特异性速率效应分析和建模癌症药物反应
Nat Commun. 2023 Jun 10;14(1):3450. doi: 10.1038/s41467-023-39122-z.
10
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.