• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用CONCORD揭示单细胞数据集中连贯的细胞状态图谱。

Revealing a coherent cell state landscape across single cell datasets with CONCORD.

作者信息

Zhu Qin, Jiang Zuzhi, Thomson Matt, Gartner Zev

机构信息

Department of Pharmaceutical Chemistry, University of California San Francisco; San Francisco, CA 94158, USA.

Tetrad Graduate Program, University of California San Francisco; San Francisco, CA 94158, USA.

出版信息

bioRxiv. 2025 Apr 11:2025.03.13.643146. doi: 10.1101/2025.03.13.643146.

DOI:10.1101/2025.03.13.643146
PMID:40161827
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11952503/
Abstract

Batch integration, denoising, and dimensionality reduction remain fundamental challenges in single-cell data analysis. While many machine learning tools aim to overcome these challenges by engineering model architectures, we use a different strategy, building on the insight that optimized mini-batch sampling during training can profoundly influence learning outcomes. We present CONCORD, a self-supervised learning approach that implements a unified, probabilistic data sampling scheme combining neighborhood-aware and dataset-aware sampling: the former enhancing resolution while the latter removing batch effects. Using only a minimalist one-hidden-layer neural network and contrastive learning, CONCORD achieves state-of-the-art performance without relying on deep architectures, auxiliary losses, or supervision. It generates high-resolution cell atlases that seamlessly integrate data across batches, technologies, and species, without relying on prior assumptions about data structure. The resulting latent representations are denoised, interpretable, and biologically meaningful-capturing gene co-expression programs, resolving subtle cellular states, and preserving both local geometric relationships and global topological organization. We demonstrate CONCORD's broad applicability across diverse datasets, establishing it as a general-purpose framework for learning unified, high-fidelity representations of cellular identity and dynamics.

摘要

批量整合、去噪和降维仍然是单细胞数据分析中的基本挑战。虽然许多机器学习工具旨在通过设计模型架构来克服这些挑战,但我们采用了一种不同的策略,基于这样一种见解:训练期间优化的小批量采样会深刻影响学习结果。我们提出了CONCORD,这是一种自监督学习方法,它实现了一种统一的概率数据采样方案,结合了邻域感知采样和数据集感知采样:前者提高分辨率,而后者消除批量效应。仅使用一个极简的单隐藏层神经网络和对比学习,CONCORD在不依赖深度架构、辅助损失或监督的情况下实现了领先的性能。它生成高分辨率的细胞图谱,可无缝整合跨批次、技术和物种的数据,而无需依赖关于数据结构的先验假设。由此产生的潜在表示经过去噪、可解释且具有生物学意义——捕捉基因共表达程序,解析微妙的细胞状态,并保留局部几何关系和全局拓扑组织。我们展示了CONCORD在各种数据集上的广泛适用性,将其确立为学习细胞身份和动态的统一、高保真表示的通用框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/3c8c35f6f900/nihpp-2025.03.13.643146v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/6deef1dec1f2/nihpp-2025.03.13.643146v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/8bdd05eba073/nihpp-2025.03.13.643146v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/b4ddc50b1e17/nihpp-2025.03.13.643146v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/4c1dd62a31f9/nihpp-2025.03.13.643146v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/3c8c35f6f900/nihpp-2025.03.13.643146v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/6deef1dec1f2/nihpp-2025.03.13.643146v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/8bdd05eba073/nihpp-2025.03.13.643146v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/b4ddc50b1e17/nihpp-2025.03.13.643146v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/4c1dd62a31f9/nihpp-2025.03.13.643146v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2199/11995960/3c8c35f6f900/nihpp-2025.03.13.643146v2-f0005.jpg

相似文献

1
Revealing a coherent cell state landscape across single cell datasets with CONCORD.利用CONCORD揭示单细胞数据集中连贯的细胞状态图谱。
bioRxiv. 2025 Apr 11:2025.03.13.643146. doi: 10.1101/2025.03.13.643146.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
4
GatorST: A Versatile Contrastive Meta-Learning Framework for Spatial Transcriptomic Data Analysis.GatorST:用于空间转录组数据分析的通用对比元学习框架。
bioRxiv. 2025 Jul 19:2025.07.01.662625. doi: 10.1101/2025.07.01.662625.
5
Preserving noise texture through training data curation for deep learning denoising of high-resolution cardiac EID-CT.通过训练数据精选来保留噪声纹理,用于高分辨率心脏EID-CT的深度学习去噪
Med Phys. 2025 Jul;52(7):e17938. doi: 10.1002/mp.17938.
6
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
7
Short-Term Memory Impairment短期记忆障碍
8
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
9
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
10
sCIN: a contrastive learning framework for single-cell multi-omics data integration.sCIN:用于单细胞多组学数据整合的对比学习框架。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf411.

本文引用的文献

1
Lineage-resolved analysis of embryonic gene expression evolution in and .XX和XX中胚胎基因表达进化的谱系解析分析 。(你提供的原文不完整,缺少关键物种信息,这里只能给出大概的翻译框架,你可补充完整后再让我准确翻译)
Science. 2025 Jun 19;388(6753):eadu8249. doi: 10.1126/science.adu8249.
2
scCobra allows contrastive cell embedding learning with domain adaptation for single cell data integration and harmonization.scCobra支持通过域适应进行对比细胞嵌入学习,以实现单细胞数据整合与归一化。
Commun Biol. 2025 Feb 13;8(1):233. doi: 10.1038/s42003-025-07692-x.
3
Recovery of biological signals lost in single-cell batch integration with CellANOVA.
通过CellANOVA在单细胞批量整合中丢失的生物信号的恢复。
Nat Biotechnol. 2024 Nov 26. doi: 10.1038/s41587-024-02463-1.
4
A cell atlas foundation model for scalable search of similar human cells.一种用于可扩展搜索相似人类细胞的细胞图谱基础模型。
Nature. 2025 Feb;638(8052):1085-1094. doi: 10.1038/s41586-024-08411-y. Epub 2024 Nov 20.
5
Patterning and folding of intestinal villi by active mesenchymal dewetting.肠道微绒毛的形态发生和折叠由活跃的间质去湿作用驱动。
Cell. 2024 Jun 6;187(12):3072-3089.e20. doi: 10.1016/j.cell.2024.04.039. Epub 2024 May 22.
6
Epithelial zonation along the mouse and human small intestine defines five discrete metabolic domains.沿小鼠和人小肠的上皮分区定义了五个不同的代谢域。
Nat Cell Biol. 2024 Feb;26(2):250-262. doi: 10.1038/s41556-023-01337-z. Epub 2024 Feb 6.
7
Predicting cell types with supervised contrastive learning on cells and their types.基于细胞及其类型的监督对比学习来预测细胞类型。
Sci Rep. 2024 Jan 3;14(1):430. doi: 10.1038/s41598-023-50185-2.
8
Inferring cellular and molecular processes in single-cell data with non-negative matrix factorization using Python, R and GenePattern Notebook implementations of CoGAPS.使用 Python、R 和 GenePattern Notebook 实现的 CoGAPS 中的非负矩阵分解,推断单细胞数据中的细胞和分子过程。
Nat Protoc. 2023 Dec;18(12):3690-3731. doi: 10.1038/s41596-023-00892-x. Epub 2023 Nov 21.
9
Supervised discovery of interpretable gene programs from single-cell data.基于监督学习的单细胞数据基因程序可解释性发现
Nat Biotechnol. 2024 Jul;42(7):1084-1095. doi: 10.1038/s41587-023-01940-3. Epub 2023 Sep 21.
10
scDCCA: deep contrastive clustering for single-cell RNA-seq data based on auto-encoder network.scDCCA:基于自动编码器网络的单细胞RNA测序数据深度对比聚类
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac625.