• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

sCIN:用于单细胞多组学数据整合的对比学习框架。

sCIN: a contrastive learning framework for single-cell multi-omics data integration.

作者信息

Ebrahimi Amir, Siahpirani Alireza Fotuhi, Montazeri Hesam

机构信息

Department of Biotechnology, College of Science, University of Tehran, Ghods 37, Tehran, 1417763135, Iran.

Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Ghods 37, Tehran, 1417763135, Iran.

出版信息

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf411.

DOI:10.1093/bib/bbaf411
PMID:40856525
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12378934/
Abstract

The rapid advancement of single-cell omics technologies such as single-cell RNA sequencing and single-cell assay for transposase-accessible chromatin with high throughput sequencing has transformed our understanding of cellular heterogeneity and regulatory mechanisms. However, integrating these data types remains challenging due to distributional discrepancies and distinct feature spaces. To address this, we present a novel single-cell Contrastive INtegration framework (sCIN) that integrates different omics modalities into a shared low-dimensional latent space. sCIN uses modality-specific encoders and contrastive learning to generate latent representations for each modality, aligning cells across modalities and removing technology-specific biases. The framework was designed to rigorously prevent data leakage between training and testing, and was extensively evaluated on three real-world paired datasets namely simultaneous high-throughput ATAC and RNA expression with sequencing, 10X PBMC (10k version), and cellular indexing of transcriptomes and epitopes, and one unpaired dataset of gene expression and chromatin accessibility. Paired datasets refer to multi-omics data generated using technologies capable of capturing different omics features from the same cell population while unpaired datasets are measured from different cell populations from a tissue. Results on paired and unpaired datasets show that sCIN outperforms state-of-the-art models, including scGLUE, scBridge, sciCAN, Con-AAE, Harmony, and MOFA+, across multiple metrics: average silhouette width for clustering quality, Recall@k, cell type@k, cell type accuracy, and median rank for integration quality. Moreover, sCIN was evaluated on simulated unpaired datasets derived from paired data, demonstrating its ability to leverage available biological information for effective multimodal integration. In summary, sCIN reliably integrates omics modalities while preserving biological meaning in both paired and unpaired settings.

摘要

单细胞转录组测序和单细胞转座酶可及染色质高通量测序等单细胞组学技术的快速发展,改变了我们对细胞异质性和调控机制的理解。然而,由于分布差异和不同的特征空间,整合这些数据类型仍然具有挑战性。为了解决这个问题,我们提出了一种新颖的单细胞对比整合框架(sCIN),该框架将不同的组学模态整合到一个共享的低维潜在空间中。sCIN使用特定模态的编码器和对比学习为每个模态生成潜在表示,对齐跨模态的细胞并消除技术特定的偏差。该框架旨在严格防止训练和测试之间的数据泄漏,并在三个真实世界的配对数据集上进行了广泛评估,即同时进行高通量ATAC和RNA测序表达、10X PBMC(10k版本)以及转录组和表位的细胞索引,以及一个基因表达和染色质可及性的非配对数据集。配对数据集是指使用能够从同一细胞群体中捕获不同组学特征的技术生成的多组学数据,而非配对数据集是从组织的不同细胞群体中测量得到的。配对和非配对数据集的结果表明,sCIN在多个指标上优于现有模型,包括scGLUE、scBridge、sciCAN、Con-AAE、Harmony和MOFA+:聚类质量的平均轮廓宽度、召回率@k、细胞类型@k、细胞类型准确率以及整合质量的中位数排名。此外,sCIN在从配对数据派生的模拟非配对数据集上进行了评估,证明了其利用可用生物信息进行有效多模态整合的能力。总之,sCIN在配对和非配对设置中都能可靠地整合组学模态,同时保留生物学意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/edd55e4bbcaa/bbaf411f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/d68213b336a5/bbaf411f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/a4df43611483/bbaf411f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/b7fd4f85645f/bbaf411f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/64b999760cc9/bbaf411f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/edd55e4bbcaa/bbaf411f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/d68213b336a5/bbaf411f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/a4df43611483/bbaf411f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/b7fd4f85645f/bbaf411f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/64b999760cc9/bbaf411f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6600/12378934/edd55e4bbcaa/bbaf411f5.jpg

相似文献

1
sCIN: a contrastive learning framework for single-cell multi-omics data integration.sCIN:用于单细胞多组学数据整合的对比学习框架。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf411.
2
Multi-omics single-cell data alignment and integration with enhanced contrastive learning and differential attention mechanism.基于增强对比学习和差分注意力机制的多组学单细胞数据对齐与整合
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf443.
3
UnCOT-AD: Unpaired Cross-Omics Translation Enables Multi-Omics Integration for Alzheimer's Disease Prediction.UnCOT-AD:非配对交叉组学翻译实现用于阿尔茨海默病预测的多组学整合
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf438.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Effective Integration of Single-Cell Multi-Omics Data Using Improved Network-Based Integrative Clustering with Multigraph Regularization.使用改进的基于网络的多图正则化集成聚类实现单细胞多组学数据的有效整合。
J Comput Biol. 2025 Jun;32(6):601-614. doi: 10.1089/cmb.2023.0460. Epub 2025 May 22.
6
MO-GCAN: multi-omics integration based on graph convolutional and attention networks.MO-GCAN:基于图卷积和注意力网络的多组学整合
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf405.
7
scMGCL: accurate and efficient integration representation of single-cell multi-omics data.scMGCL:单细胞多组学数据的准确高效整合表示
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf392.
8
MultiSC: a deep learning pipeline for analyzing multiomics single-cell data.MultiSC:用于分析多组学单细胞数据的深度学习管道。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae492.
9
A review on multi-omics integration for aiding study design of large scale TCGA cancer datasets.关于多组学整合以辅助大规模TCGA癌症数据集研究设计的综述。
BMC Genomics. 2025 Aug 22;26(1):769. doi: 10.1186/s12864-025-11925-y.
10
Integration of single cell multiomics data by deep transfer hypergraph neural network.基于深度迁移超图神经网络的单细胞多组学数据整合
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elaf009.

本文引用的文献

1
Learning Consistency and Specificity of Cells From Single-Cell Multi-Omic Data.从单细胞多组学数据中学习细胞的一致性和特异性。
IEEE J Biomed Health Inform. 2024 May;28(5):3134-3145. doi: 10.1109/JBHI.2024.3370868.
2
scBridge embraces cell heterogeneity in single-cell RNA-seq and ATAC-seq data integration.scBridge 能够整合单细胞 RNA-seq 和 ATAC-seq 数据中的细胞异质性。
Nat Commun. 2023 Sep 28;14(1):6045. doi: 10.1038/s41467-023-41795-5.
3
MultiVI: deep generative model for the integration of multimodal data.MultiVI:用于多模态数据集成的深度生成模型。
Nat Methods. 2023 Aug;20(8):1222-1231. doi: 10.1038/s41592-023-01909-9. Epub 2023 Jun 29.
4
The technological landscape and applications of single-cell multi-omics.单细胞多组学的技术领域和应用。
Nat Rev Mol Cell Biol. 2023 Oct;24(10):695-713. doi: 10.1038/s41580-023-00615-w. Epub 2023 Jun 6.
5
Con-AAE: contrastive cycle adversarial autoencoders for single-cell multi-omics alignment and integration.Con-AAE:用于单细胞多组学比对和整合的对比循环对抗自动编码器。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad162.
6
A universal framework for single-cell multi-omics data integration with graph convolutional networks.基于图卷积网络的单细胞多组学数据集成通用框架
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad081.
7
scGMAAE: Gaussian mixture adversarial autoencoders for diversification analysis of scRNA-seq data.scGMAAE:用于单细胞RNA测序数据多样化分析的高斯混合对抗自编码器
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac585.
8
sciCAN: single-cell chromatin accessibility and gene expression data integration via cycle-consistent adversarial network.sciCAN:基于循环一致对抗网络的单细胞染色质可及性和基因表达数据整合。
NPJ Syst Biol Appl. 2022 Sep 12;8(1):33. doi: 10.1038/s41540-022-00245-6.
9
Learning discriminative and structural samples for rare cell types with deep generative model.利用深度生成模型学习罕见细胞类型的判别和结构样本。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac317.
10
scDART: integrating unmatched scRNA-seq and scATAC-seq data and learning cross-modality relationship simultaneously.scDART:同时整合未配对的 scRNA-seq 和 scATAC-seq 数据并学习跨模态关系。
Genome Biol. 2022 Jun 27;23(1):139. doi: 10.1186/s13059-022-02706-x.