• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scNCL:利用邻域对比正则化将标签从 scRNA-seq 转移到 scATAC-seq 数据。

scNCL: transferring labels from scRNA-seq to scATAC-seq data with neighborhood contrastive regularization.

机构信息

School of Computer Science and Engineering, Central South University, Changsha 410083, China.

Singapore Immunology Network (SIgN), Agency for Science, Technology and Research (A*STAR), Singapore 138648, Singapore.

出版信息

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad505.

DOI:10.1093/bioinformatics/btad505
PMID:37584660
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10457667/
Abstract

MOTIVATION

scATAC-seq has enabled chromatin accessibility landscape profiling at the single-cell level, providing opportunities for determining cell-type-specific regulation codes. However, high dimension, extreme sparsity, and large scale of scATAC-seq data have posed great challenges to cell-type identification. Thus, there has been a growing interest in leveraging the well-annotated scRNA-seq data to help annotate scATAC-seq data. However, substantial computational obstacles remain to transfer information from scRNA-seq to scATAC-seq, especially for their heterogeneous features.

RESULTS

We propose a new transfer learning method, scNCL, which utilizes prior knowledge and contrastive learning to tackle the problem of heterogeneous features. Briefly, scNCL transforms scATAC-seq features into gene activity matrix based on prior knowledge. Since feature transformation can cause information loss, scNCL introduces neighborhood contrastive learning to preserve the neighborhood structure of scATAC-seq cells in raw feature space. To learn transferable latent features, scNCL uses a feature projection loss and an alignment loss to harmonize embeddings between scRNA-seq and scATAC-seq. Experiments on various datasets demonstrated that scNCL not only realizes accurate and robust label transfer for common types, but also achieves reliable detection of novel types. scNCL is also computationally efficient and scalable to million-scale datasets. Moreover, we prove scNCL can help refine cell-type annotations in existing scATAC-seq atlases.

AVAILABILITY AND IMPLEMENTATION

The source code and data used in this paper can be found in https://github.com/CSUBioGroup/scNCL-release.

摘要

动机

scATAC-seq 使单细胞水平的染色质可及性图谱分析成为可能,为确定细胞类型特异性调控代码提供了机会。然而,scATAC-seq 数据的高维性、极度稀疏性和大规模给细胞类型识别带来了巨大挑战。因此,人们越来越有兴趣利用注释良好的 scRNA-seq 数据来帮助注释 scATAC-seq 数据。然而,从 scRNA-seq 到 scATAC-seq 转移信息仍然存在大量计算障碍,特别是对于它们的异质特征。

结果

我们提出了一种新的迁移学习方法 scNCL,它利用先验知识和对比学习来解决异质特征的问题。简而言之,scNCL 根据先验知识将 scATAC-seq 特征转化为基因活性矩阵。由于特征转换可能会导致信息丢失,scNCL 引入了邻域对比学习来保持 scATAC-seq 细胞在原始特征空间中的邻域结构。为了学习可转移的潜在特征,scNCL 使用特征投影损失和对齐损失来协调 scRNA-seq 和 scATAC-seq 之间的嵌入。在各种数据集上的实验表明,scNCL 不仅可以实现常见类型的准确稳健标签转移,还可以可靠地检测新类型。scNCL 计算效率高,可扩展到百万规模的数据集。此外,我们证明 scNCL 可以帮助完善现有 scATAC-seq 图谱中的细胞类型注释。

可用性和实现

本文中使用的源代码和数据可以在 https://github.com/CSUBioGroup/scNCL-release 找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/929d/10457667/7c2b8e129bf5/btad505f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/929d/10457667/6be8cc5a9cf4/btad505f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/929d/10457667/db1f8f72c5dc/btad505f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/929d/10457667/ed29addf021b/btad505f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/929d/10457667/7c2b8e129bf5/btad505f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/929d/10457667/6be8cc5a9cf4/btad505f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/929d/10457667/db1f8f72c5dc/btad505f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/929d/10457667/ed29addf021b/btad505f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/929d/10457667/7c2b8e129bf5/btad505f4.jpg

相似文献

1
scNCL: transferring labels from scRNA-seq to scATAC-seq data with neighborhood contrastive regularization.scNCL:利用邻域对比正则化将标签从 scRNA-seq 转移到 scATAC-seq 数据。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad505.
2
scNAME: neighborhood contrastive clustering with ancillary mask estimation for scRNA-seq data.scNAME:基于辅助掩模估计的 scRNA-seq 数据邻域对比聚类。
Bioinformatics. 2022 Mar 4;38(6):1575-1583. doi: 10.1093/bioinformatics/btac011.
3
scAWMV: an adaptively weighted multi-view learning framework for the integrative analysis of parallel scRNA-seq and scATAC-seq data.scAWMV:一种自适应加权多视图学习框架,用于平行 scRNA-seq 和 scATAC-seq 数据的综合分析。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac739.
4
scGCL: an imputation method for scRNA-seq data based on graph contrastive learning.scGCL:一种基于图对比学习的 scRNA-seq 数据插补方法。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad098.
5
CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data.CL-Impute:基于对比学习的 dropout 单细胞 RNA-seq 数据插补方法。
Comput Biol Med. 2023 Sep;164:107263. doi: 10.1016/j.compbiomed.2023.107263. Epub 2023 Jul 23.
6
scDCCA: deep contrastive clustering for single-cell RNA-seq data based on auto-encoder network.scDCCA:基于自动编码器网络的单细胞RNA测序数据深度对比聚类
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac625.
7
Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data.单细胞 ATAC-seq 数据基因集评分算法的基准测试。
Genomics Proteomics Bioinformatics. 2024 Jul 3;22(2). doi: 10.1093/gpbjnl/qzae014.
8
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
9
SAILER: scalable and accurate invariant representation learning for single-cell ATAC-seq processing and integration.SAILER:用于单细胞 ATAC-seq 处理和整合的可扩展且准确的不变表示学习。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i317-i326. doi: 10.1093/bioinformatics/btab303.
10
scZAG: Integrating ZINB-Based Autoencoder with Adaptive Data Augmentation Graph Contrastive Learning for scRNA-seq Clustering.scZAG:基于 ZINB 的自动编码器与自适应数据增强图对比学习在 scRNA-seq 聚类中的整合。
Int J Mol Sci. 2024 May 29;25(11):5976. doi: 10.3390/ijms25115976.

引用本文的文献

1
Cell-Type Annotation for scATAC-Seq Data by Integrating Chromatin Accessibility and Genome Sequence.通过整合染色质可及性和基因组序列对单细胞ATAC测序数据进行细胞类型注释
Biomolecules. 2025 Jun 27;15(7):938. doi: 10.3390/biom15070938.
2
Application of computational algorithms for single-cell RNA-seq and ATAC-seq in neurodegenerative diseases.计算算法在神经退行性疾病单细胞RNA测序和染色质转座酶可接近性测序中的应用
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elae044.

本文引用的文献

1
Adversarial domain translation networks for integrating large-scale atlas-level single-cell datasets.用于整合大规模图谱级单细胞数据集的对抗域翻译网络。
Nat Comput Sci. 2022 May;2(5):317-330. doi: 10.1038/s43588-022-00251-y. Epub 2022 May 30.
2
CLAIRE: contrastive learning-based batch correction framework for better balance between batch mixing and preservation of cellular heterogeneity.CLAIRE:基于对比学习的批次校正框架,更好地平衡批次混合和保留细胞异质性。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad099.
3
GLOBE: a contrastive learning-based framework for integrating single-cell transcriptome datasets.
GLOBE:基于对比学习的整合单细胞转录组数据集的框架。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac311.
4
scDART: integrating unmatched scRNA-seq and scATAC-seq data and learning cross-modality relationship simultaneously.scDART:同时整合未配对的 scRNA-seq 和 scATAC-seq 数据并学习跨模态关系。
Genome Biol. 2022 Jun 27;23(1):139. doi: 10.1186/s13059-022-02706-x.
5
Diagonal integration of multimodal single-cell data: potential pitfalls and paths forward.多模态单细胞数据的对角线整合:潜在的陷阱和前进的道路。
Nat Commun. 2022 Jun 18;13(1):3505. doi: 10.1038/s41467-022-31104-x.
6
Multi-omics single-cell data integration and regulatory inference with graph-linked embedding.基于图链接嵌入的多组学单细胞数据整合与调控推断。
Nat Biotechnol. 2022 Oct;40(10):1458-1466. doi: 10.1038/s41587-022-01284-4. Epub 2022 May 2.
7
scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning.scJoint 集成了图谱尺度单细胞 RNA-seq 和 ATAC-seq 数据,并结合了迁移学习。
Nat Biotechnol. 2022 May;40(5):703-710. doi: 10.1038/s41587-021-01161-6. Epub 2022 Jan 20.
8
Single-cell chromatin state analysis with Signac.使用 Signac 进行单细胞染色质状态分析。
Nat Methods. 2021 Nov;18(11):1333-1341. doi: 10.1038/s41592-021-01282-5. Epub 2021 Nov 1.
9
Jointly Embedding Multiple Single-Cell Omics Measurements.联合嵌入多个单细胞组学测量值
Algorithms Bioinform. 2019 Sep 3;143. doi: 10.4230/LIPIcs.WABI.2019.10.
10
scGCN is a graph convolutional networks algorithm for knowledge transfer in single cell omics.scGCN 是一种用于单细胞组学中知识迁移的图卷积网络算法。
Nat Commun. 2021 Jun 22;12(1):3826. doi: 10.1038/s41467-021-24172-y.