• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞转录组学中数据偏移的分布外检测方法评估

Evaluation of out-of-distribution detection methods for data shifts in single-cell transcriptomics.

作者信息

Theunissen Lauren, Mortier Thomas, Saeys Yvan, Waegeman Willem

机构信息

Data Mining and Modeling for Biomedicine, VIB Center for Inflammation Research and VIB Center for AI and Computational Biology (VIB.AI), 9000 Ghent, Belgium.

Department of Data-analysis and Mathematical Modeling, Ghent University Faculty of Bioscience Engineering, 9000 Ghent, Belgium.

出版信息

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf239.

DOI:10.1093/bib/bbaf239
PMID:40439669
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12121363/
Abstract

Automatic cell-type annotation methods assign cell-type labels to new, unlabeled datasets by leveraging relationships from a reference RNA-seq atlas. However, new datasets may include labels absent from the reference dataset or exhibit feature distributions that diverge from it. These scenarios can significantly affect the reliability of cell type predictions, a factor often overlooked in current automatic annotation methods. The field of out-of-distribution detection (OOD), primarily focused on computer vision, addresses the identification of instances that differ from the training distribution. Therefore, the implementation of OOD methods in the context of novel cell type annotation and data shift detection for single-cell transcriptomics may enhance annotation accuracy and trustworthiness. We evaluate six OOD detection methods: LogitNorm, MC dropout, Deep Ensembles, Energy-based OOD, Deep NN, and Posterior networks, for their annotation and OOD detection performance in both synthetical and real-life application settings. We show that OOD detection methods can accurately identify novel cell types and demonstrate potential to detect significant data shifts in non-integrated datasets. Moreover, we find that integration of the OOD datasets does not interfere with OOD detection of novel cell types.

摘要

自动细胞类型注释方法通过利用来自参考RNA测序图谱的关系,将细胞类型标签分配给新的未标记数据集。然而,新数据集可能包含参考数据集中不存在的标签,或者呈现出与参考数据集不同的特征分布。这些情况会显著影响细胞类型预测的可靠性,而这一因素在当前的自动注释方法中常常被忽视。分布外检测(OOD)领域主要专注于计算机视觉,致力于识别与训练分布不同的实例。因此,在单细胞转录组学的新型细胞类型注释和数据偏移检测中实施OOD方法,可能会提高注释的准确性和可信度。我们评估了六种OOD检测方法:LogitNorm、MC dropout、深度集成、基于能量的OOD、深度神经网络和后验网络,考察它们在合成和实际应用场景中的注释和OOD检测性能。我们表明,OOD检测方法能够准确识别新型细胞类型,并展示出在非整合数据集中检测显著数据偏移的潜力。此外,我们发现OOD数据集的整合不会干扰新型细胞类型的OOD检测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17a4/12121363/5230c8f88a92/bbaf239f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17a4/12121363/017d291271f9/bbaf239f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17a4/12121363/5230c8f88a92/bbaf239f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17a4/12121363/017d291271f9/bbaf239f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17a4/12121363/5230c8f88a92/bbaf239f2.jpg

相似文献

1
Evaluation of out-of-distribution detection methods for data shifts in single-cell transcriptomics.单细胞转录组学中数据偏移的分布外检测方法评估
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf239.
2
scPLAN: a hierarchical computational framework for single transcriptomics data annotation, integration and cell-type label refinement.scPLAN:一种用于单细胞转录组学数据注释、整合和细胞类型标签细化的分层计算框架。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae305.
3
scBOL: a universal cell type identification framework for single-cell and spatial transcriptomics data.scBOL:单细胞和空间转录组学数据的通用细胞类型识别框架。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae188.
4
Post-hoc out-of-distribution detection for cardiac MRI segmentation.心脏磁共振成像分割的事后分布外检测
Comput Med Imaging Graph. 2025 Jan;119:102476. doi: 10.1016/j.compmedimag.2024.102476. Epub 2024 Dec 12.
5
Investigation of out-of-distribution detection across various models and training methodologies.跨多种模型和训练方法的分布外检测研究。
Neural Netw. 2024 Jul;175:106288. doi: 10.1016/j.neunet.2024.106288. Epub 2024 Apr 4.
6
PredGCN: a Pruning-enabled Gene-Cell Net for automatic cell annotation of single cell transcriptome data.PredGCN:一种支持剪枝的基因-细胞网络,用于单细胞转录组数据的自动细胞注释。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae421.
7
scTab: Scaling cross-tissue single-cell annotation models.scTab:缩放跨组织单细胞注释模型。
Nat Commun. 2024 Aug 4;15(1):6611. doi: 10.1038/s41467-024-51059-5.
8
Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models.使用深度生成模型对单细胞转录组学数据进行概率协调和注释。
Mol Syst Biol. 2021 Jan;17(1):e9620. doi: 10.15252/msb.20209620.
9
Unmasking the chameleons: A benchmark for out-of-distribution detection in medical tabular data.揭开变色龙的面纱:医学表格数据中分布外检测的基准。
Int J Med Inform. 2025 Mar;195:105762. doi: 10.1016/j.ijmedinf.2024.105762. Epub 2024 Dec 17.
10
scRCA: A Siamese network-based pipeline for annotating cell types using noisy single-cell RNA-seq reference data.scRCA:一种基于连体网络的流程,用于使用有噪声的单细胞RNA测序参考数据注释细胞类型。
Comput Biol Med. 2025 May;190:110068. doi: 10.1016/j.compbiomed.2025.110068. Epub 2025 Mar 29.

本文引用的文献

1
Considerations for building and using integrated single-cell atlases.构建和使用整合单细胞图谱的注意事项。
Nat Methods. 2025 Jan;22(1):41-57. doi: 10.1038/s41592-024-02532-y. Epub 2024 Dec 13.
2
scTab: Scaling cross-tissue single-cell annotation models.scTab:缩放跨组织单细胞注释模型。
Nat Commun. 2024 Aug 4;15(1):6611. doi: 10.1038/s41467-024-51059-5.
3
Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis.在单细胞RNA测序分析中评估GPT-4用于细胞类型注释
Nat Methods. 2024 Aug;21(8):1462-1465. doi: 10.1038/s41592-024-02235-4. Epub 2024 Mar 25.
4
Single-cell Mayo Map (scMayoMap): an easy-to-use tool for cell type annotation in single-cell RNA-sequencing data analysis.单细胞 Mayo 图谱 (scMayoMap):单细胞 RNA 测序数据分析中用于细胞类型注释的易用工具。
BMC Biol. 2023 Oct 20;21(1):223. doi: 10.1186/s12915-023-01728-6.
5
An integrated cell atlas of the lung in health and disease.肺部健康与疾病的细胞整合图谱
Nat Med. 2023 Jun;29(6):1563-1577. doi: 10.1038/s41591-023-02327-2. Epub 2023 Jun 8.
6
EasyCellType: marker-based cell-type annotation by automatically querying multiple databases.EasyCellType:通过自动查询多个数据库进行基于标记的细胞类型注释。
Bioinform Adv. 2023 Mar 24;3(1):vbad029. doi: 10.1093/bioadv/vbad029. eCollection 2023.
7
A comprehensive comparison of supervised and unsupervised methods for cell type identification in single-cell RNA-seq.单细胞 RNA-seq 中细胞类型识别的有监督与无监督方法的全面比较。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab567.
8
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
9
scDeepSort: a pre-trained cell-type annotation method for single-cell transcriptomics using deep learning with a weighted graph neural network.scDeepSort:一种使用深度学习和加权图神经网络进行单细胞转录组学的预训练细胞类型注释方法。
Nucleic Acids Res. 2021 Dec 2;49(21):e122. doi: 10.1093/nar/gkab775.
10
Evaluation of machine learning approaches for cell-type identification from single-cell transcriptomics data.基于单细胞转录组学数据的细胞类型识别的机器学习方法评估。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab035.