• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

真实场景下半监督单细胞RNA测序整合方法的基准测试

A Benchmark of Semi-Supervised scRNA-seq Integration Methods in Real-World Scenarios.

作者信息

Shen Xiaoyu, He Chuan, Guan Leying

出版信息

bioRxiv. 2025 Aug 27:2025.08.23.671952. doi: 10.1101/2025.08.23.671952.

DOI:10.1101/2025.08.23.671952
PMID:40909636
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12407688/
Abstract

Semi-supervised methods for single-cell RNA-seq integration promise to improve batch correction and biological signal preservation by leveraging cell-type labels. However, their reported benefits often rely on overly idealized settings. Here, we present the first systematic benchmark of five leading semi-supervised methods (scANVI, scGEN, ssSTACAS, scDREAMER, ItClust) against five widely used unsupervised baselines across six diverse datasets. We evaluate performance under five realistic annotation scenarios, including missing, erroneous, boundary-missing and mixed, batch-specific, and auto-generated labels, using nine established integration metrics. While semi-supervised methods show gains with perfect annotations, their robustness declines sharply under practical imperfections. Only scANVI and ssSTACAS maintain stable but modest improvements relative to their unsupervised counterparts, while none consistently outperform the strongest unsupervised method, scCRAFT. Our results highlight that current semi-supervised strategies offer limited practical advantage and that careful choice of integration method remains critical when label quality is uncertain.

摘要

用于单细胞RNA测序整合的半监督方法有望通过利用细胞类型标签来改善批次校正和生物信号保留。然而,它们所报告的优势往往依赖于过于理想化的设置。在这里,我们针对六个不同数据集上的五个广泛使用的无监督基线,对五种领先的半监督方法(scANVI、scGEN、ssSTACAS、scDREAMER、ItClust)进行了首次系统基准测试。我们使用九个既定的整合指标,在五种现实的注释场景下评估性能,包括缺失、错误、边界缺失和混合、批次特定以及自动生成的标签。虽然半监督方法在完美注释下显示出优势,但在实际存在缺陷的情况下,它们的稳健性会急剧下降。只有scANVI和ssSTACAS相对于无监督对应方法保持稳定但适度的改进,而没有一种方法始终优于最强的无监督方法scCRAFT。我们的结果突出表明,当前的半监督策略提供的实际优势有限,并且当标签质量不确定时,谨慎选择整合方法仍然至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/1a3e6a0f57a0/nihpp-2025.08.23.671952v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/f47bf54b3072/nihpp-2025.08.23.671952v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/dae292b3558b/nihpp-2025.08.23.671952v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/a5c2f9e8b786/nihpp-2025.08.23.671952v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/f6a06ca455aa/nihpp-2025.08.23.671952v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/24a9c162ea1a/nihpp-2025.08.23.671952v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/1a3e6a0f57a0/nihpp-2025.08.23.671952v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/f47bf54b3072/nihpp-2025.08.23.671952v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/dae292b3558b/nihpp-2025.08.23.671952v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/a5c2f9e8b786/nihpp-2025.08.23.671952v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/f6a06ca455aa/nihpp-2025.08.23.671952v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/24a9c162ea1a/nihpp-2025.08.23.671952v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa4/12407688/1a3e6a0f57a0/nihpp-2025.08.23.671952v1-f0006.jpg

相似文献

1
A Benchmark of Semi-Supervised scRNA-seq Integration Methods in Real-World Scenarios.真实场景下半监督单细胞RNA测序整合方法的基准测试
bioRxiv. 2025 Aug 27:2025.08.23.671952. doi: 10.1101/2025.08.23.671952.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
4
A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。
Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.
5
An Unsupervised Learning Algorithm for the Automatic Classification of Coronary Artery Lesions.一种用于冠状动脉病变自动分类的无监督学习算法。
Cureus. 2025 Jul 24;17(7):e88638. doi: 10.7759/cureus.88638. eCollection 2025 Jul.
6
HiCat: a semi-supervised approach for cell type annotation.HiCat:一种用于细胞类型注释的半监督方法。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf428.
7
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
8
Supervised dosing with a long-acting opioid medication in the management of opioid dependence.在阿片类药物依赖管理中使用长效阿片类药物进行监督给药。
Cochrane Database Syst Rev. 2017 Apr 27;4(4):CD011983. doi: 10.1002/14651858.CD011983.pub2.
9
Semi-supervised semantic segmentation of cell nuclei with diffusion model and collaborative learning.基于扩散模型和协同学习的细胞核半监督语义分割
J Med Imaging (Bellingham). 2025 Nov;12(6):061403. doi: 10.1117/1.JMI.12.6.061403. Epub 2025 Mar 20.
10
Semi-supervised detection of natural selection with positive-unlabeled learning.基于正例未标注学习的自然选择半监督检测
bioRxiv. 2025 Aug 18:2025.08.15.670602. doi: 10.1101/2025.08.15.670602.

本文引用的文献

1
Partially characterized topology guides reliable anchor-free scRNA-integration.部分特征化的拓扑结构指导可靠的无锚单细胞RNA整合。
Commun Biol. 2025 Apr 4;8(1):561. doi: 10.1038/s42003-025-07988-y.
2
Semi-supervised integration of single-cell transcriptomics data.单细胞转录组学数据的半监督整合。
Nat Commun. 2024 Jan 29;15(1):872. doi: 10.1038/s41467-024-45240-z.
3
scDREAMER for atlas-level integration of single-cell datasets using deep generative model paired with adversarial classifier.scDREAMER:基于深度生成模型与对抗分类器的单细胞数据集图谱级整合方法。
Nat Commun. 2023 Nov 27;14(1):7781. doi: 10.1038/s41467-023-43590-8.
4
A Review of Single-Cell RNA-Seq Annotation, Integration, and Cell-Cell Communication.单细胞 RNA-Seq 注释、整合和细胞间通讯综述。
Cells. 2023 Jul 30;12(15):1970. doi: 10.3390/cells12151970.
5
An integrated cell atlas of the lung in health and disease.肺部健康与疾病的细胞整合图谱
Nat Med. 2023 Jun;29(6):1563-1577. doi: 10.1038/s41591-023-02327-2. Epub 2023 Jun 8.
6
A comprehensive mouse kidney atlas enables rare cell population characterization and robust marker discovery.一份全面的小鼠肾脏图谱有助于对稀有细胞群体进行表征并发现可靠的标志物。
iScience. 2023 May 18;26(6):106877. doi: 10.1016/j.isci.2023.106877. eCollection 2023 Jun 16.
7
Dictionary learning for integrative, multimodal and scalable single-cell analysis.基于字典学习的综合、多模态和可扩展的单细胞分析。
Nat Biotechnol. 2024 Feb;42(2):293-304. doi: 10.1038/s41587-023-01767-y. Epub 2023 May 25.
8
Batch alignment of single-cell transcriptomics data using deep metric learning.基于深度度量学习的单细胞转录组学数据批量对齐。
Nat Commun. 2023 Feb 21;14(1):960. doi: 10.1038/s41467-023-36635-5.
9
A transcriptional cross species map of pancreatic islet cells.胰岛细胞的转录跨物种图谱。
Mol Metab. 2022 Dec;66:101595. doi: 10.1016/j.molmet.2022.101595. Epub 2022 Sep 13.
10
ResPAN: a powerful batch correction model for scRNA-seq data through residual adversarial networks.ResPAN:通过残差对抗网络对 scRNA-seq 数据进行强大的批量校正模型。
Bioinformatics. 2022 Aug 10;38(16):3942-3949. doi: 10.1093/bioinformatics/btac427.