• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对参考数据集设计的调整可改善细胞类型标签转移。

Adjustments to the reference dataset design improve cell type label transfer.

作者信息

Mölbert Carla, Haghverdi Laleh

机构信息

Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, Berlin, Germany.

Department of Biology, Humboldt-Universität zu Berlin, Berlin, Germany.

出版信息

Front Bioinform. 2023 Apr 5;3:1150099. doi: 10.3389/fbinf.2023.1150099. eCollection 2023.

DOI:10.3389/fbinf.2023.1150099
PMID:37091908
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10114588/
Abstract

The transfer of cell type labels from pre-annotated (reference) to newly collected data is an important task in single-cell data analysis. As the number of publicly available annotated datasets which can be used as reference, as well as the number of computational methods for cell type label transfer are constantly growing, rationals to understand and decide which reference design and which method to use for a particular query dataset are needed. Using detailed data visualisations and interpretable statistical assessments, we benchmark a set of popular cell type annotation methods, test their performance on different cell types and study the effects of the design of reference data (e.g., cell sampling criteria, inclusion of multiple datasets in one reference, gene set selection) on the reliability of predictions. Our results highlight the need for further improvements in label transfer methods, as well as preparation of high-quality pre-annotated reference data of adequate sampling from all cell types of interest, for more reliable annotation of new datasets.

摘要

将细胞类型标签从预先注释(参考)数据转移到新收集的数据是单细胞数据分析中的一项重要任务。随着可作为参考的公开可用注释数据集数量以及用于细胞类型标签转移的计算方法数量不断增加,需要合理的方法来理解并决定针对特定查询数据集使用哪种参考设计和哪种方法。通过详细的数据可视化和可解释的统计评估,我们对一组流行的细胞类型注释方法进行了基准测试,测试它们在不同细胞类型上的性能,并研究参考数据设计(例如,细胞采样标准、在一个参考中包含多个数据集、基因集选择)对预测可靠性的影响。我们的结果强调了进一步改进标签转移方法的必要性,以及准备高质量的预先注释参考数据,该数据要从所有感兴趣的细胞类型中进行充分采样,以便更可靠地注释新数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/9edcb0c6fe82/fbinf-03-1150099-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/7d7e35e72d54/fbinf-03-1150099-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/ef6368161a62/fbinf-03-1150099-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/6166aea48f94/fbinf-03-1150099-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/d66dc79141d3/fbinf-03-1150099-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/c8ee47a24c0a/fbinf-03-1150099-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/bc8983ac1c52/fbinf-03-1150099-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/9edcb0c6fe82/fbinf-03-1150099-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/7d7e35e72d54/fbinf-03-1150099-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/ef6368161a62/fbinf-03-1150099-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/6166aea48f94/fbinf-03-1150099-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/d66dc79141d3/fbinf-03-1150099-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/c8ee47a24c0a/fbinf-03-1150099-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/bc8983ac1c52/fbinf-03-1150099-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2955/10114588/9edcb0c6fe82/fbinf-03-1150099-g007.jpg

相似文献

1
Adjustments to the reference dataset design improve cell type label transfer.对参考数据集设计的调整可改善细胞类型标签转移。
Front Bioinform. 2023 Apr 5;3:1150099. doi: 10.3389/fbinf.2023.1150099. eCollection 2023.
2
scGAD: a new task and end-to-end framework for generalized cell type annotation and discovery.scGAD:用于广义细胞类型注释和发现的新任务和端到端框架。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad045.
3
Learning Cell Annotation under Multiple Reference Datasets by Multisource Domain Adaptation.通过多源域适应在多个参考数据集下学习细胞注释
J Chem Inf Model. 2023 Jan 9;63(1):397-405. doi: 10.1021/acs.jcim.2c01277. Epub 2022 Dec 29.
4
scDOT: enhancing single-cell RNA-Seq data annotation and uncovering novel cell types through multi-reference integration.scDOT:通过多参考整合增强单细胞RNA测序数据注释并揭示新型细胞类型
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae072.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
scMRA: a robust deep learning method to annotate scRNA-seq data with multiple reference datasets.scMRA:一种用于用多个参考数据集注释单细胞RNA测序数据的强大深度学习方法。
Bioinformatics. 2022 Jan 12;38(3):738-745. doi: 10.1093/bioinformatics/btab700.
7
Transformer for one stop interpretable cell type annotation.用于一站式可解释细胞类型注释的 Transformer。
Nat Commun. 2023 Jan 14;14(1):223. doi: 10.1038/s41467-023-35923-4.
8
Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis.多音性:单细胞数据分析的交互式迁移学习框架。
IEEE Trans Vis Comput Graph. 2023 Jan;29(1):591-601. doi: 10.1109/TVCG.2022.3209408. Epub 2022 Dec 20.
9
scSemiGAN: a single-cell semi-supervised annotation and dimensionality reduction framework based on generative adversarial network.scSemiGAN:基于生成对抗网络的单细胞半监督注释和降维框架。
Bioinformatics. 2022 Nov 15;38(22):5042-5048. doi: 10.1093/bioinformatics/btac652.
10
Merging nucleus datasets by correlation-based cross-training.基于相关的交叉训练合并核数据集。
Med Image Anal. 2023 Feb;84:102705. doi: 10.1016/j.media.2022.102705. Epub 2022 Dec 7.

引用本文的文献

1
STCC enhances spatial domain detection through consensus clustering of spatial transcriptomics data.STCC通过空间转录组学数据的一致性聚类增强空间域检测。
Genome Res. 2025 Jun 2;35(6):1415-1428. doi: 10.1101/gr.280031.124.
2
All the single cells: Single-cell transcriptomics/epigenomics experimental design and analysis considerations for glial biologists.所有单细胞:神经胶质生物学家的单细胞转录组学/表观基因组学实验设计与分析考量
Glia. 2025 Mar;73(3):451-473. doi: 10.1002/glia.24633. Epub 2024 Nov 19.
3
Compound-SNE: Comparative alignment of t-SNEs for multiple single-cell omics data visualisation.

本文引用的文献

1
Resolution of the curse of dimensionality in single-cell RNA sequencing data analysis.单细胞 RNA 测序数据分析中维度诅咒的解决。
Life Sci Alliance. 2022 Aug 9;5(12):e202201591. doi: 10.26508/lsa.202201591.
2
Single-cell proteo-genomic reference maps of the hematopoietic system enable the purification and massive profiling of precisely defined cell states.单细胞蛋白质基因组参考图谱可对造血系统进行精确定义的细胞状态的纯化和大规模分析。
Nat Immunol. 2021 Dec;22(12):1577-1589. doi: 10.1038/s41590-021-01059-0. Epub 2021 Nov 22.
3
Analytic Pearson residuals for normalization of single-cell RNA-seq UMI data.
Compound-SNE:用于多单细胞组学数据可视化的t-SNE比较比对
Bioinformatics. 2024 Jul 25;40(7). doi: 10.1093/bioinformatics/btae471.
用于单细胞 RNA-seq UMI 数据归一化的解析 Pearson 残差。
Genome Biol. 2021 Sep 6;22(1):258. doi: 10.1186/s13059-021-02451-7.
4
Gene signature extraction and cell identity recognition at the single-cell level with Cell-ID.利用 Cell-ID 在单细胞水平上进行基因特征提取和细胞身份识别。
Nat Biotechnol. 2021 Sep;39(9):1095-1102. doi: 10.1038/s41587-021-00896-6. Epub 2021 Apr 29.
5
Iterative transfer learning with neural network for clustering and cell type classification in single-cell RNA-seq analysis.用于单细胞RNA测序分析中聚类和细胞类型分类的神经网络迭代迁移学习
Nat Mach Intell. 2020 Oct;2(10):607-618. doi: 10.1038/s42256-020-00233-7. Epub 2020 Oct 5.
6
Targeted Perturb-seq enables genome-scale genetic screens in single cells.靶向扰动测序可在单细胞中进行全基因组规模的遗传筛选。
Nat Methods. 2020 Jun;17(6):629-635. doi: 10.1038/s41592-020-0837-5. Epub 2020 Jun 1.
7
Systematic comparison of single-cell and single-nucleus RNA-sequencing methods.单细胞和单细胞核 RNA 测序方法的系统比较。
Nat Biotechnol. 2020 Jun;38(6):737-746. doi: 10.1038/s41587-020-0465-8. Epub 2020 Apr 6.
8
SingleCellNet: A Computational Tool to Classify Single Cell RNA-Seq Data Across Platforms and Across Species.SingleCellNet:一种跨平台和跨物种对单细胞 RNA-Seq 数据进行分类的计算工具。
Cell Syst. 2019 Aug 28;9(2):207-213.e2. doi: 10.1016/j.cels.2019.06.004. Epub 2019 Jul 31.
9
Comprehensive Integration of Single-Cell Data.单细胞数据的综合整合。
Cell. 2019 Jun 13;177(7):1888-1902.e21. doi: 10.1016/j.cell.2019.05.031. Epub 2019 Jun 6.
10
Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage.基于参考的肺单细胞测序分析揭示了一种过渡性成纤维细胞样巨噬细胞。
Nat Immunol. 2019 Feb;20(2):163-172. doi: 10.1038/s41590-018-0276-y. Epub 2019 Jan 14.