• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞转录组学数据的快速无模型标准化与整合

Fast model-free standardization and integration of single-cell transcriptomics data.

作者信息

Xu Yang, Kramann Rafael, McCord Rachel Patton, Hayat Sikander

机构信息

UT-ORNL Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37996, USA.

Institute of Experimental Medicine and Systems Biology, RWTH Aachen University, Aachen, Germany.

出版信息

Res Sq. 2023 Jan 23:rs.3.rs-2485985. doi: 10.21203/rs.3.rs-2485985/v1.

DOI:10.21203/rs.3.rs-2485985/v1
PMID:36747625
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9901035/
Abstract

Single-cell transcriptomics datasets from the same anatomical sites generated by different research labs are becoming increasingly common. However, fast and computationally inexpensive tools for standardization of cell-type annotation and data integration are still needed in order to increase research inclusivity. To standardize cell-type annotation and integrate single-cell transcriptomics datasets, we have built a fast model-free integration method, named MASI (Marker-Assisted Standardization and Integration). MASI first identifies putative cell-type markers from reference data through an ensemble approach. Then, it converts gene expression matrix to cell-type score matrix with the identified putative cell-type markers for the purpose of cell-type annotation and data integration. Because of integration through cell-type markers instead of model inference, MASI can annotate approximately one million cells on a personal laptop, which provides a cheap computational alternative for the single-cell community. We benchmark MASI with other well-established methods and demonstrate that MASI outperforms other methods based on speed. Its performance for both tasks of data integration and cell-type annotation are comparable or even superior to these existing methods. To harness knowledge from single-cell atlases, we demonstrate three case studies that cover integration across biological conditions, surveyed participants, and research groups, respectively.

摘要

不同研究实验室生成的来自相同解剖部位的单细胞转录组学数据集正变得越来越普遍。然而,为了提高研究的包容性,仍然需要快速且计算成本低廉的工具来实现细胞类型注释的标准化和数据整合。为了标准化细胞类型注释并整合单细胞转录组学数据集,我们构建了一种快速的无模型整合方法,名为MASI(标记辅助标准化和整合)。MASI首先通过一种集成方法从参考数据中识别推定的细胞类型标记。然后,为了进行细胞类型注释和数据整合,它使用所识别的推定细胞类型标记将基因表达矩阵转换为细胞类型评分矩阵。由于是通过细胞类型标记而非模型推理进行整合,MASI可以在个人笔记本电脑上注释大约一百万个细胞,这为单细胞研究群体提供了一种成本低廉的计算替代方案。我们用其他成熟的方法对MASI进行了基准测试,并证明MASI在速度方面优于其他方法。它在数据整合和细胞类型注释这两项任务上的性能与现有方法相当,甚至更优。为了利用单细胞图谱中的知识,我们展示了三个案例研究,分别涵盖了跨生物条件、被调查参与者和研究组的整合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/b961a32216ce/nihpp-rs2485985v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/cc849fc57a34/nihpp-rs2485985v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/ddfc257414ed/nihpp-rs2485985v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/4dc80f35c213/nihpp-rs2485985v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/6762f43592b3/nihpp-rs2485985v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/1a6e47a56925/nihpp-rs2485985v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/b961a32216ce/nihpp-rs2485985v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/cc849fc57a34/nihpp-rs2485985v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/ddfc257414ed/nihpp-rs2485985v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/4dc80f35c213/nihpp-rs2485985v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/6762f43592b3/nihpp-rs2485985v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/1a6e47a56925/nihpp-rs2485985v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feb0/9901035/b961a32216ce/nihpp-rs2485985v1-f0006.jpg

相似文献

1
Fast model-free standardization and integration of single-cell transcriptomics data.单细胞转录组学数据的快速无模型标准化与整合
Res Sq. 2023 Jan 23:rs.3.rs-2485985. doi: 10.21203/rs.3.rs-2485985/v1.
2
MASI enables fast model-free standardization and integration of single-cell transcriptomics data.MASI 能够快速进行无模型标准化和单细胞转录组学数据的整合。
Commun Biol. 2023 Apr 28;6(1):465. doi: 10.1038/s42003-023-04820-3.
3
MACA: marker-based automatic cell-type annotation for single-cell expression data.MACA:用于单细胞表达数据的基于标记的自动细胞类型注释
Bioinformatics. 2022 Mar 4;38(6):1756-1760. doi: 10.1093/bioinformatics/btab840.
4
scPLAN: a hierarchical computational framework for single transcriptomics data annotation, integration and cell-type label refinement.scPLAN:一种用于单细胞转录组学数据注释、整合和细胞类型标签细化的分层计算框架。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae305.
5
scBOL: a universal cell type identification framework for single-cell and spatial transcriptomics data.scBOL:单细胞和空间转录组学数据的通用细胞类型识别框架。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae188.
6
scMoMaT jointly performs single cell mosaic integration and multi-modal bio-marker detection.scMoMaT 联合执行单细胞马赛克整合和多模式生物标志物检测。
Nat Commun. 2023 Jan 24;14(1):384. doi: 10.1038/s41467-023-36066-2.
7
scATAnno: Automated Cell Type Annotation for single-cell ATAC Sequencing Data.scATAnno:单细胞ATAC测序数据的自动细胞类型注释
bioRxiv. 2024 Mar 25:2023.06.01.543296. doi: 10.1101/2023.06.01.543296.
8
Principled and interpretable alignability testing and integration of single-cell data.有原则且可解释的可对齐性测试和单细胞数据的整合。
Proc Natl Acad Sci U S A. 2024 Mar 5;121(10):e2313719121. doi: 10.1073/pnas.2313719121. Epub 2024 Feb 28.
9
A Literature-Derived Knowledge Graph Augments the Interpretation of Single Cell RNA-seq Datasets.文献衍生知识图谱增强单细胞 RNA-seq 数据集的解读。
Genes (Basel). 2021 Jun 10;12(6):898. doi: 10.3390/genes12060898.
10
Computational solutions for spatial transcriptomics.空间转录组学的计算解决方案。
Comput Struct Biotechnol J. 2022 Sep 1;20:4870-4884. doi: 10.1016/j.csbj.2022.08.043. eCollection 2022.

本文引用的文献

1
Uncovering cell identity through differential stability with Cepo.利用Cepo通过差异稳定性揭示细胞身份。
Nat Comput Sci. 2021 Dec;1(12):784-790. doi: 10.1038/s43588-021-00172-2. Epub 2021 Dec 20.
2
Single-cell transcriptomics reveals cell-type-specific diversification in human heart failure.单细胞转录组学揭示了人类心力衰竭中细胞类型特异性的多样化。
Nat Cardiovasc Res. 2022 Mar;1(3):263-280. doi: 10.1038/s44161-022-00028-6. Epub 2022 Mar 16.
3
Spatial multi-omic map of human myocardial infarction.人类心肌梗死的空间多组学图谱。
Nature. 2022 Aug;608(7924):766-777. doi: 10.1038/s41586-022-05060-x. Epub 2022 Aug 10.
4
Fully-automated and ultra-fast cell-type identification using specific marker combinations from single-cell transcriptomic data.利用单细胞转录组数据中的特定标记组合进行全自动超快速细胞类型识别。
Nat Commun. 2022 Mar 10;13(1):1246. doi: 10.1038/s41467-022-28803-w.
5
A decade of molecular cell atlases.十年的分子细胞图谱。
Trends Genet. 2022 Aug;38(8):805-810. doi: 10.1016/j.tig.2022.01.004. Epub 2022 Jan 31.
6
Accurate and fast cell marker gene identification with COSG.使用COSG准确快速地鉴定细胞标记基因。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab579.
7
MACA: marker-based automatic cell-type annotation for single-cell expression data.MACA:用于单细胞表达数据的基于标记的自动细胞类型注释
Bioinformatics. 2022 Mar 4;38(6):1756-1760. doi: 10.1093/bioinformatics/btab840.
8
Single-cell delineation of lineage and genetic identity in the mouse brain.单细胞描绘小鼠大脑中的谱系和遗传身份。
Nature. 2022 Jan;601(7893):404-409. doi: 10.1038/s41586-021-04237-0. Epub 2021 Dec 15.
9
HDMC: a novel deep learning-based framework for removing batch effects in single-cell RNA-seq data.HDMC:一种用于去除单细胞 RNA-seq 数据中批次效应的新型深度学习框架。
Bioinformatics. 2022 Feb 7;38(5):1295-1303. doi: 10.1093/bioinformatics/btab821.
10
SMILE: mutual information learning for integration of single-cell omics data.SMILE:单细胞组学数据集成的互信息学习。
Bioinformatics. 2022 Jan 3;38(2):476-486. doi: 10.1093/bioinformatics/btab706.