• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估基于图像的细胞分析中的批量校正方法。

Evaluating batch correction methods for image-based cell profiling.

机构信息

Imaging Platform, Broad Institute of MIT and Harvard, 02142, Cambridge, MA, USA.

出版信息

Nat Commun. 2024 Aug 2;15(1):6516. doi: 10.1038/s41467-024-50613-5.

DOI:10.1038/s41467-024-50613-5
PMID:39095341
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11297288/
Abstract

High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands of perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects severely limit community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we benchmark ten high-performing single-cell RNA sequencing (scRNA-seq) batch correction techniques, representing diverse approaches, using a newly released Cell Painting dataset, JUMP. We focus on five scenarios with varying complexity, ranging from batches prepared in a single lab over time to batches imaged using different microscopes in multiple labs. We find that Harmony and Seurat RPCA are noteworthy, consistently ranking among the top three methods for all tested scenarios while maintaining computational efficiency. Our proposed framework, benchmark, and metrics can be used to assess new batch correction methods in the future. This work paves the way for improvements that enable the community to make the best use of public Cell Painting data for scientific discovery.

摘要

高通量基于图像的分析平台是一种强大的技术,能够以经济高效的方式从数十亿个暴露于数千种干扰因素的细胞中收集数据。因此,基于图像的分析数据越来越多地被用于各种生物应用,如预测药物作用机制或基因功能。然而,批次效应严重限制了社区在整合和解释不同实验室和设备采集的基于图像的分析数据方面的努力。为了解决这个问题,我们使用新发布的 Cell Painting 数据集 JUMP,对十种性能优异的单细胞 RNA 测序 (scRNA-seq) 批次校正技术进行了基准测试,这些技术代表了不同的方法。我们关注了五个具有不同复杂度的场景,范围从一个实验室随时间制备的批次到在多个实验室使用不同显微镜成像的批次。我们发现 Harmony 和 Seurat RPCA 非常出色,在所有测试场景中始终排名前三,同时保持计算效率。我们提出的框架、基准和指标可用于未来评估新的批次校正方法。这项工作为改进铺平了道路,使社区能够充分利用公共 Cell Painting 数据进行科学发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/f9b57acbea35/41467_2024_50613_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/d6a1faf39e3c/41467_2024_50613_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/4c4fa8771bae/41467_2024_50613_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/cf98b4c48761/41467_2024_50613_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/bc4004575128/41467_2024_50613_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/0ca6b0c87045/41467_2024_50613_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/f9b57acbea35/41467_2024_50613_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/d6a1faf39e3c/41467_2024_50613_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/4c4fa8771bae/41467_2024_50613_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/cf98b4c48761/41467_2024_50613_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/bc4004575128/41467_2024_50613_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/0ca6b0c87045/41467_2024_50613_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c94/11297288/f9b57acbea35/41467_2024_50613_Fig6_HTML.jpg

相似文献

1
Evaluating batch correction methods for image-based cell profiling.评估基于图像的细胞分析中的批量校正方法。
Nat Commun. 2024 Aug 2;15(1):6516. doi: 10.1038/s41467-024-50613-5.
2
Evaluating batch correction methods for image-based cell profiling.评估基于图像的细胞分析中的批次校正方法。
bioRxiv. 2024 Feb 28:2023.09.15.558001. doi: 10.1101/2023.09.15.558001.
3
A benchmark of batch-effect correction methods for single-cell RNA sequencing data.单细胞 RNA 测序数据批次效应校正方法的基准测试。
Genome Biol. 2020 Jan 16;21(1):12. doi: 10.1186/s13059-019-1850-9.
4
A novel batch-effect correction method for scRNA-seq data based on Adversarial Information Factorization.基于对抗信息分解的 scRNA-seq 数据新型批量效应校正方法。
PLoS Comput Biol. 2024 Feb 22;20(2):e1011880. doi: 10.1371/journal.pcbi.1011880. eCollection 2024 Feb.
5
iSMNN: batch effect correction for single-cell RNA-seq data via iterative supervised mutual nearest neighbor refinement.iSMNN:通过迭代监督的互近邻修正对单细胞 RNA-seq 数据进行批次效应校正。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab122.
6
A multicenter study benchmarking single-cell RNA sequencing technologies using reference samples.一项使用参考试样对单细胞 RNA 测序技术进行基准测试的多中心研究。
Nat Biotechnol. 2021 Sep;39(9):1103-1114. doi: 10.1038/s41587-020-00748-9. Epub 2020 Dec 21.
7
ResPAN: a powerful batch correction model for scRNA-seq data through residual adversarial networks.ResPAN:通过残差对抗网络对 scRNA-seq 数据进行强大的批量校正模型。
Bioinformatics. 2022 Aug 10;38(16):3942-3949. doi: 10.1093/bioinformatics/btac427.
8
CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data.CellMixS:量化和可视化单细胞 RNA-seq 数据中的批次效应。
Life Sci Alliance. 2021 Mar 23;4(6). doi: 10.26508/lsa.202001004. Print 2021 Jun.
9
Transfer learning for clustering single-cell RNA-seq data crossing-species and batch, case on uterine fibroids.跨物种和批次的单细胞 RNA-seq 数据聚类的迁移学习:以子宫肌瘤为例。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad426.
10
Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors.通过匹配相互最近邻,纠正单细胞 RNA 测序数据中的批次效应。
Nat Biotechnol. 2018 Jun;36(5):421-427. doi: 10.1038/nbt.4091. Epub 2018 Apr 2.

引用本文的文献

1
The evolution and application of multi-omic analysis for pituitary neuroendocrine tumors.垂体神经内分泌肿瘤多组学分析的进展与应用
Front Med (Lausanne). 2025 Sep 1;12:1629621. doi: 10.3389/fmed.2025.1629621. eCollection 2025.
2
Single-cell multi-omics in cancer immunotherapy: from tumor heterogeneity to personalized precision treatment.癌症免疫治疗中的单细胞多组学:从肿瘤异质性到个性化精准治疗
Mol Cancer. 2025 Aug 25;24(1):221. doi: 10.1186/s12943-025-02426-3.
3
Progress and new challenges in image-based profiling.基于图像的分析技术的进展与新挑战。

本文引用的文献

1
Artificial intelligence for high content imaging in drug discovery.人工智能在药物研发中的高内涵成像应用。
Curr Opin Struct Biol. 2024 Aug;87:102842. doi: 10.1016/j.sbi.2024.102842. Epub 2024 May 25.
2
Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations.三百万张经过化学和遗传扰动处理的细胞的图像和形态特征图谱。
Nat Methods. 2024 Jun;21(6):1114-1121. doi: 10.1038/s41592-024-02241-6. Epub 2024 Apr 9.
3
Learning representations for image-based profiling of perturbations.
ArXiv. 2025 Aug 7:arXiv:2508.05800v1.
4
Morphological map of under- and overexpression of genes in human cells.人类细胞中基因表达不足和过度表达的形态学图谱。
Nat Methods. 2025 Aug;22(8):1742-1752. doi: 10.1038/s41592-025-02753-9. Epub 2025 Aug 7.
5
Triple-effect correction for Cell Painting data with contrastive and domain-adversarial learning.基于对比学习和域对抗学习的细胞绘画数据三重效应校正
Nat Commun. 2025 Jul 25;16(1):6886. doi: 10.1038/s41467-025-62193-z.
6
Combining phenomics with transcriptomics reveals cell-type-specific morphological and molecular signatures of the 22q11.2 deletion.将表型组学与转录组学相结合,揭示了22q11.2缺失的细胞类型特异性形态和分子特征。
Nat Commun. 2025 Jul 9;16(1):6332. doi: 10.1038/s41467-025-61547-x.
7
A versatile information retrieval framework for evaluating profile strength and similarity.一种用于评估简档强度和相似度的通用信息检索框架。
Nat Commun. 2025 Jun 4;16(1):5181. doi: 10.1038/s41467-025-60306-2.
8
Morphological profiling data resource enables prediction of chemical compound properties.形态学分析数据资源可实现对化合物性质的预测。
iScience. 2025 Apr 16;28(5):112445. doi: 10.1016/j.isci.2025.112445. eCollection 2025 May 16.
9
sc-MULTI-omics approach in nano-rare diseases: understanding the pathophysiological mechanism of Mulvihill-Smith Syndrome.纳米罕见病中的单细胞多组学方法:了解穆尔维希尔-史密斯综合征的病理生理机制
Funct Integr Genomics. 2025 May 9;25(1):101. doi: 10.1007/s10142-025-01608-y.
10
Decoding immune cell interactions during cardiac allograft vasculopathy: insights derived from bioinformatic strategies.解析心脏移植血管病变过程中的免疫细胞相互作用:来自生物信息学策略的见解
Front Cardiovasc Med. 2025 Apr 24;12:1568528. doi: 10.3389/fcvm.2025.1568528. eCollection 2025.
基于图像的扰动分析的表示学习。
Nat Commun. 2024 Feb 21;15(1):1594. doi: 10.1038/s41467-024-45999-1.
4
The specious art of single-cell genomics.单细胞基因组学的似是而非的艺术。
PLoS Comput Biol. 2023 Aug 17;19(8):e1011288. doi: 10.1371/journal.pcbi.1011288. eCollection 2023 Aug.
5
: a unifying computational framework for modular single-cell RNA-seq data integration.用于模块化单细胞RNA测序数据整合的统一计算框架。
NAR Genom Bioinform. 2023 Jul 12;5(3):lqad069. doi: 10.1093/nargab/lqad069. eCollection 2023 Sep.
6
Optimizing the Cell Painting assay for image-based profiling.优化细胞染色法进行基于图像的分析。
Nat Protoc. 2023 Jul;18(7):1981-2013. doi: 10.1038/s41596-023-00840-9. Epub 2023 Jun 21.
7
Dictionary learning for integrative, multimodal and scalable single-cell analysis.基于字典学习的综合、多模态和可扩展的单细胞分析。
Nat Biotechnol. 2024 Feb;42(2):293-304. doi: 10.1038/s41587-023-01767-y. Epub 2023 May 25.
8
Deep learning in image-based phenotypic drug discovery.基于图像的表型药物发现中的深度学习。
Trends Cell Biol. 2023 Jul;33(7):538-554. doi: 10.1016/j.tcb.2022.11.011. Epub 2023 Jan 7.
9
Morphology and gene expression profiling provide complementary information for mapping cell state.形态学和基因表达谱分析为细胞状态的描绘提供了互补信息。
Cell Syst. 2022 Nov 16;13(11):911-923.e9. doi: 10.1016/j.cels.2022.10.001. Epub 2022 Oct 28.
10
High-dimensional gene expression and morphology profiles of cells across 28,000 genetic and chemical perturbations.细胞在 28000 个遗传和化学干扰下的高维基因表达和形态特征。
Nat Methods. 2022 Dec;19(12):1550-1557. doi: 10.1038/s41592-022-01667-0. Epub 2022 Nov 7.