基于图像的扰动分析的表示学习。

Learning representations for image-based profiling of perturbations.

机构信息

HUN-REN Biological Research Centre, 62 Temesvári krt, Szeged, 6726, Hungary.

Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA.

出版信息

Nat Commun. 2024 Feb 21;15(1):1594. doi: 10.1038/s41467-024-45999-1.

DOI:10.1038/s41467-024-45999-1

PMID:38383513

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10881515/

Abstract

Measuring the phenotypic effect of treatments on cells through imaging assays is an efficient and powerful way of studying cell biology, and requires computational methods for transforming images into quantitative data. Here, we present an improved strategy for learning representations of treatment effects from high-throughput imaging, following a causal interpretation. We use weakly supervised learning for modeling associations between images and treatments, and show that it encodes both confounding factors and phenotypic features in the learned representation. To facilitate their separation, we constructed a large training dataset with images from five different studies to maximize experimental diversity, following insights from our causal analysis. Training a model with this dataset successfully improves downstream performance, and produces a reusable convolutional network for image-based profiling, which we call Cell Painting CNN. We evaluated our strategy on three publicly available Cell Painting datasets, and observed that the Cell Painting CNN improves performance in downstream analysis up to 30% with respect to classical features, while also being more computationally efficient.

摘要

通过成像分析来衡量处理对细胞的表型效应是研究细胞生物学的一种有效且强大的方法，这需要计算方法将图像转换为定量数据。在这里，我们提出了一种从高通量成像中学习处理效应表示的改进策略，遵循因果解释。我们使用弱监督学习来模拟图像和处理之间的关联，并表明它在学习的表示中同时编码了混杂因素和表型特征。为了便于分离，我们构建了一个包含来自五个不同研究的图像的大型训练数据集，以最大程度地提高实验多样性，这是根据我们的因果分析得出的见解。使用这个数据集训练模型成功地提高了下游性能，并产生了一个可重复使用的基于图像的分析卷积网络，我们称之为 Cell Painting CNN。我们在三个公开的 Cell Painting 数据集上评估了我们的策略，观察到 Cell Painting CNN 可以将下游分析的性能提高高达 30%，而与经典特征相比，它的计算效率也更高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d5/10881515/dc9912fb69c6/41467_2024_45999_Fig1_HTML.jpg

相似文献

Learning representations for image-based profiling of perturbations.

Nat Commun. 2024 Feb 21;15(1):1594. doi: 10.1038/s41467-024-45999-1.

High-content image generation for drug discovery using generative adversarial networks.

Neural Netw. 2020 Dec;132:353-363. doi: 10.1016/j.neunet.2020.09.007. Epub 2020 Sep 20.

Combining weakly and strongly supervised learning improves strong supervision in Gleason pattern classification.

BMC Med Imaging. 2021 May 8;21(1):77. doi: 10.1186/s12880-021-00609-0.

Anomaly detection for high-content image-based phenotypic cell profiling.

bioRxiv. 2024 Jun 3:2024.06.01.595856. doi: 10.1101/2024.06.01.595856.

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning.

IEEE Trans Med Imaging. 2016 May;35(5):1285-98. doi: 10.1109/TMI.2016.2528162. Epub 2016 Feb 11.

Foundation models in gastrointestinal endoscopic AI: Impact of architecture, pre-training approach and data efficiency.

Med Image Anal. 2024 Dec;98:103298. doi: 10.1016/j.media.2024.103298. Epub 2024 Aug 12.

Seeking an optimal approach for Computer-aided Diagnosis of Pulmonary Embolism.

Med Image Anal. 2024 Jan;91:102988. doi: 10.1016/j.media.2023.102988. Epub 2023 Oct 13.

Learning low-dose CT degradation from unpaired data with flow-based model.

Med Phys. 2022 Dec;49(12):7516-7530. doi: 10.1002/mp.15886. Epub 2022 Aug 8.

Transformer-based unsupervised contrastive learning for histopathological image classification.

Med Image Anal. 2022 Oct;81:102559. doi: 10.1016/j.media.2022.102559. Epub 2022 Jul 30.

Semi-supervised training of deep convolutional neural networks with heterogeneous data and few local annotations: An experiment on prostate histopathology image classification.

Med Image Anal. 2021 Oct;73:102165. doi: 10.1016/j.media.2021.102165. Epub 2021 Jul 14.

引用本文的文献

Prediction of cellular morphology changes under perturbations with a transcriptome-guided diffusion model.

Nat Commun. 2025 Sep 2;16(1):8210. doi: 10.1038/s41467-025-63478-z.

MGMG: Cell Morphology-Guided Molecule Generation for Drug Discovery.

bioRxiv. 2025 Jul 17:2025.07.11.664424. doi: 10.1101/2025.07.11.664424.

Triple-effect correction for Cell Painting data with contrastive and domain-adversarial learning.

Nat Commun. 2025 Jul 25;16(1):6886. doi: 10.1038/s41467-025-62193-z.

Nontargeted Toxicological/Chemical Analysis in Complex Mixtures for Risk Assessment and Key Driver Discovery.

Environ Health (Wash). 2025 Mar 27;3(7):701-704. doi: 10.1021/envhealth.5c00052. eCollection 2025 Jul 18.

Single-cell spatial transcriptomics reveals immunotherapy-driven bone marrow niche remodeling in AML.

Sci Adv. 2025 Jul 11;11(28):eadw4871. doi: 10.1126/sciadv.adw4871. Epub 2025 Jul 9.

A versatile information retrieval framework for evaluating profile strength and similarity.

Nat Commun. 2025 Jun 4;16(1):5181. doi: 10.1038/s41467-025-60306-2.

Morphological profiling data resource enables prediction of chemical compound properties.

iScience. 2025 Apr 16;28(5):112445. doi: 10.1016/j.isci.2025.112445. eCollection 2025 May 16.

Image2Reg: Linking chromatin images to gene regulation using genetic and chemical perturbation screens.

Cell Syst. 2025 Jun 18;16(6):101293. doi: 10.1016/j.cels.2025.101293. Epub 2025 May 12.

Machine Learning for Toxicity Prediction Using Chemical Structures: Pillars for Success in the Real World.

Chem Res Toxicol. 2025 May 19;38(5):759-807. doi: 10.1021/acs.chemrestox.5c00033. Epub 2025 May 2.

Toward automated and explainable high-throughput perturbation analysis in single cells.

Patterns (N Y). 2025 Apr 11;6(4):101228. doi: 10.1016/j.patter.2025.101228.

本文引用的文献

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations.

Nat Methods. 2024 Jun;21(6):1114-1121. doi: 10.1038/s41592-024-02241-6. Epub 2024 Apr 9.

CLANet: A comprehensive framework for cross-batch cell line identification using brightfield images.

Med Image Anal. 2024 May;94:103123. doi: 10.1016/j.media.2024.103123. Epub 2024 Feb 29.

Optimizing the Cell Painting assay for image-based profiling.

Nat Protoc. 2023 Jul;18(7):1981-2013. doi: 10.1038/s41596-023-00840-9. Epub 2023 Jun 21.

Predicting compound activity from phenotypic profiles and chemical structures.

Nat Commun. 2023 Apr 8;14(1):1967. doi: 10.1038/s41467-023-37570-1.

Morphology and gene expression profiling provide complementary information for mapping cell state.

Cell Syst. 2022 Nov 16;13(11):911-923.e9. doi: 10.1016/j.cels.2022.10.001. Epub 2022 Oct 28.

Improving feature extraction from histopathological images through a fine-tuning ImageNet model.

J Pathol Inform. 2022 Jun 30;13:100115. doi: 10.1016/j.jpi.2022.100115. eCollection 2022.

Virtual screening for small-molecule pathway regulators by image-profile matching.

Cell Syst. 2022 Sep 21;13(9):724-736.e9. doi: 10.1016/j.cels.2022.08.003. Epub 2022 Sep 2.

Cell Painting predicts impact of lung cancer variants.

Mol Biol Cell. 2022 May 15;33(6):ar49. doi: 10.1091/mbc.E21-11-0538. Epub 2022 Mar 30.

Integrating deep learning and unbiased automated high-content screening to identify complex disease signatures in human fibroblasts.

Nat Commun. 2022 Mar 25;13(1):1590. doi: 10.1038/s41467-022-28423-4.

CellProfiler 4: improvements in speed, utility and usability.

BMC Bioinformatics. 2021 Sep 10;22(1):433. doi: 10.1186/s12859-021-04344-9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于图像的扰动分析的表示学习。

Learning representations for image-based profiling of perturbations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献