基于迁移学习的多功能且无需训练的高通量筛选分析

Transfer learning for versatile and training free high content screening analyses.

机构信息

Computational Bioimaging and Bioinformatics, Institut de Biologie de l'Ecole Normale Supérieure, PSL University, 46 Rue d'Ulm, 75005, Paris, France.

Biophenics Laboratory, Department of Translational Research, Cell and Tissue Imaging Facility (PICT-IBiSA), Institut Curie, PSL Research University, 26 Rue d'Ulm, 75005, Paris, France.

出版信息

Sci Rep. 2023 Dec 18;13(1):22599. doi: 10.1038/s41598-023-49554-8.

DOI:10.1038/s41598-023-49554-8

PMID:38114550

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10730630/

Abstract

High content screening (HCS) is a technology that automates cell biology experiments at large scale. A High Content Screen produces a high amount of microscopy images of cells under many conditions and requires that a dedicated image and data analysis workflow be designed for each assay to select hits. This heavy data analytic step remains challenging and has been recognized as one of the burdens hindering the adoption of HCS. In this work we propose a solution to hit selection by using transfer learning without additional training. A pretrained residual network is employed to encode each image of a screen into a discriminant representation. The deep features obtained are then corrected to account for well plate bias and misalignment. We then propose two training-free pipelines dedicated to the two main categories of HCS for compound selection: with or without positive control. When a positive control is available, it is used alongside the negative control to compute a linear discriminant axis, thus building a classifier without training. Once all samples are projected onto this axis, the conditions that best reproduce the positive control can be selected. When no positive control is available, the Mahalanobis distance is computed from each sample to the negative control distribution. The latter provides a metric to identify the conditions that alter the negative control's cell phenotype. This metric is subsequently used to categorize hits through a clustering step. Given the lack of available ground truth in HCS, we provide a qualitative comparison of the results obtained using this approach with results obtained with handcrafted image analysis features for compounds and siRNA screens with or without control. Our results suggests that the fully automated and generic pipeline we propose offers a good alternative to handcrafted dedicated image analysis approaches. Furthermore, we demonstrate that this solution select conditions of interest that had not been identified using the primary dedicated analysis. Altogether, this approach provides a fully automated, reproducible, versatile and comprehensive alternative analysis solution for HCS encompassing compound-based or downregulation screens, with or without positive controls, without the need for training or cell detection, or the development of a dedicated image analysis workflow.

摘要

高通量筛选（HCS）是一种能够大规模自动化细胞生物学实验的技术。高通量筛选会生成大量在多种条件下的细胞显微镜图像，并且需要为每个检测设计专门的图像和数据分析工作流程，以选择命中结果。这个繁重的数据分析步骤仍然具有挑战性，并且已被认为是阻碍高通量筛选采用的一个负担。在这项工作中，我们提出了一种无需额外训练即可进行命中选择的解决方案，即使用迁移学习。使用预训练的残差网络将屏幕的每张图像编码为判别表示。然后，对所获得的深度特征进行校正，以解决板孔偏差和对准问题。然后，我们提出了两种与化合物选择相关的、适用于主要两类高通量筛选的无训练流水线：有阳性对照和无阳性对照。当有阳性对照可用时，它与阴性对照一起用于计算线性判别轴，从而在不进行训练的情况下构建分类器。一旦将所有样本都投影到该轴上，就可以选择最佳重现阳性对照的条件。当没有阳性对照时，从每个样本到阴性对照分布计算马氏距离。后者提供了一种识别改变阴性对照细胞表型的条件的度量标准。然后通过聚类步骤使用该度量标准对命中结果进行分类。由于高通量筛选中缺乏可用的真实数据，我们使用该方法获得的结果与具有或不具有对照的化合物和 siRNA 筛选的手工制作图像分析特征获得的结果进行了定性比较。我们的结果表明，我们提出的完全自动化和通用流水线是手工制作的专用图像分析方法的一个很好的替代方案。此外，我们证明该解决方案选择了使用主要专用分析方法未识别出的感兴趣条件。总的来说，该方法为高通量筛选提供了一种完全自动化、可重现、多功能和全面的替代分析解决方案，涵盖基于化合物或下调筛选、有或无阳性对照，无需培训或细胞检测，也无需开发专用的图像分析工作流程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3750/10730630/fbc85f6b0e21/41598_2023_49554_Fig1_HTML.jpg

相似文献

Transfer learning for versatile and training free high content screening analyses.

Sci Rep. 2023 Dec 18;13(1):22599. doi: 10.1038/s41598-023-49554-8.

Virtual plates: Getting the best out of high content screens.

SLAS Discov. 2024 Jan;29(1):77-85. doi: 10.1016/j.slasd.2023.11.004. Epub 2023 Nov 29.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

HCS Methodology for Helping in Lab Scale Image-Based Assays.

Methods Mol Biol. 2019;2040:331-356. doi: 10.1007/978-1-4939-9686-5_15.

On the objectivity, reliability, and validity of deep learning enabled bioimage analyses.

Elife. 2020 Oct 19;9:e59780. doi: 10.7554/eLife.59780.

Single object profiles regression analysis (SOPRA): a novel method for analyzing high-content cell-based screens.

BMC Bioinformatics. 2022 Oct 21;23(1):440. doi: 10.1186/s12859-022-04981-8.

HCS-Neurons: identifying phenotypic changes in multi-neuron images upon drug treatments of high-content screening.

BMC Bioinformatics. 2013;14 Suppl 16(Suppl 16):S12. doi: 10.1186/1471-2105-14-S16-S12. Epub 2013 Oct 22.

Workflow and metrics for image quality control in large-scale high-content screens.

J Biomol Screen. 2012 Feb;17(2):266-74. doi: 10.1177/1087057111420292. Epub 2011 Sep 28.

A cell-level quality control workflow for high-throughput image analysis.

BMC Bioinformatics. 2020 Jul 2;21(1):280. doi: 10.1186/s12859-020-03603-5.

Computer vision for high content screening.

Crit Rev Biochem Mol Biol. 2016;51(2):102-9. doi: 10.3109/10409238.2015.1135868. Epub 2016 Jan 24.

引用本文的文献

A self-supervised learning approach for high throughput and high content cell segmentation.

Commun Biol. 2025 May 21;8(1):780. doi: 10.1038/s42003-025-08190-w.

本文引用的文献

Upregulation of the Mevalonate Pathway through EWSR1-FLI1/EGR2 Regulatory Axis Confers Ewing Cells Exquisite Sensitivity to Statins.

Cancers (Basel). 2022 May 8;14(9):2327. doi: 10.3390/cancers14092327.

Quantitative Automated Assays in Living Cells to Screen for Inhibitors of Hemichannel Function.

SLAS Discov. 2021 Mar;26(3):420-427. doi: 10.1177/2472555220954388. Epub 2020 Sep 11.

Transfer Learning with Deep Convolutional Neural Networks for Classifying Cellular Morphological Changes.

SLAS Discov. 2019 Apr;24(4):466-475. doi: 10.1177/2472555218818756. Epub 2019 Jan 14.

Automated analysis of high-content microscopy data with deep learning.

Mol Syst Biol. 2017 Apr 18;13(4):924. doi: 10.15252/msb.20177551.

Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning.

G3 (Bethesda). 2017 May 5;7(5):1385-1392. doi: 10.1534/g3.116.033654.

A multi-scale convolutional neural network for phenotyping high-content cellular images.

Bioinformatics. 2017 Jul 1;33(13):2010-2019. doi: 10.1093/bioinformatics/btx069.

Increasing the Content of High-Content Screening: An Overview.

J Biomol Screen. 2014 Jun;19(5):640-50. doi: 10.1177/1087057114528537. Epub 2014 Apr 7.

Fluorescence-based analysis of trafficking in mammalian cells.

Methods Cell Biol. 2013;118:179-94. doi: 10.1016/B978-0-12-417164-0.00011-2.

Large-scale screening using familial dysautonomia induced pluripotent stem cells identifies compounds that rescue IKBKAP expression.

Nat Biotechnol. 2012 Dec;30(12):1244-8. doi: 10.1038/nbt.2435. Epub 2012 Nov 25.

Synchronization of secretory protein traffic in populations of cells.

Nat Methods. 2012 Mar 11;9(5):493-8. doi: 10.1038/nmeth.1928.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于迁移学习的多功能且无需训练的高通量筛选分析

Transfer learning for versatile and training free high content screening analyses.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献