从表型谱和化学结构预测化合物活性。

Predicting compound activity from phenotypic profiles and chemical structures.

机构信息

Broad Institute of MIT and Harvard, Cambridge, USA.

Biological Research Centre, Szeged, Hungary.

出版信息

Nat Commun. 2023 Apr 8;14(1):1967. doi: 10.1038/s41467-023-37570-1.

DOI:10.1038/s41467-023-37570-1

PMID:37031208

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10082762/

Abstract

Predicting assay results for compounds virtually using chemical structures and phenotypic profiles has the potential to reduce the time and resources of screens for drug discovery. Here, we evaluate the relative strength of three high-throughput data sources-chemical structures, imaging (Cell Painting), and gene-expression profiles (L1000)-to predict compound bioactivity using a historical collection of 16,170 compounds tested in 270 assays for a total of 585,439 readouts. All three data modalities can predict compound activity for 6-10% of assays, and in combination they predict 21% of assays with high accuracy, which is a 2 to 3 times higher success rate than using a single modality alone. In practice, the accuracy of predictors could be lower and still be useful, increasing the assays that can be predicted from 37% with chemical structures alone up to 64% when combined with phenotypic data. Our study shows that unbiased phenotypic profiling can be leveraged to enhance compound bioactivity prediction to accelerate the early stages of the drug-discovery process.

摘要

利用化学结构和表型谱虚拟预测化合物的检测结果，有望减少药物发现筛选的时间和资源。在这里，我们评估了三种高通量数据源（化学结构、成像（细胞染色）和基因表达谱（L1000））的相对强度，以使用历史上的 16170 种化合物进行 270 种检测的集合来预测化合物的生物活性，总共产生了 585439 个读数。所有三种数据模式都可以预测 6-10%的检测结果，而组合使用可以以高精度预测 21%的检测结果，成功率比单独使用单一模式高 2 到 3 倍。在实践中，预测器的准确性可能较低，但仍然有用，将可以预测的检测结果从单独使用化学结构的 37%增加到与表型数据结合使用的 64%。我们的研究表明，可以利用无偏表型分析来增强化合物生物活性预测，从而加速药物发现过程的早期阶段。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6c7/10082762/4cbfa586a91c/41467_2023_37570_Fig1_HTML.jpg

相似文献

Predicting compound activity from phenotypic profiles and chemical structures.

Nat Commun. 2023 Apr 8;14(1):1967. doi: 10.1038/s41467-023-37570-1.

Cell Painting-based bioactivity prediction boosts high-throughput screening hit-rates and compound diversity.

Nat Commun. 2024 Apr 24;15(1):3470. doi: 10.1038/s41467-024-47171-1.

Comparison of Approaches for Determining Bioactivity Hits from High-Dimensional Profiling Data.

SLAS Discov. 2021 Feb;26(2):292-308. doi: 10.1177/2472555220950245. Epub 2020 Aug 29.

Enhancing the Small-Scale Screenable Biological Space beyond Known Chemogenomics Libraries with Gray Chemical Matter─Compounds with Novel Mechanisms from High-Throughput Screening Profiles.

ACS Chem Biol. 2024 Apr 19;19(4):938-952. doi: 10.1021/acschembio.3c00737. Epub 2024 Apr 2.

Application of Cell Painting for chemical hazard evaluation in support of screening-level chemical assessments.

Toxicol Appl Pharmacol. 2023 Jun 1;468:116513. doi: 10.1016/j.taap.2023.116513. Epub 2023 Apr 11.

DRUG-seq for miniaturized high-throughput transcriptome profiling in drug discovery.

Nat Commun. 2018 Oct 17;9(1):4307. doi: 10.1038/s41467-018-06500-x.

Compound Activity Prediction with Dose-Dependent Transcriptomic Profiles and Deep Learning.

J Chem Inf Model. 2024 Apr 8;64(7):2695-2704. doi: 10.1021/acs.jcim.3c01855. Epub 2024 Jan 31.

Public Domain HTS Fingerprints: Design and Evaluation of Compound Bioactivity Profiles from PubChem's Bioassay Repository.

J Chem Inf Model. 2016 Feb 22;56(2):390-8. doi: 10.1021/acs.jcim.5b00498. Epub 2016 Jan 14.

Bioactivity screening of environmental chemicals using imaging-based high-throughput phenotypic profiling.

Toxicol Appl Pharmacol. 2020 Jan 15;389:114876. doi: 10.1016/j.taap.2019.114876. Epub 2019 Dec 30.

Quantitative Prioritization of Tool Compounds for Phenotypic Screening.

Methods Mol Biol. 2018;1787:195-206. doi: 10.1007/978-1-4939-7847-2_15.

引用本文的文献

Prediction of cellular morphology changes under perturbations with a transcriptome-guided diffusion model.

Nat Commun. 2025 Sep 2;16(1):8210. doi: 10.1038/s41467-025-63478-z.

High-throughput profiling of chemical-induced gene expression across 93,644 perturbations.

Nat Methods. 2025 Aug 18. doi: 10.1038/s41592-025-02781-5.

Progress and new challenges in image-based profiling.

ArXiv. 2025 Aug 7:arXiv:2508.05800v1.

Morphological map of under- and overexpression of genes in human cells.

Nat Methods. 2025 Aug;22(8):1742-1752. doi: 10.1038/s41592-025-02753-9. Epub 2025 Aug 7.

Machine Learning for Toxicity Prediction Using Chemical Structures: Pillars for Success in the Real World.

Chem Res Toxicol. 2025 May 19;38(5):759-807. doi: 10.1021/acs.chemrestox.5c00033. Epub 2025 May 2.

Cell Painting PLUS: expanding the multiplexing capacity of Cell Painting-based phenotypic profiling using iterative staining-elution cycles.

Nat Commun. 2025 Apr 24;16(1):3857. doi: 10.1038/s41467-025-58765-8.

CPHNet: a novel pipeline for anti-HAPE drug screening via deep learning-based Cell Painting scoring.

Respir Res. 2025 Mar 8;26(1):91. doi: 10.1186/s12931-025-03173-1.

Applications of Artificial Intelligence in Drug Repurposing.

Adv Sci (Weinh). 2025 Apr;12(14):e2411325. doi: 10.1002/advs.202411325. Epub 2025 Mar 6.

Evaluating feature extraction in ovarian cancer cell line co-cultures using deep neural networks.

Commun Biol. 2025 Feb 25;8(1):303. doi: 10.1038/s42003-025-07766-w.

State of the ART: Drug Screening Reveals Artesunate as a Promising Anti-Fibrosis Therapy.

J Respir Biol Transl Med. 2025 Mar;2(1). doi: 10.70322/jrbtm.2024.10023. Epub 2024 Dec 16.

本文引用的文献

Learning representations for image-based profiling of perturbations.

Nat Commun. 2024 Feb 21;15(1):1594. doi: 10.1038/s41467-024-45999-1.

Exposing the Limitations of Molecular Machine Learning with Activity Cliffs.

J Chem Inf Model. 2022 Dec 12;62(23):5938-5951. doi: 10.1021/acs.jcim.2c01073. Epub 2022 Dec 1.

Integrating cell morphology with gene expression and chemical structure to aid mitochondrial toxicity detection.

Commun Biol. 2022 Aug 23;5(1):858. doi: 10.1038/s42003-022-03763-5.

Connecting chemistry and biology through molecular descriptors.

Curr Opin Chem Biol. 2022 Feb;66:102090. doi: 10.1016/j.cbpa.2021.09.001. Epub 2021 Oct 6.

Image-based cell phenotyping with deep learning.

Curr Opin Chem Biol. 2021 Dec;65:9-17. doi: 10.1016/j.cbpa.2021.04.001. Epub 2021 May 21.

Comparison of Chemical Structure and Cell Morphology Information for Multitask Bioactivity Predictions.

J Chem Inf Model. 2021 Mar 22;61(3):1444-1456. doi: 10.1021/acs.jcim.0c00864. Epub 2021 Mar 4.

Predicting cell health phenotypes using image-based morphology profiling.

Mol Biol Cell. 2021 Apr 19;32(9):995-1005. doi: 10.1091/mbc.E20-12-0784. Epub 2021 Feb 3.

Image-based profiling for drug discovery: due for a machine-learning upgrade?

Nat Rev Drug Discov. 2021 Feb;20(2):145-159. doi: 10.1038/s41573-020-00117-w. Epub 2020 Dec 22.

Compressing gene expression data using multiple latent space dimensionalities learns complementary biological representations.

Genome Biol. 2020 May 11;21(1):109. doi: 10.1186/s13059-020-02021-3.

A Deep Learning Approach to Antibiotic Discovery.

Cell. 2020 Feb 20;180(4):688-702.e13. doi: 10.1016/j.cell.2020.01.021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从表型谱和化学结构预测化合物活性。

Predicting compound activity from phenotypic profiles and chemical structures.

机构信息

Broad Institute of MIT and Harvard, Cambridge, USA.

Biological Research Centre, Szeged, Hungary.

出版信息

Nat Commun. 2023 Apr 8;14(1):1967. doi: 10.1038/s41467-023-37570-1.

DOI:10.1038/s41467-023-37570-1

PMID:37031208

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10082762/

Abstract

摘要

从表型谱和化学结构预测化合物活性。

Predicting compound activity from phenotypic profiles and chemical structures.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

从表型谱和化学结构预测化合物活性。

Predicting compound activity from phenotypic profiles and chemical structures.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献