探索利用细胞系产生的化合物诱导转录组数据预测化合物对分子靶点的活性。

Exploring the Use of Compound-Induced Transcriptomic Data Generated From Cell Lines to Predict Compound Activity Toward Molecular Targets.

作者信息

Baillif Benoît, Wichard Joerg, Méndez-Lucio Oscar, Rouquié David

机构信息

Bayer SAS, Bayer CropScience, Sophia Antipolis, France.

Department of Genetic Toxicology, Bayer AG, Berlin, Germany.

出版信息

Front Chem. 2020 Apr 23;8:296. doi: 10.3389/fchem.2020.00296. eCollection 2020.

DOI:10.3389/fchem.2020.00296

PMID:32391323

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7191531/

Abstract

Pharmaceutical or phytopharmaceutical molecules rely on the interaction with one or more specific molecular targets to induce their anticipated biological responses. Nonetheless, these compounds are also prone to interact with many other non-intended biological targets, also known as off-targets. Unfortunately, off-target identification is difficult and expensive. Consequently, QSAR models predicting the activity on a target have gained importance in drug discovery or in the de-risking of chemicals. However, a restricted number of targets are well characterized and hold enough data to build such models. A good alternative to individual target evaluations is to use integrative evaluations such as transcriptomics obtained from compound-induced gene expression measurements derived from cell cultures. The advantage of these particular experiments is to capture the consequences of the interaction of compounds on many possible molecular targets and biological pathways, without having any constraints concerning the chemical space. In this work, we assessed the value of a large public dataset of compound-induced transcriptomic data, to predict compound activity on a selection of 69 molecular targets. We compared such descriptors with other QSAR descriptors, namely the Morgan fingerprints (similar to extended-connectivity fingerprints). Depending on the target, active compounds could show similar signatures in one or multiple cell lines, whether these active compounds shared similar or different chemical structures. Random forest models using gene expression signatures were able to perform similarly or better than counterpart models built with Morgan fingerprints for 25% of the target prediction tasks. These performances occurred mostly using signatures produced in cell lines showing similar signatures for active compounds toward the considered target. We show that compound-induced transcriptomic data could represent a great opportunity for target prediction, allowing to overcome the chemical space limitation of QSAR models.

摘要

药物或植物药物分子依靠与一个或多个特定分子靶点相互作用来诱导预期的生物学反应。然而，这些化合物也容易与许多其他非预期的生物学靶点相互作用，这些靶点也被称为脱靶。不幸的是，脱靶鉴定既困难又昂贵。因此，预测靶点活性的定量构效关系（QSAR）模型在药物发现或化学品风险降低方面变得越来越重要。然而，只有有限数量的靶点得到了充分表征并拥有足够的数据来构建此类模型。个体靶点评估的一个很好的替代方法是使用综合评估，例如从细胞培养物中化合物诱导的基因表达测量获得的转录组学。这些特定实验的优势在于能够捕捉化合物与许多可能的分子靶点和生物学途径相互作用的后果，而不受化学空间的任何限制。在这项工作中，我们评估了一个大型化合物诱导转录组数据公共数据集的价值，以预测化合物对69个分子靶点的活性。我们将这些描述符与其他QSAR描述符进行了比较，即摩根指纹（类似于扩展连接性指纹）。根据靶点的不同，活性化合物在一种或多种细胞系中可能表现出相似的特征，无论这些活性化合物具有相似或不同的化学结构。使用基因表达特征的随机森林模型在25%的靶点预测任务中能够表现得与使用摩根指纹构建的对应模型相似或更好。这些性能大多出现在使用对所考虑靶点的活性化合物显示相似特征的细胞系中产生的特征时。我们表明，化合物诱导的转录组数据可能为靶点预测提供一个很好的机会，从而克服QSAR模型的化学空间限制。

相似文献

Exploring the Use of Compound-Induced Transcriptomic Data Generated From Cell Lines to Predict Compound Activity Toward Molecular Targets.探索利用细胞系产生的化合物诱导转录组数据预测化合物对分子靶点的活性。

Front Chem. 2020 Apr 23;8:296. doi: 10.3389/fchem.2020.00296. eCollection 2020.

Toxicity prediction using target, interactome, and pathway profiles as descriptors.基于靶标、互作组和通路谱特征进行毒性预测。

Toxicol Lett. 2023 May 15;381:20-26. doi: 10.1016/j.toxlet.2023.04.005. Epub 2023 Apr 13.

Targeting HIV/HCV Coinfection Using a Machine Learning-Based Multiple Quantitative Structure-Activity Relationships (Multiple QSAR) Method.基于机器学习的多重定量构效关系（多重 QSAR）方法靶向 HIV/HCV 共感染。

Int J Mol Sci. 2019 Jul 22;20(14):3572. doi: 10.3390/ijms20143572.

A new ChEMBL dataset for the similarity-based target fishing engine FastTargetPred: Annotation of an exhaustive list of linear tetrapeptides.用于基于相似性的靶点筛选引擎FastTargetPred的新ChEMBL数据集：线性四肽详尽列表的注释

Data Brief. 2022 Apr 11;42:108159. doi: 10.1016/j.dib.2022.108159. eCollection 2022 Jun.

Exploring QSAR models for activity-cliff prediction.探索用于活性悬崖预测的定量构效关系模型。

J Cheminform. 2023 Apr 17;15(1):47. doi: 10.1186/s13321-023-00708-w.

Machine Learning Uses Chemo-Transcriptomic Profiles to Stratify Antimalarial Compounds With Similar Mode of Action.机器学习利用化转录组特征对作用模式相似的抗疟化合物进行分层。

Front Cell Infect Microbiol. 2021 Jun 29;11:688256. doi: 10.3389/fcimb.2021.688256. eCollection 2021.

On the correspondence between the transcriptomic response of a compound and its effects on its targets.关于化合物的转录组反应与其对靶标影响之间的对应关系。

BMC Bioinformatics. 2023 May 19;24(1):207. doi: 10.1186/s12859-023-05337-6.

Editorial: Current status and perspective on drug targets in tubercle bacilli and drug design of antituberculous agents based on structure-activity relationship.社论：结核杆菌药物靶点的现状与展望以及基于构效关系的抗结核药物设计

Curr Pharm Des. 2014;20(27):4305-6. doi: 10.2174/1381612819666131118203915.

QSAR-derived affinity fingerprints (part 2): modeling performance for potency prediction.基于定量构效关系的亲和力指纹图谱（第2部分）：效能预测的建模性能

J Cheminform. 2020 Jun 5;12(1):41. doi: 10.1186/s13321-020-00444-5.

How diverse are diversity assessment methods? A comparative analysis and benchmarking of molecular descriptor space.多样性评估方法有哪些差异？分子描述符空间的比较分析和基准测试。

J Chem Inf Model. 2014 Jan 27;54(1):230-42. doi: 10.1021/ci400469u. Epub 2013 Dec 13.

引用本文的文献

Application of perturbation gene expression profiles in drug discovery-From mechanism of action to quantitative modelling.扰动基因表达谱在药物发现中的应用——从作用机制到定量建模

Front Syst Biol. 2023 Feb 9;3:1126044. doi: 10.3389/fsysb.2023.1126044. eCollection 2023.

Targeting Prostate Cancer Metabolism Through Transcriptional and Epigenetic Modulation: A Multi-Target Approach to Therapeutic Innovation.通过转录和表观遗传调控靶向前列腺癌代谢：治疗创新的多靶点方法

Int J Mol Sci. 2025 Jun 23;26(13):6013. doi: 10.3390/ijms26136013.

Protocol for predicting suppressors of cell-death pathways based on transcriptomic and vulnerability data.基于转录组学和易感性数据预测细胞死亡途径抑制因子的方案。

STAR Protoc. 2025 Jun 20;6(2):103855. doi: 10.1016/j.xpro.2025.103855. Epub 2025 May 29.

Optimizing Cancer Treatment: Exploring the Role of AI in Radioimmunotherapy.优化癌症治疗：探索人工智能在放射免疫治疗中的作用。

Diagnostics (Basel). 2025 Feb 6;15(3):397. doi: 10.3390/diagnostics15030397.

Computational pipeline predicting cell death suppressors as targets for cancer therapy.预测细胞死亡抑制因子作为癌症治疗靶点的计算流程。

iScience. 2024 Aug 30;27(9):110859. doi: 10.1016/j.isci.2024.110859. eCollection 2024 Sep 20.

Signature analysis of high-throughput transcriptomics screening data for mechanistic inference and chemical grouping.高通量转录组筛选数据的特征分析用于机制推断和化学分组。

Toxicol Sci. 2024 Nov 1;202(1):103-122. doi: 10.1093/toxsci/kfae108.

Cell Painting-based bioactivity prediction boosts high-throughput screening hit-rates and compound diversity.基于细胞绘画的生物活性预测提高了高通量筛选的命中率和化合物多样性。

Nat Commun. 2024 Apr 24;15(1):3470. doi: 10.1038/s41467-024-47171-1.

Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis.基于基因表达的化合物作用机制分析的因果推理算法的基准测试。

BMC Bioinformatics. 2023 Apr 18;24(1):154. doi: 10.1186/s12859-023-05277-1.

Computational analyses of mechanism of action (MoA): data, methods and integration.作用机制的计算分析：数据、方法与整合

RSC Chem Biol. 2021 Dec 22;3(2):170-200. doi: 10.1039/d1cb00069a. eCollection 2022 Feb 9.

Exploration of the DARTable Genome- a Resource Enabling Data-Driven NAMs for Developmental and Reproductive Toxicity Prediction.DARTable基因组探索——一种助力基于数据驱动的发育和生殖毒性预测的NAMs的资源

Front Toxicol. 2022 Jan 19;3:806311. doi: 10.3389/ftox.2021.806311. eCollection 2021.

本文引用的文献

De novo generation of hit-like molecules from gene expression signatures using artificial intelligence.利用人工智能从基因表达特征生成类似命中的新分子。

Nat Commun. 2020 Jan 3;11(1):10. doi: 10.1038/s41467-019-13807-w.

Advancing computational biology and bioinformatics research through open innovation competitions.通过开放式创新竞赛推动计算生物学和生物信息学研究。

PLoS One. 2019 Sep 27;14(9):e0222165. doi: 10.1371/journal.pone.0222165. eCollection 2019.

Combining structural and bioactivity-based fingerprints improves prediction performance and scaffold hopping capability.结合基于结构和生物活性的指纹图谱可提高预测性能和骨架跳跃能力。

J Cheminform. 2019 Aug 8;11(1):54. doi: 10.1186/s13321-019-0376-1.

Comprehensive transcriptomic analysis of cell lines as models of primary tumors across 22 tumor types.22 种肿瘤类型中细胞系作为原发性肿瘤模型的综合转录组分析。

Nat Commun. 2019 Aug 8;10(1):3574. doi: 10.1038/s41467-019-11415-2.

Leveraging Image-Derived Phenotypic Measurements for Drug-Target Interaction Predictions.利用图像衍生的表型测量进行药物-靶点相互作用预测。

Cancer Inform. 2019 Jun 12;18:1176935119856595. doi: 10.1177/1176935119856595. eCollection 2019.

Applications of machine learning in drug discovery and development.机器学习在药物发现和开发中的应用。

Nat Rev Drug Discov. 2019 Jun;18(6):463-477. doi: 10.1038/s41573-019-0024-5.

The Carcinogenome Project: In Vitro Gene Expression Profiling of Chemical Perturbations to Predict Long-Term Carcinogenicity.致癌基因组计划：化学干扰的体外基因表达谱分析预测长期致癌性。

Environ Health Perspect. 2019 Apr;127(4):47002. doi: 10.1289/EHP3986.

Accurate Prediction of Biological Assays with High-Throughput Microscopy Images and Convolutional Networks.高通量显微镜图像和卷积网络在生物测定中的精确预测。

J Chem Inf Model. 2019 Mar 25;59(3):1163-1171. doi: 10.1021/acs.jcim.8b00670. Epub 2019 Mar 6.

Predicting protein targets for drug-like compounds using transcriptomics.基于转录组学预测类药化合物的蛋白靶标。

PLoS Comput Biol. 2018 Dec 7;14(12):e1006651. doi: 10.1371/journal.pcbi.1006651. eCollection 2018 Dec.

Analysis of Time-Series Gene Expression Data to Explore Mechanisms of Chemical-Induced Hepatic Steatosis Toxicity.分析时间序列基因表达数据以探索化学诱导的肝脂肪变性毒性机制。

Front Genet. 2018 Sep 18;9:396. doi: 10.3389/fgene.2018.00396. eCollection 2018.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

探索利用细胞系产生的化合物诱导转录组数据预测化合物对分子靶点的活性。

Exploring the Use of Compound-Induced Transcriptomic Data Generated From Cell Lines to Predict Compound Activity Toward Molecular Targets.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献