• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

图神经网络在癌细胞系的表型虚拟筛选方面很有前景。

Graph neural networks are promising for phenotypic virtual screening on cancer cell lines.

作者信息

Vishwakarma Sachin, Hernandez-Hernandez Saiveth, Ballester Pedro J

机构信息

Evotec SAS (France), Toulouse, France.

Centre de Recherche en Cancérologie de Marseille, Marseille 13009, France.

出版信息

Biol Methods Protoc. 2024 Sep 3;9(1):bpae065. doi: 10.1093/biomethods/bpae065. eCollection 2024.

DOI:10.1093/biomethods/bpae065
PMID:39502795
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11537795/
Abstract

Artificial intelligence is increasingly driving early drug design, offering novel approaches to virtual screening. Phenotypic virtual screening (PVS) aims to predict how cancer cell lines respond to different compounds by focusing on observable characteristics rather than specific molecular targets. Some studies have suggested that deep learning may not be the best approach for PVS. However, these studies are limited by the small number of tested molecules as well as not employing suitable performance metrics and dissimilar-molecules splits better mimicking the challenging chemical diversity of real-world screening libraries. Here we prepared 60 datasets, each containing approximately 30 000-50 000 molecules tested for their growth inhibitory activities on one of the NCI-60 cancer cell lines. We conducted multiple performance evaluations of each of the five machine learning algorithms for PVS on these 60 problem instances. To provide even a more comprehensive evaluation, we used two model validation types: the random split and the dissimilar-molecules split. Overall, about 14 440 training runs aczross datasets were carried out per algorithm. The models were primarily evaluated using hit rate, a more suitable metric in VS contexts. The results show that all models are more challenged by test molecules that are substantially different from those in the training data. In both validation types, the D-MPNN algorithm, a graph-based deep neural network, was found to be the most suitable for building predictive models for this PVS problem.

摘要

人工智能在早期药物设计中发挥着越来越重要的作用,为虚拟筛选提供了新方法。表型虚拟筛选(PVS)旨在通过关注可观察到的特征而非特定分子靶点来预测癌细胞系对不同化合物的反应。一些研究表明,深度学习可能不是PVS的最佳方法。然而,这些研究受到测试分子数量少以及未采用合适的性能指标和更好模拟现实世界筛选文库具有挑战性的化学多样性的不同分子划分的限制。在这里,我们准备了60个数据集,每个数据集包含大约30000-50000个针对NCI-60癌细胞系之一测试其生长抑制活性的分子。我们对这60个问题实例上用于PVS的五种机器学习算法中的每一种进行了多次性能评估。为了提供更全面的评估,我们使用了两种模型验证类型:随机划分和不同分子划分。总体而言,每种算法在各个数据集上大约进行了14440次训练运行。这些模型主要使用命中率进行评估,命中率在虚拟筛选环境中是更合适的指标。结果表明,所有模型都受到与训练数据中分子差异很大的测试分子的更大挑战。在两种验证类型中,基于图的深度神经网络D-MPNN算法被发现最适合为这个PVS问题构建预测模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/b4f2ce44ff32/bpae065f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/5c2cb69efb07/bpae065f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/bcb4dab7353a/bpae065f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/05ee574865f0/bpae065f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/272d6c5fc094/bpae065f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/b4f2ce44ff32/bpae065f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/5c2cb69efb07/bpae065f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/bcb4dab7353a/bpae065f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/05ee574865f0/bpae065f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/272d6c5fc094/bpae065f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/edf1/11537795/b4f2ce44ff32/bpae065f5.jpg

相似文献

1
Graph neural networks are promising for phenotypic virtual screening on cancer cell lines.图神经网络在癌细胞系的表型虚拟筛选方面很有前景。
Biol Methods Protoc. 2024 Sep 3;9(1):bpae065. doi: 10.1093/biomethods/bpae065. eCollection 2024.
2
Evaluation of QSAR Equations for Virtual Screening.QSAR 方程在虚拟筛选中的评估。
Int J Mol Sci. 2020 Oct 22;21(21):7828. doi: 10.3390/ijms21217828.
3
Employing Molecular Conformations for Ligand-Based Virtual Screening with Equivariant Graph Neural Network and Deep Multiple Instance Learning.利用基于分子构象的等价图神经网络和深度多重实例学习进行配体虚拟筛选。
Molecules. 2023 Aug 9;28(16):5982. doi: 10.3390/molecules28165982.
4
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
5
Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models.图神经网络能否为药物发现学习更好的分子表示?基于描述符和基于图的模型的比较研究。
J Cheminform. 2021 Feb 17;13(1):12. doi: 10.1186/s13321-020-00479-8.
6
DENVIS: Scalable and High-Throughput Virtual Screening Using Graph Neural Networks with Atomic and Surface Protein Pocket Features.DENVIS:使用具有原子和表面蛋白口袋特征的图神经网络进行可扩展的高通量虚拟筛选。
J Chem Inf Model. 2022 Oct 10;62(19):4642-4659. doi: 10.1021/acs.jcim.2c01057. Epub 2022 Sep 26.
7
Artificial intelligence to deep learning: machine intelligence approach for drug discovery.人工智能到深度学习:药物发现的机器智能方法。
Mol Divers. 2021 Aug;25(3):1315-1360. doi: 10.1007/s11030-021-10217-3. Epub 2021 Apr 12.
8
FP-GNN: a versatile deep learning architecture for enhanced molecular property prediction.FP-GNN:一种用于增强分子性质预测的多功能深度学习架构。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac408.
9
Machine learning random forest for predicting oncosomatic variant NGS analysis.机器学习随机森林预测肿瘤体细胞变异 NGS 分析。
Sci Rep. 2021 Nov 8;11(1):21820. doi: 10.1038/s41598-021-01253-y.
10
Discovering the Active Ingredients of Medicine and Food Homologous Substances for Inhibiting the Cyclooxygenase-2 Metabolic Pathway by Machine Learning Algorithms.利用机器学习算法发现抑制环氧化酶-2代谢途径的药食同源物质的有效成分。
Molecules. 2023 Sep 23;28(19):6782. doi: 10.3390/molecules28196782.

引用本文的文献

1
UMAP-based clustering split for rigorous evaluation of AI models for virtual screening on cancer cell lines.基于UMAP的聚类划分,用于对癌细胞系虚拟筛选的人工智能模型进行严格评估。
J Cheminform. 2025 Jun 10;17(1):94. doi: 10.1186/s13321-025-01039-8.
2
Cross-disciplinary perspectives on the potential for artificial intelligence across chemistry.关于人工智能在化学领域潜力的跨学科观点。
Chem Soc Rev. 2025 Apr 25. doi: 10.1039/d5cs00146c.

本文引用的文献

1
Deep representation learning of chemical-induced transcriptional profile for phenotype-based drug discovery.基于表型的药物发现中化学诱导转录谱的深度表示学习。
Nat Commun. 2024 Jun 25;15(1):5378. doi: 10.1038/s41467-024-49620-3.
2
AI is a viable alternative to high throughput screening: a 318-target study.人工智能是高通量筛选的可行替代方案:一项 318 靶点研究。
Sci Rep. 2024 Apr 2;14(1):7526. doi: 10.1038/s41598-024-54655-z.
3
A small-molecule TNIK inhibitor targets fibrosis in preclinical and clinical models.一种小分子TNIK抑制剂在临床前和临床模型中靶向纤维化。
Nat Biotechnol. 2025 Jan;43(1):63-75. doi: 10.1038/s41587-024-02143-0. Epub 2024 Mar 8.
4
Chemprop: A Machine Learning Package for Chemical Property Prediction.Chemprop:一个用于化学性质预测的机器学习工具包。
J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.
5
The AI revolution in chemistry is not that far away.化学领域的人工智能革命并不遥远。
Nature. 2023 Dec;624(7991):252. doi: 10.1038/d41586-023-03948-w.
6
A practical guide to machine-learning scoring for structure-based virtual screening.基于结构的虚拟筛选的机器学习评分实用指南。
Nat Protoc. 2023 Nov;18(11):3460-3511. doi: 10.1038/s41596-023-00885-w. Epub 2023 Oct 16.
7
Antiproliferative Activity Predictor: A New Reliable In Silico Tool for Drug Response Prediction against NCI60 Panel.抗增殖活性预测器:一种新的可靠的用于预测 NCI60 面板药物反应的计算工具。
Int J Mol Sci. 2022 Nov 19;23(22):14374. doi: 10.3390/ijms232214374.
8
Phenotypic drug discovery: recent successes, lessons learned and new directions.表型药物发现:近期的成功、经验教训和新方向。
Nat Rev Drug Discov. 2022 Dec;21(12):899-914. doi: 10.1038/s41573-022-00472-w. Epub 2022 May 30.
9
When are two hydrogen bonds better than one? Accurate first-principles models explain the balance of hydrogen bond donors and acceptors found in proteins.两个氢键何时优于一个?精确的第一性原理模型解释了蛋白质中氢键供体和受体的平衡。
Chem Sci. 2020 Nov 19;12(3):1147-1162. doi: 10.1039/d0sc05084a. eCollection 2021 Jan 21.
10
Machine Learning Enables Accurate and Rapid Prediction of Active Molecules Against Breast Cancer Cells.机器学习能够准确快速地预测抗乳腺癌细胞的活性分子。
Front Pharmacol. 2021 Dec 17;12:796534. doi: 10.3389/fphar.2021.796534. eCollection 2021.