基于高结构可区分性的深度主动学习在分子突变预测中的应用。

Deep active learning with high structural discriminability for molecular mutagenicity prediction.

机构信息

Shanghai Key Laboratory of Power Station Automation Technology, School of Mechatronics Engineering and Automation, Shanghai University, Shanghai, China.

Academy of Military Medical Sciences, Beijing, China.

出版信息

Commun Biol. 2024 Aug 31;7(1):1071. doi: 10.1038/s42003-024-06758-6.

DOI:10.1038/s42003-024-06758-6

PMID:39217273

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11366013/

Abstract

The assessment of mutagenicity is essential in drug discovery, as it may lead to cancer and germ cells damage. Although in silico methods have been proposed for mutagenicity prediction, their performance is hindered by the scarcity of labeled molecules. However, experimental mutagenicity testing can be time-consuming and costly. One solution to reduce the annotation cost is active learning, where the algorithm actively selects the most valuable molecules from a vast chemical space and presents them to the oracle (e.g., a human expert) for annotation, thereby rapidly improving the model's predictive performance with a smaller annotation cost. In this paper, we propose muTOX-AL, a deep active learning framework, which can actively explore the chemical space and identify the most valuable molecules, resulting in competitive performance with a small number of labeled samples. The experimental results show that, compared to the random sampling strategy, muTOX-AL can reduce the number of training molecules by about 57%. Additionally, muTOX-AL exhibits outstanding molecular structural discriminability, allowing it to pick molecules with high structural similarity but opposite properties.

摘要

致突变性评估在药物发现中至关重要，因为它可能导致癌症和生殖细胞损伤。尽管已经提出了用于致突变性预测的计算方法，但由于标记分子的稀缺性，它们的性能受到限制。然而，实验性致突变性测试可能既耗时又昂贵。减少注释成本的一种解决方案是主动学习，其中算法从广阔的化学空间中主动选择最有价值的分子，并将其呈现给（例如，人类专家）进行注释，从而以较小的注释成本快速提高模型的预测性能。在本文中，我们提出了 muTOX-AL，这是一个深度主动学习框架，它可以主动探索化学空间并识别最有价值的分子，从而以少量标记样本实现有竞争力的性能。实验结果表明，与随机抽样策略相比，muTOX-AL 可以将训练分子的数量减少约 57%。此外，muTOX-AL 表现出出色的分子结构辨别能力，能够挑选具有高结构相似性但性质相反的分子。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6619/11366013/e16c7a8a0f3d/42003_2024_6758_Fig1_HTML.jpg

相似文献

Deep active learning with high structural discriminability for molecular mutagenicity prediction.基于高结构可区分性的深度主动学习在分子突变预测中的应用。

Commun Biol. 2024 Aug 31;7(1):1071. doi: 10.1038/s42003-024-06758-6.

QSAR modeling without descriptors using graph convolutional neural networks: the case of mutagenicity prediction.使用图卷积神经网络的无描述符定量构效关系建模：以致突变性预测为例

Mol Divers. 2021 Aug;25(3):1283-1299. doi: 10.1007/s11030-021-10250-2. Epub 2021 Jun 19.

In silico prediction of chemical Ames mutagenicity.计算机预测化学物质的致突变性。

J Chem Inf Model. 2012 Nov 26;52(11):2840-7. doi: 10.1021/ci300400a. Epub 2012 Oct 17.

Comparative evaluation of in silico systems for ames test mutagenicity prediction: scope and limitations.计算机系统预测 Ames 试验致突变性的比较评估：范围和局限性。

Chem Res Toxicol. 2011 Jun 20;24(6):843-54. doi: 10.1021/tx2000398. Epub 2011 May 2.

Multiple Instance Learning Improves Ames Mutagenicity Prediction for Problematic Molecular Species.多实例学习提高对问题分子物种的 Ames 致突变性预测。

Chem Res Toxicol. 2023 Aug 21;36(8):1227-1237. doi: 10.1021/acs.chemrestox.2c00372. Epub 2023 Jul 21.

Comparative evaluation of 11 in silico models for the prediction of small molecule mutagenicity: role of steric hindrance and electron-withdrawing groups.用于预测小分子致突变性的11种计算机模拟模型的比较评估：空间位阻和吸电子基团的作用

Toxicol Mech Methods. 2017 Jan;27(1):24-35. doi: 10.1080/15376516.2016.1174761. Epub 2016 Nov 4.

Optimizing machine-learning models for mutagenicity prediction through better feature selection.通过更好的特征选择来优化用于致突变性预测的机器学习模型。

Mutagenesis. 2022 Oct 26;37(3-4):191-202. doi: 10.1093/mutage/geac010.

In Silico Prediction of Chemical Toxicity Profile Using Local Lazy Learning.使用局部懒惰学习法对化学毒性特征进行计算机模拟预测

Comb Chem High Throughput Screen. 2017;20(4):346-353. doi: 10.2174/1386207320666170217151826.

Machine learning - Predicting Ames mutagenicity of small molecules.机器学习——预测小分子的艾姆斯致突变性。

J Mol Graph Model. 2021 Dec;109:108011. doi: 10.1016/j.jmgm.2021.108011. Epub 2021 Sep 5.

In Silico Approaches in Predictive Genetic Toxicology.预测性遗传毒理学中的计算机模拟方法。

Methods Mol Biol. 2019;2031:351-373. doi: 10.1007/978-1-4939-9646-9_20.

引用本文的文献

Evidential deep learning-based drug-target interaction prediction.基于证据深度学习的药物-靶点相互作用预测

Nat Commun. 2025 Jul 26;16(1):6915. doi: 10.1038/s41467-025-62235-6.

Recent advances in AI-based toxicity prediction for drug discovery.基于人工智能的药物发现毒性预测的最新进展。

Front Chem. 2025 Jul 8;13:1632046. doi: 10.3389/fchem.2025.1632046. eCollection 2025.

AMPred-MFG: Investigating the Mutagenicity of Compounds Using Motif-Based Graph Combined with Molecular Fingerprints and Graph Attention Mechanism.AMPred-MFG：利用基于基序的图结合分子指纹和图注意力机制研究化合物的致突变性。

Interdiscip Sci. 2025 Jul 16. doi: 10.1007/s12539-025-00742-2.

Advancing genetic engineering with active learning: theory, implementations and potential opportunities.通过主动学习推进基因工程：理论、实现与潜在机遇

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf286.

IECata: interpretable bilinear attention network and evidential deep learning improve the catalytic efficiency prediction of enzymes.IECata：可解释的双线性注意力网络和证据深度学习改进了酶的催化效率预测

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf283.

A guide for active learning in synergistic drug discovery.协同药物发现中的主动学习指南。

Sci Rep. 2025 Jan 28;15(1):3484. doi: 10.1038/s41598-025-85600-3.

本文引用的文献

Uncertainty-driven dynamics for active learning of interatomic potentials.基于不确定性的原子间相互作用势主动学习动力学。

Nat Comput Sci. 2023 Mar;3(3):230-239. doi: 10.1038/s43588-023-00406-5. Epub 2023 Mar 6.

Scientific discovery in the age of artificial intelligence.人工智能时代的科学发现。

Nature. 2023 Aug;620(7972):47-60. doi: 10.1038/s41586-023-06221-2. Epub 2023 Aug 2.

Multiple Instance Learning Improves Ames Mutagenicity Prediction for Problematic Molecular Species.多实例学习提高对问题分子物种的 Ames 致突变性预测。

Chem Res Toxicol. 2023 Aug 21;36(8):1227-1237. doi: 10.1021/acs.chemrestox.2c00372. Epub 2023 Jul 21.

TOXRIC: a comprehensive database of toxicological data and benchmarks.TOXRIC：一个全面的毒理学数据和基准数据库。

Nucleic Acids Res. 2023 Jan 6;51(D1):D1432-D1445. doi: 10.1093/nar/gkac1074.

Multitask Deep Neural Networks for Ames Mutagenicity Prediction.多任务深度神经网络在 Ames 致突变性预测中的应用。

J Chem Inf Model. 2022 Dec 26;62(24):6342-6351. doi: 10.1021/acs.jcim.2c00532. Epub 2022 Sep 6.

A graph neural network approach for molecule carcinogenicity prediction.基于图神经网络的分子致癌性预测方法。

Bioinformatics. 2022 Jun 24;38(Suppl 1):i84-i91. doi: 10.1093/bioinformatics/btac266.

Optimizing machine-learning models for mutagenicity prediction through better feature selection.通过更好的特征选择来优化用于致突变性预测的机器学习模型。

Mutagenesis. 2022 Oct 26;37(3-4):191-202. doi: 10.1093/mutage/geac010.

VenomPred: A Machine Learning Based Platform for Molecular Toxicity Predictions.毒液预测：一个基于机器学习的分子毒性预测平台。

Int J Mol Sci. 2022 Feb 14;23(4):2105. doi: 10.3390/ijms23042105.

DeepReac+: deep active learning for quantitative modeling of organic chemical reactions.DeepReac+：用于有机化学反应定量建模的深度主动学习

Chem Sci. 2021 Oct 9;12(43):14459-14472. doi: 10.1039/d1sc02087k. eCollection 2021 Nov 10.

Active Learning for Drug Design: A Case Study on the Plasma Exposure of Orally Administered Drugs.药物设计中的主动学习：以口服药物的血浆暴露为例的案例研究。

J Med Chem. 2021 Nov 25;64(22):16838-16853. doi: 10.1021/acs.jmedchem.1c01683. Epub 2021 Nov 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于高结构可区分性的深度主动学习在分子突变预测中的应用。

Deep active learning with high structural discriminability for molecular mutagenicity prediction.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献