• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过更好的特征选择来优化用于致突变性预测的机器学习模型。

Optimizing machine-learning models for mutagenicity prediction through better feature selection.

机构信息

SBX Corporation, Tokyo, Japan.

Global Drug Safety, Eisai Co., Ltd., Tokyo, Japan.

出版信息

Mutagenesis. 2022 Oct 26;37(3-4):191-202. doi: 10.1093/mutage/geac010.

DOI:10.1093/mutage/geac010
PMID:35554560
Abstract

Assessing a compound's mutagenicity using machine learning is an important activity in the drug discovery and development process. Traditional methods of mutagenicity detection, such as Ames test, are expensive and time and labor intensive. In this context, in silico methods that predict a compound mutagenicity with high accuracy are important. Recently, machine-learning (ML) models are increasingly being proposed to improve the accuracy of mutagenicity prediction. While these models are used in practice, there is further scope to improve the accuracy of these models. We hypothesize that choosing the right features to train the model can further lead to better accuracy. We systematically consider and evaluate a combination of novel structural and molecular features which have the maximal impact on the accuracy of models. We rigorously evaluate these features against multiple classification models (from classical ML models to deep neural network models). The performance of the models was assessed using 5- and 10-fold cross-validation and we show that our approach using the molecule structure, molecular properties, and structural alerts as feature sets successfully outperform the state-of-the-art methods for mutagenicity prediction for the Hansen et al. benchmark dataset with an area under the receiver operating characteristic curve of 0.93. More importantly, our framework shows how combining features could benefit model accuracy improvements.

摘要

使用机器学习评估化合物的致突变性是药物发现和开发过程中的一项重要活动。传统的致突变性检测方法,如艾姆斯试验,既昂贵又费时费力。在这种情况下,能够准确预测化合物致突变性的计算方法就显得尤为重要。最近,越来越多的机器学习 (ML) 模型被提出,以提高致突变性预测的准确性。虽然这些模型在实践中得到了应用,但进一步提高这些模型的准确性仍有很大的空间。我们假设选择正确的特征来训练模型可以进一步提高准确性。我们系统地考虑和评估了一系列对模型准确性有最大影响的新型结构和分子特征。我们严格地将这些特征与多种分类模型(从经典的机器学习模型到深度神经网络模型)进行了比较。我们使用 5 折和 10 折交叉验证来评估模型的性能,结果表明,我们使用分子结构、分子性质和结构警报作为特征集的方法在 Hansen 等人的基准数据集上的预测性能明显优于最先进的方法,接收器操作特征曲线下的面积为 0.93。更重要的是,我们的框架展示了如何结合特征可以提高模型的准确性。

相似文献

1
Optimizing machine-learning models for mutagenicity prediction through better feature selection.通过更好的特征选择来优化用于致突变性预测的机器学习模型。
Mutagenesis. 2022 Oct 26;37(3-4):191-202. doi: 10.1093/mutage/geac010.
2
Machine learning - Predicting Ames mutagenicity of small molecules.机器学习——预测小分子的艾姆斯致突变性。
J Mol Graph Model. 2021 Dec;109:108011. doi: 10.1016/j.jmgm.2021.108011. Epub 2021 Sep 5.
3
A deep neural network-based approach for prediction of mutagenicity of compounds.基于深度神经网络的化合物突变预测方法。
Environ Sci Pollut Res Int. 2021 Sep;28(34):47641-47650. doi: 10.1007/s11356-021-14028-9. Epub 2021 Apr 24.
4
AMPred-CNN: Ames mutagenicity prediction model based on convolutional neural networks.AMPred-CNN:基于卷积神经网络的 Ames 诱变预测模型。
Comput Biol Med. 2024 Jun;176:108560. doi: 10.1016/j.compbiomed.2024.108560. Epub 2024 May 8.
5
Could deep learning in neural networks improve the QSAR models?神经网络中的深度学习能否改进 QSAR 模型?
SAR QSAR Environ Res. 2019 Sep;30(9):617-642. doi: 10.1080/1062936X.2019.1650827. Epub 2019 Aug 28.
6
QSAR modeling without descriptors using graph convolutional neural networks: the case of mutagenicity prediction.使用图卷积神经网络的无描述符定量构效关系建模:以致突变性预测为例
Mol Divers. 2021 Aug;25(3):1283-1299. doi: 10.1007/s11030-021-10250-2. Epub 2021 Jun 19.
7
MutagenPred-GCNNs: A Graph Convolutional Neural Network-Based Classification Model for Mutagenicity Prediction with Data-Driven Molecular Fingerprints.MutagenPred-GCNNs:一种基于图卷积神经网络的分类模型,用于使用数据驱动的分子指纹进行致突变性预测。
Interdiscip Sci. 2021 Mar;13(1):25-33. doi: 10.1007/s12539-020-00407-2. Epub 2021 Jan 27.
8
Multiple Instance Learning Improves Ames Mutagenicity Prediction for Problematic Molecular Species.多实例学习提高对问题分子物种的 Ames 致突变性预测。
Chem Res Toxicol. 2023 Aug 21;36(8):1227-1237. doi: 10.1021/acs.chemrestox.2c00372. Epub 2023 Jul 21.
9
Multitask Deep Neural Networks for Ames Mutagenicity Prediction.多任务深度神经网络在 Ames 致突变性预测中的应用。
J Chem Inf Model. 2022 Dec 26;62(24):6342-6351. doi: 10.1021/acs.jcim.2c00532. Epub 2022 Sep 6.
10
Automated detection of toxicophores and prediction of mutagenicity using PMCSFG algorithm.使用PMCSFG算法自动检测毒效基团并预测致突变性。
Mol Inform. 2023 Mar;42(3):e2200232. doi: 10.1002/minf.202200232. Epub 2023 Feb 2.

引用本文的文献

1
Recent advances in AI-based toxicity prediction for drug discovery.基于人工智能的药物发现毒性预测的最新进展。
Front Chem. 2025 Jul 8;13:1632046. doi: 10.3389/fchem.2025.1632046. eCollection 2025.
2
Leveraging machine learning models in evaluating ADMET properties for drug discovery and development.利用机器学习模型评估药物发现与开发中的ADMET性质。
ADMET DMPK. 2025 Jun 7;13(3):2772. doi: 10.5599/admet.2772. eCollection 2025.
3
Stacked ensemble-based mutagenicity prediction model using multiple modalities with graph attention network.
基于堆叠集成的致突变性预测模型,采用多种模态结合图注意力网络。
Med Biol Eng Comput. 2025 Jun 5. doi: 10.1007/s11517-025-03392-0.
4
Deep active learning with high structural discriminability for molecular mutagenicity prediction.基于高结构可区分性的深度主动学习在分子突变预测中的应用。
Commun Biol. 2024 Aug 31;7(1):1071. doi: 10.1038/s42003-024-06758-6.
5
Comparative in silico analysis of CNS-active molecules targeting the blood-brain barrier choline transporter for Alzheimer's disease therapy.针对阿尔茨海默病治疗,对靶向血脑屏障胆碱转运体的中枢神经系统活性分子进行计算机模拟比较分析。
In Silico Pharmacol. 2024 Jul 31;12(2):71. doi: 10.1007/s40203-024-00245-w. eCollection 2024.
6
Asking the right questions for mutagenicity prediction from BioMedical text.从生物医学文本中预测致突变性应提出的正确问题。
NPJ Syst Biol Appl. 2023 Dec 18;9(1):63. doi: 10.1038/s41540-023-00324-2.
7
Analytical technologies to revolutionize the environmental mutagenesis-and genome- research -from the basics to the cutting-edge research-: the Open Symposium of the Japanese Environmental Mutagen and Genome Society (JEMS), 2022.变革环境诱变与基因组研究的分析技术——从基础到前沿研究——:2022年日本环境诱变与基因组学会(JEMS)公开研讨会
Genes Environ. 2023 Mar 9;45(1):9. doi: 10.1186/s41021-023-00265-6.