• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用二元蚁群优化(BACO)特征选择方法选择有效描述符来预测药物分子的毒性

Predicting the Toxicity of Drug Molecules with Selecting Effective Descriptors Using a Binary Ant Colony Optimization (BACO) Feature Selection Approach.

作者信息

Dan Yuanyuan, Ruan Junhao, Zhu Zhenghua, Yu Hualong

机构信息

School of Environmental and Chemical Engineering, Jiangsu University of Science and Technology, Zhenjiang 212100, China.

School of Computer, Jiangsu University of Science and Technology, Zhenjiang 212100, China.

出版信息

Molecules. 2025 Mar 31;30(7):1548. doi: 10.3390/molecules30071548.

DOI:10.3390/molecules30071548
PMID:40286190
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11990530/
Abstract

Predicting the toxicity of drug molecules using in silico quantitative structure-activity relationship (QSAR) approaches is very helpful for guiding safe drug development and accelerating the drug development procedure. The ongoing development of machine learning techniques has made this task easier and more accurate, but it still suffers negative effects from both the severely skewed distribution of active/inactive chemicals and relatively high-dimensional feature distribution. To simultaneously address both of these issues, a binary ant colony optimization feature selection algorithm, called BACO, is proposed in this study. Specifically, it divides the labeled drug molecules into a training set and a validation set multiple times; with each division, the ant colony seeks an optimal feature group that aims to maximize the weighted combination of three specific class imbalance performance metrics (F-measure, G-mean, and MCC) on the validation set. Then, after running all divisions, the frequency of each feature (descriptor) that emerges in the optimal feature groups is calculated and ranked in descending order. Only those high-frequency features are used to train a support vector machine (SVM) and construct the structure-activity relationship (SAR) prediction model. The experimental results for the 12 datasets in the Tox21 challenge, represented by the Modred descriptor calculator, show that the proposed BACO method significantly outperforms several traditional feature selection approaches that have been widely used in QSAR analysis. It only requires a few to a few dozen descriptors for most datasets to exhibit its best performance, which shows its effectiveness and potential application value in cheminformatics.

摘要

使用计算机辅助定量构效关系(QSAR)方法预测药物分子的毒性,对于指导安全药物研发和加速药物研发进程非常有帮助。机器学习技术的不断发展使这项任务变得更加轻松和准确,但它仍然受到活性/非活性化学物质严重偏态分布和相对高维特征分布的负面影响。为了同时解决这两个问题,本研究提出了一种二元蚁群优化特征选择算法,称为BACO。具体来说,它将标记的药物分子多次划分为训练集和验证集;每次划分时,蚁群寻找一个最优特征组,旨在使验证集上三个特定的类不平衡性能指标(F值、G均值和马修斯相关系数)的加权组合最大化。然后,在运行完所有划分后,计算每个特征(描述符)在最优特征组中出现的频率,并按降序排列。仅使用那些高频特征来训练支持向量机(SVM)并构建构效关系(SAR)预测模型。由Modred描述符计算器表示的Tox21挑战中12个数据集的实验结果表明,所提出的BACO方法明显优于几种在QSAR分析中广泛使用的传统特征选择方法。对于大多数数据集,它只需要几个到几十个描述符就能展现出最佳性能,这表明了其在化学信息学中的有效性和潜在应用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddab/11990530/e6704547ed4e/molecules-30-01548-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddab/11990530/f7c48c8c1c3e/molecules-30-01548-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddab/11990530/455a3e4f5675/molecules-30-01548-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddab/11990530/e6704547ed4e/molecules-30-01548-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddab/11990530/f7c48c8c1c3e/molecules-30-01548-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddab/11990530/455a3e4f5675/molecules-30-01548-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddab/11990530/e6704547ed4e/molecules-30-01548-g003.jpg

相似文献

1
Predicting the Toxicity of Drug Molecules with Selecting Effective Descriptors Using a Binary Ant Colony Optimization (BACO) Feature Selection Approach.使用二元蚁群优化(BACO)特征选择方法选择有效描述符来预测药物分子的毒性
Molecules. 2025 Mar 31;30(7):1548. doi: 10.3390/molecules30071548.
2
Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets.基于 SVM 和最优描述符集预测药物性肝毒性。
Int J Mol Sci. 2021 Jul 28;22(15):8073. doi: 10.3390/ijms22158073.
3
Binary classification of a large collection of environmental chemicals from estrogen receptor assays by quantitative structure-activity relationship and machine learning methods.通过定量构效关系和机器学习方法对来自雌激素受体测定的大量环境化学物质进行二元分类。
J Chem Inf Model. 2013 Dec 23;53(12):3244-61. doi: 10.1021/ci400527b. Epub 2013 Dec 11.
4
An Innovative Excited-ACS-IDGWO Algorithm for Optimal Biomedical Data Feature Selection.一种创新的基于激发 ACS-IDGWO 算法的最优生物医学数据特征选择方法。
Biomed Res Int. 2020 Aug 17;2020:8506365. doi: 10.1155/2020/8506365. eCollection 2020.
5
In silico prediction of drug-induced developmental toxicity by using machine learning approaches.基于机器学习方法的药物诱导发育毒性的计算机预测。
Mol Divers. 2020 Nov;24(4):1281-1290. doi: 10.1007/s11030-019-09991-y. Epub 2019 Sep 5.
6
Enhanced leukemia prediction using hybrid ant colony and ant lion optimization for gene selection and classification.使用混合蚁群和蚁狮优化进行基因选择与分类以增强白血病预测
MethodsX. 2025 Feb 20;14:103239. doi: 10.1016/j.mex.2025.103239. eCollection 2025 Jun.
7
ADMET Evaluation in Drug Discovery. 18. Reliable Prediction of Chemical-Induced Urinary Tract Toxicity by Boosting Machine Learning Approaches.药物发现中的 ADMET 评估。18. 通过机器学习方法的提升实现可靠的化学诱导的泌尿道毒性预测。
Mol Pharm. 2017 Nov 6;14(11):3935-3953. doi: 10.1021/acs.molpharmaceut.7b00631. Epub 2017 Oct 27.
8
An Explainable Supervised Machine Learning Model for Predicting Respiratory Toxicity of Chemicals Using Optimal Molecular Descriptors.一种使用最优分子描述符预测化学物质呼吸毒性的可解释监督机器学习模型。
Pharmaceutics. 2022 Apr 11;14(4):832. doi: 10.3390/pharmaceutics14040832.
9
A Machine Learning-Based QSAR Model for Benzimidazole Derivatives as Corrosion Inhibitors by Incorporating Comprehensive Feature Selection.基于机器学习的苯并咪唑衍生物作为缓蚀剂的 QSAR 模型,综合特征选择。
Interdiscip Sci. 2019 Dec;11(4):738-747. doi: 10.1007/s12539-019-00346-7. Epub 2019 Sep 4.
10
Informing the Human Plasma Protein Binding of Environmental Chemicals by Machine Learning in the Pharmaceutical Space: Applicability Domain and Limits of Predictability.机器学习在制药领域中对环境化学物质与人血浆蛋白结合的预测:适用范围与预测局限性
J Chem Inf Model. 2016 Nov 28;56(11):2243-2252. doi: 10.1021/acs.jcim.6b00291. Epub 2016 Nov 3.

本文引用的文献

1
Consensus Modeling for Predicting Chemical Binding to Transthyretin as the Winning Solution of the Tox24 Challenge.预测化学物质与转甲状腺素蛋白结合的共识建模作为Tox24挑战赛的获胜解决方案
Chem Res Toxicol. 2025 Mar 17;38(3):392-399. doi: 10.1021/acs.chemrestox.4c00421. Epub 2025 Feb 19.
2
Analysis of implicit and explicit uncertainties in QSAR prediction of chemical toxicity: A case study of neurotoxicity.定量构效关系预测化学毒性的内隐和外显不确定性分析:以神经毒性为例。
Regul Toxicol Pharmacol. 2024 Dec;154:105716. doi: 10.1016/j.yrtph.2024.105716. Epub 2024 Oct 10.
3
A Review of Machine Learning and QSAR/QSPR Predictions for Complexes of Organic Molecules with Cyclodextrins.
机器学习与有机分子与环糊精复合物的定量构效关系/定量构性关系预测综述
Molecules. 2024 Jul 2;29(13):3159. doi: 10.3390/molecules29133159.
4
Targeting bacterial growth in biofilm conditions: rational design of novel inhibitors to mitigate clinical and food contamination using QSAR.针对生物膜条件下的细菌生长:利用定量构效关系合理设计新型抑制剂以减轻临床和食品污染
J Enzyme Inhib Med Chem. 2024 Dec;39(1):2330907. doi: 10.1080/14756366.2024.2330907. Epub 2024 Apr 23.
5
Reliable method for predicting the binding affinity of RNA-small molecule interactions using machine learning.利用机器学习预测 RNA-小分子相互作用结合亲和力的可靠方法。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae002.
6
Ensemble Quantitative Read-Across Structure-Activity Relationship Algorithm for Predicting Skin Cytotoxicity.用于预测皮肤细胞毒性的定量构效关系算法集。
Chem Res Toxicol. 2023 Dec 18;36(12):1961-1972. doi: 10.1021/acs.chemrestox.3c00238. Epub 2023 Dec 4.
7
Utilizing machine learning techniques to predict the blood-brain barrier permeability of compounds detected using LCQTOF-MS in Malaysian Kelulut honey.利用机器学习技术预测使用 LCQTOF-MS 检测到的马来西亚芦菇蜜中化合物的血脑屏障通透性。
SAR QSAR Environ Res. 2023 Apr-Jun;34(6):475-500. doi: 10.1080/1062936X.2023.2230868. Epub 2023 Jul 6.
8
Machine learning-driven QSAR models for predicting the mixture toxicity of nanoparticles.基于机器学习的 QSAR 模型预测纳米粒子混合物毒性
Environ Int. 2023 Jul;177:108025. doi: 10.1016/j.envint.2023.108025. Epub 2023 Jun 9.
9
Artificial Intelligence in Drug Toxicity Prediction: Recent Advances, Challenges, and Future Perspectives.人工智能在药物毒性预测中的应用:最新进展、挑战与未来展望。
J Chem Inf Model. 2023 May 8;63(9):2628-2643. doi: 10.1021/acs.jcim.3c00200. Epub 2023 Apr 26.
10
Effects of Class Imbalance and Data Scarcity on the Performance of Binary Classification Machine Learning Models Developed Based on ToxCast/Tox21 Assay Data.基于 ToxCast/Tox21 检测数据开发的二分类机器学习模型中,类不平衡和数据稀缺对性能的影响。
Chem Res Toxicol. 2022 Dec 19;35(12):2219-2226. doi: 10.1021/acs.chemrestox.2c00189. Epub 2022 Dec 7.