• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MaNGA:一种用于 QSAR 建模的新型多目标多领域遗传算法。

MaNGA: a novel multi-niche multi-objective genetic algorithm for QSAR modelling.

机构信息

Faculty of Medicine and Health Technology, Tampere University, Tampere 33200, Finland.

Department of Mathematics and Applications, University of Napoli Federico II, Naples 80138, Italy.

出版信息

Bioinformatics. 2020 Jan 1;36(1):145-153. doi: 10.1093/bioinformatics/btz521.

DOI:10.1093/bioinformatics/btz521
PMID:31233136
Abstract

SUMMARY

Quantitative structure-activity relationship (QSAR) modelling is currently used in multiple fields to relate structural properties of compounds to their biological activities. This technique is also used for drug design purposes with the aim of predicting parameters that determine drug behaviour. To this end, a sophisticated process, involving various analytical steps concatenated in series, is employed to identify and fine-tune the optimal set of predictors from a large dataset of molecular descriptors (MDs). The search of the optimal model requires to optimize multiple objectives at the same time, as the aim is to obtain the minimal set of features that maximizes the goodness of fit and the applicability domain (AD). Hence, a multi-objective optimization strategy, improving multiple parameters in parallel, can be applied. Here we propose a new multi-niche multi-objective genetic algorithm that simultaneously enables stable feature selection as well as obtaining robust and validated regression models with maximized AD. We benchmarked our method on two simulated datasets. Moreover, we analyzed an aquatic acute toxicity dataset and compared the performances of single- and multi-objective fitness functions on different regression models. Our results show that our multi-objective algorithm is a valid alternative to classical QSAR modelling strategy, for continuous response values, since it automatically finds the model with the best compromise between statistical robustness, predictive performance, widest AD, and the smallest number of MDs.

AVAILABILITY AND IMPLEMENTATION

The python implementation of MaNGA is available at https://github.com/Greco-Lab/MaNGA.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

定量构效关系(QSAR)建模目前被广泛应用于多个领域,用于将化合物的结构性质与其生物活性联系起来。该技术也被用于药物设计目的,旨在预测决定药物行为的参数。为此,采用了一种复杂的过程,涉及串联的多个分析步骤,用于从大量分子描述符(MDs)数据集中识别和微调最佳预测器集。搜索最佳模型需要同时优化多个目标,因为目标是获得最小的特征集,最大限度地提高拟合度和适用域(AD)。因此,可以应用一种多目标优化策略,同时并行优化多个参数。在这里,我们提出了一种新的多小生境多目标遗传算法,它可以同时实现稳定的特征选择,以及获得具有最大化 AD 的稳健和验证回归模型。我们在两个模拟数据集上对我们的方法进行了基准测试。此外,我们分析了一个水生急性毒性数据集,并比较了单目标和多目标适应度函数在不同回归模型上的性能。我们的结果表明,对于连续响应值,我们的多目标算法是经典 QSAR 建模策略的有效替代方案,因为它自动找到了在统计稳健性、预测性能、最宽 AD 和最小 MDs 数量之间具有最佳折衷的模型。

可用性和实现

MaNGA 的 Python 实现可在 https://github.com/Greco-Lab/MaNGA 上获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
MaNGA: a novel multi-niche multi-objective genetic algorithm for QSAR modelling.MaNGA:一种用于 QSAR 建模的新型多目标多领域遗传算法。
Bioinformatics. 2020 Jan 1;36(1):145-153. doi: 10.1093/bioinformatics/btz521.
2
Exploring the QSAR's predictive truthfulness of the novel N-tuple discrete derivative indices on benchmark datasets.探索新型N元组离散导数指标在基准数据集上的定量构效关系(QSAR)预测真实性。
SAR QSAR Environ Res. 2017 May;28(5):367-389. doi: 10.1080/1062936X.2017.1326403.
3
QSAR model for predicting neuraminidase inhibitors of influenza A viruses (H1N1) based on adaptive grasshopper optimization algorithm.基于自适应草蜢优化算法的流感 A 病毒(H1N1)神经氨酸酶抑制剂的定量构效关系模型。
SAR QSAR Environ Res. 2020 Nov;31(11):803-814. doi: 10.1080/1062936X.2020.1818616. Epub 2020 Sep 17.
4
Beware of External Validation! - A Comparative Study of Several Validation Techniques used in QSAR Modelling.谨防外部验证!——QSAR建模中几种验证技术的比较研究。
Curr Comput Aided Drug Des. 2018;14(4):284-291. doi: 10.2174/1573409914666180426144304.
5
Molecular modelling studies for the discovery of new substituted pyridines derivatives with angiotensin II AT1 receptor antagonists.用于发现具有血管紧张素II AT1受体拮抗剂活性的新型取代吡啶衍生物的分子建模研究。
Interdiscip Sci. 2014 Sep;6(3):197-207. doi: 10.1007/s12539-013-0201-x. Epub 2014 Sep 11.
6
QSAR Models for Predicting Aquatic Toxicity of Esters Using Genetic Algorithm-Multiple Linear Regression Methods.基于遗传算法-多元线性回归方法预测酯类化合物水生毒性的定量构效关系模型
Comb Chem High Throughput Screen. 2019 Aug 8;22(5):317-325. doi: 10.2174/1386207322666190618150856.
7
An automated tool for obtaining QSAR-ready series of compounds using semantic web technologies.使用语义网技术获取 QSAR 就绪化合物系列的自动化工具。
Bioinformatics. 2018 Jan 1;34(1):131-133. doi: 10.1093/bioinformatics/btx566.
8
Multi-Objective Genetic Algorithm (MOGA) As a Feature Selecting Strategy in the Development of Ionic Liquids' Quantitative Toxicity-Toxicity Relationship Models.多目标遗传算法(MOGA)作为离子液体定量毒性-毒性关系模型开发中的特征选择策略。
J Chem Inf Model. 2018 Dec 24;58(12):2467-2476. doi: 10.1021/acs.jcim.8b00378. Epub 2018 Dec 14.
9
QSAR modeling for quinoxaline derivatives using genetic algorithm and simulated annealing based feature selection.基于遗传算法和模拟退火的特征选择对喹喔啉衍生物的定量构效关系建模。
Curr Med Chem. 2009;16(30):4032-48. doi: 10.2174/092986709789352303.
10
Feature set optimization in biomarker discovery from genome-scale data.从基因组规模数据中发现生物标志物的特征集优化。
Bioinformatics. 2020 Jun 1;36(11):3393-3400. doi: 10.1093/bioinformatics/btaa144.

引用本文的文献

1
Dual-stage optimizer for systematic overestimation adjustment applied to multi-objective genetic algorithms for biomarker selection.用于系统高估调整的双阶段优化器应用于生物标志物选择的多目标遗传算法
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae674.
2
In Silico Modeling and Structural Analysis of Soluble Epoxide Hydrolase Inhibitors for Enhanced Therapeutic Design.计算机模拟与可溶性环氧化物水解酶抑制剂的结构分析及其在增强治疗设计中的应用。
Molecules. 2023 Aug 31;28(17):6379. doi: 10.3390/molecules28176379.
3
Integrated Network Pharmacology Approach for Drug Combination Discovery: A Multi-Cancer Case Study.
基于整合网络药理学方法的药物组合发现:多癌种案例研究
Cancers (Basel). 2022 Apr 18;14(8):2043. doi: 10.3390/cancers14082043.
4
Nextcast: A software suite to analyse and model toxicogenomics data.Nextcast:一个用于分析和建模毒理基因组学数据的软件套件。
Comput Struct Biotechnol J. 2022 Mar 18;20:1413-1426. doi: 10.1016/j.csbj.2022.03.014. eCollection 2022.
5
Density of Deep Eutectic Solvents: The Path Forward Cheminformatics-Driven Reliable Predictions for Mixtures.深共晶溶剂的密度:化学信息学驱动的混合物可靠预测的未来之路。
Molecules. 2021 Sep 24;26(19):5779. doi: 10.3390/molecules26195779.
6
Manually curated transcriptomics data collection for toxicogenomic assessment of engineered nanomaterials.人工策管转录组学数据收集,用于工程纳米材料的毒基因组评估。
Sci Data. 2021 Feb 8;8(1):49. doi: 10.1038/s41597-021-00808-y.
7
Digital Pharmaceutical Sciences.数字药物科学。
AAPS PharmSciTech. 2020 Jul 26;21(6):206. doi: 10.1208/s12249-020-01747-4.
8
Transcriptomics in Toxicogenomics, Part III: Data Modelling for Risk Assessment.毒理基因组学中的转录组学,第三部分:风险评估的数据建模
Nanomaterials (Basel). 2020 Apr 8;10(4):708. doi: 10.3390/nano10040708.
9
NanoSolveIT Project: Driving nanoinformatics research to develop innovative and integrated tools for nanosafety assessment.纳米解决方案信息技术项目:推动纳米信息学研究,开发用于纳米安全评估的创新型集成工具。
Comput Struct Biotechnol J. 2020 Mar 7;18:583-602. doi: 10.1016/j.csbj.2020.02.023. eCollection 2020.