• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

定量构效关系中的偏最小二乘建模与遗传算法优化

Partial least squares modeling and genetic algorithm optimization in quantitative structure-activity relationships.

作者信息

Hasegawa K, Funatsu K

机构信息

Nippon Roche Research Center, Nippon Roche K.K., Kanagawa, Japan.

出版信息

SAR QSAR Environ Res. 2000;11(3-4):189-209. doi: 10.1080/10629360008033231.

DOI:10.1080/10629360008033231
PMID:10969871
Abstract

Quantitative structure-activity relationship (QSAR) studies based on chemometric techniques are reviewed. Partial least squares (PLS) is introduced as a novel robust method to replace classical methods such as multiple linear regression (MLR). Advantages of PLS compared to MLR are illustrated with typical applications. Genetic algorithm (GA) is a novel optimization technique which can be used as a search engine in variable selection. A novel hybrid approach comprising GA and PLS for variable selection developed in our group (GAPLS) is described. The more advanced method for comparative molecular field analysis (CoMFA) modeling called GA-based region selection (GARGS) is described as well. Applications of GAPLS and GARGS to QSAR and 3D-QSAR problems are shown with some representative examples. GA can be hybridized with nonlinear modeling methods such as artificial neural networks (ANN) for providing useful tools in chemometric and QSAR.

摘要

综述了基于化学计量学技术的定量构效关系(QSAR)研究。介绍了偏最小二乘法(PLS)作为一种新颖的稳健方法,以取代诸如多元线性回归(MLR)等经典方法。通过典型应用说明了PLS相对于MLR的优势。遗传算法(GA)是一种新颖的优化技术,可作为变量选择中的搜索引擎。描述了我们团队开发的一种包含GA和PLS用于变量选择的新型混合方法(GAPLS)。还介绍了一种更先进的用于比较分子场分析(CoMFA)建模的方法,即基于GA的区域选择(GARGS)。通过一些代表性实例展示了GAPLS和GARGS在QSAR和3D-QSAR问题中的应用。GA可以与非线性建模方法如人工神经网络(ANN)进行杂交,以在化学计量学和QSAR中提供有用的工具。

相似文献

1
Partial least squares modeling and genetic algorithm optimization in quantitative structure-activity relationships.定量构效关系中的偏最小二乘建模与遗传算法优化
SAR QSAR Environ Res. 2000;11(3-4):189-209. doi: 10.1080/10629360008033231.
2
Application of genetic algorithm-kernel partial least square as a novel nonlinear feature selection method: activity of carbonic anhydrase II inhibitors.遗传算法-核偏最小二乘法作为一种新型非线性特征选择方法的应用:碳酸酐酶II抑制剂的活性
Eur J Med Chem. 2007 May;42(5):649-59. doi: 10.1016/j.ejmech.2006.12.020. Epub 2007 Jan 12.
3
Comparison of MLR, PLS and GA-MLR in QSAR analysis.多元线性回归(MLR)、偏最小二乘法(PLS)和遗传算法-多元线性回归(GA-MLR)在定量构效关系(QSAR)分析中的比较。
SAR QSAR Environ Res. 2003 Oct-Dec;14(5-6):433-45. doi: 10.1080/10629360310001624015.
4
Quantitative structure-activity relationship modeling of dopamine D(1) antagonists using comparative molecular field analysis, genetic algorithms-partial least-squares, and K nearest neighbor methods.使用比较分子场分析、遗传算法-偏最小二乘法和K近邻方法对多巴胺D(1)拮抗剂进行定量构效关系建模。
J Med Chem. 1999 Aug 26;42(17):3217-26. doi: 10.1021/jm980415j.
5
A self-adaptive genetic algorithm-artificial neural network algorithm with leave-one-out cross validation for descriptor selection in QSAR study.一种带有留一交叉验证的自适应遗传算法-人工神经网络算法,用于 QSAR 研究中的描述符选择。
J Comput Chem. 2010 Jul 30;31(10):1956-68. doi: 10.1002/jcc.21471.
6
Application of the modelling power approach to variable subset selection for GA-PLS QSAR models.建模能力方法在GA-PLS QSAR模型变量子集选择中的应用。
Anal Chim Acta. 2008 Feb 25;609(2):169-74. doi: 10.1016/j.aca.2008.01.013. Epub 2008 Jan 16.
7
Improvement of multivariate image analysis applied to quantitative structure-activity relationship (QSAR) analysis by using wavelet-principal component analysis ranking variable selection and least-squares support vector machine regression: QSAR study of checkpoint kinase WEE1 inhibitors.通过使用小波主成分分析排序变量选择和最小二乘支持向量机回归改进应用于定量构效关系(QSAR)分析的多变量图像分析:关卡激酶WEE1抑制剂的QSAR研究
Chem Biol Drug Des. 2009 Feb;73(2):244-52. doi: 10.1111/j.1747-0285.2008.00764.x.
8
Variable selection in near-infrared spectroscopy: benchmarking of feature selection methods on biodiesel data.近红外光谱中的变量选择:生物柴油数据特征选择方法的基准测试。
Anal Chim Acta. 2011 Apr 29;692(1-2):63-72. doi: 10.1016/j.aca.2011.03.006. Epub 2011 Mar 8.
9
A new and efficient variable selection algorithm based on ant colony optimization. Applications to near infrared spectroscopy/partial least-squares analysis.一种新的基于蚁群优化的高效变量选择算法。在近红外光谱/偏最小二乘法分析中的应用。
Anal Chim Acta. 2011 Aug 5;699(1):18-25. doi: 10.1016/j.aca.2011.04.061. Epub 2011 May 11.
10
Combined genetic algorithm and multiple linear regression (GA-MLR) optimizer: Application to multi-exponential fluorescence decay surface.组合遗传算法与多元线性回归(GA-MLR)优化器:在多指数荧光衰减表面的应用。
J Phys Chem A. 2006 Dec 7;110(48):12977-85. doi: 10.1021/jp063998e.

引用本文的文献

1
Unveiling the antiviral inhibitory activity of ebselen and ebsulfur derivatives on SARS-CoV-2 using machine learning-based QSAR, LB-PaCS-MD, and experimental assay.利用基于机器学习的定量构效关系(QSAR)、线性约束势全原子分子动力学(LB-PaCS-MD)和实验分析揭示依布硒啉和依布硫衍生物对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的抗病毒抑制活性。
Sci Rep. 2025 Feb 26;15(1):6956. doi: 10.1038/s41598-025-91235-1.
2
Dynamic Profiling and Binding Affinity Prediction of NBTI Antibacterials against DNA Gyrase Enzyme by Multidimensional Machine Learning and Molecular Dynamics Simulations.通过多维机器学习和分子动力学模拟对NBTI抗菌剂针对DNA旋转酶的动态分析和结合亲和力预测
ACS Omega. 2024 Apr 11;9(16):18278-18295. doi: 10.1021/acsomega.4c00036. eCollection 2024 Apr 23.
3
Core, Coating, or Corona? The Importance of Considering Protein Coronas in nano-QSPR Modeling of Zeta Potential.核心、涂层还是电晕?在纳米 QSPR 模型中考虑 Zeta 电位的蛋白电晕的重要性。
ACS Nano. 2023 Feb 14;17(3):1989-1997. doi: 10.1021/acsnano.2c06977. Epub 2023 Jan 18.
4
Exploring the anti-SARS-CoV-2 main protease potential of FDA approved marine drugs using integrated machine learning templates as predictive tools.利用整合机器学习模板作为预测工具,探索已获 FDA 批准的海洋药物对 SARS-CoV-2 主要蛋白酶的抑制作用。
Int J Biol Macromol. 2022 Nov 1;220:1415-1428. doi: 10.1016/j.ijbiomac.2022.09.086. Epub 2022 Sep 16.
5
Experimentally Validated QSAR Model for Surface p Prediction of Heterolipids Having Potential as Delivery Materials for Nucleic Acid Therapeutics.用于预测具有作为核酸治疗递送材料潜力的杂脂表面p的经实验验证的定量构效关系模型。
ACS Omega. 2020 Nov 30;5(49):32023-32031. doi: 10.1021/acsomega.0c04931. eCollection 2020 Dec 15.
6
Estimation of melting points of large set of persistent organic pollutants utilizing QSPR approach.利用定量构效关系方法估算大量持久性有机污染物的熔点。
J Mol Model. 2016 Mar;22(3):55. doi: 10.1007/s00894-016-2917-0. Epub 2016 Feb 13.
7
Topological sub-structural molecular design (TOPS-MODE): a useful tool to explore key fragments of human A3 adenosine receptor ligands.拓扑亚结构分子设计(TOPS-MODE):探索人A3腺苷受体配体关键片段的有用工具。
Mol Divers. 2016 Feb;20(1):55-76. doi: 10.1007/s11030-015-9617-z. Epub 2015 Jul 24.
8
Human farnesyl pyrophosphate synthase inhibition by nitrogen bisphosphonates: a 3D-QSAR study.氮双膦酸盐对人法呢基焦磷酸合酶的抑制作用:三维定量构效关系研究。
J Comput Aided Mol Des. 2013 Aug;27(8):739-54. doi: 10.1007/s10822-013-9674-2. Epub 2013 Aug 24.
9
Classification of bioaccumulative and non-bioaccumulative chemicals using statistical learning approaches.使用统计学习方法对生物累积性和非生物累积性化学物质进行分类。
Mol Divers. 2008 Aug-Nov;12(3-4):157-69. doi: 10.1007/s11030-008-9092-x. Epub 2008 Oct 21.
10
Modeling the excitation wavelengths (lambda(ex)) of boronic acids.模拟硼酸的激发波长(λ(ex))。
J Mol Model. 2008 Jun;14(6):441-9. doi: 10.1007/s00894-008-0293-0. Epub 2008 Mar 20.