• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种带有留一交叉验证的自适应遗传算法-人工神经网络算法,用于 QSAR 研究中的描述符选择。

A self-adaptive genetic algorithm-artificial neural network algorithm with leave-one-out cross validation for descriptor selection in QSAR study.

机构信息

School of Chemistry and Chemical Engineering of Sun Yat-sen University, Guanzhou 510275, People's Republic of China.

出版信息

J Comput Chem. 2010 Jul 30;31(10):1956-68. doi: 10.1002/jcc.21471.

DOI:10.1002/jcc.21471
PMID:20512843
Abstract

Based on the quantitative structure-activity relationships (QSARs) models developed by artificial neural networks (ANNs), genetic algorithm (GA) was used in the variable-selection approach with molecule descriptors and helped to improve the back-propagation training algorithm as well. The cross validation techniques of leave-one-out investigated the validity of the generated ANN model and preferable variable combinations derived in the GAs. A self-adaptive GA-ANN model was successfully established by using a new estimate function for avoiding over-fitting phenomenon in ANN training. Compared with the variables selected in two recent QSAR studies that were based on stepwise multiple linear regression (MLR) models, the variables selected in self-adaptive GA-ANN model are superior in constructing ANN model, as they revealed a higher cross validation (CV) coefficient (Q(2)) and a lower root mean square deviation both in the established model and biological activity prediction. The introduced methods for validation, including leave-multiple-out, Y-randomization, and external validation, proved the superiority of the established GA-ANN models over MLR models in both stability and predictive power. Self-adaptive GA-ANN showed us a prospect of improving QSAR model.

摘要

基于人工神经网络 (ANN) 开发的定量构效关系 (QSAR) 模型,遗传算法 (GA) 被用于具有分子描述符的变量选择方法,并帮助改进了反向传播训练算法。留一法交叉验证技术调查了所生成的 ANN 模型的有效性和 GA 中得出的优选变量组合。通过使用新的估计函数来避免 ANN 训练中的过拟合现象,成功建立了自适应 GA-ANN 模型。与基于逐步多元线性回归 (MLR) 模型的两项最近的 QSAR 研究中选择的变量相比,自适应 GA-ANN 模型中选择的变量在构建 ANN 模型方面更具优势,因为它们在建立的模型和生物活性预测中均显示出更高的交叉验证 (CV) 系数 (Q(2)) 和更低的均方根偏差。所介绍的验证方法,包括留多次、Y 随机化和外部验证,证明了自适应 GA-ANN 模型在稳定性和预测能力方面均优于 MLR 模型。自适应 GA-ANN 为我们展示了改进 QSAR 模型的前景。

相似文献

1
A self-adaptive genetic algorithm-artificial neural network algorithm with leave-one-out cross validation for descriptor selection in QSAR study.一种带有留一交叉验证的自适应遗传算法-人工神经网络算法,用于 QSAR 研究中的描述符选择。
J Comput Chem. 2010 Jul 30;31(10):1956-68. doi: 10.1002/jcc.21471.
2
Prediction of capillary gas chromatographic retention times of fatty acid methyl esters in human blood using MLR, PLS and back-propagation artificial neural networks.采用多元线性回归、偏最小二乘法和反向传播人工神经网络预测人血中脂肪酸甲酯的毛细管气相色谱保留时间。
Talanta. 2011 Jan 15;83(3):1014-22. doi: 10.1016/j.talanta.2010.11.017. Epub 2010 Nov 11.
3
Application of genetic algorithm-kernel partial least square as a novel nonlinear feature selection method: activity of carbonic anhydrase II inhibitors.遗传算法-核偏最小二乘法作为一种新型非线性特征选择方法的应用:碳酸酐酶II抑制剂的活性
Eur J Med Chem. 2007 May;42(5):649-59. doi: 10.1016/j.ejmech.2006.12.020. Epub 2007 Jan 12.
4
A novel QSAR model for prediction of apoptosis-inducing activity of 4-aryl-4-H-chromenes based on support vector machine.基于支持向量机的用于预测4-芳基-4-H-色烯凋亡诱导活性的新型定量构效关系模型。
Bioorg Med Chem. 2007 Dec 15;15(24):7746-54. doi: 10.1016/j.bmc.2007.08.057. Epub 2007 Sep 1.
5
Quantitative structure-property relationship modelling of the degradability rate constant of alkenes by OH radicals in atmosphere.大气中烯烃被羟基自由基降解速率常数的定量结构-性质关系建模
SAR QSAR Environ Res. 2009;20(1-2):77-90. doi: 10.1080/10629360902726700.
6
QSAR study of heparanase inhibitors activity using artificial neural networks and Levenberg-Marquardt algorithm.使用人工神经网络和列文伯格-马夸尔特算法对乙酰肝素酶抑制剂活性进行定量构效关系研究。
Eur J Med Chem. 2008 Mar;43(3):548-56. doi: 10.1016/j.ejmech.2007.04.014. Epub 2007 May 18.
7
QSPR modeling of soil sorption coefficients (K(OC)) of pesticides using SPA-ANN and SPA-MLR.使用逐步回归分析-人工神经网络(SPA-ANN)和逐步回归分析-多元线性回归(SPA-MLR)对农药土壤吸附系数(K(OC))进行定量结构-性质关系(QSPR)建模。
J Agric Food Chem. 2009 Aug 12;57(15):7153-8. doi: 10.1021/jf9008839.
8
Quantitative structure-activity relationship study of serotonin (5-HT7) receptor inhibitors using modified ant colony algorithm and adaptive neuro-fuzzy interference system (ANFIS).基于改进蚁群算法和自适应神经模糊推理系统(ANFIS)的5-羟色胺(5-HT7)受体抑制剂的定量构效关系研究
Eur J Med Chem. 2009 Apr;44(4):1463-70. doi: 10.1016/j.ejmech.2008.09.050. Epub 2008 Oct 14.
9
Application of ab initio theory to QSAR study of 1,4-dihydropyridine-based calcium channel blockers using GA-MLR and PC-GA-ANN procedures.从头算理论在基于1,4 - 二氢吡啶的钙通道阻滞剂QSAR研究中的应用:使用遗传算法 - 多元线性回归(GA - MLR)和主成分 - 遗传算法 - 人工神经网络(PC - GA - ANN)方法
J Comput Chem. 2004 Sep;25(12):1495-503. doi: 10.1002/jcc.20066.
10
Prediction of cytotoxicity data (CC(50)) of anti-HIV 5-phenyl-1-phenylamino-1H-imidazole derivatives by artificial neural network trained with Levenberg-Marquardt algorithm.用Levenberg-Marquardt算法训练的人工神经网络预测抗HIV 5-苯基-1-苯基氨基-1H-咪唑衍生物的细胞毒性数据(CC(50))
J Mol Graph Model. 2007 Jul;26(1):360-7. doi: 10.1016/j.jmgm.2007.01.005. Epub 2007 Jan 18.

引用本文的文献

1
Diagnosis of T-cell-mediated kidney rejection by biopsy-based proteomic biomarkers and machine learning.基于活检的蛋白质组生物标志物和机器学习诊断 T 细胞介导的肾排斥反应。
Front Immunol. 2023 Feb 6;14:1090373. doi: 10.3389/fimmu.2023.1090373. eCollection 2023.
2
The use of predictive models to develop chromatography-based purification processes.使用预测模型来开发基于色谱的纯化工艺。
Front Bioeng Biotechnol. 2022 Oct 12;10:1009102. doi: 10.3389/fbioe.2022.1009102. eCollection 2022.
3
Implementation of a dynamic intestinal gut-on-a-chip barrier model for transport studies of lipophilic dioxin congeners.
用于亲脂性二噁英同系物转运研究的动态肠道芯片上肠道屏障模型的构建
RSC Adv. 2018 Sep 19;8(57):32440-32453. doi: 10.1039/c8ra05430d. eCollection 2018 Sep 18.
4
RankProd Combined with Genetic Algorithm Optimized Artificial Neural Network Establishes a Diagnostic and Prognostic Prediction Model that Revealed C1QTNF3 as a Biomarker for Prostate Cancer.RankProd 联合遗传算法优化的人工神经网络建立了一个诊断和预后预测模型,揭示 C1QTNF3 是前列腺癌的生物标志物。
EBioMedicine. 2018 Jun;32:234-244. doi: 10.1016/j.ebiom.2018.05.010. Epub 2018 Jun 1.
5
Molecular Dynamics Simulations with Quantum Mechanics/Molecular Mechanics and Adaptive Neural Networks.基于量子力学/分子力学和自适应神经网络的分子动力学模拟
J Chem Theory Comput. 2018 Mar 13;14(3):1442-1455. doi: 10.1021/acs.jctc.7b01195. Epub 2018 Feb 26.
6
Multiscale Quantum Mechanics/Molecular Mechanics Simulations with Neural Networks.基于神经网络的多尺度量子力学/分子力学模拟
J Chem Theory Comput. 2016 Oct 11;12(10):4934-4946. doi: 10.1021/acs.jctc.6b00663. Epub 2016 Sep 6.
7
Protein aggregation and lyophilization: Protein structural descriptors as predictors of aggregation propensity.蛋白质聚集与冻干:作为聚集倾向预测指标的蛋白质结构描述符
Comput Chem Eng. 2013 Nov 11;58(2013):369-377. doi: 10.1016/j.compchemeng.2013.07.008.
8
Application of receptor models on water quality data in source apportionment in Kuantan River Basin.受体模型在关丹河流域水质数据源解析中的应用。
Iranian J Environ Health Sci Eng. 2012 Dec 10;9(1):18. doi: 10.1186/1735-2746-9-18.