• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于定量构效关系中特征选择的序列与混合遗传算法及学习自动机(SGALA、MGALA)

Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR.

作者信息

MotieGhader Habib, Gharaghani Sajjad, Masoudi-Sobhanzadeh Yosef, Masoudi-Nejad Ali

机构信息

Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.

Laboratory of Bioinformatics and Drug Design (LBD), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.

出版信息

Iran J Pharm Res. 2017 Spring;16(2):533-553.

PMID:28979308
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5603862/
Abstract

Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as GA, PSO, ACO and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR feature selection are proposed. SGALA algorithm uses advantages of Genetic algorithm and Learning Automata sequentially and the MGALA algorithm uses advantages of Genetic Algorithm and Learning Automata simultaneously. We applied our proposed algorithms to select the minimum possible number of features from three different datasets and also we observed that the MGALA and SGALA algorithms had the best outcome independently and in average compared to other feature selection algorithms. Through comparison of our proposed algorithms, we deduced that the rate of convergence to optimal result in MGALA and SGALA algorithms were better than the rate of GA, ACO, PSO and LA algorithms. In the end, the results of GA, ACO, PSO, LA, SGALA, and MGALA algorithms were applied as the input of LS-SVR model and the results from LS-SVR models showed that the LS-SVR model had more predictive ability with the input from SGALA and MGALA algorithms than the input from all other mentioned algorithms. Therefore, the results have corroborated that not only is the predictive efficiency of proposed algorithms better, but their rate of convergence is also superior to the all other mentioned algorithms.

摘要

特征选择在定量构效关系(QSAR)分析中至关重要。这个问题已经通过一些元启发式算法得以解决,如遗传算法(GA)、粒子群优化算法(PSO)、蚁群优化算法(ACO)等。在这项工作中,提出了两种新颖的混合元启发式算法,即基于遗传算法和学习自动机的顺序遗传算法与学习自动机(SGALA)以及混合遗传算法与学习自动机(MGALA),用于QSAR特征选择。SGALA算法依次利用遗传算法和学习自动机的优势,而MGALA算法同时利用遗传算法和学习自动机的优势。我们将所提出的算法应用于从三个不同的数据集中选择尽可能少的特征数量,并且我们观察到,与其他特征选择算法相比,MGALA和SGALA算法各自以及平均而言都具有最佳结果。通过对所提出算法的比较,我们推断出MGALA和SGALA算法收敛到最优结果的速率优于遗传算法、蚁群优化算法、粒子群优化算法和学习自动机算法。最后,将遗传算法、蚁群优化算法、粒子群优化算法、学习自动机算法、SGALA算法和MGALA算法的结果作为最小二乘支持向量回归(LS - SVR)模型的输入,LS - SVR模型的结果表明,与来自所有其他提及算法的输入相比,LS - SVR模型在输入来自SGALA和MGALA算法时具有更强的预测能力。因此,可以证实所提出算法不仅预测效率更高,而且其收敛速率也优于所有其他提及算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/cc608812ff68/ijpr-16-533-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/76ca5a3c7519/ijpr-16-533-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/8f753a815a60/ijpr-16-533-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/9bb1713e3da2/ijpr-16-533-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/02e13ba12f2f/ijpr-16-533-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/913867a845e6/ijpr-16-533-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/edeb5b9310ef/ijpr-16-533-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/ac4d90d7e9b3/ijpr-16-533-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/fea439326965/ijpr-16-533-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/7a6a73bd8664/ijpr-16-533-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/3d84692ed650/ijpr-16-533-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/7ee7f3973626/ijpr-16-533-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/7721f493902f/ijpr-16-533-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/65eeb3ebb8ba/ijpr-16-533-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/6e6ac651b702/ijpr-16-533-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/61fcaea5bb9b/ijpr-16-533-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/73c934cd4dca/ijpr-16-533-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/1eb512626594/ijpr-16-533-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/eab0ee1f624d/ijpr-16-533-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/b17027a2bed9/ijpr-16-533-g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/cc608812ff68/ijpr-16-533-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/76ca5a3c7519/ijpr-16-533-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/8f753a815a60/ijpr-16-533-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/9bb1713e3da2/ijpr-16-533-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/02e13ba12f2f/ijpr-16-533-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/913867a845e6/ijpr-16-533-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/edeb5b9310ef/ijpr-16-533-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/ac4d90d7e9b3/ijpr-16-533-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/fea439326965/ijpr-16-533-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/7a6a73bd8664/ijpr-16-533-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/3d84692ed650/ijpr-16-533-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/7ee7f3973626/ijpr-16-533-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/7721f493902f/ijpr-16-533-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/65eeb3ebb8ba/ijpr-16-533-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/6e6ac651b702/ijpr-16-533-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/61fcaea5bb9b/ijpr-16-533-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/73c934cd4dca/ijpr-16-533-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/1eb512626594/ijpr-16-533-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/eab0ee1f624d/ijpr-16-533-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/b17027a2bed9/ijpr-16-533-g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8031/5603862/cc608812ff68/ijpr-16-533-g021.jpg

相似文献

1
Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR.用于定量构效关系中特征选择的序列与混合遗传算法及学习自动机(SGALA、MGALA)
Iran J Pharm Res. 2017 Spring;16(2):533-553.
2
Evaluation of mutual information, genetic algorithm and SVR for feature selection in QSAR regression.用于定量构效关系回归中特征选择的互信息、遗传算法和支持向量回归评估。
Curr Drug Discov Technol. 2011 Jun;8(2):107-11. doi: 10.2174/157016311795563839.
3
Improved intelligent water drop-based hybrid feature selection method for microarray data processing.基于智能水滴的改进型混合特征选择方法在微阵列数据处理中的应用。
Comput Biol Chem. 2023 Apr;103:107809. doi: 10.1016/j.compbiolchem.2022.107809. Epub 2023 Jan 13.
4
Particle swarm optimization and genetic algorithm as feature selection techniques for the QSAR modeling of imidazo[1,5-a]pyrido[3,2-e]pyrazines, inhibitors of phosphodiesterase 10A.粒子群优化和遗传算法作为特征选择技术在咪唑并[1,5-a]吡啶并[3,2-e]吡嗪、磷酸二酯酶 10A 抑制剂的定量构效关系建模中的应用。
Chem Biol Drug Des. 2013 Dec;82(6):685-96. doi: 10.1111/cbdd.12196. Epub 2013 Sep 19.
5
Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification.遗传蜂群(GBC)算法:一种用于微阵列癌症分类的新基因选择方法。
Comput Biol Chem. 2015 Jun;56:49-60. doi: 10.1016/j.compbiolchem.2015.03.001. Epub 2015 Mar 18.
6
A universal deep learning approach for modeling the flow of patients under different severities.一种通用的深度学习方法,用于对不同严重程度的患者进行建模。
Comput Methods Programs Biomed. 2018 Feb;154:191-203. doi: 10.1016/j.cmpb.2017.11.003. Epub 2017 Nov 7.
7
Predicting Blast-Induced Ground Vibration in Open-Pit Mines Using Vibration Sensors and Support Vector Regression-Based Optimization Algorithms.利用振动传感器和基于支持向量回归的优化算法预测露天矿爆破地震。
Sensors (Basel). 2019 Dec 24;20(1):132. doi: 10.3390/s20010132.
8
Feature Selection for Motor Imagery EEG Classification Based on Firefly Algorithm and Learning Automata.基于萤火虫算法和学习自动机的脑电分类特征选择。
Sensors (Basel). 2017 Nov 8;17(11):2576. doi: 10.3390/s17112576.
9
QSAR modeling for quinoxaline derivatives using genetic algorithm and simulated annealing based feature selection.基于遗传算法和模拟退火的特征选择对喹喔啉衍生物的定量构效关系建模。
Curr Med Chem. 2009;16(30):4032-48. doi: 10.2174/092986709789352303.
10
Applying GA-PSO-TLBO approach to engineering optimization problems.将遗传算法-粒子群优化算法-教学学习优化算法应用于工程优化问题。
Math Biosci Eng. 2023 Jan;20(1):552-571. doi: 10.3934/mbe.2023025. Epub 2022 Oct 12.

引用本文的文献

1
Predicting the Toxicity of Drug Molecules with Selecting Effective Descriptors Using a Binary Ant Colony Optimization (BACO) Feature Selection Approach.使用二元蚁群优化(BACO)特征选择方法选择有效描述符来预测药物分子的毒性
Molecules. 2025 Mar 31;30(7):1548. doi: 10.3390/molecules30071548.
2
A fuzzy logic-based computational method for the repurposing of drugs against COVID-19.一种基于模糊逻辑的用于新冠病毒药物再利用的计算方法。
Bioimpacts. 2022;12(4):315-324. doi: 10.34172/bi.2021.40. Epub 2021 Aug 10.
3
Drug repurposing for coronavirus (SARS-CoV-2) based on gene co-expression network analysis.

本文引用的文献

1
World Competitive Contests (WCC) algorithm: A novel intelligent optimization algorithm for biological and non-biological problems.世界竞赛(WCC)算法:一种用于解决生物和非生物问题的新型智能优化算法。
Inform Med Unlocked. 2016;3:15-28. doi: 10.1016/j.imu.2016.06.002. Epub 2016 Jun 28.
2
Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection.监督式、无监督式和半监督式特征选择:基因选择综述
IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):971-989. doi: 10.1109/TCBB.2015.2478454. Epub 2015 Sep 14.
3
Discovery of inhibitors of the mitotic kinase TTK based on N-(3-(3-sulfamoylphenyl)-1H-indazol-5-yl)-acetamides and carboxamides.
基于基因共表达网络分析的冠状病毒(SARS-CoV-2)药物再利用。
Sci Rep. 2021 Nov 8;11(1):21872. doi: 10.1038/s41598-021-01410-3.
4
A machine learning method based on the genetic and world competitive contests algorithms for selecting genes or features in biological applications.一种基于遗传算法和世界竞争竞赛算法的机器学习方法,用于在生物应用中选择基因或特征。
Sci Rep. 2021 Feb 8;11(1):3349. doi: 10.1038/s41598-021-82796-y.
5
World competitive contest-based artificial neural network: A new class-specific method for classification of clinical and biological datasets.基于世界竞赛的人工神经网络:一种用于临床和生物学数据集分类的新型特定类别方法。
Genomics. 2021 Jan;113(1 Pt 2):541-552. doi: 10.1016/j.ygeno.2020.09.047. Epub 2020 Sep 28.
6
Gene biomarker discovery at different stages of Alzheimer using gene co-expression network approach.利用基因共表达网络方法在阿尔茨海默病不同阶段进行基因生物标志物发现。
Sci Rep. 2020 Jul 22;10(1):12210. doi: 10.1038/s41598-020-69249-8.
7
FeatureSelect: a software for feature selection based on machine learning approaches.FeatureSelect:一款基于机器学习方法的特征选择软件。
BMC Bioinformatics. 2019 Apr 3;20(1):170. doi: 10.1186/s12859-019-2754-0.
8
Prediction Methods of Herbal Compounds in Chinese Medicinal Herbs.中草药中草药化合物的预测方法。
Molecules. 2018 Sep 10;23(9):2303. doi: 10.3390/molecules23092303.
基于N-(3-(3-氨磺酰基苯基)-1H-吲唑-5-基)乙酰胺和羧酰胺的有丝分裂激酶TTK抑制剂的发现
Bioorg Med Chem. 2014 Sep 1;22(17):4968-97. doi: 10.1016/j.bmc.2014.06.027. Epub 2014 Jun 23.
4
Assessing the validity of QSARs for ready biodegradability of chemicals: an applicability domain perspective.从适用域角度评估化学物质快速生物降解性的定量构效关系的有效性
Curr Comput Aided Drug Des. 2014;10(2):137-47. doi: 10.2174/1573409910666140410110241.
5
Predictive QSAR modeling of aldose reductase inhibitors using Monte Carlo feature selection.基于蒙特卡罗特征选择的醛糖还原酶抑制剂的定量构效关系预测模型研究。
Eur J Med Chem. 2014 Apr 9;76:352-9. doi: 10.1016/j.ejmech.2014.02.043. Epub 2014 Feb 16.
6
Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm.基于粒子群优化算法的决策树模型在癌症识别中的基因选择。
BMC Bioinformatics. 2014 Feb 20;15:49. doi: 10.1186/1471-2105-15-49.
7
Particle swarm optimization and genetic algorithm as feature selection techniques for the QSAR modeling of imidazo[1,5-a]pyrido[3,2-e]pyrazines, inhibitors of phosphodiesterase 10A.粒子群优化和遗传算法作为特征选择技术在咪唑并[1,5-a]吡啶并[3,2-e]吡嗪、磷酸二酯酶 10A 抑制剂的定量构效关系建模中的应用。
Chem Biol Drug Des. 2013 Dec;82(6):685-96. doi: 10.1111/cbdd.12196. Epub 2013 Sep 19.
8
Feature selection methods in QSAR studies.定量构效关系(QSAR)研究中的特征选择方法。
J AOAC Int. 2012 May-Jun;95(3):636-51. doi: 10.5740/jaoacint.sge_goodarzi.
9
Comparison of different approaches to define the applicability domain of QSAR models.比较不同方法来定义定量构效关系模型的适用性域。
Molecules. 2012 Apr 25;17(5):4791-810. doi: 10.3390/molecules17054791.
10
QSAR modeling of peptide biological activity by coupling support vector machine with particle swarm optimization algorithm and genetic algorithm.QSAR 模型的肽生物活性的耦合支持向量机与粒子群优化算法和遗传算法。
J Mol Graph Model. 2010 Sep;29(2):188-96. doi: 10.1016/j.jmgm.2010.06.002. Epub 2010 Jun 18.