• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

QSAR 建模中的杂交特征选择与特征学习方法在药物发现中的应用。

Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery.

机构信息

Instituto de Ciencias e Ingeniería de la Computación (ICIC), Universidad Nacional del Sur-CONICET, San Andrés 800 - Campus Palihue, 8000, Bahía Blanca, Argentina.

Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas (CSIC), Ramiro de Maeztu 9, 28040, Madrid, Spain.

出版信息

Sci Rep. 2017 May 25;7(1):2403. doi: 10.1038/s41598-017-02114-3.

DOI:10.1038/s41598-017-02114-3
PMID:28546583
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5445096/
Abstract

Quantitative structure-activity relationship modeling using machine learning techniques constitutes a complex computational problem, where the identification of the most informative molecular descriptors for predicting a specific target property plays a critical role. Two main general approaches can be used for this modeling procedure: feature selection and feature learning. In this paper, a performance comparative study of two state-of-art methods related to these two approaches is carried out. In particular, regression and classification models for three different issues are inferred using both methods under different experimental scenarios: two drug-like properties, such as blood-brain-barrier and human intestinal absorption, and enantiomeric excess, as a measurement of purity used for chiral substances. Beyond the contrastive analysis of feature selection and feature learning methods as competitive approaches, the hybridization of these strategies is also evaluated based on previous results obtained in material sciences. From the experimental results, it can be concluded that there is not a clear winner between both approaches because the performance depends on the characteristics of the compound databases used for modeling. Nevertheless, in several cases, it was observed that the accuracy of the models can be improved by combining both approaches when the molecular descriptor sets provided by feature selection and feature learning contain complementary information.

摘要

使用机器学习技术进行定量构效关系建模是一个复杂的计算问题,其中确定最具信息量的分子描述符以预测特定目标性质起着关键作用。对于这种建模过程,可以使用两种主要的一般方法:特征选择和特征学习。在本文中,对这两种方法相关的两种最先进的方法进行了性能比较研究。特别是,在不同的实验场景下,使用这两种方法推断了三种不同问题的回归和分类模型:两种类药性,如血脑屏障和人肠吸收,以及对映体过量,作为手性物质纯度的测量。除了对比特征选择和特征学习方法作为竞争方法外,还根据材料科学中获得的先前结果评估了这些策略的混合。从实验结果可以得出结论,由于性能取决于用于建模的化合物数据库的特征,因此这两种方法之间没有明显的赢家。然而,在某些情况下,当特征选择和特征学习提供的分子描述符集包含互补信息时,观察到通过组合这两种方法可以提高模型的准确性。

相似文献

1
Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery.QSAR 建模中的杂交特征选择与特征学习方法在药物发现中的应用。
Sci Rep. 2017 May 25;7(1):2403. doi: 10.1038/s41598-017-02114-3.
2
Current approaches for choosing feature selection and learning algorithms in quantitative structure-activity relationships (QSAR).当前用于定量构效关系 (QSAR) 中选择特征选择和学习算法的方法。
Expert Opin Drug Discov. 2018 Dec;13(12):1075-1089. doi: 10.1080/17460441.2018.1542428. Epub 2018 Nov 3.
3
Finding Needles in a Haystack: Determining Key Molecular Descriptors Associated with the Blood-brain Barrier Entry of Chemical Compounds Using Machine Learning.在干草堆中找针:使用机器学习确定与化学化合物血脑屏障进入相关的关键分子描述符。
Mol Inform. 2019 Aug;38(8-9):e1800164. doi: 10.1002/minf.201800164. Epub 2019 Jul 19.
4
Pre-processing feature selection for improved C&RT models for oral absorption.预处理特征选择可提高口服吸收的 C&RT 模型。
J Chem Inf Model. 2013 Oct 28;53(10):2730-42. doi: 10.1021/ci400378j. Epub 2013 Oct 9.
5
Filter feature selectors in the development of binary QSAR models.在二元 QSAR 模型的开发中筛选特征选择器。
SAR QSAR Environ Res. 2019 May;30(5):313-345. doi: 10.1080/1062936X.2019.1588160.
6
Improved Prediction of Blood-Brain Barrier Permeability Through Machine Learning with Combined Use of Molecular Property-Based Descriptors and Fingerprints.通过机器学习结合分子性质基描述符和指纹提高血脑屏障通透性的预测。
AAPS J. 2018 Mar 21;20(3):54. doi: 10.1208/s12248-018-0215-8.
7
ADMET Evaluation in Drug Discovery. 18. Reliable Prediction of Chemical-Induced Urinary Tract Toxicity by Boosting Machine Learning Approaches.药物发现中的 ADMET 评估。18. 通过机器学习方法的提升实现可靠的化学诱导的泌尿道毒性预测。
Mol Pharm. 2017 Nov 6;14(11):3935-3953. doi: 10.1021/acs.molpharmaceut.7b00631. Epub 2017 Oct 27.
8
An Analysis of QSAR Research Based on Machine Learning Concepts.基于机器学习概念的定量构效关系研究分析。
Curr Drug Discov Technol. 2021;18(1):17-30. doi: 10.2174/1570163817666200316104404.
9
Using kernel alignment to select features of molecular descriptors in a QSAR study.使用核对齐选择 QSAR 研究中分子描述符的特征。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1373-84. doi: 10.1109/TCBB.2011.31.
10
Prediction of antibacterial compounds by machine learning approaches.通过机器学习方法预测抗菌化合物。
J Comput Chem. 2009 Jun;30(8):1202-11. doi: 10.1002/jcc.21148.

引用本文的文献

1
Machine Learning for Toxicity Prediction Using Chemical Structures: Pillars for Success in the Real World.利用化学结构进行毒性预测的机器学习:在现实世界中取得成功的支柱。
Chem Res Toxicol. 2025 May 19;38(5):759-807. doi: 10.1021/acs.chemrestox.5c00033. Epub 2025 May 2.
2
Democratizing cheminformatics: interpretable chemical grouping using an automated KNIME workflow.化学信息学的民主化:使用自动化的KNIME工作流程进行可解释的化学分组
J Cheminform. 2024 Aug 16;16(1):101. doi: 10.1186/s13321-024-00894-1.
3
Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review.

本文引用的文献

1
Descriptors and their selection methods in QSAR analysis: paradigm for drug design.定量构效关系分析中的描述符及其选择方法:药物设计范例
Drug Discov Today. 2016 Aug;21(8):1291-302. doi: 10.1016/j.drudis.2016.06.013. Epub 2016 Jun 18.
2
QSAR Accelerated Discovery of Potent Ice Recrystallization Inhibitors.定量构效关系加速高效冰重结晶抑制剂的发现
Sci Rep. 2016 May 24;6:26403. doi: 10.1038/srep26403.
3
Have artificial neural networks met expectations in drug discovery as implemented in QSAR framework?人工神经网络在 QSAR 框架中实施的药物发现方面是否达到了预期?
基于机器学习的化学信息学的最新进展:全面综述。
Int J Mol Sci. 2023 Jul 15;24(14):11488. doi: 10.3390/ijms241411488.
4
Machine intelligence-driven framework for optimized hit selection in virtual screening.用于虚拟筛选中优化命中物选择的机器智能驱动框架。
J Cheminform. 2022 Jul 22;14(1):48. doi: 10.1186/s13321-022-00630-7.
5
Artificial intelligence and machine learning approaches for drug design: challenges and opportunities for the pharmaceutical industries.人工智能和机器学习方法在药物设计中的应用:制药行业的挑战与机遇。
Mol Divers. 2022 Jun;26(3):1893-1913. doi: 10.1007/s11030-021-10326-z. Epub 2021 Oct 23.
6
CardioTox net: a robust predictor for hERG channel blockade based on deep learning meta-feature ensembles.心脏毒性网络:基于深度学习元特征集成的hERG通道阻断的强大预测器。
J Cheminform. 2021 Aug 16;13(1):60. doi: 10.1186/s13321-021-00541-z.
7
AI in drug development: a multidisciplinary perspective.人工智能在药物研发中的应用:多学科视角。
Mol Divers. 2021 Aug;25(3):1461-1479. doi: 10.1007/s11030-021-10266-8. Epub 2021 Jul 12.
8
Artificial Intelligence in Drug Discovery: A Comprehensive Review of Data-driven and Machine Learning Approaches.药物研发中的人工智能:数据驱动与机器学习方法的全面综述
Biotechnol Bioprocess Eng. 2020;25(6):895-930. doi: 10.1007/s12257-020-0049-y. Epub 2021 Jan 7.
9
Targeting Cannabinoid Receptor Activation and BACE-1 Activity Counteracts TgAPP Mice Memory Impairment and Alzheimer's Disease Lymphoblast Alterations.靶向大麻素受体激活和 BACE-1 活性可改善 TgAPP 小鼠的记忆障碍和阿尔茨海默病淋巴母细胞的改变。
Mol Neurobiol. 2020 Apr;57(4):1938-1951. doi: 10.1007/s12035-019-01813-4. Epub 2020 Jan 3.
10
Influence of feature rankers in the construction of molecular activity prediction models.特征排序器对分子活性预测模型构建的影响。
J Comput Aided Mol Des. 2020 Mar;34(3):305-325. doi: 10.1007/s10822-019-00273-1. Epub 2019 Dec 31.
Expert Opin Drug Discov. 2016 Jul;11(7):627-39. doi: 10.1080/17460441.2016.1186876. Epub 2016 May 30.
4
Predicting human intestinal absorption of diverse chemicals using ensemble learning based QSAR modeling approaches.使用基于集成学习的定量构效关系(QSAR)建模方法预测多种化学物质在人体肠道中的吸收情况。
Comput Biol Chem. 2016 Apr;61:178-96. doi: 10.1016/j.compbiolchem.2016.01.005. Epub 2016 Jan 29.
5
Chemoinformatics: Achievements and Challenges, a Personal View.化学信息学:成就与挑战,个人观点
Molecules. 2016 Jan 27;21(2):151. doi: 10.3390/molecules21020151.
6
Quantitative structure-activity relationship: promising advances in drug discovery platforms.定量构效关系:药物发现平台的有前途的进展。
Expert Opin Drug Discov. 2015 Dec;10(12):1283-300. doi: 10.1517/17460441.2015.1083006. Epub 2015 Sep 11.
7
Visual analytics in cheminformatics: user-supervised descriptor selection for QSAR methods.化学信息学中的可视化分析:用于定量构效关系方法的用户监督描述符选择
J Cheminform. 2015 Aug 19;7:39. doi: 10.1186/s13321-015-0092-4. eCollection 2015.
8
An in silico platform for predicting, screening and designing of antihypertensive peptides.用于预测、筛选和设计抗高血压肽的计算机模拟平台。
Sci Rep. 2015 Jul 27;5:12512. doi: 10.1038/srep12512.
9
Choosing feature selection and learning algorithms in QSAR.定量构效关系中特征选择与学习算法的选择
J Chem Inf Model. 2014 Mar 24;54(3):837-43. doi: 10.1021/ci400573c. Epub 2014 Mar 5.
10
QSPR models for predicting log P(liver) values for volatile organic compounds combining statistical methods and domain knowledge.基于统计方法和领域知识的挥发性有机化合物的 log P(liver) 值预测的 QSPR 模型。
Molecules. 2012 Dec 17;17(12):14937-53. doi: 10.3390/molecules171214937.