• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于数据描述和模型适用域的单类分类方法。

The One-Class Classification Approach to Data Description and to Models Applicability Domain.

作者信息

Baskin Igor I, Kireeva Natalia, Varnek Alexandre

机构信息

Department of Chemistry, Moscow State University, Moscow 119991, Russia.

Laboratoire d'Infochimie, UMR 7177 CNRS, Université de Strasbourg, 4, rue B. Pascal, Strasbourg 67000, France.

出版信息

Mol Inform. 2010 Sep 17;29(8-9):581-7. doi: 10.1002/minf.201000063. Epub 2010 Aug 30.

DOI:10.1002/minf.201000063
PMID:27463453
Abstract

In this paper, we associate an applicability domain (AD) of QSAR/QSPR models with the area in the input (descriptor) space in which the density of training data points exceeds a certain threshold. It could be proved that the predictive performance of the models (built on the training set) is larger for the test compounds inside the high density area, than for those outside this area. Instead of searching a decision surface separating high and low density areas in the input space, the one-class classification 1-SVM approach looks for a hyperplane in the associated feature space. Unlike other reported in the literature AD definitions, this approach: (i) is purely "data-based", i.e. it assigns the same AD to all models built on the same training set, (ii) provides results that depend only on the initial descriptors pool generated for the training set, (iii) can be used for the huge number of descriptors, as well as in the framework of structured kernel-based approaches, e.g., chemical graph kernels. The developed approach has been applied to improve the performance of QSPR models for stability constants of the complexes of organic ligands with alkaline-earth metals in water.

摘要

在本文中,我们将定量构效关系/定量构性关系(QSAR/QSPR)模型的适用域(AD)与输入(描述符)空间中训练数据点密度超过特定阈值的区域相关联。可以证明,对于高密度区域内的测试化合物,(基于训练集构建的)模型的预测性能要高于该区域外的测试化合物。单类分类1 - 支持向量机(1 - SVM)方法不是在输入空间中寻找分隔高密度和低密度区域的决策面,而是在相关特征空间中寻找一个超平面。与文献中报道的其他AD定义不同,该方法:(i)纯粹是“基于数据的”,即它为基于相同训练集构建的所有模型分配相同的AD,(ii)提供的结果仅取决于为训练集生成的初始描述符库,(iii)可用于大量描述符,以及基于结构化核的方法框架中,例如化学图核。所开发的方法已被应用于提高关于有机配体与碱土金属在水中配合物稳定常数的QSPR模型的性能。

相似文献

1
The One-Class Classification Approach to Data Description and to Models Applicability Domain.用于数据描述和模型适用域的单类分类方法。
Mol Inform. 2010 Sep 17;29(8-9):581-7. doi: 10.1002/minf.201000063. Epub 2010 Aug 30.
2
Combinatorial QSAR modeling of P-glycoprotein substrates.P-糖蛋白底物的组合定量构效关系建模
J Chem Inf Model. 2006 May-Jun;46(3):1245-54. doi: 10.1021/ci0504317.
3
Molecule kernels: a descriptor- and alignment-free quantitative structure-activity relationship approach.分子内核:一种无描述符和比对的定量构效关系方法。
J Chem Inf Model. 2008 Sep;48(9):1868-81. doi: 10.1021/ci800144y. Epub 2008 Sep 4.
4
Structure Modification toward Applicability Domain of a QSAR/QSPR Model Considering Activity/Property.考虑活性/性质的 QSAR/QSPR 模型适用性域的结构修饰
Mol Inform. 2017 Dec;36(12). doi: 10.1002/minf.201700076. Epub 2017 Aug 16.
5
What are the limits of applicability for graph theoretic descriptors in QSPR/QSAR? Modeling dipole moments of aromatic compounds with TOPS-MODE descriptors.定量构效关系/定量结构活性关系中,图论描述符的适用范围有哪些?使用TOPS-MODE描述符对芳香族化合物的偶极矩进行建模。
J Chem Inf Comput Sci. 2003 Jan-Feb;43(1):75-84. doi: 10.1021/ci025604w.
6
QSAR and QSPR studies of a highly structured physicochemical domain.高度结构化物理化学领域的定量构效关系和定量结构-性质关系研究
J Chem Inf Model. 2006 Jan-Feb;46(1):264-76. doi: 10.1021/ci050293l.
7
Predicting the predictability: a unified approach to the applicability domain problem of QSAR models.预测可预测性:一种解决QSAR模型适用域问题的统一方法。
J Chem Inf Model. 2009 Jul;49(7):1762-76. doi: 10.1021/ci9000579.
8
QSPR modelling of dielectric constants of π-conjugated organic compounds by means of the CORAL software.利用CORAL软件对π共轭有机化合物的介电常数进行定量结构-性质关系建模。
SAR QSAR Environ Res. 2014;25(6):507-26. doi: 10.1080/1062936X.2014.899267. Epub 2014 Apr 9.
9
Assessment of applicability domain for multivariate counter-propagation artificial neural network predictive models by minimum euclidean distance space analysis: a case study.基于最小欧式距离空间分析的多元对传人工神经网络预测模型适用性域评估:案例研究。
Anal Chim Acta. 2013 Jan 8;759:28-42. doi: 10.1016/j.aca.2012.11.002. Epub 2012 Nov 15.
10
Structural similarity based kriging for quantitative structure activity and property relationship modeling.基于结构相似性的克里金法用于定量构效关系和性质关系建模。
J Chem Inf Model. 2014 Jul 28;54(7):1833-49. doi: 10.1021/ci500110v. Epub 2014 Jun 25.

引用本文的文献

1
Machine Learning for Toxicity Prediction Using Chemical Structures: Pillars for Success in the Real World.利用化学结构进行毒性预测的机器学习:在现实世界中取得成功的支柱。
Chem Res Toxicol. 2025 May 19;38(5):759-807. doi: 10.1021/acs.chemrestox.5c00033. Epub 2025 May 2.
2
Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions.全面分析 QSPR 模型在化学反应中的适用性域。
Int J Mol Sci. 2020 Aug 3;21(15):5542. doi: 10.3390/ijms21155542.
3
Predictive cartography of metal binders using generative topographic mapping.
使用生成地形映射对金属粘合剂进行预测制图。
J Comput Aided Mol Des. 2017 Aug;31(8):701-714. doi: 10.1007/s10822-017-0033-6. Epub 2017 Jul 7.
4
The continuous molecular fields approach to building 3D-QSAR models.连续分子场方法构建 3D-QSAR 模型。
J Comput Aided Mol Des. 2013 May;27(5):427-42. doi: 10.1007/s10822-013-9656-4. Epub 2013 May 30.