Suppr超能文献

用于定义分类模型适用范围的不同方法的效率

Efficiency of different measures for defining the applicability domain of classification models.

作者信息

Klingspohn Waldemar, Mathea Miriam, Ter Laak Antonius, Heinrich Nikolaus, Baumann Knut

机构信息

Institute of Medicinal and Pharmaceutical Chemistry, University of Technology Braunschweig, Beethovenstrasse 55, 38106, Brunswick, Germany.

Bayer Pharma Aktiengesellschaft, Computational Chemistry, Müllerstrasse 178, 13353, Berlin, Germany.

出版信息

J Cheminform. 2017 Aug 3;9(1):44. doi: 10.1186/s13321-017-0230-2.

Abstract

The goal of defining an applicability domain for a predictive classification model is to identify the region in chemical space where the model's predictions are reliable. The boundary of the applicability domain is defined with the help of a measure that shall reflect the reliability of an individual prediction. Here, the available measures are differentiated into those that flag unusual objects and which are independent of the original classifier and those that use information of the trained classifier. The former set of techniques is referred to as novelty detection while the latter is designated as confidence estimation. A review of the available confidence estimators shows that most of these measures estimate the probability of class membership of the predicted objects which is inversely related to the error probability. Thus, class probability estimates are natural candidates for defining the applicability domain but were not comprehensively included in previous benchmark studies. The focus of the present study is to find the best measure for defining the applicability domain for a given binary classification technique and to determine the performance of novelty detection versus confidence estimation. Six different binary classification techniques in combination with ten data sets were studied to benchmark the various measures. The area under the receiver operating characteristic curve (AUC ROC) was employed as main benchmark criterion. It is shown that class probability estimates constantly perform best to differentiate between reliable and unreliable predictions. Previously proposed alternatives to class probability estimates do not perform better than the latter and are inferior in most cases. Interestingly, the impact of defining an applicability domain depends on the observed area under the receiver operator characteristic curve. That means that it depends on the level of difficulty of the classification problem (expressed as AUC ROC) and will be largest for intermediately difficult problems (range AUC ROC 0.7-0.9). In the ranking of classifiers, classification random forests performed best on average. Hence, classification random forests in combination with the respective class probability estimate are a good starting point for predictive binary chemoinformatic classifiers with applicability domain. Graphical abstract .

摘要

定义预测分类模型适用性域的目标是识别化学空间中模型预测可靠的区域。适用性域的边界借助一种应反映单个预测可靠性的度量来定义。在此,可用的度量分为两类:一类用于标记异常对象且独立于原始分类器,另一类使用训练后分类器的信息。前一组技术称为新颖性检测,后一组称为置信度估计。对可用置信度估计器的综述表明,这些度量大多估计预测对象的类别隶属概率,该概率与错误概率成反比。因此,类别概率估计是定义适用性域的自然选择,但在先前的基准研究中未得到全面纳入。本研究的重点是为给定的二元分类技术找到定义适用性域的最佳度量,并确定新颖性检测与置信度估计相比的性能。研究了六种不同的二元分类技术与十个数据集的组合,以对各种度量进行基准测试。采用接收器操作特征曲线下的面积(AUC ROC)作为主要基准标准。结果表明,类别概率估计在区分可靠和不可靠预测方面始终表现最佳。先前提出的类别概率估计的替代方法并不比后者表现更好,并且在大多数情况下都较差。有趣的是,定义适用性域的影响取决于接收器操作特征曲线下观察到的面积。这意味着它取决于分类问题的难度水平(以AUC ROC表示),对于中等难度的问题(AUC ROC范围为0.7 - 0.9)影响最大。在分类器排名中,分类随机森林平均表现最佳。因此,分类随机森林与相应的类别概率估计相结合,是具有适用性域的预测性二元化学信息学分类器的一个良好起点。图形摘要 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84c0/5543028/e074e69f9b95/13321_2017_230_Figa_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验