Suppr超能文献

化学计量学中新型混合整数优化稀疏回归方法。

Novel mixed integer optimization sparse regression approach in chemometrics.

机构信息

Operations Research Center, MIT, United States.

Operations Research Center, MIT, United States.

出版信息

Anal Chim Acta. 2020 Nov 15;1137:115-124. doi: 10.1016/j.aca.2020.08.054. Epub 2020 Sep 2.

Abstract

Sparse mathematical modelling plays an increasingly important role in chemometrics due to its interpretability and prediction power. While many sparse techniques used in chemometrics rely on L penalization to create sparser models, Mixed Integer Optimization (MIO) achieves sparsity by imposing constraints directly in the model. In this paper, we develop an intuitive and flexible robust sparse regression framework using MIO. We use constraints and penalization to achieve sparsity and robustness respectively. We test and compare results with those obtained using other techniques generating sparser models such as LASSO and sparse PLS. We also use PLS as a baseline to compare predictive performance. We use a LIBS data set of certified reference materials (CRM) of various mineral ores to illustrate the framework using different objective functions. The MIO framework proposed improves accuracy, sparsity and robustness vs. LASSO and SPLS. MIO achieves an average R higher than other methods on average by at least 10.6%. Robust MIO approach also improves interpretability. It also uses 4.3 variables on average while LASSO and SPLS use 16.1 and 805.8 respectively. We also illustrate how interpretability can help build better models through examples derived from the data sets used. When adding noise to the signal, MIO achieves an R of 0.69 on average when all models have negative R values. The MIO framework proposed is versatile and could be used in other chemometrics applications.

摘要

稀疏数学建模由于其可解释性和预测能力,在化学计量学中扮演着越来越重要的角色。虽然化学计量学中使用的许多稀疏技术都依赖于 L 惩罚来创建更稀疏的模型,但混合整数优化(MIO)通过在模型中直接施加约束来实现稀疏性。在本文中,我们使用 MIO 开发了一个直观且灵活的稳健稀疏回归框架。我们分别使用约束和惩罚来实现稀疏性和稳健性。我们使用 PLS 作为基线来比较预测性能,并与使用 LASSO 和稀疏 PLS 等生成更稀疏模型的其他技术的结果进行了测试和比较。我们使用各种矿物矿石的认证参考物质(CRM)的 LIBS 数据集来使用不同的目标函数说明该框架。与 LASSO 和 SPLS 相比,所提出的 MIO 框架在准确性、稀疏性和稳健性方面有所提高。稳健的 MIO 方法还提高了可解释性。它还平均使用 4.3 个变量,而 LASSO 和 SPLS 分别使用 16.1 和 805.8 个变量。我们还通过从使用的数据集中得出的示例说明了可解释性如何帮助构建更好的模型。当向信号中添加噪声时,当所有模型的 R 值均为负时,MIO 平均达到 0.69 的 R 值。所提出的 MIO 框架用途广泛,可用于其他化学计量学应用。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验