Suppr超能文献

化学计量学中的埃特马迪回归:基于可靠性的建模与预测程序。

Etemadi regression in chemometrics: Reliability-based procedures for modeling and forecasting.

作者信息

Etemadi Sepideh, Khashei Mehdi

机构信息

Department of Industrial and Systems Engineering, Isfahan University of Technology (IUT), Isfahan, 84156-83111, Iran.

出版信息

Heliyon. 2024 Feb 15;10(5):e26399. doi: 10.1016/j.heliyon.2024.e26399. eCollection 2024 Mar 15.

Abstract

The creation of predictive models with a high degree of generalizability in chemical analysis and process optimization is of paramount importance. Nonetheless, formulating a prediction model based on collected data from chemical measurements that maximize quantitative generalizability remains a challenging task for chemometrics experts. To tackle this challenge, a range of forecasting models with varying characteristics, structures, and capabilities has been developed, utilizing either accuracy-based or reliability-based modeling strategies. While the majority of models follow the accuracy-based approach, a recently proposed reliability-based approach, known as the Etemadi approach, has shown impressive performance across various scientific fields. The Etemadi models were constructed through a reliability-based parameter estimation process in such a manner that maximizes the models' reliability. However, the foundation of modeling procedures for chemometrics purposes is built upon the assumption that high generalizability in inaccessible/test data is achieved through the accuracy-based training procedure in which errors in available historical/training data are minimized. After conducting a thorough review of the current literature, we have found that none of the forecasting models for chemometrics purposes incorporate reliability into their modeling procedures. Given the dynamic and highly sensitive nature of chemistry experiments and processes, implementing a reliable model that controls performance criteria variation is a promising strategy for achieving stable and robust forecasts. To address this research gap, this paper introduces several key innovations, which can be highlighted as follows: Proposing a general design structure based on a new optimal reliability-based parameter estimation process. Introducing a novel risk-based modeling strategy that minimizes the performance variation of models implemented under different conditions in chemical laboratory experiments, to generate a more generalizable model for diverse applications in chemometrics. Specifying the degree of influence that each reliability and accuracy factor has in enhancing the generalizability and uncertainty modeling of chemometric models. Empirical evidence confirms the effectiveness and superior performance of reliability-based models compared to accuracy-based models in 78.95% of the cases across various fields, including Pharmacology, Biochemistry, Agrochemical, Geochemical, Biological, Pollutants, Physicochemical Properties, and Gases Experiment. Furthermore, the study's findings demonstrate that the reliability-based modeling approach outperforms the accuracy-based strategy in terms of MAE, MSE, ARV, and RMSE by an average of 4.697%, 5.646%, 5.646%, and 4.342%, respectively. It is also statistically proven that reliability has a more significant impact on improving the generalizability of chemometric models than accuracy. This emphasizes the importance of including reliability as a crucial factor in chemometrics modeling, a consideration that has been overlooked in traditional modeling processes. Consequently, reliability-based modeling approaches can be regarded as a viable alternative to conventional accuracy-based modeling methods for chemical modeling purposes.

摘要

在化学分析和过程优化中创建具有高度通用性的预测模型至关重要。尽管如此,基于化学测量收集的数据制定一个能使定量通用性最大化的预测模型,对化学计量学专家来说仍然是一项具有挑战性的任务。为应对这一挑战,已经开发了一系列具有不同特征、结构和能力的预测模型,采用基于准确性或基于可靠性的建模策略。虽然大多数模型采用基于准确性的方法,但最近提出的一种基于可靠性的方法,即埃泰马迪方法,在各个科学领域都表现出了令人印象深刻的性能。埃泰马迪模型是通过基于可靠性的参数估计过程构建的,其方式是使模型的可靠性最大化。然而,化学计量学建模程序的基础是建立在这样一种假设之上的,即通过基于准确性的训练程序来实现不可访问/测试数据中的高通用性,在该程序中,可用历史/训练数据中的误差被最小化。在对当前文献进行全面综述后,我们发现,用于化学计量学目的的预测模型中没有一个在其建模程序中纳入可靠性。鉴于化学实验和过程的动态性和高度敏感性,实施一个能控制性能标准变化的可靠模型是实现稳定和稳健预测的一个有前景的策略。为填补这一研究空白,本文介绍了几个关键创新点,可突出如下:提出一种基于新的最优可靠性参数估计过程的通用设计结构。引入一种新颖的基于风险的建模策略,该策略可最小化在化学实验室实验不同条件下实施的模型的性能变化,以生成一个在化学计量学中适用于多种应用的更具通用性的模型。明确每个可靠性和准确性因素在增强化学计量学模型的通用性和不确定性建模方面的影响程度。实证证据证实,在包括药理学、生物化学、农用化学品、地球化学、生物学、污染物、物理化学性质和气体实验在内的各个领域中,78.95%的情况下,基于可靠性的模型比基于准确性的模型更有效且性能更优。此外,该研究结果表明,基于可靠性的建模方法在平均绝对误差(MAE)、均方误差(MSE)、平均相对方差(ARV)和均方根误差(RMSE)方面分别比基于准确性的策略平均高出4.697%、5.646%、5.646%和4.342%。统计上也证明,可靠性对提高化学计量学模型的通用性的影响比准确性更显著。这强调了将可靠性作为化学计量学建模中的一个关键因素的重要性,而这一因素在传统建模过程中被忽视了。因此,基于可靠性的建模方法可被视为用于化学建模目的的传统基于准确性的建模方法的一个可行替代方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d21a/10907519/d6ae0befaac9/fx1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验