Murari A, Peluso E, Lungaroni M, Gaudio P, Vega J, Gelfusa M
Consorzio RFX (CNR, ENEA, INFN, Università di Padova, Acciaierie Venete SpA), Corso Stati Uniti 4, 35127, Padua, Italy.
Department of Industrial Engineering, University of Rome "Tor Vergata", via del Politecnico 1, 00133, Rome, Italy.
Sci Rep. 2020 Nov 16;10(1):19858. doi: 10.1038/s41598-020-76826-4.
In recent years, the techniques of the exact sciences have been applied to the analysis of increasingly complex and non-linear systems. The related uncertainties and the large amounts of data available have progressively shown the limits of the traditional hypothesis driven methods, based on first principle theories. Therefore, a new approach of data driven theory formulation has been developed. It is based on the manipulation of symbols with genetic computing and it is meant to complement traditional procedures, by exploring large datasets to find the most suitable mathematical models to interpret them. The paper reports on the vast amounts of numerical tests that have shown the potential of the new techniques to provide very useful insights in various studies, ranging from the formulation of scaling laws to the original identification of the most appropriate dimensionless variables to investigate a given system. The application to some of the most complex experiments in physics, in particular thermonuclear plasmas, has proved the capability of the methodology to address real problems, even highly nonlinear and practically important ones such as catastrophic instabilities. The proposed tools are therefore being increasingly used in various fields of science and they constitute a very good set of techniques to bridge the gap between experiments, traditional data analysis and theory formulation.
近年来,精密科学技术已被应用于对日益复杂的非线性系统的分析。相关的不确定性以及大量可得数据逐渐显示出基于第一性原理的传统假设驱动方法的局限性。因此,一种新的数据驱动理论构建方法应运而生。它基于利用遗传计算对符号进行操作,旨在通过探索大型数据集来寻找最合适的数学模型以解释这些数据,从而对传统程序起到补充作用。本文报告了大量数值测试,这些测试表明新技术在各种研究中具有提供非常有用见解的潜力,范围从标度律的制定到确定研究给定系统最合适的无量纲变量。将该方法应用于一些物理学中最复杂的实验,特别是热核聚变等离子体,已证明该方法有能力解决实际问题,甚至是诸如灾难性不稳定性等高度非线性且实际意义重大的问题。因此,所提出的工具在各个科学领域中越来越多地被使用,它们构成了一套很好的技术,可弥合实验、传统数据分析与理论构建之间的差距。