Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, Russia.
Department of Physics, University of Gothenburg, SE-41296 Gothenburg, Sweden.
Sensors (Basel). 2021 Oct 21;21(21):6982. doi: 10.3390/s21216982.
The power of machine learning (ML) in feature identification can be harnessed for determining quantities in experiments that are difficult to measure directly. However, if an ML model is trained on simulated data, rather than experimental results, the differences between the two can pose an obstacle to reliable data extraction. Here we report on the development of ML-based diagnostics for experiments on high-intensity laser-matter interactions. With the intention to accentuate robust, physics-governed features, the presence of which is tolerant to such differences, we test the application of principal component analysis, data augmentation and training with data that has superimposed noise of gradually increasing amplitude. Using synthetic data of simulated experiments, we identify that the approach based on the noise of increasing amplitude yields the most accurate ML models and thus is likely to be useful in similar projects on ML-based diagnostics.
机器学习(ML)在特征识别方面的强大功能可用于确定难以直接测量的实验数量。然而,如果 ML 模型是基于模拟数据而不是实验结果进行训练的,那么两者之间的差异可能会成为可靠数据提取的障碍。在这里,我们报告了用于高强度激光与物质相互作用实验的基于机器学习的诊断方法的开发。为了强调稳健的、受物理规律控制的特征,我们测试了主成分分析、数据增强以及使用逐渐增加幅度的噪声叠加数据进行训练的应用。使用模拟实验的合成数据,我们确定基于幅度增加噪声的方法可以产生最准确的 ML 模型,因此可能对基于 ML 的诊断的类似项目有用。