Yousefi Elham, Pronzato Luc, Hainy Markus, Müller Werner G, Wynn Henry P
Institute of Applied Statistics, Johannes Kepler University, Altenberger Straße 69, 4040 Linz, Austria.
Université Côte d'Azur, CNRS, Laboratoire I3S - UMR 7271, 2000, route des Lucioles-Les Algorithmes-bât. Euclide B, 06900 Sophia Antipolis, France.
Stat Pap (Berl). 2023;64(4):1275-1304. doi: 10.1007/s00362-023-01436-x. Epub 2023 Mar 30.
The paper covers the design and analysis of experiments to discriminate between two Gaussian process models with different covariance kernels, such as those widely used in computer experiments, kriging, sensor location and machine learning. Two frameworks are considered. First, we study sequential constructions, where successive design (observation) points are selected, either as additional points to an existing design or from the beginning of observation. The selection relies on the maximisation of the difference between the symmetric Kullback Leibler divergences for the two models, which depends on the observations, or on the mean squared error of both models, which does not. Then, we consider static criteria, such as the familiar log-likelihood ratios and the Fréchet distance between the covariance functions of the two models. Other distance-based criteria, simpler to compute than previous ones, are also introduced, for which, considering the framework of approximate design, a necessary condition for the optimality of a design measure is provided. The paper includes a study of the mathematical links between different criteria and numerical illustrations are provided.
本文涵盖了用于区分具有不同协方差核的两个高斯过程模型的实验设计与分析,这些模型广泛应用于计算机实验、克里金法、传感器定位和机器学习等领域。文中考虑了两种框架。首先,我们研究序贯构造,即选择连续的设计(观测)点,这些点既可以是现有设计的额外点,也可以是从观测开始时选取。选择依赖于两个模型的对称库尔贝克 - 莱布勒散度之差的最大化,该差值依赖于观测值,或者依赖于两个模型的均方误差,而均方误差则不依赖于观测值。然后,我们考虑静态准则,例如常见的对数似然比以及两个模型协方差函数之间的弗雷歇距离。还引入了其他基于距离的准则,这些准则比之前的准则更易于计算,针对这些准则,在近似设计的框架下,给出了设计度量最优性的必要条件。本文还研究了不同准则之间的数学联系,并提供了数值示例。