Brockhoff Dimo, Auger Anne, Hansen Nikolaus, Tušar Tea
Inria, CMAP, CNRS, Ecole Polytechnique, Institut Polytechnique de Paris, France
Jožef Stefan Institute, Ljubljana, Slovenia
Evol Comput. 2022 Jun 1;30(2):165-193. doi: 10.1162/evco_a_00298.
Several test function suites are being used for numerical benchmarking of multiobjective optimization algorithms. While they have some desirable properties, such as well-understood Pareto sets and Pareto fronts of various shapes, most of the currently used functions possess characteristics that are arguably underrepresented in real-world problems such as separability, optima located exactly at the boundary constraints, and the existence of variables that solely control the distance between a solution and the Pareto front. Via the alternative construction of combining existing single-objective problems from the literature, we describe the bbob-biobj test suite with 55 bi-objective functions in continuous domain, and its extended version with 92 bi-objective functions (bbob-biobj-ext). Both test suites have been implemented in the COCO platform for black-box optimization benchmarking and various visualizations of the test functions are shown to reveal their properties. Besides providing details on the construction of these problems and presenting their (known) properties, this article also aims at giving the rationale behind our approach in terms of groups of functions with similar properties, objective space normalization, and problem instances. The latter allows us to easily compare the performance of deterministic and stochastic solvers, which is an often overlooked issue in benchmarking.
几个测试函数套件正被用于多目标优化算法的数值基准测试。虽然它们具有一些理想的属性,比如广为人知的帕累托集和各种形状的帕累托前沿,但目前使用的大多数函数所具有的特征在现实世界问题中可能代表性不足,例如可分性、最优解恰好位于边界约束处,以及存在仅控制解与帕累托前沿之间距离的变量。通过结合文献中现有的单目标问题进行另类构建,我们描述了连续域中具有55个双目标函数的bbob - biobj测试套件及其具有92个双目标函数的扩展版本(bbob - biobj - ext)。这两个测试套件均已在用于黑盒优化基准测试的COCO平台中实现,并且展示了测试函数的各种可视化结果以揭示其属性。除了详细介绍这些问题的构建并呈现它们(已知的)属性外,本文还旨在从具有相似属性的函数组、目标空间归一化和问题实例等方面给出我们方法背后的基本原理。后者使我们能够轻松比较确定性和随机求解器的性能这一在基准测试中经常被忽视的问题。