School of Mathematics and Statistics, The University of Melbourne, Parkville, Victoria 3010 Australia
Evol Comput. 2020 Fall;28(3):379-404. doi: 10.1162/evco_a_00262. Epub 2019 Jul 11.
This article presents a method to generate diverse and challenging new test instances for continuous black-box optimization. Each instance is represented as a feature vector of exploratory landscape analysis measures. By projecting the features into a two-dimensional instance space, the location of existing test instances can be visualized, and their similarities and differences revealed. New instances are generated through genetic programming which evolves functions with controllable characteristics. Convergence to selected target points in the instance space is used to drive the evolutionary process, such that the new instances span the entire space more comprehensively. We demonstrate the method by generating two-dimensional functions to visualize its success, and ten-dimensional functions to test its scalability. We show that the method can recreate existing test functions when target points are co-located with existing functions, and can generate new functions with entirely different characteristics when target points are located in empty regions of the instance space. Moreover, we test the effectiveness of three state-of-the-art algorithms on the new set of instances. The results demonstrate that the new set is not only more diverse than a well-known benchmark set, but also more challenging for the tested algorithms. Hence, the method opens up a new avenue for developing test instances with controllable characteristics, necessary to expose the strengths and weaknesses of algorithms, and drive algorithm development.
本文提出了一种为连续黑盒优化生成多样化且具有挑战性的新测试实例的方法。每个实例都表示为探索性景观分析度量的特征向量。通过将特征投影到二维实例空间中,可以可视化现有测试实例的位置,并揭示它们的相似性和差异性。新实例通过遗传编程生成,遗传编程可进化出具有可控特征的函数。通过收敛到实例空间中的选定目标点来驱动进化过程,从而使新实例更全面地覆盖整个空间。我们通过生成二维函数来可视化其成功,以及生成十维函数来测试其可扩展性来演示该方法。我们表明,当目标点与现有函数重合时,该方法可以重现现有的测试函数,而当目标点位于实例空间的空区域时,该方法可以生成具有完全不同特征的新函数。此外,我们在新的实例集上测试了三种最先进算法的有效性。结果表明,新的实例集不仅比著名的基准集更加多样化,而且对测试算法也更具挑战性。因此,该方法为开发具有可控特征的测试实例开辟了新途径,这对于暴露算法的优缺点并推动算法发展是必要的。