Wang Yang, Chen Zhibin
Department of Mathematics, Kunming University of Science and Technology, Kunming 650093, China.
Math Biosci Eng. 2022 Jul 4;19(10):9730-9748. doi: 10.3934/mbe.2022452.
Recent research has showen that deep reinforcement learning (DRL) can be used to design better heuristics for the traveling salesman problem (TSP) on the small scale, but does not do well when generalized to large instances. In order to improve the generalization ability of the model when the nodes change from small to large, we propose a dynamic graph Conv-LSTM model (DGCM) to the solve large-scale TSP. The noted feature of our model is the use of a dynamic encoder-decoder architecture and a convolution long short-term memory network, which enable the model to capture the topological structure of the graph dynamically, as well as the potential relationships between nodes. In addition, we propose a dynamic positional encoding layer in the DGCM, which can improve the quality of solutions by providing more location information. The experimental results show that the performance of the DGCM on the large-scale TSP surpasses the state-of-the-art DRL-based methods and yields good performance when generalized to real-world datasets. Moreover, our model compares favorably to heuristic algorithms and professional solvers in terms of computational time.
最近的研究表明,深度强化学习(DRL)可用于为小规模旅行商问题(TSP)设计更好的启发式算法,但在推广到大规模实例时效果不佳。为了提高模型在节点从小规模变为大规模时的泛化能力,我们提出了一种动态图卷积长短期记忆模型(DGCM)来解决大规模TSP问题。我们模型的显著特点是使用了动态编码器 - 解码器架构和卷积长短期记忆网络,这使得模型能够动态地捕捉图的拓扑结构以及节点之间的潜在关系。此外,我们在DGCM中提出了一个动态位置编码层,它可以通过提供更多位置信息来提高解决方案的质量。实验结果表明,DGCM在大规模TSP上的性能超过了基于DRL的现有最优方法,并且在推广到真实世界数据集时表现良好。此外,在计算时间方面,我们的模型优于启发式算法和专业求解器。