Juan Zhang, Zhang Jing, Gao Ming
Fanli Business School, Nanyang Institute of Technology, Nanyang, Henan, China.
Hospitality Management Department, Tourism College of Zhejiang, Hangzhou, Zhejiang, China.
Front Neurorobot. 2024 Nov 26;18:1439195. doi: 10.3389/fnbot.2024.1439195. eCollection 2024.
With the rapid development of the tourism industry, the demand for accurate and personalized travel route recommendations has significantly increased. However, traditional methods often fail to effectively integrate visual and sequential information, leading to recommendations that are both less accurate and less personalized.
This paper introduces SelfAM-Vtrans, a novel algorithm that leverages multimodal data-combining visual Transformers, LSTMs, and self-attention mechanisms-to enhance the accuracy and personalization of travel route recommendations. SelfAM-Vtrans integrates visual and sequential information by employing a visual Transformer to extract features from travel images, thereby capturing spatial relationships within them. Concurrently, a Long Short-Term Memory (LSTM) network encodes sequential data to capture the temporal dependencies within travel sequences. To effectively merge these two modalities, a self-attention mechanism fuses the visual features and sequential encodings, thoroughly accounting for their interdependencies. Based on this fused representation, a classification or regression model is trained using real travel datasets to recommend optimal travel routes.
The algorithm was rigorously evaluated through experiments conducted on real-world travel datasets, and its performance was benchmarked against other route recommendation methods. The results demonstrate that SelfAM-Vtrans significantly outperforms traditional approaches in terms of both recommendation accuracy and personalization. By comprehensively incorporating both visual and sequential data, this method offers travelers more tailored and precise route suggestions, thereby enriching the overall travel experience.
随着旅游业的快速发展,对准确且个性化的旅行路线推荐的需求显著增加。然而,传统方法往往无法有效地整合视觉和序列信息,导致推荐的准确性和个性化程度都较低。
本文介绍了SelfAM-Vtrans,这是一种新颖的算法,它利用多模态数据——结合视觉Transformer、长短期记忆网络(LSTM)和自注意力机制——来提高旅行路线推荐的准确性和个性化程度。SelfAM-Vtrans通过使用视觉Transformer从旅行图像中提取特征来整合视觉和序列信息,从而捕捉其中的空间关系。同时,长短期记忆(LSTM)网络对序列数据进行编码,以捕捉旅行序列中的时间依赖性。为了有效地融合这两种模态,自注意力机制融合视觉特征和序列编码,充分考虑它们的相互依赖性。基于这种融合表示,使用真实旅行数据集训练分类或回归模型,以推荐最佳旅行路线。
通过在真实世界旅行数据集上进行的实验对该算法进行了严格评估,并将其性能与其他路线推荐方法进行了基准测试。结果表明,SelfAM-Vtrans在推荐准确性和个性化方面均显著优于传统方法。通过全面整合视觉和序列数据,该方法为旅行者提供了更具针对性和精确性的路线建议,从而丰富了整体旅行体验。