Seo In-Jae, Lee Yo-Han, Jang Beakcheol
Graduate School of Information, Yonsei University, Seoul, Seodaemun-gu, Republic of Korea.
PLoS One. 2025 May 22;20(5):e0324621. doi: 10.1371/journal.pone.0324621. eCollection 2025.
As the fashion e-commerce markets rapidly develop, tens of thousands of products are registered daily on e-commerce platforms. Individual sellers register products after setting up a product category directly on a fashion e-commerce platform. However, many sellers fail to find a suitable category and mistakenly register their products under incorrect ones. Precise category matching is important for increasing sales through search optimization and accurate product exposure. However, manually correcting registered categories is time-consuming and costly for platform managers. To resolve this problem, this study proposes a methodology for fashion e-commerce product classification based on multi-modal deep learning and transfer learning. Through the proposed methodology, three challenges in classifying fashion e-commerce products are addressed. First, the issue of extremely biased e-commerce data is addressed through under-sampling. Second, multi-modal deep learning enables the model to simultaneously use input data in different formats, which helps mitigate the impact of noisy and low-quality e-commerce data by providing richer information.Finally, the high computational cost and long training times involved in training deep learning models with both image and text data are mitigated by leveraging transfer learning. In this study, three strategies for transfer learning to fine-tune the image and text modules are presented. In addition, five methods for fusing feature vectors extracted from a single modal into one and six strategies for fine-tuning multi-modal models are presented, featuring a total of 14 strategies. The study shows that multi-modal models outperform unimodal models based solely on text or image. It also suggests the optimal conditions for classifying e-commerce products, helping fashion e-commerce practitioners construct models tailored to their respective business environments more efficiently.
随着时尚电子商务市场的迅速发展,电子商务平台上每天都有成千上万种产品被注册。个体卖家在时尚电子商务平台上直接设置产品类别后注册产品。然而,许多卖家未能找到合适的类别,错误地将其产品注册到不正确的类别下。精确的类别匹配对于通过搜索优化和准确的产品曝光来增加销售额至关重要。然而,对于平台管理者来说,手动纠正注册类别既耗时又成本高昂。为了解决这个问题,本研究提出了一种基于多模态深度学习和迁移学习的时尚电子商务产品分类方法。通过所提出的方法,解决了时尚电子商务产品分类中的三个挑战。首先,通过欠采样解决了电子商务数据极度不平衡的问题。其次,多模态深度学习使模型能够同时使用不同格式的输入数据,通过提供更丰富的信息有助于减轻嘈杂和低质量电子商务数据的影响。最后,通过利用迁移学习减轻了使用图像和文本数据训练深度学习模型时涉及的高计算成本和长时间训练。在本研究中,提出了三种用于迁移学习以微调图像和文本模块的策略。此外,还提出了五种将从单模态提取的特征向量融合为一个的方法和六种微调多模态模型的策略,共14种策略。研究表明,多模态模型优于仅基于文本或图像的单模态模型。它还提出了电子商务产品分类的最佳条件,帮助时尚电子商务从业者更有效地构建适合其各自业务环境的模型。