基于众筹活动成功预测的BERT与FastText表征的对比分析

Comparative analysis of BERT and FastText representations on crowdfunding campaign success prediction.

作者信息

Gunduz Hakan

机构信息

Software Engineering Department, Kocaeli University, Kocaeli, Marmara, Turkey.

出版信息

PeerJ Comput Sci. 2024 Sep 11;10:e2316. doi: 10.7717/peerj-cs.2316. eCollection 2024.

DOI:10.7717/peerj-cs.2316

PMID:39314718

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11419673/

Abstract

Crowdfunding has become a popular financing method, attracting investors, businesses, and entrepreneurs. However, many campaigns fail to secure funding, making it crucial to reduce participation risks using artificial intelligence (AI). This study investigates the effectiveness of advanced AI techniques in predicting the success of crowdfunding campaigns on Kickstarter by analyzing campaign blurbs. We compare the performance of two widely used text representation models, bidirectional encoder representations from transformers (BERT) and FastText, in conjunction with long-short term memory (LSTM) and gradient boosting machine (GBM) classifiers. Our analysis involves preprocessing campaign blurbs, extracting features using BERT and FastText, and evaluating the predictive performance of these features with LSTM and GBM models. All experimental results show that BERT representations significantly outperform FastText, with the highest accuracy of 0.745 achieved using a fine-tuned BERT model combined with LSTM. These findings highlight the importance of using deep contextual embeddings and the benefits of fine-tuning pre-trained models for domain-specific applications. The results are benchmarked against existing methods, demonstrating the superiority of our approach. This study provides valuable insights for improving predictive models in the crowdfunding domain, offering practical implications for campaign creators and investors.

摘要

众筹已成为一种流行的融资方式，吸引着投资者、企业和企业家。然而，许多众筹活动未能获得资金，因此利用人工智能（AI）降低参与风险至关重要。本研究通过分析活动简介，调查先进的人工智能技术在预测Kickstarter众筹活动成功方面的有效性。我们比较了两种广泛使用的文本表示模型——来自变换器的双向编码器表示（BERT）和FastText——与长短期记忆（LSTM）和梯度提升机（GBM）分类器的性能。我们的分析包括预处理活动简介、使用BERT和FastText提取特征，以及用LSTM和GBM模型评估这些特征的预测性能。所有实验结果表明，BERT表示显著优于FastText，使用微调后的BERT模型与LSTM相结合达到了最高0.745的准确率。这些发现凸显了使用深度上下文嵌入的重要性以及为特定领域应用微调预训练模型的好处。结果与现有方法进行了基准比较，证明了我们方法的优越性。本研究为改进众筹领域的预测模型提供了有价值的见解，为活动创建者和投资者提供了实际意义。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于众筹活动成功预测的BERT与FastText表征的对比分析

Comparative analysis of BERT and FastText representations on crowdfunding campaign success prediction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

基于众筹活动成功预测的BERT与FastText表征的对比分析

Comparative analysis of BERT and FastText representations on crowdfunding campaign success prediction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献