• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于优化 XGBoost 算法的电影营销和发行环境下的票房预测模型。

The box office prediction model based on the optimized XGBoost algorithm in the context of film marketing and distribution.

机构信息

Kyonggi University, Suwon City, South Korea.

出版信息

PLoS One. 2024 Oct 3;19(10):e0309227. doi: 10.1371/journal.pone.0309227. eCollection 2024.

DOI:10.1371/journal.pone.0309227
PMID:39361570
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11449294/
Abstract

To improve the accuracy and efficiency of box office prediction, this study deeply discusses the application of the optimized eXtreme Gradient Boosting (XGBoost) model in this scenario and its advantages compared with other commonly used machine learning models. By comparing and analyzing five models, involving the Deep Neural Network, Light Gradient Boosting Machine, Random Forest, Gradient Boosting Decision Tree, and CatBoost, several key performance indicators such as accuracy, precision, recall, F1 score, generalization error, stability, robustness, and adaptability score are comprehensively investigated. The research results reveal that the optimization model proposed in this study is superior to the comparison model in most evaluation indicators, especially when the data volume reaches 2500, showing obvious advantages. For example, the accuracy is increased to 0.9, the F1 score is 0.9, the generalization error is reduced to 0.09, and the stability score is as high as 0.98. The robustness and adaptability scores are both 0.97, which proves its strong prediction ability and high stability and robustness on large-scale datasets. Therefore, this study provides scientific data support and a decision-making basis for the film industry in the formulation of marketing and distribution strategies. Moreover, film producers and distributors can reasonably estimate their market performance early in film shooting, optimize investment decisions, and reduce economic risks through accurate box office predictions.

摘要

为了提高票房预测的准确性和效率,本研究深入探讨了优化后的极端梯度提升(XGBoost)模型在这种情况下的应用,以及与其他常用机器学习模型相比的优势。通过比较和分析五个模型,包括深度神经网络、轻量级梯度提升机、随机森林、梯度提升决策树和 CatBoost,综合研究了准确性、精度、召回率、F1 分数、泛化误差、稳定性、鲁棒性和适应性得分等几个关键性能指标。研究结果表明,与比较模型相比,本研究提出的优化模型在大多数评估指标上都具有优势,特别是在数据量达到 2500 时,表现出明显的优势。例如,准确性提高到 0.9,F1 分数提高到 0.9,泛化误差降低到 0.09,稳定性得分高达 0.98。鲁棒性和适应性得分均为 0.97,这证明了它在大规模数据集上具有强大的预测能力和较高的稳定性和鲁棒性。因此,本研究为电影行业在制定营销策略和发行策略时提供了科学的数据支持和决策依据。此外,电影制作人和发行商可以通过准确的票房预测,在电影拍摄早期合理估计其市场表现,优化投资决策,降低经济风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/97c3b38bbf6f/pone.0309227.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/661e5e3e7326/pone.0309227.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/6f77b2c1a96a/pone.0309227.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/ddc848c3e930/pone.0309227.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/7fe0af2a6b65/pone.0309227.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/9f277cfc86b7/pone.0309227.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/97c3b38bbf6f/pone.0309227.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/661e5e3e7326/pone.0309227.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/6f77b2c1a96a/pone.0309227.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/ddc848c3e930/pone.0309227.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/7fe0af2a6b65/pone.0309227.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/9f277cfc86b7/pone.0309227.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e622/11449294/97c3b38bbf6f/pone.0309227.g006.jpg

相似文献

1
The box office prediction model based on the optimized XGBoost algorithm in the context of film marketing and distribution.基于优化 XGBoost 算法的电影营销和发行环境下的票房预测模型。
PLoS One. 2024 Oct 3;19(10):e0309227. doi: 10.1371/journal.pone.0309227. eCollection 2024.
2
Enhanced osteoporotic fracture prediction in postmenopausal women using Bayesian optimization of machine learning models with genetic risk score.使用具有遗传风险评分的机器学习模型的贝叶斯优化增强绝经后女性骨质疏松性骨折预测
J Bone Miner Res. 2024 May 2;39(4):462-472. doi: 10.1093/jbmr/zjae025.
3
Prediction model of atrial fibrillation recurrence after Cox-Maze IV procedure in patients with chronic valvular disease and atrial fibrillation based on machine learning algorithm.基于机器学习算法的慢性瓣膜病合并心房颤动患者 Cox-Maze IV 术后心房颤动复发预测模型。
Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2023 Jul 28;48(7):995-1007. doi: 10.11817/j.issn.1672-7347.2023.230018.
4
Optimizing Film Companies' Marketing Strategy Using Blockchain and Recurrent Neural Network Model.利用区块链和递归神经网络模型优化电影公司的营销策略。
Comput Intell Neurosci. 2022 Sep 9;2022:4139074. doi: 10.1155/2022/4139074. eCollection 2022.
5
Early box office prediction in China's film market based on a stacking fusion model.基于堆叠融合模型的中国电影市场早期票房预测
Ann Oper Res. 2022;308(1-2):321-338. doi: 10.1007/s10479-020-03804-4. Epub 2020 Oct 6.
6
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用:以新生儿呼吸暂停预测为例的研究
Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.
7
Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets.我们是否需要不同的机器学习算法来进行定量构效关系建模?对 16 种机器学习算法在 14 个定量构效关系数据集上的综合评估。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa321.
8
Prognosis prediction and risk stratification of transarterial chemoembolization or intraarterial chemotherapy for unresectable hepatocellular carcinoma based on machine learning.基于机器学习的经动脉化疗栓塞术或动脉内化疗对不可切除肝细胞癌的预后预测及风险分层
Eur Radiol. 2024 Aug;34(8):5094-5107. doi: 10.1007/s00330-024-10581-2. Epub 2024 Jan 30.
9
Predicting Fetal Alcohol Spectrum Disorders Using Machine Learning Techniques: Multisite Retrospective Cohort Study.使用机器学习技术预测胎儿酒精谱系障碍:多地点回顾性队列研究。
J Med Internet Res. 2023 Jul 18;25:e45041. doi: 10.2196/45041.
10
Air passenger carbon offset and carbon neutrality strategies: Implementation mechanism by convolutional neural network.航空乘客碳抵消与碳中和策略:基于卷积神经网络的实施机制
Heliyon. 2024 Sep 5;10(19):e37495. doi: 10.1016/j.heliyon.2024.e37495. eCollection 2024 Oct 15.

本文引用的文献

1
Revisiting predictions of movie economic success: random Forest applied to profits.重新审视电影经济成功的预测:应用于利润的随机森林算法
Multimed Tools Appl. 2023 Mar 28:1-24. doi: 10.1007/s11042-023-15169-4.
2
Towards Data-Driven Decision-Making in the Korean Film Industry: An XAI Model for Box Office Analysis Using Dimension Reduction, Clustering, and Classification.迈向韩国电影产业中数据驱动的决策:一种使用降维、聚类和分类的票房分析可解释人工智能模型
Entropy (Basel). 2023 Mar 27;25(4):571. doi: 10.3390/e25040571.
3
VAR-tree model based spatio-temporal characterization and prediction of O concentration in China.
基于 VAR 树模型的中国 O 浓度时空特征刻画与预测。
Ecotoxicol Environ Saf. 2023 Jun 1;257:114960. doi: 10.1016/j.ecoenv.2023.114960. Epub 2023 Apr 26.
4
Predicting Box-Office Markets with Machine Learning Methods.用机器学习方法预测票房市场。
Entropy (Basel). 2022 May 16;24(5):711. doi: 10.3390/e24050711.
5
Early box office prediction in China's film market based on a stacking fusion model.基于堆叠融合模型的中国电影市场早期票房预测
Ann Oper Res. 2022;308(1-2):321-338. doi: 10.1007/s10479-020-03804-4. Epub 2020 Oct 6.