• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计数回归模型与机器学习技术的预测性能:使用汽车保险理赔频率数据集的比较分析

Predictive performance of count regression models versus machine learning techniques: A comparative analysis using an automobile insurance claims frequency dataset.

作者信息

Alomair Gadir

机构信息

Department of Quantitative Methods, School of Business, King Faisal University, Al-Ahsa, Saudi Arabia.

出版信息

PLoS One. 2024 Dec 31;19(12):e0314975. doi: 10.1371/journal.pone.0314975. eCollection 2024.

DOI:10.1371/journal.pone.0314975
PMID:39739961
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11687910/
Abstract

Accurate forecasting of claim frequency in automobile insurance is essential for insurers to assess risks effectively and establish appropriate pricing policies. Traditional methods typically rely on a Poisson distribution for modeling claim counts; however, this approach can be inadequate due to frequent zero-claim periods, leading to zero inflation in the data. Zero inflation occurs when more zeros are observed than expected under standard Poisson or negative binomial (NB) models. While machine learning (ML) techniques have been explored for predictive analytics in other contexts, their application to zero-inflated insurance data remains limited. This study investigates the utility of ML in improving forecast accuracy under conditions of zero-inflation, a data characteristic common in automobile insurance. The research involved a comparative evaluation of several models, including Poisson, NB, zero-inflated Poisson (ZIP), hurdle Poisson, zero-inflated negative binomial (ZINB), hurdle negative binomial, random forest (RF), support vector machine (SVM), and artificial neural network (ANN) on an insurance dataset. The performance of these models was assessed using mean absolute error. The results reveal that the SVM model outperforms others in predictive accuracy, particularly in handling zero-inflation, followed by the ZIP and ZINB models. In contrast, the traditional Poisson and NB models showed lower predictive capabilities. By addressing the challenge of zero-inflation in automobile claim data, this study offers insights into improving the accuracy of claim frequency predictions. Although this study is based on a single dataset, the findings provide valuable perspectives on enhancing prediction accuracy and improving risk management practices in the insurance industry.

摘要

准确预测汽车保险中的索赔频率对于保险公司有效评估风险和制定适当的定价政策至关重要。传统方法通常依赖泊松分布来对索赔次数进行建模;然而,由于频繁出现零索赔期,这种方法可能并不适用,从而导致数据中的零膨胀现象。当在标准泊松或负二项式(NB)模型下观察到的零值比预期更多时,就会出现零膨胀。虽然机器学习(ML)技术已在其他领域用于预测分析,但其在零膨胀保险数据中的应用仍然有限。本研究调查了ML在零膨胀条件下提高预测准确性的效用,零膨胀是汽车保险中常见的数据特征。该研究对包括泊松、NB、零膨胀泊松(ZIP)、障碍泊松、零膨胀负二项式(ZINB)、障碍负二项式、随机森林(RF)、支持向量机(SVM)和人工神经网络(ANN)在内的多个模型在一个保险数据集上进行了比较评估。使用平均绝对误差评估这些模型的性能。结果表明,SVM模型在预测准确性方面优于其他模型,尤其是在处理零膨胀方面,其次是ZIP和ZINB模型。相比之下,传统的泊松和NB模型显示出较低的预测能力。通过应对汽车索赔数据中的零膨胀挑战,本研究为提高索赔频率预测的准确性提供了见解。尽管本研究基于单个数据集,但研究结果为提高预测准确性和改进保险业风险管理实践提供了有价值的观点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee84/11687910/82ab014ac642/pone.0314975.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee84/11687910/d7601fa6d1ce/pone.0314975.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee84/11687910/0ca11b5d8826/pone.0314975.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee84/11687910/82ab014ac642/pone.0314975.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee84/11687910/d7601fa6d1ce/pone.0314975.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee84/11687910/0ca11b5d8826/pone.0314975.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee84/11687910/82ab014ac642/pone.0314975.g003.jpg

相似文献

1
Predictive performance of count regression models versus machine learning techniques: A comparative analysis using an automobile insurance claims frequency dataset.计数回归模型与机器学习技术的预测性能:使用汽车保险理赔频率数据集的比较分析
PLoS One. 2024 Dec 31;19(12):e0314975. doi: 10.1371/journal.pone.0314975. eCollection 2024.
2
A comparison of statistical methods for modeling count data with an application to hospital length of stay.一种用于对计数数据建模的统计方法比较及其在住院时间中的应用。
BMC Med Res Methodol. 2022 Aug 4;22(1):211. doi: 10.1186/s12874-022-01685-8.
3
Using zero-inflated and hurdle regression models to analyze schistosomiasis data of school children in the southern areas of Ghana.利用零膨胀和障碍回归模型分析加纳南部地区学龄儿童的血吸虫病数据。
PLoS One. 2024 Jul 12;19(7):e0304681. doi: 10.1371/journal.pone.0304681. eCollection 2024.
4
On the use of zero-inflated and hurdle models for modeling vaccine adverse event count data.关于使用零膨胀模型和障碍模型对疫苗不良事件计数数据进行建模
J Biopharm Stat. 2006;16(4):463-81. doi: 10.1080/10543400600719384.
5
On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.关于零膨胀和过度分散计数响应的参数模型和非参数模型的性能。
Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.
6
Models for analyzing zero-inflated and overdispersed count data: an application to cigarette and marijuana use.用于分析零膨胀和过度分散计数数据的模型:在香烟和大麻使用中的应用。
Nicotine Tob Res. 2018 Apr 18;22(8):1390-8. doi: 10.1093/ntr/nty072.
7
Zero inflated statistical count models for analysing the costs imposed by GERD and dyspepsia.用于分析胃食管反流病(GERD)和消化不良所带来成本的零膨胀统计计数模型。
Arab J Gastroenterol. 2013 Dec;14(4):165-8. doi: 10.1016/j.ajg.2013.09.004. Epub 2013 Nov 28.
8
Multilevel modeling in single-case studies with zero-inflated and overdispersed count data.零膨胀和过离散计数数据的单病例研究中的多层次建模。
Behav Res Methods. 2024 Apr;56(4):2765-2781. doi: 10.3758/s13428-024-02359-7. Epub 2024 Feb 21.
9
Statistical modelling of falls count data with excess zeros.基于过零数据的跌倒计数资料的统计建模。
Inj Prev. 2011 Aug;17(4):266-70. doi: 10.1136/ip.2011.031740. Epub 2011 Jun 8.
10
A simulation study of the performance of statistical models for count outcomes with excessive zeros.计数结局中过度零的统计模型性能的模拟研究。
Stat Med. 2024 Oct 30;43(24):4752-4767. doi: 10.1002/sim.10198. Epub 2024 Aug 28.