Alleck Thaksheel, Giovannelli Tommaso, Vicente Luis Nunes, Mitchell Roman, Remen Ori
Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, PA 18015-1582, USA.
Data Brief. 2024 Jun 10;55:110625. doi: 10.1016/j.dib.2024.110625. eCollection 2024 Aug.
In this data article, we present a dataset containing match scores from major international competitions for 12 popular team ball sports: basketball, cricket, field hockey, futsal, handball, ice hockey, lacrosse, roller hockey, rugby, soccer, volleyball, and water polo. The dataset was obtained by web scraping data available on Wikipedia pages and includes the following information related to individual matches: the year of the competition edition when a match occurred, the names of the two opposing teams, their respective scores, and the name of the winning team. Our match score dataset provides researchers in the field of sports analytics with valuable data that can be used to compute team statistics, develop team ranking and rating systems, infer patterns and trends in a team's performance across the edition years, build predictive models to forecast the outcome of future matches, and evaluate the performance of machine learning algorithms.
在本数据文章中,我们展示了一个数据集,其中包含12种热门团体球类运动在重大国际比赛中的比赛比分:篮球、板球、曲棍球、五人制足球、手球、冰球、长曲棍球、轮滑曲棍球、橄榄球、足球、排球和水球。该数据集是通过网络抓取维基百科页面上的可用数据获得的,包含与各场比赛相关的以下信息:比赛举行的竞赛年份、两支参赛队伍的名称、各自的比分以及获胜队伍的名称。我们的比赛比分数据集为体育分析领域的研究人员提供了有价值的数据,可用于计算球队统计数据、开发球队排名和评级系统、推断各年份球队表现的模式和趋势、建立预测模型以预测未来比赛结果,以及评估机器学习算法的性能。