Dertinger Stephen D
Litron Laboratories, Rochester, New York.
Environ Mol Mutagen. 2017 Jul;58(6):390-397. doi: 10.1002/em.22105. Epub 2017 Jun 24.
In many respects the evolution of baseball statistics mirrors advances made in the field of genetic toxicology. From its inception, baseball and statistics have been inextricably linked. Generations of players and fans have used a number of relatively simple measurements to describe team and individual player's current performance, as well as for historical record-keeping purposes. Over the years, baseball analytics has progressed in several important ways. Early advances were based on deriving more meaningful metrics from simpler forerunners. Now, technological innovations are delivering much deeper insights. Videography, radar, and other advances that include automatic player recognition capabilities provide the means to measure more complex and useful factors. Fielders' reaction times, efficiency of the route taken to reach a batted ball, and pitch-framing effectiveness come to mind. With the current availability of complex measurements from multiple data streams, multifactorial analyses occurring via machine learning algorithms have become necessary to make sense of the terabytes of data that are now being captured in every Major League Baseball game. Collectively, these advances have transformed baseball statistics from being largely descriptive in nature to serving data-driven, predictive roles. Whereas genetic toxicology has charted a somewhat parallel course, a case can be made that greater utilization of baseball's mindset and strategies would serve our scientific field well. This paper describes three useful lessons for genetic toxicology, courtesy of the field of baseball analytics: seek objective knowledge; incorporate multiple data streams; and embrace machine learning. Environ. Mol. Mutagen. 58:390-397, 2017. © 2017 Wiley Periodicals, Inc.
在许多方面,棒球数据统计的发展反映了遗传毒理学领域所取得的进步。从一开始,棒球与数据统计就紧密相连。几代球员和球迷都使用了一些相对简单的衡量标准来描述球队和球员个人的当前表现,以及用于历史记录保存目的。多年来,棒球分析在几个重要方面取得了进展。早期的进展是基于从更简单的前身中推导出更有意义的指标。如今,技术创新带来了更深入的见解。摄像技术、雷达以及包括自动球员识别功能在内的其他进展提供了测量更复杂和有用因素的手段。比如外野手的反应时间、接到击出球所采取路线的效率以及投球框架的有效性等。随着目前从多个数据流中获取复杂测量数据的可能性,通过机器学习算法进行多因素分析对于理解现在每场美国职业棒球大联盟比赛中所捕获的数万亿字节的数据变得必不可少。总体而言,这些进展已将棒球数据统计从本质上主要是描述性的转变为发挥数据驱动的预测作用。虽然遗传毒理学也走过了类似的历程,但可以说更多地运用棒球的思维方式和策略将对我们的科学领域大有裨益。本文介绍了遗传毒理学可从棒球分析领域借鉴的三个有益经验:寻求客观知识;纳入多个数据流;以及接受机器学习。《环境与分子突变》58:390 - 397,2017年。© 2017威利期刊公司