Vallmuur Kirsten, Marucci-Wellman Helen R, Taylor Jennifer A, Lehto Mark, Corns Helen L, Smith Gordon S
Queensland University of Technology, Centre for Accident Research and Road Safety-Queensland, Brisbane, Queensland, Australia.
Center for Injury Epidemiology, Liberty Mutual Research Institute for Safety, Hopkinton, Massachusetts, USA.
Inj Prev. 2016 Apr;22 Suppl 1(Suppl 1):i34-42. doi: 10.1136/injuryprev-2015-041813. Epub 2016 Jan 4.
Vast amounts of injury narratives are collected daily and are available electronically in real time and have great potential for use in injury surveillance and evaluation. Machine learning algorithms have been developed to assist in identifying cases and classifying mechanisms leading to injury in a much timelier manner than is possible when relying on manual coding of narratives. The aim of this paper is to describe the background, growth, value, challenges and future directions of machine learning as applied to injury surveillance.
This paper reviews key aspects of machine learning using injury narratives, providing a case study to demonstrate an application to an established human-machine learning approach.
The range of applications and utility of narrative text has increased greatly with advancements in computing techniques over time. Practical and feasible methods exist for semiautomatic classification of injury narratives which are accurate, efficient and meaningful. The human-machine learning approach described in the case study achieved high sensitivity and PPV and reduced the need for human coding to less than a third of cases in one large occupational injury database.
The last 20 years have seen a dramatic change in the potential for technological advancements in injury surveillance. Machine learning of 'big injury narrative data' opens up many possibilities for expanded sources of data which can provide more comprehensive, ongoing and timely surveillance to inform future injury prevention policy and practice.
每天都会收集大量的伤害描述,并且这些描述能够实时以电子方式获取,在伤害监测和评估中具有巨大的应用潜力。已经开发出机器学习算法,以比依赖人工对描述进行编码更及时的方式协助识别病例并对导致伤害的机制进行分类。本文旨在描述应用于伤害监测的机器学习的背景、发展、价值、挑战和未来方向。
本文回顾了使用伤害描述进行机器学习的关键方面,并提供了一个案例研究,以展示一种已确立的人机学习方法的应用。
随着时间的推移,随着计算技术的进步,叙述性文本的应用范围和实用性大大增加。存在用于伤害描述半自动分类的实用可行方法,这些方法准确、高效且有意义。案例研究中描述的人机学习方法实现了高灵敏度和阳性预测值,并将一个大型职业伤害数据库中人工编码的需求减少到不到三分之一的病例。
在过去20年中,伤害监测技术进步的潜力发生了巨大变化。对“大量伤害描述数据”进行机器学习为扩展数据源开辟了许多可能性,这些数据源可以提供更全面、持续和及时的监测,为未来的伤害预防政策和实践提供信息。