Georgia Institute of Technology, North Ave NW, Atlanta, Georgia, 30332, USA.
Comput Methods Programs Biomed. 2019 Aug;177:141-153. doi: 10.1016/j.cmpb.2019.05.024. Epub 2019 May 25.
Code assignment is of paramount importance in many levels in modern hospitals, from ensuring accurate billing process to creating a valid record of patient care history. However, the coding process is tedious and subjective, and it requires medical coders with extensive training. This study aims to evaluate the performance of deep-learning-based systems to automatically map clinical notes to ICD-9 medical codes.
The evaluations of this research are focused on end-to-end learning methods without manually defined rules. Traditional machine learning algorithms, as well as state-of-the-art deep learning methods such as Recurrent Neural Networks and Convolution Neural Networks, were applied to the Medical Information Mart for Intensive Care (MIMIC-III) dataset. An extensive number of experiments was applied to different settings of the tested algorithm.
Findings showed that the deep learning-based methods outperformed other conventional machine learning methods. From our assessment, the best models could predict the top 10 ICD-9 codes with 0.6957 F and 0.8967 accuracy and could estimate the top 10 ICD-9 categories with 0.7233 F and 0.8588 accuracy. Our implementation also outperformed existing work under certain evaluation metrics.
A set of standard metrics was utilized in assessing the performance of ICD-9 code assignment on MIMIC-III dataset. All the developed evaluation tools and resources are available online, which can be used as a baseline for further research.
在现代医院的许多层面,代码分配都至关重要,从确保准确的计费流程到创建有效的患者护理历史记录。然而,编码过程繁琐且主观,需要经过广泛培训的医疗编码员。本研究旨在评估基于深度学习的系统自动将临床记录映射到 ICD-9 医疗代码的性能。
本研究的评估重点是端到端学习方法,无需手动定义规则。传统的机器学习算法以及最先进的深度学习方法,如递归神经网络和卷积神经网络,应用于医疗信息集市重症监护(MIMIC-III)数据集。对测试算法的不同设置进行了大量实验。
研究结果表明,基于深度学习的方法优于其他传统机器学习方法。通过我们的评估,最好的模型可以以 0.6957 F 和 0.8967 的准确率预测前 10 个 ICD-9 代码,并以 0.7233 F 和 0.8588 的准确率预测前 10 个 ICD-9 类别。我们的实现也在某些评估指标下优于现有工作。
使用一组标准指标评估 MIMIC-III 数据集上的 ICD-9 代码分配性能。所有开发的评估工具和资源都在线提供,可作为进一步研究的基准。