López-Úbeda Pilar, Díaz-Galiano Manuel Carlos, Martín-Noguerol Teodoro, Luna Antonio, Ureña-López L Alfonso, Martín-Valdivia M Teresa
SINAI Group - CEATIC - Universidad de Jaén, Campus Las Lagunillas s/n, Jaén E-23071, Spain.
MRI Unit, Radiology Department, HT médica Carmelo Torres 2, Jaén 23007, Spain.
Comput Methods Programs Biomed. 2021 Mar;200:105939. doi: 10.1016/j.cmpb.2021.105939. Epub 2021 Jan 16.
Assignment of medical imaging procedure protocols requires extensive knowledge about patient's data, usually included in radiological request forms and radiological reports. Assignment of protocol is required prior to radiological study acquisition, determining procedure for each patient. The automation of this protocol assignment process could improve the efficiency of patient's diagnosis. Artificial intelligence has proven to be of great help in these healthcare-related problems, and specifically the application of Natural Language Processing (NLP) techniques for extracting information from text reports has been successfully used in automatic text classification tasks.
In this paper, machine learning classification models based on NLP have been developed using patient's data present in radiological reports and radiological imaging protocols. We have used a real corpus provided by the private medical center "HT medica" composed of almost 700,000 Computed Tomography (CT) and Magnetic Resonance Imaging (MRI) examinations obtained during routine clinical use. We have compared several models including traditional machine learning methods such as support vector machine and random forest, neural networks and transfer language techniques.
The results obtained are encouraging taking into account that the system is performing a complex text multiclass classification task. Specifically, for the best proposed system we obtain 92.2% accuracy in the CT dataset and 86.9% in the MRI dataset.
The best machine learning system is potentially efficient, quality and cost effective. For this reason it is currently used in real scenarios by radiologists as decision support tool for assigning protocols of CT and MRI studies.
医学成像检查方案的分配需要对患者数据有广泛了解,这些数据通常包含在放射检查申请单和放射学报告中。在进行放射学检查之前需要分配检查方案,以确定每位患者的检查流程。该方案分配过程的自动化可以提高患者诊断的效率。人工智能已被证明在这些与医疗保健相关的问题中非常有帮助,特别是自然语言处理(NLP)技术在从文本报告中提取信息方面的应用已成功用于自动文本分类任务。
在本文中,基于NLP开发了机器学习分类模型,使用放射学报告和放射成像检查方案中的患者数据。我们使用了私立医疗中心“HT medica”提供的真实语料库,该语料库由在常规临床使用期间获得的近700,000例计算机断层扫描(CT)和磁共振成像(MRI)检查组成。我们比较了几种模型,包括支持向量机和随机森林等传统机器学习方法、神经网络和迁移语言技术。
考虑到该系统正在执行复杂的文本多类分类任务,所获得的结果令人鼓舞。具体而言,对于最佳提出的系统,我们在CT数据集中获得了92.2%的准确率,在MRI数据集中获得了86.9%的准确率。
最佳机器学习系统具有潜在的高效性、高质量和成本效益。因此,目前放射科医生在实际场景中使用它作为分配CT和MRI检查方案的决策支持工具。