Senders Joeky T, Karhade Aditya V, Cote David J, Mehrtash Alireza, Lamba Nayan, DiRisio Aislyn, Muskens Ivo S, Gormley William B, Smith Timothy R, Broekman Marike L D, Arnaout Omar
Brigham and Women's Hospital, Harvard Medical School, Boston, MA.
Haaglanden Medical Center, The Hague, the Netherlands.
JCO Clin Cancer Inform. 2019 Apr;3:1-9. doi: 10.1200/CCI.18.00138.
Although the bulk of patient-generated health data are increasing exponentially, their use is impeded because most data come in unstructured format, namely as free-text clinical reports. A variety of natural language processing (NLP) methods have emerged to automate the processing of free text ranging from statistical to deep learning-based models; however, the optimal approach for medical text analysis remains to be determined. The aim of this study was to provide a head-to-head comparison of novel NLP techniques and inform future studies about their utility for automated medical text analysis.
Magnetic resonance imaging reports of patients with brain metastases treated in two tertiary centers were retrieved and manually annotated using a binary classification (single metastasis two or more metastases). Multiple bag-of-words and sequence-based NLP models were developed and compared after randomly splitting the annotated reports into training and test sets in an 80:20 ratio.
A total of 1,479 radiology reports of patients diagnosed with brain metastases were retrieved. The least absolute shrinkage and selection operator (LASSO) regression model demonstrated the best overall performance on the hold-out test set with an area under the receiver operating characteristic curve of 0.92 (95% CI, 0.89 to 0.94), accuracy of 83% (95% CI, 80% to 87%), calibration intercept of -0.06 (95% CI, -0.14 to 0.01), and calibration slope of 1.06 (95% CI, 0.95 to 1.17).
Among various NLP techniques, the bag-of-words approach combined with a LASSO regression model demonstrated the best overall performance in extracting binary outcomes from free-text clinical reports. This study provides a framework for the development of machine learning-based NLP models as well as a clinical vignette of patients diagnosed with brain metastases.
尽管患者生成的健康数据量正在呈指数级增长,但其应用却受到阻碍,因为大多数数据都是非结构化格式,即自由文本临床报告。各种自然语言处理(NLP)方法已经出现,用于自动化处理自由文本,从基于统计的模型到基于深度学习的模型;然而,医学文本分析的最佳方法仍有待确定。本研究的目的是对新型NLP技术进行直接比较,并为未来关于其在自动化医学文本分析中的效用的研究提供信息。
检索了在两个三级中心接受治疗的脑转移瘤患者的磁共振成像报告,并使用二元分类(单个转移瘤 两个或更多转移瘤)进行人工注释。在将注释后的报告以80:20的比例随机分为训练集和测试集后,开发并比较了多个词袋模型和基于序列的NLP模型。
共检索到1479份诊断为脑转移瘤患者的放射学报告。最小绝对收缩和选择算子(LASSO)回归模型在保留测试集上表现出最佳的整体性能,受试者操作特征曲线下面积为0.92(95%CI,0.89至0.94),准确率为83%(95%CI,80%至87%),校准截距为-0.06(95%CI,-0.14至0.01),校准斜率为1.06(95%CI,0.95至1.17)。
在各种NLP技术中,词袋方法与LASSO回归模型相结合在从自由文本临床报告中提取二元结果方面表现出最佳的整体性能。本研究为基于机器学习的NLP模型的开发提供了一个框架,以及诊断为脑转移瘤患者的临床案例。