Department of Neurology, Yale University, New Haven, Connecticut.
Department of Neurology, University of Wisconsin, Madison.
JAMA Netw Open. 2022 Aug 1;5(8):e2227109. doi: 10.1001/jamanetworkopen.2022.27109.
Clinical text reports from head computed tomography (CT) represent rich, incompletely utilized information regarding acute brain injuries and neurologic outcomes. CT reports are unstructured; thus, extracting information at scale requires automated natural language processing (NLP). However, designing new NLP algorithms for each individual injury category is an unwieldy proposition. An NLP tool that summarizes all injuries in head CT reports would facilitate exploration of large data sets for clinical significance of neuroradiological findings.
To automatically extract acute brain pathological data and their features from head CT reports.
DESIGN, SETTING, AND PARTICIPANTS: This diagnostic study developed a 2-part named entity recognition (NER) NLP model to extract and summarize data on acute brain injuries from head CT reports. The model, termed BrainNERD, extracts and summarizes detailed brain injury information for research applications. Model development included building and comparing 2 NER models using a custom dictionary of terms, including lesion type, location, size, and age, then designing a rule-based decoder using NER outputs to evaluate for the presence or absence of injury subtypes. BrainNERD was evaluated against independent test data sets of manually classified reports, including 2 external validation sets. The model was trained on head CT reports from 1152 patients generated by neuroradiologists at the Yale Acute Brain Injury Biorepository. External validation was conducted using reports from 2 outside institutions. Analyses were conducted from May 2020 to December 2021.
Performance of the BrainNERD model was evaluated using precision, recall, and F1 scores based on manually labeled independent test data sets.
A total of 1152 patients (mean [SD] age, 67.6 [16.1] years; 586 [52%] men), were included in the training set. NER training using transformer architecture and bidirectional encoder representations from transformers was significantly faster than spaCy. For all metrics, the 10-fold cross-validation performance was 93% to 99%. The final test performance metrics for the NER test data set were 98.82% (95% CI, 98.37%-98.93%) for precision, 98.81% (95% CI, 98.46%-99.06%) for recall, and 98.81% (95% CI, 98.40%-98.94%) for the F score. The expert review comparison metrics were 99.06% (95% CI, 97.89%-99.13%) for precision, 98.10% (95% CI, 97.93%-98.77%) for recall, and 98.57% (95% CI, 97.78%-99.10%) for the F score. The decoder test set metrics were 96.06% (95% CI, 95.01%-97.16%) for precision, 96.42% (95% CI, 94.50%-97.87%) for recall, and 96.18% (95% CI, 95.151%-97.16%) for the F score. Performance in external institution report validation including 1053 head CR reports was greater than 96%.
These findings suggest that the BrainNERD model accurately extracted acute brain injury terms and their properties from head CT text reports. This freely available new tool could advance clinical research by integrating information in easily gathered head CT reports to expand knowledge of acute brain injury radiographic phenotypes.
头部计算机断层扫描 (CT) 的临床文本报告代表了有关急性脑损伤和神经结局的丰富但未充分利用的信息。CT 报告是非结构化的;因此,需要自动化自然语言处理 (NLP) 才能大规模提取信息。然而,为每个单独的损伤类别设计新的 NLP 算法是一项繁琐的任务。一种能够总结头部 CT 报告中所有损伤的 NLP 工具将有助于探索大型数据集,以了解神经放射学发现的临床意义。
自动从头部 CT 报告中提取急性脑病理数据及其特征。
设计、设置和参与者:这项诊断研究开发了一个两部分的命名实体识别 (NER) NLP 模型,用于从头部 CT 报告中提取和总结急性脑损伤数据。该模型称为 BrainNERD,用于研究应用程序提取和总结详细的脑损伤信息。模型开发包括使用包括病变类型、位置、大小和年龄在内的术语自定义词典构建和比较两个 NER 模型,然后使用 NER 输出设计基于规则的解码器,以评估是否存在或不存在损伤亚型。BrainNERD 针对独立的手动分类报告测试数据集进行了评估,包括两个外部验证集。该模型是在耶鲁急性脑损伤生物库的神经放射科医生生成的 1152 名患者的头部 CT 报告上进行训练的。外部验证是使用来自两个外部机构的报告进行的。分析于 2020 年 5 月至 2021 年 12 月进行。
使用基于手动标记的独立测试数据集的精度、召回率和 F1 分数评估 BrainNERD 模型的性能。
共纳入 1152 名患者(平均[标准差]年龄,67.6[16.1]岁;586[52%]名男性),纳入训练集。使用变压器架构和来自变压器的双向编码器表示进行的 NER 训练比 spaCy 快得多。对于所有指标,10 倍交叉验证的性能为 93% 到 99%。NER 测试数据集的最终测试性能指标为 98.82%(95%CI,98.37%-98.93%)的精度,98.81%(95%CI,98.46%-99.06%)的召回率和 98.81%(95%CI,98.40%-98.94%)的 F 分数。专家审查比较指标为 99.06%(95%CI,97.89%-99.13%)的精度,98.10%(95%CI,97.93%-98.77%)的召回率和 98.57%(95%CI,97.78%-99.10%)的 F 分数。解码器测试集指标为 96.06%(95%CI,95.01%-97.16%)的精度,96.42%(95%CI,94.50%-97.87%)的召回率和 96.18%(95%CI,95.151%-97.16%)的 F 分数。包括 1053 份头部 CR 报告的外部机构报告验证的性能大于 96%。
这些发现表明,BrainNERD 模型能够准确地从头部 CT 文本报告中提取急性脑损伤术语及其属性。这个免费的新工具可以通过整合易于收集的头部 CT 报告中的信息来推进临床研究,从而扩展对急性脑损伤放射学表型的认识。