• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自然语言处理研究报告的质量:对放射学报告研究的系统评价。

The reporting quality of natural language processing studies: systematic review of studies of radiology reports.

机构信息

Centre for Clinical Brain Sciences, University of Edinburgh, Chancellor's Building, Little France, Edinburgh, EH16 4TJ, Scotland, UK.

Centre for Medical Informatics, Usher Institute, University of Edinburgh, Edinburgh, Scotland, UK.

出版信息

BMC Med Imaging. 2021 Oct 2;21(1):142. doi: 10.1186/s12880-021-00671-8.

DOI:10.1186/s12880-021-00671-8
PMID:34600486
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8487512/
Abstract

BACKGROUND

Automated language analysis of radiology reports using natural language processing (NLP) can provide valuable information on patients' health and disease. With its rapid development, NLP studies should have transparent methodology to allow comparison of approaches and reproducibility. This systematic review aims to summarise the characteristics and reporting quality of studies applying NLP to radiology reports.

METHODS

We searched Google Scholar for studies published in English that applied NLP to radiology reports of any imaging modality between January 2015 and October 2019. At least two reviewers independently performed screening and completed data extraction. We specified 15 criteria relating to data source, datasets, ground truth, outcomes, and reproducibility for quality assessment. The primary NLP performance measures were precision, recall and F1 score.

RESULTS

Of the 4,836 records retrieved, we included 164 studies that used NLP on radiology reports. The commonest clinical applications of NLP were disease information or classification (28%) and diagnostic surveillance (27.4%). Most studies used English radiology reports (86%). Reports from mixed imaging modalities were used in 28% of the studies. Oncology (24%) was the most frequent disease area. Most studies had dataset size > 200 (85.4%) but the proportion of studies that described their annotated, training, validation, and test set were 67.1%, 63.4%, 45.7%, and 67.7% respectively. About half of the studies reported precision (48.8%) and recall (53.7%). Few studies reported external validation performed (10.8%), data availability (8.5%) and code availability (9.1%). There was no pattern of performance associated with the overall reporting quality.

CONCLUSIONS

There is a range of potential clinical applications for NLP of radiology reports in health services and research. However, we found suboptimal reporting quality that precludes comparison, reproducibility, and replication. Our results support the need for development of reporting standards specific to clinical NLP studies.

摘要

背景

使用自然语言处理(NLP)对放射学报告进行自动化语言分析可以提供有关患者健康和疾病的有价值信息。随着其快速发展,NLP 研究应该具有透明的方法,以允许比较方法和可重复性。本系统评价旨在总结应用 NLP 分析放射学报告的研究的特点和报告质量。

方法

我们在 Google Scholar 上搜索了 2015 年 1 月至 2019 年 10 月期间发表的应用 NLP 分析任何成像方式放射学报告的英文研究。至少有两名审查员独立进行筛选并完成数据提取。我们为质量评估指定了 15 项与数据源、数据集、基准、结果和可重复性相关的标准。主要的 NLP 性能指标是精度、召回率和 F1 分数。

结果

在检索到的 4836 条记录中,我们纳入了 164 项使用 NLP 分析放射学报告的研究。NLP 最常见的临床应用是疾病信息或分类(28%)和诊断监测(27.4%)。大多数研究使用英语放射学报告(86%)。混合成像方式的报告在 28%的研究中使用。肿瘤学(24%)是最常见的疾病领域。大多数研究的数据集大小大于 200(85.4%),但描述其注释、训练、验证和测试集的研究比例分别为 67.1%、63.4%、45.7%和 67.7%。约一半的研究报告了精度(48.8%)和召回率(53.7%)。很少有研究报告进行了外部验证(10.8%)、数据可用性(8.5%)和代码可用性(9.1%)。性能与整体报告质量之间没有关联模式。

结论

NLP 分析放射学报告在医疗保健和研究中具有多种潜在的临床应用。然而,我们发现报告质量不理想,无法进行比较、可重复性和复制。我们的结果支持针对临床 NLP 研究制定特定报告标准的必要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce8/8487512/877634a7eb8e/12880_2021_671_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce8/8487512/35fd9d85b016/12880_2021_671_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce8/8487512/7e02aaae29f6/12880_2021_671_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce8/8487512/7627b836070f/12880_2021_671_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce8/8487512/877634a7eb8e/12880_2021_671_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce8/8487512/35fd9d85b016/12880_2021_671_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce8/8487512/7e02aaae29f6/12880_2021_671_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce8/8487512/7627b836070f/12880_2021_671_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce8/8487512/877634a7eb8e/12880_2021_671_Fig4_HTML.jpg

相似文献

1
The reporting quality of natural language processing studies: systematic review of studies of radiology reports.自然语言处理研究报告的质量:对放射学报告研究的系统评价。
BMC Med Imaging. 2021 Oct 2;21(1):142. doi: 10.1186/s12880-021-00671-8.
2
Identification of Long Bone Fractures in Radiology Reports Using Natural Language Processing to support Healthcare Quality Improvement.利用自然语言处理技术识别放射学报告中的长骨骨折以支持医疗质量改进
Appl Clin Inform. 2016 Nov 9;7(4):1051-1068. doi: 10.4338/ACI-2016-08-RA-0129.
3
A systematic review of natural language processing applied to radiology reports.自然语言处理在放射学报告中的应用的系统评价。
BMC Med Inform Decis Mak. 2021 Jun 3;21(1):179. doi: 10.1186/s12911-021-01533-7.
4
Using Natural Language Processing of Free-Text Radiology Reports to Identify Type 1 Modic Endplate Changes.利用自然语言处理技术对放射科报告中的自由文本进行分析,以识别 1 型 Modic 终板改变。
J Digit Imaging. 2018 Feb;31(1):84-90. doi: 10.1007/s10278-017-0013-3.
5
Applications of natural language processing in radiology: A systematic review.自然语言处理在放射学中的应用:一项系统综述。
Int J Med Inform. 2022 Jul;163:104779. doi: 10.1016/j.ijmedinf.2022.104779. Epub 2022 Apr 26.
6
Automated anonymization of radiology reports: comparison of publicly available natural language processing and large language models.放射学报告的自动匿名化:公开可用的自然语言处理与大语言模型的比较
Eur Radiol. 2025 May;35(5):2634-2641. doi: 10.1007/s00330-024-11148-x. Epub 2024 Oct 31.
7
Natural Language Processing in Radiology: A Systematic Review.自然语言处理在放射学中的应用:系统评价。
Radiology. 2016 May;279(2):329-43. doi: 10.1148/radiol.16142770.
8
Automated Radiology-Arthroscopy Correlation of Knee Meniscal Tears Using Natural Language Processing Algorithms.使用自然语言处理算法实现膝关节半月板撕裂的放射-关节镜自动关联。
Acad Radiol. 2022 Apr;29(4):479-487. doi: 10.1016/j.acra.2021.01.017. Epub 2021 Feb 11.
9
Natural Language Processing in Radiology: Update on Clinical Applications.自然语言处理在放射学中的应用:临床应用的更新。
J Am Coll Radiol. 2022 Nov;19(11):1271-1285. doi: 10.1016/j.jacr.2022.06.016. Epub 2022 Aug 25.
10
Deep Learning for Natural Language Processing in Radiology-Fundamentals and a Systematic Review.放射学中自然语言处理的深度学习——基础与系统综述
J Am Coll Radiol. 2020 May;17(5):639-648. doi: 10.1016/j.jacr.2019.12.026. Epub 2020 Jan 28.

引用本文的文献

1
A systematic review of natural language processing applications in Trauma & Orthopaedics.创伤与矫形外科学中自然语言处理应用的系统评价。
Bone Jt Open. 2025 Mar 5;6(3):264-274. doi: 10.1302/2633-1462.63.BJO-2024-0081.R1.
2
Revolutionizing Radiology with Natural Language Processing and Chatbot Technologies: A Narrative Umbrella Review on Current Trends and Future Directions.利用自然语言处理和聊天机器人技术革新放射学:当前趋势与未来方向的叙述性综述
J Clin Med. 2024 Dec 2;13(23):7337. doi: 10.3390/jcm13237337.
3
Comparison of an Ensemble of Machine Learning Models and the BERT Language Model for Analysis of Text Descriptions of Brain CT Reports to Determine the Presence of Intracranial Hemorrhage.

本文引用的文献

1
A systematic review of natural language processing applied to radiology reports.自然语言处理在放射学报告中的应用的系统评价。
BMC Med Inform Decis Mak. 2021 Jun 3;21(1):179. doi: 10.1186/s12911-021-01533-7.
2
Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension.涉及人工智能干预的临床试验方案指南:SPIRIT-AI 扩展。
Lancet Digit Health. 2020 Oct;2(10):e549-e560. doi: 10.1016/S2589-7500(20)30219-3. Epub 2020 Sep 9.
3
Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension.
基于机器学习模型集成与 BERT 语言模型的脑 CT 报告文本描述分析用于判断颅内出血的比较研究
Sovrem Tekhnologii Med. 2024;16(1):27-34. doi: 10.17691/stm2024.16.1.03. Epub 2024 Feb 28.
4
A scoping review of large language model based approaches for information extraction from radiology reports.基于大语言模型从放射学报告中提取信息的方法的范围综述。
NPJ Digit Med. 2024 Aug 24;7(1):222. doi: 10.1038/s41746-024-01219-0.
5
Revolutionizing Pulmonary Diagnostics: A Narrative Review of Artificial Intelligence Applications in Lung Imaging.变革肺部诊断:人工智能在肺部成像中的应用叙事性综述
Cureus. 2024 Apr 5;16(4):e57657. doi: 10.7759/cureus.57657. eCollection 2024 Apr.
6
Using natural language processing to analyze unstructured patient-reported outcomes data derived from electronic health records for cancer populations: a systematic review.利用自然语言处理分析从癌症患者电子健康记录中获取的非结构化患者报告结局数据:一项系统综述。
Expert Rev Pharmacoecon Outcomes Res. 2024 Apr;24(4):467-475. doi: 10.1080/14737167.2024.2322664. Epub 2024 Mar 5.
7
Text Analysis of Radiology Reports with Signs of Intracranial Hemorrhage on Brain CT Scans Using the Decision Tree Algorithm.基于决策树算法的脑 CT 扫描有颅内出血征象的放射学报告的文本分析。
Sovrem Tekhnologii Med. 2022;14(6):34-40. doi: 10.17691/stm2022.14.6.04. Epub 2022 Nov 28.
8
Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification?Transformer 与传统自然语言处理:自动化放射科报告分类需要多少数据?
Br J Radiol. 2023 Sep;96(1149):20220769. doi: 10.1259/bjr.20220769. Epub 2023 May 25.
9
A scoping review of natural language processing of radiology reports in breast cancer.乳腺癌放射学报告自然语言处理的范围综述
Front Oncol. 2023 Apr 12;13:1160167. doi: 10.3389/fonc.2023.1160167. eCollection 2023.
10
Structured reporting of computed tomography in the polytrauma patient assessment: a Delphi consensus proposal.计算机断层扫描在多发伤患者评估中的结构化报告:德尔菲共识建议。
Radiol Med. 2023 Feb;128(2):222-233. doi: 10.1007/s11547-023-01596-8. Epub 2023 Jan 19.
涉及人工智能干预的临床试验报告的报告规范:CONSORT-AI 扩展。
Lancet Digit Health. 2020 Oct;2(10):e537-e548. doi: 10.1016/S2589-7500(20)30218-1. Epub 2020 Sep 9.
4
Reply to: Transparency and reproducibility in artificial intelligence.回复:人工智能中的透明度与可重复性
Nature. 2020 Oct;586(7829):E17-E18. doi: 10.1038/s41586-020-2767-x.
5
Transparency and reproducibility in artificial intelligence.人工智能中的透明度和可重复性。
Nature. 2020 Oct;586(7829):E14-E16. doi: 10.1038/s41586-020-2766-y. Epub 2020 Oct 14.
6
Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies.人工智能与临床医生:深度学习研究的设计、报告标准和主张的系统评价。
BMJ. 2020 Mar 25;368:m689. doi: 10.1136/bmj.m689.
7
Reporting quality of studies using machine learning models for medical diagnosis: a systematic review.使用机器学习模型进行医学诊断的研究报告质量:系统评价。
BMJ Open. 2020 Mar 23;10(3):e034568. doi: 10.1136/bmjopen-2019-034568.
8
Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness.机器学习和人工智能研究如何造福患者:透明度、可重复性、伦理和有效性方面的 20 个关键问题。
BMJ. 2020 Mar 20;368:l6927. doi: 10.1136/bmj.l6927.
9
Can we trust AI not to further embed racial bias and prejudice?我们能相信人工智能不会进一步强化种族偏见吗?
BMJ. 2020 Feb 12;368:m363. doi: 10.1136/bmj.m363.
10
Assessing Radiology Research on Artificial Intelligence: A Brief Guide for Authors, Reviewers, and Readers-From the Editorial Board.评估人工智能放射学研究:给作者、审稿人和读者的简要指南——来自编辑委员会
Radiology. 2020 Mar;294(3):487-489. doi: 10.1148/radiol.2019192515. Epub 2019 Dec 31.