基于自然语言处理的胸部X光片解读的开发与多中心验证

Development and multicenter validation of chest X-ray radiography interpretations based on natural language processing.

作者信息

Zhang Yaping, Liu Mingqian, Hu Shundong, Shen Yao, Lan Jun, Jiang Beibei, de Bock Geertruida H, Vliegenthart Rozemarijn, Chen Xu, Xie Xueqian

机构信息

Radiology Department, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Haining Rd.100, Shanghai, 200080 China.

Radiology Department, Shanghai General Hospital of Nanjing Medical University, Haining Rd.100, Shanghai, 200080 China.

出版信息

Commun Med (Lond). 2021 Oct 28;1:43. doi: 10.1038/s43856-021-00043-x. eCollection 2021.

DOI:10.1038/s43856-021-00043-x

PMID:35602222

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9053275/

Abstract

BACKGROUND

Artificial intelligence can assist in interpreting chest X-ray radiography (CXR) data, but large datasets require efficient image annotation. The purpose of this study is to extract CXR labels from diagnostic reports based on natural language processing, train convolutional neural networks (CNNs), and evaluate the classification performance of CNN using CXR data from multiple centers.

METHODS

We collected the CXR images and corresponding radiology reports of 74,082 subjects as the training dataset. The linguistic entities and relationships from unstructured radiology reports were extracted by the bidirectional encoder representations from transformers (BERT) model, and a knowledge graph was constructed to represent the association between image labels of abnormal signs and the report text of CXR. Then, a 25-label classification system were built to train and test the CNN models with weakly supervised labeling.

RESULTS

In three external test cohorts of 5,996 symptomatic patients, 2,130 screening examinees, and 1,804 community clinic patients, the mean AUC of identifying 25 abnormal signs by CNN reaches 0.866 ± 0.110, 0.891 ± 0.147, and 0.796 ± 0.157, respectively. In symptomatic patients, CNN shows no significant difference with local radiologists in identifying 21 signs (p > 0.05), but is poorer for 4 signs (p < 0.05). In screening examinees, CNN shows no significant difference for 17 signs (p > 0.05), but is poorer at classifying nodules (p = 0.013). In community clinic patients, CNN shows no significant difference for 12 signs (p > 0.05), but performs better for 6 signs (p < 0.001).

CONCLUSION

We construct and validate an effective CXR interpretation system based on natural language processing.

摘要

背景

人工智能可辅助解读胸部X线摄影（CXR）数据，但大型数据集需要高效的图像标注。本研究的目的是基于自然语言处理从诊断报告中提取CXR标签，训练卷积神经网络（CNN），并使用来自多个中心的CXR数据评估CNN的分类性能。

方法

我们收集了74082名受试者的CXR图像及相应的放射学报告作为训练数据集。通过基于变换器的双向编码器表征（BERT）模型从未结构化的放射学报告中提取语言实体和关系，并构建知识图谱以表示异常征象的图像标签与CXR报告文本之间的关联。然后，构建一个25标签分类系统，用弱监督标注来训练和测试CNN模型。

结果

在三个外部测试队列中，分别为5996名有症状患者、2130名筛查受检者和1804名社区诊所患者，CNN识别25种异常征象的平均AUC分别达到0.866±0.110、0.891±0.147和0.796±0.157。在有症状患者中，CNN在识别21种征象方面与当地放射科医生无显著差异（p>0.05），但在4种征象上表现较差（p<0.05）。在筛查受检者中，CNN在17种征象上无显著差异（p>0.05），但在结节分类方面较差（p=0.013）。在社区诊所患者中，CNN在12种征象上无显著差异（p>0.05），但在6种征象上表现更好（p<0.001）。

结论

我们构建并验证了一个基于自然语言处理的有效的CXR解读系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/f78b6f06a53a/43856_2021_43_Fig1_HTML.jpg

相似文献

Development and multicenter validation of chest X-ray radiography interpretations based on natural language processing.

Commun Med (Lond). 2021 Oct 28;1:43. doi: 10.1038/s43856-021-00043-x. eCollection 2021.

Comparison of Chest Radiograph Captions Based on Natural Language Processing vs Completed by Radiologists.

JAMA Netw Open. 2023 Feb 1;6(2):e2255113. doi: 10.1001/jamanetworkopen.2022.55113.

Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).

BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.

Multi-Label Classification in Patient-Doctor Dialogues With the RoBERTa-WWM-ext + CNN (Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach With Whole Word Masking Extended Combining a Convolutional Neural Network) Model: Named Entity Study.

JMIR Med Inform. 2022 Apr 21;10(4):e35606. doi: 10.2196/35606.

Development and validation of open-source deep neural networks for comprehensive chest x-ray reading: a retrospective, multicentre study.

Lancet Digit Health. 2024 Jan;6(1):e44-e57. doi: 10.1016/S2589-7500(23)00218-2. Epub 2023 Dec 8.

Development and External Validation of an Artificial Intelligence Model for Identifying Radiology Reports Containing Recommendations for Additional Imaging.

AJR Am J Roentgenol. 2023 Sep;221(3):377-385. doi: 10.2214/AJR.23.29120. Epub 2023 Apr 19.

The effect of Gaussian noise on pneumonia detection on chest radiographs, using convolutional neural networks.

Radiography (Lond). 2023 Jan;29(1):38-43. doi: 10.1016/j.radi.2022.09.011. Epub 2022 Oct 20.

Optimizing the Synergistic Potential of Pseudo-Labels from Radiology Notes and Annotated Ground Truth in Identifying Pulmonary Opacities on Chest Radiographs for Early Detection of Acute Respiratory Distress Syndrome.

AMIA Annu Symp Proc. 2024 Jan 11;2023:270-279. eCollection 2023.

RadioBERT: A deep learning-based system for medical report generation from chest X-ray images using contextual embeddings.

J Biomed Inform. 2022 Nov;135:104220. doi: 10.1016/j.jbi.2022.104220. Epub 2022 Oct 10.

Multiple-Inputs Convolutional Neural Network for COVID-19 Classification and Critical Region Screening From Chest X-ray Radiographs: Model Development and Performance Evaluation.

JMIR Bioinform Biotechnol. 2022 Oct 4;3(1):e36660. doi: 10.2196/36660. eCollection 2022 Jan-Dec.

引用本文的文献

NLP-Driven Analysis of Pneumothorax Incidence Following Central Venous Catheter Procedures: A Data-Driven Re-Evaluation of Routine Imaging in Value-Based Medicine.

Diagnostics (Basel). 2024 Dec 12;14(24):2792. doi: 10.3390/diagnostics14242792.

Automated labelling of radiology reports using natural language processing: Comparison of traditional and newer methods.

Health Care Sci. 2023 Apr 24;2(2):120-128. doi: 10.1002/hcs2.40. eCollection 2023 Apr.

Fully automated artificial intelligence-based coronary CT angiography image processing: efficiency, diagnostic capability, and risk stratification.

Eur Radiol. 2024 Aug;34(8):4909-4919. doi: 10.1007/s00330-023-10494-6. Epub 2024 Jan 9.

Development and Evaluation of a Natural Language Processing System for Curating a Trans-Thoracic Echocardiogram (TTE) Database.

Bioengineering (Basel). 2023 Nov 10;10(11):1307. doi: 10.3390/bioengineering10111307.

Comparison of Chest Radiograph Captions Based on Natural Language Processing vs Completed by Radiologists.

JAMA Netw Open. 2023 Feb 1;6(2):e2255113. doi: 10.1001/jamanetworkopen.2022.55113.

Development and Validation of a Model to Identify Critical Brain Injuries Using Natural Language Processing of Text Computed Tomography Reports.

JAMA Netw Open. 2022 Aug 1;5(8):e2227109. doi: 10.1001/jamanetworkopen.2022.27109.

本文引用的文献

Stance detection with BERT embeddings for credibility analysis of information on social media.

PeerJ Comput Sci. 2021 Apr 14;7:e467. doi: 10.7717/peerj-cs.467. eCollection 2021.

Applying Machine Learning to Identify Anti-Vaccination Tweets during the COVID-19 Pandemic.

Int J Environ Res Public Health. 2021 Apr 12;18(8):4069. doi: 10.3390/ijerph18084069.

Artificial Intelligence-Assisted Prediction of Late-Onset Cardiomyopathy Among Childhood Cancer Survivors.

JCO Clin Cancer Inform. 2021 Apr;5:459-468. doi: 10.1200/CCI.20.00176.

Human-recognizable CT image features of subsolid lung nodules associated with diagnosis and classification by convolutional neural networks.

Eur Radiol. 2021 Oct;31(10):7303-7315. doi: 10.1007/s00330-021-07901-1. Epub 2021 Apr 13.

GT-Finder: Classify the family of glucose transporters with pre-trained BERT language models.

Comput Biol Med. 2021 Apr;131:104259. doi: 10.1016/j.compbiomed.2021.104259. Epub 2021 Feb 7.

Comparison of Chest Radiograph Interpretations by Artificial Intelligence Algorithm vs Radiology Residents.

JAMA Netw Open. 2020 Oct 1;3(10):e2022779. doi: 10.1001/jamanetworkopen.2020.22779.

Validation of a Deep Learning Algorithm for the Detection of Malignant Pulmonary Nodules in Chest Radiographs.

JAMA Netw Open. 2020 Sep 1;3(9):e2017135. doi: 10.1001/jamanetworkopen.2020.17135.

PadChest: A large chest x-ray image dataset with multi-label annotated reports.

Med Image Anal. 2020 Dec;66:101797. doi: 10.1016/j.media.2020.101797. Epub 2020 Aug 20.

Constructing knowledge graphs and their biomedical applications.

Comput Struct Biotechnol J. 2020 Jun 2;18:1414-1428. doi: 10.1016/j.csbj.2020.05.017. eCollection 2020.

COVID-19 on Chest Radiographs: A Multireader Evaluation of an Artificial Intelligence System.

Radiology. 2020 Sep;296(3):E166-E172. doi: 10.1148/radiol.2020201874. Epub 2020 May 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于自然语言处理的胸部X光片解读的开发与多中心验证

Development and multicenter validation of chest X-ray radiography interpretations based on natural language processing.

作者信息

Zhang Yaping, Liu Mingqian, Hu Shundong, Shen Yao, Lan Jun, Jiang Beibei, de Bock Geertruida H, Vliegenthart Rozemarijn, Chen Xu, Xie Xueqian

机构信息

Radiology Department, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Haining Rd.100, Shanghai, 200080 China.

Radiology Department, Shanghai General Hospital of Nanjing Medical University, Haining Rd.100, Shanghai, 200080 China.