• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自然语言处理的胸部X光片解读的开发与多中心验证

Development and multicenter validation of chest X-ray radiography interpretations based on natural language processing.

作者信息

Zhang Yaping, Liu Mingqian, Hu Shundong, Shen Yao, Lan Jun, Jiang Beibei, de Bock Geertruida H, Vliegenthart Rozemarijn, Chen Xu, Xie Xueqian

机构信息

Radiology Department, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Haining Rd.100, Shanghai, 200080 China.

Radiology Department, Shanghai General Hospital of Nanjing Medical University, Haining Rd.100, Shanghai, 200080 China.

出版信息

Commun Med (Lond). 2021 Oct 28;1:43. doi: 10.1038/s43856-021-00043-x. eCollection 2021.

DOI:10.1038/s43856-021-00043-x
PMID:35602222
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9053275/
Abstract

BACKGROUND

Artificial intelligence can assist in interpreting chest X-ray radiography (CXR) data, but large datasets require efficient image annotation. The purpose of this study is to extract CXR labels from diagnostic reports based on natural language processing, train convolutional neural networks (CNNs), and evaluate the classification performance of CNN using CXR data from multiple centers.

METHODS

We collected the CXR images and corresponding radiology reports of 74,082 subjects as the training dataset. The linguistic entities and relationships from unstructured radiology reports were extracted by the bidirectional encoder representations from transformers (BERT) model, and a knowledge graph was constructed to represent the association between image labels of abnormal signs and the report text of CXR. Then, a 25-label classification system were built to train and test the CNN models with weakly supervised labeling.

RESULTS

In three external test cohorts of 5,996 symptomatic patients, 2,130 screening examinees, and 1,804 community clinic patients, the mean AUC of identifying 25 abnormal signs by CNN reaches 0.866 ± 0.110, 0.891 ± 0.147, and 0.796 ± 0.157, respectively. In symptomatic patients, CNN shows no significant difference with local radiologists in identifying 21 signs (p > 0.05), but is poorer for 4 signs (p < 0.05). In screening examinees, CNN shows no significant difference for 17 signs (p > 0.05), but is poorer at classifying nodules (p = 0.013). In community clinic patients, CNN shows no significant difference for 12 signs (p > 0.05), but performs better for 6 signs (p < 0.001).

CONCLUSION

We construct and validate an effective CXR interpretation system based on natural language processing.

摘要

背景

人工智能可辅助解读胸部X线摄影(CXR)数据,但大型数据集需要高效的图像标注。本研究的目的是基于自然语言处理从诊断报告中提取CXR标签,训练卷积神经网络(CNN),并使用来自多个中心的CXR数据评估CNN的分类性能。

方法

我们收集了74082名受试者的CXR图像及相应的放射学报告作为训练数据集。通过基于变换器的双向编码器表征(BERT)模型从未结构化的放射学报告中提取语言实体和关系,并构建知识图谱以表示异常征象的图像标签与CXR报告文本之间的关联。然后,构建一个25标签分类系统,用弱监督标注来训练和测试CNN模型。

结果

在三个外部测试队列中,分别为5996名有症状患者、2130名筛查受检者和1804名社区诊所患者,CNN识别25种异常征象的平均AUC分别达到0.866±0.110、0.891±0.147和0.796±0.157。在有症状患者中,CNN在识别21种征象方面与当地放射科医生无显著差异(p>0.05),但在4种征象上表现较差(p<0.05)。在筛查受检者中,CNN在17种征象上无显著差异(p>0.05),但在结节分类方面较差(p=0.013)。在社区诊所患者中,CNN在12种征象上无显著差异(p>0.05),但在6种征象上表现更好(p<0.001)。

结论

我们构建并验证了一个基于自然语言处理的有效的CXR解读系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/5c276eb2d72d/43856_2021_43_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/f78b6f06a53a/43856_2021_43_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/588b88618a62/43856_2021_43_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/77401604fe0e/43856_2021_43_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/f473260a59e1/43856_2021_43_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/5c276eb2d72d/43856_2021_43_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/f78b6f06a53a/43856_2021_43_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/588b88618a62/43856_2021_43_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/77401604fe0e/43856_2021_43_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/f473260a59e1/43856_2021_43_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9779/9053275/5c276eb2d72d/43856_2021_43_Fig5_HTML.jpg

相似文献

1
Development and multicenter validation of chest X-ray radiography interpretations based on natural language processing.基于自然语言处理的胸部X光片解读的开发与多中心验证
Commun Med (Lond). 2021 Oct 28;1:43. doi: 10.1038/s43856-021-00043-x. eCollection 2021.
2
Comparison of Chest Radiograph Captions Based on Natural Language Processing vs Completed by Radiologists.基于自然语言处理的胸片描述与放射科医生完成的描述的比较。
JAMA Netw Open. 2023 Feb 1;6(2):e2255113. doi: 10.1001/jamanetworkopen.2022.55113.
3
Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).使用基于转换器的双向编码器表示 (BERT) 和领域内预训练 (IDPT) 对耳鸣患者的可操作放射学报告进行自动文本分类。
BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.
4
Multi-Label Classification in Patient-Doctor Dialogues With the RoBERTa-WWM-ext + CNN (Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach With Whole Word Masking Extended Combining a Convolutional Neural Network) Model: Named Entity Study.基于RoBERTa-WWM-ext + CNN(带有全词掩码扩展的基于变换器预训练方法的稳健优化双向编码器表示与卷积神经网络相结合)模型的医患对话多标签分类:命名实体研究
JMIR Med Inform. 2022 Apr 21;10(4):e35606. doi: 10.2196/35606.
5
Development and validation of open-source deep neural networks for comprehensive chest x-ray reading: a retrospective, multicentre study.开源深度学习神经网络在全面胸部 X 线阅读中的开发和验证:一项回顾性、多中心研究。
Lancet Digit Health. 2024 Jan;6(1):e44-e57. doi: 10.1016/S2589-7500(23)00218-2. Epub 2023 Dec 8.
6
Development and External Validation of an Artificial Intelligence Model for Identifying Radiology Reports Containing Recommendations for Additional Imaging.开发和外部验证用于识别包含额外成像建议的放射学报告的人工智能模型。
AJR Am J Roentgenol. 2023 Sep;221(3):377-385. doi: 10.2214/AJR.23.29120. Epub 2023 Apr 19.
7
The effect of Gaussian noise on pneumonia detection on chest radiographs, using convolutional neural networks.使用卷积神经网络时高斯噪声对胸部X光片肺炎检测的影响。
Radiography (Lond). 2023 Jan;29(1):38-43. doi: 10.1016/j.radi.2022.09.011. Epub 2022 Oct 20.
8
Optimizing the Synergistic Potential of Pseudo-Labels from Radiology Notes and Annotated Ground Truth in Identifying Pulmonary Opacities on Chest Radiographs for Early Detection of Acute Respiratory Distress Syndrome.优化放射科记录中的伪标签与标注的地面实况之间的协同作用,以识别胸部 X 光片中的肺部混浊,从而实现急性呼吸窘迫综合征的早期检测。
AMIA Annu Symp Proc. 2024 Jan 11;2023:270-279. eCollection 2023.
9
RadioBERT: A deep learning-based system for medical report generation from chest X-ray images using contextual embeddings.RadioBERT:一种基于深度学习的系统,用于使用上下文嵌入从胸部 X 光图像生成医学报告。
J Biomed Inform. 2022 Nov;135:104220. doi: 10.1016/j.jbi.2022.104220. Epub 2022 Oct 10.
10
Multiple-Inputs Convolutional Neural Network for COVID-19 Classification and Critical Region Screening From Chest X-ray Radiographs: Model Development and Performance Evaluation.用于胸部X光片COVID-19分类及关键区域筛查的多输入卷积神经网络:模型开发与性能评估
JMIR Bioinform Biotechnol. 2022 Oct 4;3(1):e36660. doi: 10.2196/36660. eCollection 2022 Jan-Dec.

引用本文的文献

1
NLP-Driven Analysis of Pneumothorax Incidence Following Central Venous Catheter Procedures: A Data-Driven Re-Evaluation of Routine Imaging in Value-Based Medicine.基于自然语言处理的中心静脉导管置入术后气胸发生率分析:基于价值医疗的常规影像学数据驱动再评估
Diagnostics (Basel). 2024 Dec 12;14(24):2792. doi: 10.3390/diagnostics14242792.
2
Automated labelling of radiology reports using natural language processing: Comparison of traditional and newer methods.使用自然语言处理对放射学报告进行自动标注:传统方法与新方法的比较。
Health Care Sci. 2023 Apr 24;2(2):120-128. doi: 10.1002/hcs2.40. eCollection 2023 Apr.
3
Fully automated artificial intelligence-based coronary CT angiography image processing: efficiency, diagnostic capability, and risk stratification.

本文引用的文献

1
Stance detection with BERT embeddings for credibility analysis of information on social media.基于BERT嵌入的立场检测用于社交媒体信息可信度分析
PeerJ Comput Sci. 2021 Apr 14;7:e467. doi: 10.7717/peerj-cs.467. eCollection 2021.
2
Applying Machine Learning to Identify Anti-Vaccination Tweets during the COVID-19 Pandemic.应用机器学习识别 COVID-19 大流行期间的反疫苗推文。
Int J Environ Res Public Health. 2021 Apr 12;18(8):4069. doi: 10.3390/ijerph18084069.
3
Artificial Intelligence-Assisted Prediction of Late-Onset Cardiomyopathy Among Childhood Cancer Survivors.
基于完全自动化人工智能的冠状动脉 CT 血管造影图像处理:效率、诊断能力和风险分层。
Eur Radiol. 2024 Aug;34(8):4909-4919. doi: 10.1007/s00330-023-10494-6. Epub 2024 Jan 9.
4
Development and Evaluation of a Natural Language Processing System for Curating a Trans-Thoracic Echocardiogram (TTE) Database.用于整理经胸超声心动图(TTE)数据库的自然语言处理系统的开发与评估
Bioengineering (Basel). 2023 Nov 10;10(11):1307. doi: 10.3390/bioengineering10111307.
5
Comparison of Chest Radiograph Captions Based on Natural Language Processing vs Completed by Radiologists.基于自然语言处理的胸片描述与放射科医生完成的描述的比较。
JAMA Netw Open. 2023 Feb 1;6(2):e2255113. doi: 10.1001/jamanetworkopen.2022.55113.
6
Development and Validation of a Model to Identify Critical Brain Injuries Using Natural Language Processing of Text Computed Tomography Reports.利用文本计算机断层扫描报告的自然语言处理开发和验证一种识别关键脑损伤的模型。
JAMA Netw Open. 2022 Aug 1;5(8):e2227109. doi: 10.1001/jamanetworkopen.2022.27109.
人工智能辅助预测儿童癌症幸存者的迟发性心肌病。
JCO Clin Cancer Inform. 2021 Apr;5:459-468. doi: 10.1200/CCI.20.00176.
4
Human-recognizable CT image features of subsolid lung nodules associated with diagnosis and classification by convolutional neural networks.与卷积神经网络诊断和分类相关的亚实性肺结节的人类可识别 CT 图像特征。
Eur Radiol. 2021 Oct;31(10):7303-7315. doi: 10.1007/s00330-021-07901-1. Epub 2021 Apr 13.
5
GT-Finder: Classify the family of glucose transporters with pre-trained BERT language models.GT-Finder:使用预训练的 BERT 语言模型对葡萄糖转运蛋白家族进行分类。
Comput Biol Med. 2021 Apr;131:104259. doi: 10.1016/j.compbiomed.2021.104259. Epub 2021 Feb 7.
6
Comparison of Chest Radiograph Interpretations by Artificial Intelligence Algorithm vs Radiology Residents.人工智能算法与放射科住院医师对胸部 X 线片解读的比较。
JAMA Netw Open. 2020 Oct 1;3(10):e2022779. doi: 10.1001/jamanetworkopen.2020.22779.
7
Validation of a Deep Learning Algorithm for the Detection of Malignant Pulmonary Nodules in Chest Radiographs.深度学习算法在胸部 X 光片中检测恶性肺结节的验证。
JAMA Netw Open. 2020 Sep 1;3(9):e2017135. doi: 10.1001/jamanetworkopen.2020.17135.
8
PadChest: A large chest x-ray image dataset with multi-label annotated reports.PadChest:一个大型胸部 X 射线图像数据集,带有多标签注释报告。
Med Image Anal. 2020 Dec;66:101797. doi: 10.1016/j.media.2020.101797. Epub 2020 Aug 20.
9
Constructing knowledge graphs and their biomedical applications.构建知识图谱及其生物医学应用。
Comput Struct Biotechnol J. 2020 Jun 2;18:1414-1428. doi: 10.1016/j.csbj.2020.05.017. eCollection 2020.
10
COVID-19 on Chest Radiographs: A Multireader Evaluation of an Artificial Intelligence System.COVID-19 胸片:人工智能系统的多读者评估。
Radiology. 2020 Sep;296(3):E166-E172. doi: 10.1148/radiol.2020201874. Epub 2020 May 8.