• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习的自然语言处理的非英语头部 CT 报告分类的比较。

Comparison of deep learning models for natural language processing-based classification of non-English head CT reports.

机构信息

Division of Diagnostic Imaging, Sheba Medical Center, Sackler Faculty of Medicine, Tel Aviv University, Derech Sheba St 2, Ramat Gan, Israel.

DeepVision Lab, Sheba Medical Center, Ramat Gan, Israel.

出版信息

Neuroradiology. 2020 Oct;62(10):1247-1256. doi: 10.1007/s00234-020-02420-0. Epub 2020 Apr 25.

DOI:10.1007/s00234-020-02420-0
PMID:32335686
Abstract

PURPOSE

Natural language processing (NLP) can be used for automatic flagging of radiology reports. We assessed deep learning models for classifying non-English head CT reports.

METHODS

We retrospectively collected head CT reports (2011-2018). Reports were signed in Hebrew. Emergency department (ED) reports of adult patients from January to February for each year (2013-2018) were manually labeled. All other reports were used to pre-train an embedding layer. We explored two use cases: (1) general labeling use case, in which reports were labeled as normal vs. pathological; (2) specific labeling use case, in which reports were labeled as with and without intra-cranial hemorrhage. We tested long short-term memory (LSTM) and LSTM-attention (LSTM-ATN) networks for classifying reports. We also evaluated the improvement of adding Word2Vec word embedding. Deep learning models were compared with a bag-of-words (BOW) model.

RESULTS

We retrieved 176,988 head CT reports for pre-training. We manually labeled 7784 reports as normal (46.3%) or pathological (53.7%), and 7.1% with intra-cranial hemorrhage. For the general labeling, LSTM-ATN-Word2Vec showed the best results (AUC = 0.967 ± 0.006, accuracy 90.8% ± 0.01). For the specific labeling, all methods showed similar accuracies between 95.0 and 95.9%. Both LSTM-ATN-Word2Vec and BOW had the highest AUC (0.970).

CONCLUSION

For a general use case, word embedding using a large cohort of non-English head CT reports and ATN improves NLP performance. For a more specific task, BOW and deep learning showed similar results. Models should be explored and tailored to the NLP task.

摘要

目的

自然语言处理(NLP)可用于自动标记放射学报告。我们评估了深度学习模型在分类非英语头部 CT 报告中的性能。

方法

我们回顾性地收集了头部 CT 报告(2011-2018 年)。报告以希伯来语签署。每年 1 月至 2 月(2013-2018 年)从急诊科(ED)收集成年患者的 CT 报告,并进行手动标记。所有其他报告均用于预训练嵌入层。我们探索了两种用例:(1)通用标记用例,其中报告被标记为正常与异常;(2)特定标记用例,其中报告被标记为有或无颅内出血。我们测试了长短期记忆(LSTM)和 LSTM-注意力(LSTM-ATN)网络来分类报告。我们还评估了添加 Word2Vec 词嵌入的改进效果。深度学习模型与词袋(BOW)模型进行了比较。

结果

我们为预训练检索了 176988 个头部 CT 报告。我们手动标记了 7784 个报告为正常(46.3%)或异常(53.7%),7.1%有颅内出血。对于通用标记,LSTM-ATN-Word2Vec 表现最佳(AUC=0.967±0.006,准确率 90.8%±0.01)。对于特定标记,所有方法的准确率在 95.0%至 95.9%之间相似。LSTM-ATN-Word2Vec 和 BOW 均具有最高 AUC(0.970)。

结论

对于通用用例,使用大量非英语头部 CT 报告和 ATN 的词嵌入可以提高 NLP 性能。对于更具体的任务,BOW 和深度学习的表现相似。应根据 NLP 任务探索和定制模型。

相似文献

1
Comparison of deep learning models for natural language processing-based classification of non-English head CT reports.基于深度学习的自然语言处理的非英语头部 CT 报告分类的比较。
Neuroradiology. 2020 Oct;62(10):1247-1256. doi: 10.1007/s00234-020-02420-0. Epub 2020 Apr 25.
2
Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports.基于自然语言的机器学习模型在临床放射学报告标注中的应用。
Radiology. 2018 May;287(2):570-580. doi: 10.1148/radiol.2018171093. Epub 2018 Jan 30.
3
Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).使用基于转换器的双向编码器表示 (BERT) 和领域内预训练 (IDPT) 对耳鸣患者的可操作放射学报告进行自动文本分类。
BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.
4
Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm.通过头部CT报告的自然语言处理识别颅内占位效应:CTIME算法
Am J Emerg Med. 2022 Jan;51:388-392. doi: 10.1016/j.ajem.2021.11.001. Epub 2021 Nov 9.
5
Deep Learning-Based Brain Hemorrhage Detection in CT Reports.基于深度学习的 CT 报告中脑出血检测。
Stud Health Technol Inform. 2022 May 25;294:866-867. doi: 10.3233/SHTI220609.
6
Comparison of natural language processing algorithms in assessing the importance of head computed tomography reports written in Japanese.比较自然语言处理算法在评估日文版头部计算机断层扫描报告重要性方面的表现。
Jpn J Radiol. 2024 Jul;42(7):697-708. doi: 10.1007/s11604-024-01549-9. Epub 2024 Mar 29.
7
Automated outcome classification of emergency department computed tomography imaging reports.急诊 CT 影像报告的自动化结果分类。
Acad Emerg Med. 2013 Aug;20(8):848-54. doi: 10.1111/acem.12174.
8
Deep Learning for Natural Language Processing in Radiology-Fundamentals and a Systematic Review.放射学中自然语言处理的深度学习——基础与系统综述
J Am Coll Radiol. 2020 May;17(5):639-648. doi: 10.1016/j.jacr.2019.12.026. Epub 2020 Jan 28.
9
Natural language processing deep learning models for the differential between high-grade gliomas and metastasis: what if the key is how we report them?自然语言处理深度学习模型在高级别胶质瘤和转移瘤鉴别中的应用:如果关键在于我们如何报告这些结果呢?
Eur Radiol. 2024 Mar;34(3):2113-2120. doi: 10.1007/s00330-023-10202-4. Epub 2023 Sep 4.
10
Towards automated generation of curated datasets in radiology: Application of natural language processing to unstructured reports exemplified on CT for pulmonary embolism.面向放射学中经过策展的数据集的自动化生成:以 CT 肺栓塞影像报告为例的自然语言处理在非结构化报告中的应用。
Eur J Radiol. 2020 Apr;125:108862. doi: 10.1016/j.ejrad.2020.108862. Epub 2020 Feb 6.

引用本文的文献

1
Evaluation of ChatGPT 4.0 in Thoracic Imaging and Diagnostics.ChatGPT 4.0在胸部影像学与诊断中的评估
Cureus. 2024 Nov 15;16(11):e73741. doi: 10.7759/cureus.73741. eCollection 2024 Nov.
2
BioBERTurk: Exploring Turkish Biomedical Language Model Development Strategies in Low-Resource Setting.BioBERTurk:在资源匮乏环境中探索土耳其生物医学语言模型开发策略
J Healthc Inform Res. 2023 Sep 19;7(4):433-446. doi: 10.1007/s41666-023-00140-7. eCollection 2023 Dec.
3
Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification?
Transformer 与传统自然语言处理:自动化放射科报告分类需要多少数据?
Br J Radiol. 2023 Sep;96(1149):20220769. doi: 10.1259/bjr.20220769. Epub 2023 May 25.
4
Natural Language Processing and Graph Theory: Making Sense of Imaging Records in a Novel Representation Frame.自然语言处理与图论:在一种新型表示框架中理解影像记录
JMIR Med Inform. 2022 Dec 21;10(12):e40534. doi: 10.2196/40534.
5
Machine learning to predict in-hospital mortality among patients with severe obesity: Proof of concept study.机器学习预测重度肥胖患者的院内死亡率:概念验证研究。
Obes Sci Pract. 2022 Mar 24;8(4):474-482. doi: 10.1002/osp4.571. eCollection 2022 Aug.
6
Labeling Noncontrast Head CT Reports for Common Findings Using Natural Language Processing.基于自然语言处理的常见头部 CT 无对比剂报告标注
AJNR Am J Neuroradiol. 2022 May;43(5):721-726. doi: 10.3174/ajnr.A7500. Epub 2022 Apr 28.
7
Deep Learning-Based Natural Language Processing in Radiology: The Impact of Report Complexity, Disease Prevalence, Dataset Size, and Algorithm Type on Model Performance.深度学习在放射学中的自然语言处理:报告复杂性、疾病流行率、数据集大小和算法类型对模型性能的影响。
J Med Syst. 2021 Sep 4;45(10):91. doi: 10.1007/s10916-021-01761-4.
8
Year 2020 (with COVID): Observation of Scientific Literature on Clinical Natural Language Processing.2020 年(含新冠疫情):临床自然语言处理相关科学文献观察
Yearb Med Inform. 2021 Aug;30(1):257-263. doi: 10.1055/s-0041-1726528. Epub 2021 Sep 3.
9
Gene Mutation Classification through Text Evidence Facilitating Cancer Tumour Detection.通过文本证据进行基因突变分类,辅助癌症肿瘤检测。
J Healthc Eng. 2021 Jul 27;2021:8689873. doi: 10.1155/2021/8689873. eCollection 2021.