• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自然语言处理识别颈动脉狭窄患者。

Identification of patients with carotid stenosis using natural language processing.

机构信息

Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA.

Department of Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA.

出版信息

Eur Radiol. 2020 Jul;30(7):4125-4133. doi: 10.1007/s00330-020-06721-z. Epub 2020 Feb 26.

DOI:10.1007/s00330-020-06721-z
PMID:32103365
Abstract

PURPOSE

The highly structured nature of medical reports makes them feasible for automated large-scale patient identification. This study aimed to develop a natural language processing (NLP) model to retrospectively retrieve patients with presence and history of carotid stenosis (CS) using their ultrasound reports.

METHODS

Ultrasound reports from our institution between January 2016 and December 2017 were selected. To process the texts, we developed a parser to divide the raw text into fields. For baseline method, we used bag-of-n-grams and term frequency inverse document frequency as the features and used linear classifiers. Logistic regression was performed as the baseline model. Convolution and recurrent neural networks (CNN; RNN) with attention mechanism were applied to the dataset to improve the classification accuracy.

RESULTS

We had 1220 ultrasound reports for training and 307 for testing, totaling to 1527 reports. For predicting history of CS, both CNN and RNN-attention models had a significantly higher specificity than logistic regression. In addition, RNN-attention also had a significantly higher F1 score and accuracy. For predicting presence of carotid stenosis, all models achieved above 93% accuracy. RNN-attention achieved a 95.4% accuracy, although the difference with logistic regression was not statistically significant. RNN-attention had a statistically significant higher specificity than logistic regression.

CONCLUSIONS

We developed linear, CNN, and RNN models to predict history and presence of CS from ultrasound reports. We have demonstrated NLP to be an efficient, accurate approach for large-scale retrospective patient identification, with applications in long-term follow-up of patients and clinical research studies.

KEY POINTS

• Natural language processing models using both linear classifiers and neural networks can achieve a good performance, with an overall accuracy above 90% in predicting history and presence of carotid stenosis. • Convolution and recurrent neural networks, especially with additional features including field awareness and attention mechanism, have superior performance than traditional linear classifiers. • NLP is shown to be an efficient approach for large-scale retrospective patient identification, with applications in long-term follow-up of patients and further clinical research studies.

摘要

目的

医学报告具有高度结构化的特点,因此非常适合进行自动化的大规模患者识别。本研究旨在开发一种自然语言处理(NLP)模型,以便使用其超声报告回顾性检索存在和既往颈动脉狭窄(CS)的患者。

方法

选择了我院 2016 年 1 月至 2017 年 12 月期间的超声报告。为了处理文本,我们开发了一个解析器,将原始文本分为字段。对于基线方法,我们使用了词袋和词频逆文档频率作为特征,并使用了线性分类器。逻辑回归被用作基线模型。卷积和循环神经网络(CNN;RNN)与注意力机制被应用于数据集,以提高分类准确性。

结果

我们有 1220 份超声报告用于训练和 307 份用于测试,总计 1527 份报告。在预测 CS 病史方面,CNN 和 RNN-attention 模型的特异性均显著高于逻辑回归。此外,RNN-attention 还具有更高的 F1 评分和准确性。在预测颈动脉狭窄的存在方面,所有模型的准确率均超过 93%。RNN-attention 的准确率达到了 95.4%,尽管与逻辑回归的差异没有统计学意义。RNN-attention 的特异性显著高于逻辑回归。

结论

我们开发了线性、CNN 和 RNN 模型,以便从超声报告中预测 CS 的病史和存在。我们已经证明了 NLP 是一种高效、准确的大规模回顾性患者识别方法,可应用于患者的长期随访和临床研究。

关键点

  • 使用线性分类器和神经网络的自然语言处理模型可以取得良好的性能,在预测颈动脉狭窄的病史和存在方面,整体准确率超过 90%。

  • 卷积和循环神经网络,特别是具有字段感知和注意力机制等附加特征的模型,其性能优于传统的线性分类器。

  • NLP 是一种高效的大规模回顾性患者识别方法,可应用于患者的长期随访和进一步的临床研究。

相似文献

1
Identification of patients with carotid stenosis using natural language processing.使用自然语言处理识别颈动脉狭窄患者。
Eur Radiol. 2020 Jul;30(7):4125-4133. doi: 10.1007/s00330-020-06721-z. Epub 2020 Feb 26.
2
Establishing a carotid artery stenosis disease cohort for comparative effectiveness research using natural language processing.利用自然语言处理技术建立颈动脉狭窄疾病队列进行比较效果研究。
J Vasc Surg. 2021 Dec;74(6):1937-1947.e3. doi: 10.1016/j.jvs.2021.05.054. Epub 2021 Jun 25.
3
Predicting mental conditions based on "history of present illness" in psychiatric notes with deep neural networks.基于精神科病历中的“现病史”用深度神经网络预测精神状况。
J Biomed Inform. 2017 Nov;75S:S138-S148. doi: 10.1016/j.jbi.2017.06.010. Epub 2017 Jun 10.
4
Developing a Cancer Digital Twin: Supervised Metastases Detection From Consecutive Structured Radiology Reports.开发癌症数字孪生模型:基于连续结构化放射学报告的转移性病变监督检测
Front Artif Intell. 2022 Mar 2;5:826402. doi: 10.3389/frai.2022.826402. eCollection 2022.
5
Towards automated generation of curated datasets in radiology: Application of natural language processing to unstructured reports exemplified on CT for pulmonary embolism.面向放射学中经过策展的数据集的自动化生成:以 CT 肺栓塞影像报告为例的自然语言处理在非结构化报告中的应用。
Eur J Radiol. 2020 Apr;125:108862. doi: 10.1016/j.ejrad.2020.108862. Epub 2020 Feb 6.
6
Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.使用基于自然语言处理的脑磁共振成像放射学报告机器学习预测卒中结局
J Pers Med. 2020 Dec 16;10(4):286. doi: 10.3390/jpm10040286.
7
Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification.卷积神经网络 (CNN) 和循环神经网络 (RNN) 架构在放射学文本报告分类中的比较效果。
Artif Intell Med. 2019 Jun;97:79-88. doi: 10.1016/j.artmed.2018.11.004. Epub 2018 Nov 23.
8
ACR-SA: attention-based deep model through two-channel CNN and Bi-RNN for sentiment analysis.ACR-SA:通过双通道卷积神经网络和双向循环神经网络实现的基于注意力的深度情感分析模型
PeerJ Comput Sci. 2022 Mar 17;8:e877. doi: 10.7717/peerj-cs.877. eCollection 2022.
9
Comparing information extraction techniques for low-prevalence concepts: The case of insulin rejection by patients.比较低患病率概念的信息提取技术:以患者拒绝胰岛素为例。
J Biomed Inform. 2019 Nov;99:103306. doi: 10.1016/j.jbi.2019.103306. Epub 2019 Oct 13.
10
Natural language processing and machine learning algorithm to identify brain MRI reports with acute ischemic stroke.自然语言处理和机器学习算法识别急性缺血性脑卒中的脑部 MRI 报告。
PLoS One. 2019 Feb 28;14(2):e0212778. doi: 10.1371/journal.pone.0212778. eCollection 2019.

引用本文的文献

1
The added value of including thyroid nodule features into large language models for automatic ACR TI-RADS classification based on ultrasound reports.将甲状腺结节特征纳入基于超声报告的大语言模型以进行自动ACR TI-RADS分类的附加价值。
Jpn J Radiol. 2025 Apr;43(4):593-602. doi: 10.1007/s11604-024-01707-z. Epub 2024 Nov 25.
2
Identifying the Severity of Heart Valve Stenosis and Regurgitation Among a Diverse Population Within an Integrated Health Care System: Natural Language Processing Approach.在综合性医疗保健系统中识别不同人群中心脏瓣膜狭窄和反流的严重程度:自然语言处理方法。
JMIR Cardio. 2024 Sep 30;8:e60503. doi: 10.2196/60503.
3
Current Applications and Future Perspectives of Artificial and Biomimetic Intelligence in Vascular Surgery and Peripheral Artery Disease.
人工智能与仿生智能在血管外科和外周动脉疾病中的当前应用及未来展望
Biomimetics (Basel). 2024 Aug 1;9(8):465. doi: 10.3390/biomimetics9080465.
4
Comprehensive Review of Natural Language Processing (NLP) in Vascular Surgery.血管外科中自然语言处理(NLP)的综合综述
EJVES Vasc Forum. 2023 Sep 17;60:57-63. doi: 10.1016/j.ejvsvf.2023.09.002. eCollection 2023.
5
Detection of Lumbar Spondylolisthesis from X-ray Images Using Deep Learning Network.使用深度学习网络从X射线图像中检测腰椎滑脱
J Clin Med. 2022 Sep 16;11(18):5450. doi: 10.3390/jcm11185450.
6
Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches.使用自然语言处理方法从血管造影报告中准确识别脑动脉狭窄
Diagnostics (Basel). 2022 Aug 3;12(8):1882. doi: 10.3390/diagnostics12081882.
7
Natural Language Processing of Large-Scale Structured Radiology Reports to Identify Oncologic Patients With or Without Splenomegaly Over a 10-Year Period.10 年间基于自然语言处理的大规模结构化放射学报告,以识别有或无脾肿大的肿瘤患者。
JCO Clin Cancer Inform. 2022 Jan;6:e2100104. doi: 10.1200/CCI.21.00104.
8
A disease-specific language representation model for cerebrovascular disease research.一种用于脑血管病研究的疾病特异性语言表示模型。
Comput Methods Programs Biomed. 2021 Nov;211:106446. doi: 10.1016/j.cmpb.2021.106446. Epub 2021 Sep 30.
9
Multimodality carotid plaque tissue characterization and classification in the artificial intelligence paradigm: a narrative review for stroke application.人工智能范式下的多模态颈动脉斑块组织特征分析与分类:针对卒中应用的叙述性综述
Ann Transl Med. 2021 Jul;9(14):1206. doi: 10.21037/atm-20-7676.
10
Automatic Prediction of Recurrence of Major Cardiovascular Events: A Text Mining Study Using Chest X-Ray Reports.自动预测主要心血管事件复发:使用胸部 X 光报告进行的文本挖掘研究。
J Healthc Eng. 2021 Jul 9;2021:6663884. doi: 10.1155/2021/6663884. eCollection 2021.