• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用预训练语言模型实现外科医生协会成人心脏手术登记处手术程序提取的自动化。

Automating surgical procedure extraction for society of surgeons adult cardiac surgery registry using pretrained language models.

作者信息

Lee Jaehyun, Sharma Ishan, Arcaro Nichole, Blackstone Eugene H, Gillinov A Marc, Svensson Lars G, Karamlou Tara, Chen David

机构信息

Cardiovascular Outcomes Research and Registries, Cleveland Clinic, Cleveland Clinic, Cleveland, OH 44195, United States.

Heart, Vascular, and Thoracic Institute, Cleveland Clinic, Cleveland, OH 44195, United States.

出版信息

JAMIA Open. 2024 Jul 24;7(3):ooae054. doi: 10.1093/jamiaopen/ooae054. eCollection 2024 Oct.

DOI:10.1093/jamiaopen/ooae054
PMID:39049992
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11268872/
Abstract

OBJECTIVE

Surgical registries play a crucial role in clinical knowledge discovery, hospital quality assurance, and quality improvement. However, maintaining a surgical registry requires significant monetary and human resources given the wide gamut of information abstracted from medical records ranging from patient co-morbidities to procedural details to post-operative outcomes. Although natural language processing (NLP) methods such as pretrained language models (PLMs) have promised automation of this process, there are yet substantial barriers to implementation. In particular, constant shifts in both underlying data and required registry content are hurdles to the application of NLP technologies.

MATERIALS AND METHODS

In our work, we evaluate the application of PLMs for automating the population of the Society of Thoracic Surgeons (STSs) adult cardiac surgery registry (ACS) procedural elements, for which we term Cardiovascular Surgery Bidirectional Encoder Representations from Transformers (CS-BERT). CS-BERT was validated across multiple satellite sites and versions of the STS-ACS registry.

RESULTS

CS-BERT performed well (F1 score of 0.8417 ± 0.1838) in common cardiac surgery procedures compared to models based on diagnosis codes (F1 score of 0.6130 ± 0.0010). The model also generalized well to satellite sites and across different versions of the STS-ACS registry.

DISCUSSION AND CONCLUSIONS

This study provides evidence that PLMs can be used to extract the more common cardiac surgery procedure variables in the STS-ACS registry, potentially reducing need for expensive human annotation and wide scale dissemination. Further research is needed for rare procedural variables which suffer from both lack of data and variable documentation quality.

摘要

目的

手术登记在临床知识发现、医院质量保证和质量改进中起着至关重要的作用。然而,鉴于从病历中提取的信息范围广泛,从患者合并症到手术细节再到术后结果,维护手术登记需要大量的资金和人力资源。尽管诸如预训练语言模型(PLM)之类的自然语言处理(NLP)方法有望实现这一过程的自动化,但在实施方面仍存在重大障碍。特别是,基础数据和所需登记内容的不断变化是NLP技术应用的障碍。

材料与方法

在我们的工作中,我们评估了PLM在自动填充胸外科医师协会(STS)成人心脏手术登记(ACS)程序元素方面的应用,我们将其称为心血管外科双向编码器表征从变压器(CS-BERT)。CS-BERT在多个卫星站点和不同版本的STS-ACS登记中得到了验证。

结果

与基于诊断代码的模型(F1分数为0.6130±0.0010)相比,CS-BERT在常见心脏手术程序中表现良好(F1分数为0.8417±0.1838)。该模型在卫星站点和不同版本的STS-ACS登记中也具有良好的通用性。

讨论与结论

本研究提供了证据,证明PLM可用于提取STS-ACS登记中更常见的心脏手术程序变量,可能减少对昂贵人工注释的需求并实现广泛传播。对于缺乏数据且文档质量参差不齐的罕见手术变量,还需要进一步研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f90/11268872/c1a946dd2cd7/ooae054f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f90/11268872/3b8c8140cc57/ooae054f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f90/11268872/c1a946dd2cd7/ooae054f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f90/11268872/3b8c8140cc57/ooae054f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f90/11268872/c1a946dd2cd7/ooae054f2.jpg

相似文献

1
Automating surgical procedure extraction for society of surgeons adult cardiac surgery registry using pretrained language models.使用预训练语言模型实现外科医生协会成人心脏手术登记处手术程序提取的自动化。
JAMIA Open. 2024 Jul 24;7(3):ooae054. doi: 10.1093/jamiaopen/ooae054. eCollection 2024 Oct.
2
When BERT meets Bilbo: a learning curve analysis of pretrained language model on disease classification.当 BERT 遇见比尔博:预训练语言模型在疾病分类上的学习曲线分析。
BMC Med Inform Decis Mak. 2022 Apr 5;21(Suppl 9):377. doi: 10.1186/s12911-022-01829-2.
3
A Natural Language Processing Model for COVID-19 Detection Based on Dutch General Practice Electronic Health Records by Using Bidirectional Encoder Representations From Transformers: Development and Validation Study.基于荷兰全科电子健康记录的 COVID-19 检测自然语言处理模型:使用转换器的双向编码器表示进行开发和验证研究。
J Med Internet Res. 2023 Oct 4;25:e49944. doi: 10.2196/49944.
4
Evaluation of a BERT Natural Language Processing Model for Automating CT and MRI Triage and Protocol Selection.用于自动进行CT和MRI分诊及方案选择的BERT自然语言处理模型评估
Can Assoc Radiol J. 2025 May;76(2):265-272. doi: 10.1177/08465371241255895. Epub 2024 Jun 4.
5
Use of artificial intelligence to identify data elements for The Japanese Orthopaedic Association National Registry from operative records.利用人工智能从手术记录中识别日本矫形外科学会国家注册中心的数据元素。
J Orthop Sci. 2023 Nov;28(6):1392-1399. doi: 10.1016/j.jos.2022.09.003. Epub 2022 Sep 23.
6
Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework.基于 BERT(来自 Transformers 的双向编码器表示)的深度学习方法在提取中文放射学报告证据中的应用:计算机辅助肝癌诊断框架的开发。
J Med Internet Res. 2021 Jan 12;23(1):e19689. doi: 10.2196/19689.
7
A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation.基于 Transformer 的双向编码器表示模型的精细调整在食品命名实体识别中的应用:算法开发与验证。
J Med Internet Res. 2021 Aug 9;23(8):e28229. doi: 10.2196/28229.
8
Ophthalmology Operation Note Encoding with Open-Source Machine Learning and Natural Language Processing.开源机器学习和自然语言处理在眼科手术记录编码中的应用。
Ophthalmic Res. 2023;66(1):928-939. doi: 10.1159/000530954. Epub 2023 May 11.
9
Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods.使用基于转换器的自然语言处理方法识别与糖尿病视网膜病变相关的临床概念及其属性。
BMC Med Inform Decis Mak. 2022 Sep 27;22(Suppl 3):255. doi: 10.1186/s12911-022-01996-2.
10
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.

引用本文的文献

1
Fine-Tuning Large Language Models for Specialized Use Cases.针对特定用例微调大语言模型。
Mayo Clin Proc Digit Health. 2024 Nov 29;3(1):100184. doi: 10.1016/j.mcpdig.2024.11.005. eCollection 2025 Mar.

本文引用的文献

1
Building large-scale registries from unstructured clinical notes using a low-resource natural language processing pipeline.利用低资源自然语言处理管道从非结构化临床笔记中构建大规模注册中心。
Artif Intell Med. 2024 May;151:102847. doi: 10.1016/j.artmed.2024.102847. Epub 2024 Mar 22.
2
Launching into clinical space with medspaCy: a new clinical text processing toolkit in Python.医学 spaCy:Python 中的新型临床文本处理工具包,助力临床应用。
AMIA Annu Symp Proc. 2022 Feb 21;2021:438-447. eCollection 2021.
3
Can Natural Language Processing and Artificial Intelligence Automate The Generation of Billing Codes From Operative Note Dictations?
自然语言处理和人工智能能否根据手术记录口述自动生成计费代码?
Global Spine J. 2023 Sep;13(7):1946-1955. doi: 10.1177/21925682211062831. Epub 2022 Feb 28.
4
Automated surveillance systems for healthcare-associated infections: results from a European survey and experiences from real-life utilization.医疗保健相关感染的自动监测系统:一项欧洲调查的结果及实际应用经验
J Hosp Infect. 2022 Apr;122:35-43. doi: 10.1016/j.jhin.2021.12.021. Epub 2022 Jan 12.
5
Natural language inference for curation of structured clinical registries from unstructured text.从非结构化文本中进行结构化临床注册管理的自然语言推理。
J Am Med Inform Assoc. 2021 Dec 28;29(1):97-108. doi: 10.1093/jamia/ocab243.
6
Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases.将注册数据标准化为OMOP通用数据模型:来自三个肺动脉高压数据库的经验。
BMC Med Res Methodol. 2021 Nov 2;21(1):238. doi: 10.1186/s12874-021-01434-3.
7
Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning.自动ICD - 10编码与训练系统:基于监督学习的深度神经网络
JMIR Med Inform. 2021 Aug 31;9(8):e23230. doi: 10.2196/23230.
8
Electronic Health Record (EHR) Abstraction.电子健康记录(EHR)提取
Perspect Health Inf Manag. 2021 Mar 15;18(Spring):1g. eCollection 2021 Spring.
9
STS Adult Cardiac Surgery Database: 2021 Update on Outcomes, Quality, and Research.STS 成人心脏外科学数据库:2021 年手术结果、质量和研究更新。
Ann Thorac Surg. 2021 Jun;111(6):1770-1780. doi: 10.1016/j.athoracsur.2021.03.043. Epub 2021 Mar 29.
10
Next Generation of Central Cancer Registries.下一代中央癌症登记处。
JCO Clin Cancer Inform. 2021 Mar;5:288-294. doi: 10.1200/CCI.20.00177.