• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BERT 是否需要进行领域适应来进行临床否定检测?

Does BERT need domain adaptation for clinical negation detection?

机构信息

Computational Health Informatics Program, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA.

School of Information, University of Arizona, Tucson, Arizona, USA.

出版信息

J Am Med Inform Assoc. 2020 Apr 1;27(4):584-591. doi: 10.1093/jamia/ocaa001.

DOI:10.1093/jamia/ocaa001
PMID:32044989
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7075528/
Abstract

INTRODUCTION

Classifying whether concepts in an unstructured clinical text are negated is an important unsolved task. New domain adaptation and transfer learning methods can potentially address this issue.

OBJECTIVE

We examine neural unsupervised domain adaptation methods, introducing a novel combination of domain adaptation with transformer-based transfer learning methods to improve negation detection. We also want to better understand the interaction between the widely used bidirectional encoder representations from transformers (BERT) system and domain adaptation methods.

MATERIALS AND METHODS

We use 4 clinical text datasets that are annotated with negation status. We evaluate a neural unsupervised domain adaptation algorithm and BERT, a transformer-based model that is pretrained on massive general text datasets. We develop an extension to BERT that uses domain adversarial training, a neural domain adaptation method that adds an objective to the negation task, that the classifier should not be able to distinguish between instances from 2 different domains.

RESULTS

The domain adaptation methods we describe show positive results, but, on average, the best performance is obtained by plain BERT (without the extension). We provide evidence that the gains from BERT are likely not additive with the gains from domain adaptation.

DISCUSSION

Our results suggest that, at least for the task of clinical negation detection, BERT subsumes domain adaptation, implying that BERT is already learning very general representations of negation phenomena such that fine-tuning even on a specific corpus does not lead to much overfitting.

CONCLUSION

Despite being trained on nonclinical text, the large training sets of models like BERT lead to large gains in performance for the clinical negation detection task.

摘要

简介

对非结构化临床文本中的概念进行否定分类是一个尚未解决的重要问题。新的领域自适应和迁移学习方法可能会解决这个问题。

目的

我们研究了神经无监督领域自适应方法,引入了一种新的领域自适应与基于转换器的迁移学习方法的组合,以提高否定检测的效果。我们还希望更好地理解广泛使用的双向转换器表示(BERT)系统和领域自适应方法之间的相互作用。

材料和方法

我们使用 4 个具有否定状态注释的临床文本数据集。我们评估了一种神经无监督领域自适应算法和 BERT,这是一种基于转换器的模型,在大规模通用文本数据集上进行了预训练。我们开发了 BERT 的扩展版本,使用了对抗训练的领域自适应方法,该方法为否定任务添加了一个目标,即分类器不应能够区分来自 2 个不同领域的实例。

结果

我们描述的领域自适应方法显示出了积极的结果,但平均而言,最好的性能是由普通的 BERT(没有扩展)获得的。我们提供的证据表明,BERT 的收益与领域自适应的收益并非相加的关系。

讨论

我们的结果表明,至少对于临床否定检测任务,BERT 包含了领域自适应,这意味着 BERT 已经在学习否定现象的非常通用的表示,因此即使在特定语料库上进行微调也不会导致过度拟合。

结论

尽管 BERT 是在非临床文本上进行训练的,但像 BERT 这样的大型训练集的模型在临床否定检测任务中表现出了很大的性能提升。

相似文献

1
Does BERT need domain adaptation for clinical negation detection?BERT 是否需要进行领域适应来进行临床否定检测?
J Am Med Inform Assoc. 2020 Apr 1;27(4):584-591. doi: 10.1093/jamia/ocaa001.
2
Oversampling effect in pretraining for bidirectional encoder representations from transformers (BERT) to localize medical BERT and enhance biomedical BERT.在基于转换器的双向编码器表示预训练(BERT)中进行过采样,以定位医学 BERT 并增强生物医学 BERT。
Artif Intell Med. 2024 Jul;153:102889. doi: 10.1016/j.artmed.2024.102889. Epub 2024 May 5.
3
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
4
Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).使用基于转换器的双向编码器表示 (BERT) 和领域内预训练 (IDPT) 对耳鸣患者的可操作放射学报告进行自动文本分类。
BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.
5
The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study.预训练语言模型对跨语言医学文本中否定和推测检测的影响:比较研究
JMIR Med Inform. 2020 Dec 3;8(12):e18953. doi: 10.2196/18953.
6
A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation.基于 Transformer 的双向编码器表示模型的精细调整在食品命名实体识别中的应用:算法开发与验证。
J Med Internet Res. 2021 Aug 9;23(8):e28229. doi: 10.2196/28229.
7
Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study.用于放射学报告中自动重要发现标记和提取的否定与推测检测的深度学习方法:内部验证与技术比较研究
JMIR Med Inform. 2023 Apr 25;11:e46348. doi: 10.2196/46348.
8
When BERT meets Bilbo: a learning curve analysis of pretrained language model on disease classification.当 BERT 遇见比尔博:预训练语言模型在疾病分类上的学习曲线分析。
BMC Med Inform Decis Mak. 2022 Apr 5;21(Suppl 9):377. doi: 10.1186/s12911-022-01829-2.
9
Semantic Textual Similarity in Japanese Clinical Domain Texts Using BERT.基于 BERT 的日本临床领域文本的语义文本相似性研究
Methods Inf Med. 2021 Jun;60(S 01):e56-e64. doi: 10.1055/s-0041-1731390. Epub 2021 Jul 8.
10
BioBERT and Similar Approaches for Relation Extraction.BioBERT 及其在关系抽取中的应用。
Methods Mol Biol. 2022;2496:221-235. doi: 10.1007/978-1-0716-2305-3_12.

引用本文的文献

1
Large Language Model Symptom Identification From Clinical Text: Multicenter Study.基于临床文本的大语言模型症状识别:多中心研究。
J Med Internet Res. 2025 Jul 31;27:e72984. doi: 10.2196/72984.
2
Enhancing Cross-Domain Generalizability in Social Determinants of Health Extraction with Prompt-Tuning Large Language Models.利用提示调整大型语言模型增强健康提取社会决定因素中的跨领域通用性。
AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:432-440. eCollection 2025.
3
A Large Language Model to Detect Negated Expressions in Radiology Reports.一种用于检测放射学报告中否定表达的大语言模型。
J Imaging Inform Med. 2025 Jun;38(3):1297-1303. doi: 10.1007/s10278-024-01274-9. Epub 2024 Sep 25.
4
Developing an Inpatient Electronic Medical Record Phenotype for Hospital-Acquired Pressure Injuries: Case Study Using Natural Language Processing Models.开发用于医院获得性压力性损伤的住院电子病历表型:使用自然语言处理模型的案例研究
JMIR AI. 2023 Mar 8;2:e41264. doi: 10.2196/41264.
5
Prediction of COVID-19 Patients' Emergency Room Revisit using Multi-Source Transfer Learning.基于多源迁移学习的新冠肺炎患者急诊复诊预测
Proc (IEEE Int Conf Healthc Inform). 2023 Jun;2023:138-144. doi: 10.1109/ICHI57859.2023.00028. Epub 2023 Dec 11.
6
Applying Natural Language Processing to Textual Data From Clinical Data Warehouses: Systematic Review.将自然语言处理应用于临床数据仓库中的文本数据:系统评价。
JMIR Med Inform. 2023 Dec 15;11:e42477. doi: 10.2196/42477.
7
An interpretable deep learning framework for predicting liver metastases in postoperative colorectal cancer patients using natural language processing and clinical data integration.基于自然语言处理和临床数据集成的可解释深度学习框架,用于预测结直肠癌术后肝转移患者。
Cancer Med. 2023 Sep;12(18):19337-19351. doi: 10.1002/cam4.6523. Epub 2023 Sep 11.
8
DARE: Distill and Reinforce Ensemble Neural Networks for Climate-Domain Processing.DARE:用于气候领域处理的蒸馏与强化集成神经网络。
Entropy (Basel). 2023 Apr 11;25(4):643. doi: 10.3390/e25040643.
9
Performance and Agreement When Annotating Chest X-ray Text Reports-A Preliminary Step in the Development of a Deep Learning-Based Prioritization and Detection System.标注胸部X光文本报告时的性能与一致性——基于深度学习的优先级排序和检测系统开发的初步步骤
Diagnostics (Basel). 2023 Mar 11;13(6):1070. doi: 10.3390/diagnostics13061070.
10
Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods.荷兰临床文本中的否定词检测:基于规则和机器学习方法的评估。
BMC Bioinformatics. 2023 Jan 9;24(1):10. doi: 10.1186/s12859-022-05130-x.

本文引用的文献

1
Simplified Neural Unsupervised Domain Adaptation.简化神经无监督域适应
Proc Conf. 2019 Jun;2019:414-419.
2
Enhancing clinical concept extraction with contextual embeddings.利用上下文嵌入增强临床概念提取。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1297-1304. doi: 10.1093/jamia/ocz096.
3
Clinical information extraction applications: A literature review.临床信息提取应用:文献综述。
J Biomed Inform. 2018 Jan;77:34-49. doi: 10.1016/j.jbi.2017.11.011. Epub 2017 Nov 21.
4
MIMIC-III, a freely accessible critical care database.MIMIC-III,一个免费获取的重症监护数据库。
Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.
5
DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx.DEEPEN:一种将依存关系纳入NegEx的临床文本否定检测系统。
J Biomed Inform. 2015 Apr;54:213-9. doi: 10.1016/j.jbi.2015.02.010. Epub 2015 Mar 16.
6
Negation's not solved: generalizability versus optimizability in clinical natural language processing.否定问题尚未解决:临床自然语言处理中的可推广性与可优化性
PLoS One. 2014 Nov 13;9(11):e112774. doi: 10.1371/journal.pone.0112774. eCollection 2014.
7
Towards comprehensive syntactic and semantic annotations of the clinical narrative.朝着临床叙述的全面句法和语义标注努力。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):922-30. doi: 10.1136/amiajnl-2012-001317. Epub 2013 Jan 25.
8
Dependency Parser-based Negation Detection in Clinical Narratives.临床叙述中基于依存句法分析器的否定检测
AMIA Jt Summits Transl Sci Proc. 2012;2012:1-8. Epub 2012 Mar 19.
9
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛:临床文本中的概念、断言和关系
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.
10
ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.语境:一种从临床报告中确定否定、体验者和时间状态的算法。
J Biomed Inform. 2009 Oct;42(5):839-51. doi: 10.1016/j.jbi.2009.05.002. Epub 2009 May 10.