文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Domain adaptation for semantic role labeling of clinical text.

作者信息

Zhang Yaoyun, Tang Buzhou, Jiang Min, Wang Jingqi, Xu Hua

机构信息

University of Texas School of Biomedical Informatics at Houston, Houston, TX, USA.

University of Texas School of Biomedical Informatics at Houston, Houston, TX, USA Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, China.

出版信息

J Am Med Inform Assoc. 2015 Sep;22(5):967-79. doi: 10.1093/jamia/ocu048. Epub 2015 Jun 10.


DOI:10.1093/jamia/ocu048
PMID:26063745
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4986662/
Abstract

OBJECTIVE: Semantic role labeling (SRL), which extracts a shallow semantic relation representation from different surface textual forms of free text sentences, is important for understanding natural language. Few studies in SRL have been conducted in the medical domain, primarily due to lack of annotated clinical SRL corpora, which are time-consuming and costly to build. The goal of this study is to investigate domain adaptation techniques for clinical SRL leveraging resources built from newswire and biomedical literature to improve performance and save annotation costs. MATERIALS AND METHODS: Multisource Integrated Platform for Answering Clinical Questions (MiPACQ), a manually annotated SRL clinical corpus, was used as the target domain dataset. PropBank and NomBank from newswire and BioProp from biomedical literature were used as source domain datasets. Three state-of-the-art domain adaptation algorithms were employed: instance pruning, transfer self-training, and feature augmentation. The SRL performance using different domain adaptation algorithms was evaluated by using 10-fold cross-validation on the MiPACQ corpus. Learning curves for the different methods were generated to assess the effect of sample size. RESULTS AND CONCLUSION: When all three source domain corpora were used, the feature augmentation algorithm achieved statistically significant higher F-measure (83.18%), compared to the baseline with MiPACQ dataset alone (F-measure, 81.53%), indicating that domain adaptation algorithms may improve SRL performance on clinical text. To achieve a comparable performance to the baseline method that used 90% of MiPACQ training samples, the feature augmentation algorithm required <50% of training samples in MiPACQ, demonstrating that annotation costs of clinical SRL can be reduced significantly by leveraging existing SRL resources from other domains.

摘要

相似文献

[1]
Domain adaptation for semantic role labeling of clinical text.

J Am Med Inform Assoc. 2015-9

[2]
Domain adaptation for semantic role labeling in the biomedical domain.

Bioinformatics. 2010-2-23

[3]
Semantic Role Labeling of Clinical Text: Comparing Syntactic Parsers and Features.

AMIA Annu Symp Proc. 2017-2-10

[4]
BIOSMILE: a semantic role labeling system for biomedical verbs using a maximum-entropy model with automatically generated template features.

BMC Bioinformatics. 2007-9-1

[5]
Semi-automatic conversion of BioProp semantic annotation to PASBio annotation.

BMC Bioinformatics. 2008-12-12

[6]
Parsing clinical text: how good are the state-of-the-art parsers?

BMC Med Inform Decis Mak. 2015

[7]
Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation.

AMIA Annu Symp Proc. 2018-4-16

[8]
Semantic role labeling for protein transport predicates.

BMC Bioinformatics. 2008-6-11

[9]
Large scale application of neural network based semantic role labeling for automated relation extraction from biomedical texts.

PLoS One. 2009-7-28

[10]
Towards semantic role labeling & IE in the medical literature.

AMIA Annu Symp Proc. 2005

引用本文的文献

[1]
Defining Phenotypes from Clinical Data to Drive Genomic Research.

Annu Rev Biomed Data Sci. 2018-7

[2]
Amplifying Domain Expertise in Clinical Data Pipelines.

JMIR Med Inform. 2020-11-5

[3]
Automatic Labeled Dialogue Generation for Nursing Record Systems.

J Pers Med. 2020-7-16

[4]
FasTag: Automatic text classification of unstructured medical narratives.

PLoS One. 2020-6-22

[5]
CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital.

BMC Med Inform Decis Mak. 2018-6-25

[6]
Adapting Word Embeddings from Multiple Domains to Symptom Recognition from Psychiatric Notes.

AMIA Jt Summits Transl Sci Proc. 2018-5-18

[7]
Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation.

AMIA Annu Symp Proc. 2018-4-16

[8]
A bibliometric analysis of natural language processing in medical research.

BMC Med Inform Decis Mak. 2018-3-22

[9]
Ranking Medical Terms to Support Expansion of Lay Language Resources for Patient Comprehension of Electronic Health Record Notes: Adapted Distant Supervision Approach.

JMIR Med Inform. 2017-10-31

[10]
A hybrid approach to automatic de-identification of psychiatric notes.

J Biomed Inform. 2017-6-7

本文引用的文献

[1]
Statistical parsing of varieties of clinical Finnish.

Artif Intell Med. 2014-7

[2]
Predicate argument structure frames for modeling information in operative notes.

Stud Health Technol Inform. 2013

[3]
Improving performance of natural language processing part-of-speech tagging on clinical narratives through domain adaptation.

J Am Med Inform Assoc. 2013-3-13

[4]
Towards comprehensive syntactic and semantic annotations of the clinical narrative.

J Am Med Inform Assoc. 2013-1-25

[5]
A study of actions in operative notes.

AMIA Annu Symp Proc. 2012

[6]
SemMedDB: a PubMed-scale repository of biomedical semantic predications.

Bioinformatics. 2012-10-8

[7]
Domain adaptation for semantic role labeling in the biomedical domain.

Bioinformatics. 2010-2-23

[8]
UMLS content views appropriate for NLP processing of the biomedical literature vs. clinical text.

J Biomed Inform. 2010-2-10

[9]
MedEx: a medication information extraction system for clinical narratives.

J Am Med Inform Assoc. 2010

[10]
Large scale application of neural network based semantic role labeling for automated relation extraction from biomedical texts.

PLoS One. 2009-7-28

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索