• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物医学科学话语中因果关系的半监督学习

Semi-supervised learning of causal relations in biomedical scientific discourse.

作者信息

Mihăilă Claudiu, Ananiadou Sophia

出版信息

Biomed Eng Online. 2014;13 Suppl 2(Suppl 2):S1. doi: 10.1186/1475-925X-13-S2-S1. Epub 2014 Dec 11.

DOI:10.1186/1475-925X-13-S2-S1
PMID:25559746
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4304242/
Abstract

BACKGROUND

The increasing number of daily published articles in the biomedical domain has become too large for humans to handle on their own. As a result, bio-text mining technologies have been developed to improve their workload by automatically analysing the text and extracting important knowledge. Specific bio-entities, bio-events between these and facts can now be recognised with sufficient accuracy and are widely used by biomedical researchers. However, understanding how the extracted facts are connected in text is an extremely difficult task, which cannot be easily tackled by machinery.

RESULTS

In this article, we describe our method to recognise causal triggers and their arguments in biomedical scientific discourse. We introduce new features and show that a self-learning approach improves the performance obtained by supervised machine learners to 83.47% for causal triggers. Furthermore, the spans of causal arguments can be recognised to a slightly higher level that by using supervised or rule-based methods that have been employed before.

CONCLUSION

Exploiting the large amount of unlabelled data that is already available can help improve the performance of recognising causal discourse relations in the biomedical domain. This improvement will further benefit the development of multiple tasks, such as hypothesis generation for experimental laboratories, contradiction detection, and the creation of causal networks.

摘要

背景

生物医学领域每日发表的文章数量不断增加,已多得让人类自身难以处理。因此,已开发出生物文本挖掘技术,通过自动分析文本和提取重要知识来减轻工作量。现在,特定的生物实体、它们之间的生物事件以及事实能够以足够的准确性被识别出来,并被生物医学研究人员广泛使用。然而,理解提取的事实在文本中是如何关联的是一项极其困难的任务,机器难以轻易解决。

结果

在本文中,我们描述了在生物医学科学论述中识别因果触发因素及其论据的方法。我们引入了新的特征,并表明一种自学习方法将监督式机器学习器在因果触发因素方面的性能提高到了83.47%。此外,与之前使用的监督式或基于规则的方法相比,因果论据的跨度能够被识别到略高的水平。

结论

利用现有的大量未标记数据有助于提高生物医学领域中因果论述关系识别的性能。这一改进将进一步有益于多个任务的发展,如实验实验室的假设生成、矛盾检测以及因果网络的创建。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/d93d55e387a6/1475-925X-13-S2-S1-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/c0ccaee27a9a/1475-925X-13-S2-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/293845125d47/1475-925X-13-S2-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/733788ab1a7c/1475-925X-13-S2-S1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/63e81b0da167/1475-925X-13-S2-S1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/0e2d57423168/1475-925X-13-S2-S1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/ab87c8740ea7/1475-925X-13-S2-S1-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/baffecbd33e1/1475-925X-13-S2-S1-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/71bc58d5128d/1475-925X-13-S2-S1-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/d93d55e387a6/1475-925X-13-S2-S1-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/c0ccaee27a9a/1475-925X-13-S2-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/293845125d47/1475-925X-13-S2-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/733788ab1a7c/1475-925X-13-S2-S1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/63e81b0da167/1475-925X-13-S2-S1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/0e2d57423168/1475-925X-13-S2-S1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/ab87c8740ea7/1475-925X-13-S2-S1-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/baffecbd33e1/1475-925X-13-S2-S1-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/71bc58d5128d/1475-925X-13-S2-S1-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba06/4304242/d93d55e387a6/1475-925X-13-S2-S1-9.jpg

相似文献

1
Semi-supervised learning of causal relations in biomedical scientific discourse.生物医学科学话语中因果关系的半监督学习
Biomed Eng Online. 2014;13 Suppl 2(Suppl 2):S1. doi: 10.1186/1475-925X-13-S2-S1. Epub 2014 Dec 11.
2
Exploring hedge identification in biomedical literature.探索生物医学文献中的对冲识别。
J Biomed Inform. 2008 Aug;41(4):636-54. doi: 10.1016/j.jbi.2008.01.001. Epub 2008 Jan 11.
3
Exploring supervised and unsupervised methods to detect topics in biomedical text.探索用于检测生物医学文本主题的监督和无监督方法。
BMC Bioinformatics. 2006 Mar 16;7:140. doi: 10.1186/1471-2105-7-140.
4
Automatic discourse connective detection in biomedical text.生物医学文本中的自动语篇连接词检测。
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):800-8. doi: 10.1136/amiajnl-2011-000775. Epub 2012 Jun 28.
5
Exploiting the contextual cues for bio-entity name recognition in biomedical literature.利用上下文线索识别生物医学文献中的生物实体名称。
J Biomed Inform. 2008 Aug;41(4):580-7. doi: 10.1016/j.jbi.2008.01.002. Epub 2008 Jan 11.
6
Structured learning for spatial information extraction from biomedical text: bacteria biotopes.从生物医学文本中提取空间信息的结构化学习:细菌生物栖息地
BMC Bioinformatics. 2015 Apr 25;16:129. doi: 10.1186/s12859-015-0542-z.
7
POSBIOTM-NER: a trainable biomedical named-entity recognition system.POSBIOTM-NER:一个可训练的生物医学命名实体识别系统。
Bioinformatics. 2005 Jun 1;21(11):2794-6. doi: 10.1093/bioinformatics/bti414. Epub 2005 Apr 6.
8
Status of text-mining techniques applied to biomedical text.应用于生物医学文本的文本挖掘技术现状。
Drug Discov Today. 2006 Apr;11(7-8):315-25. doi: 10.1016/j.drudis.2006.02.011.
9
Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles.使用MEDLINE作为知识来源来消除全文生物医学期刊文章中缩写词和首字母缩略词的歧义。
J Biomed Inform. 2007 Apr;40(2):150-9. doi: 10.1016/j.jbi.2006.06.001. Epub 2006 Jun 7.
10
Text classification performance: is the sample size the only factor to be considered?文本分类性能:样本量是唯一需要考虑的因素吗?
Stud Health Technol Inform. 2013;192:1193.

引用本文的文献

1
Improving ESP Writing Class Learning Outcomes Among Medical University Undergraduates: How Do Emotions Impact?提高医科大学本科生专门用途英语写作课程的学习成果:情绪如何产生影响?
Front Psychol. 2022 Jun 20;13:909590. doi: 10.3389/fpsyg.2022.909590. eCollection 2022.
2
Identification of pharmacodynamic biomarker hypotheses through literature analysis with IBM Watson.通过 IBM Watson 进行文献分析识别药效生物标志物假说。
PLoS One. 2019 Apr 8;14(4):e0214619. doi: 10.1371/journal.pone.0214619. eCollection 2019.

本文引用的文献

1
Recognising discourse causality triggers in the biomedical domain.识别生物医学领域中的语篇因果关系触发因素。
J Bioinform Comput Biol. 2013 Dec;11(6):1343008. doi: 10.1142/S0219720013430087. Epub 2013 Dec 2.
2
Deploying and sharing U-Compare workflows as web services.将U-Compare工作流程作为网络服务进行部署和共享。
J Biomed Semantics. 2013 Feb 18;4(1):7. doi: 10.1186/2041-1480-4-7.
3
BioCause: Annotating and analysing causality in the biomedical domain.生物原因:生物医学领域的因果关系标注与分析。
BMC Bioinformatics. 2013 Jan 16;14:2. doi: 10.1186/1471-2105-14-2.
4
Automatic discourse connective detection in biomedical text.生物医学文本中的自动语篇连接词检测。
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):800-8. doi: 10.1136/amiajnl-2011-000775. Epub 2012 Jun 28.
5
Boosting automatic event extraction from the literature using domain adaptation and coreference resolution.利用领域自适应和共指解析技术提高文献中自动事件抽取的性能。
Bioinformatics. 2012 Jul 1;28(13):1759-65. doi: 10.1093/bioinformatics/bts237. Epub 2012 Apr 25.
6
Argo: an integrative, interactive, text mining-based workbench supporting curation.Argonaut:一个集成的、交互的、基于文本挖掘的工作平台,支持管理。
Database (Oxford). 2012 Mar 20;2012:bas010. doi: 10.1093/database/bas010. Print 2012.
7
Software for systems biology: from tools to integrated platforms.系统生物学软件:从工具到集成平台。
Nat Rev Genet. 2011 Nov 3;12(12):821-32. doi: 10.1038/nrg3096.
8
Enhancing biomedical text summarization using semantic relation extraction.利用语义关系抽取技术增强生物医学文本摘要
PLoS One. 2011;6(8):e23862. doi: 10.1371/journal.pone.0023862. Epub 2011 Aug 26.
9
A review of causal inference for biomedical informatics.生物医学信息学因果推断研究综述。
J Biomed Inform. 2011 Dec;44(6):1102-12. doi: 10.1016/j.jbi.2011.07.001. Epub 2011 Jul 14.
10
Exploring subdomain variation in biomedical language.探索生物医学语言中的子域变化。
BMC Bioinformatics. 2011 May 27;12:212. doi: 10.1186/1471-2105-12-212.