• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
An Ontology-Enabled Natural Language Processing Pipeline for Provenance Metadata Extraction from Biomedical Text (Short Paper).一种用于从生物医学文本中提取溯源元数据的启用本体的自然语言处理管道(短文)。
On Move Meaningful Internet Syst. 2016 Oct;10033:699-708. doi: 10.1007/978-3-319-48472-3_43. Epub 2016 Oct 18.
2
ProvCaRe: Characterizing scientific reproducibility of biomedical research studies using semantic provenance metadata.ProvCaRe:使用语义来源元数据刻画生物医学研究的科学可重复性。
Int J Med Inform. 2019 Jan;121:10-18. doi: 10.1016/j.ijmedinf.2018.10.009. Epub 2018 Nov 3.
3
ProvCaRe Semantic Provenance Knowledgebase: Evaluating Scientific Reproducibility of Research Studies.ProvCaRe语义溯源知识库:评估研究的科学可重复性。
AMIA Annu Symp Proc. 2018 Apr 16;2017:1705-1714. eCollection 2017.
4
Scientific Reproducibility in Biomedical Research: Provenance Metadata Ontology for Semantic Annotation of Study Description.生物医学研究中的科学可重复性:用于研究描述语义注释的来源元数据本体论
AMIA Annu Symp Proc. 2017 Feb 10;2016:1070-1079. eCollection 2016.
5
Semantic Provenance Graph for Reproducibility of Biomedical Research Studies: Generating and Analyzing Graph Structures from Published Literature.用于生物医学研究可重复性的语义溯源图:从已发表文献中生成和分析图结构。
Stud Health Technol Inform. 2019 Aug 21;264:328-332. doi: 10.3233/SHTI190237.
6
Natural language processing (NLP) tools in extracting biomedical concepts from research articles: a case study on autism spectrum disorder.自然语言处理(NLP)工具在从研究文章中提取生物医学概念中的应用:以自闭症谱系障碍为例。
BMC Med Inform Decis Mak. 2020 Dec 30;20(Suppl 11):322. doi: 10.1186/s12911-020-01352-2.
7
Provenance Information for Biomedical Data and Workflows: Scoping Review.生物医学数据和工作流程的出处信息:范围综述。
J Med Internet Res. 2024 Aug 23;26:e51297. doi: 10.2196/51297.
8
Semantic biomedical resource discovery: a Natural Language Processing framework.语义生物医学资源发现:一种自然语言处理框架。
BMC Med Inform Decis Mak. 2015 Sep 30;15:77. doi: 10.1186/s12911-015-0200-4.
9
Approaches and Criteria for Provenance in Biomedical Data Sets and Workflows: Protocol for a Scoping Review.生物医学数据集和工作流程中出处的方法与标准:范围综述方案
JMIR Res Protoc. 2021 Nov 22;10(11):e31750. doi: 10.2196/31750.
10
Extracting Medical Information From Free-Text and Unstructured Patient-Generated Health Data Using Natural Language Processing Methods: Feasibility Study With Real-world Data.使用自然语言处理方法从自由文本和非结构化患者生成的健康数据中提取医学信息:基于真实世界数据的可行性研究
JMIR Form Res. 2023 Mar 7;7:e43014. doi: 10.2196/43014.

引用本文的文献

1
ProvCaRe: Characterizing scientific reproducibility of biomedical research studies using semantic provenance metadata.ProvCaRe:使用语义来源元数据刻画生物医学研究的科学可重复性。
Int J Med Inform. 2019 Jan;121:10-18. doi: 10.1016/j.ijmedinf.2018.10.009. Epub 2018 Nov 3.

本文引用的文献

1
Scientific Reproducibility in Biomedical Research: Provenance Metadata Ontology for Semantic Annotation of Study Description.生物医学研究中的科学可重复性:用于研究描述语义注释的来源元数据本体论
AMIA Annu Symp Proc. 2017 Feb 10;2016:1070-1079. eCollection 2016.
2
Scaling Up Scientific Discovery in Sleep Medicine: The National Sleep Research Resource.扩大睡眠医学领域的科学发现:国家睡眠研究资源
Sleep. 2016 May 1;39(5):1151-64. doi: 10.5665/sleep.5774.
3
CPAP versus oxygen in obstructive sleep apnea.CPAP 与氧疗治疗阻塞性睡眠呼吸暂停的比较。
N Engl J Med. 2014 Jun 12;370(24):2276-85. doi: 10.1056/NEJMoa1306766.
4
Policy: NIH plans to enhance reproducibility.政策:NIH 计划提高可重复性。
Nature. 2014 Jan 30;505(7485):612-3. doi: 10.1038/505612a.
5
A call for transparent reporting to optimize the predictive value of preclinical research.呼吁透明报告,以优化临床前研究的预测价值。
Nature. 2012 Oct 11;490(7419):187-91. doi: 10.1038/nature11556.
6
The open biomedical annotator.开放式生物医学注释工具
Summit Transl Bioinform. 2009 Mar 1;2009:56-60.
7
Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统(cTAKES):架构、组件评估和应用。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.
8
caTIES: a grid based system for coding and retrieval of surgical pathology reports and tissue specimens in support of translational research.caTIES:一个基于网格的系统,用于编码和检索外科病理学报告和组织标本,以支持转化研究。
J Am Med Inform Assoc. 2010 May-Jun;17(3):253-64. doi: 10.1136/jamia.2009.002295.
9
An overview of MetaMap: historical perspective and recent advances.MetaMap 概述:历史视角与最新进展。
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.
10
Extracting information from textual documents in the electronic health record: a review of recent research.从电子健康记录中的文本文件提取信息:近期研究综述
Yearb Med Inform. 2008:128-44.

一种用于从生物医学文本中提取溯源元数据的启用本体的自然语言处理管道(短文)。

An Ontology-Enabled Natural Language Processing Pipeline for Provenance Metadata Extraction from Biomedical Text (Short Paper).

作者信息

Valdez Joshua, Rueschman Michael, Kim Matthew, Redline Susan, Sahoo Satya S

机构信息

Division of Medical Informatics and Electrical Engineering and Computer Science Department, Case Western Reserve University, Cleveland, OH, USA.

Departments of Medicine, Brigham and Women's Hospital and Beth Israel Deaconess Medical Center, Harvard University, Boston, MA, USA.

出版信息

On Move Meaningful Internet Syst. 2016 Oct;10033:699-708. doi: 10.1007/978-3-319-48472-3_43. Epub 2016 Oct 18.

DOI:10.1007/978-3-319-48472-3_43
PMID:28664200
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5486409/
Abstract

Extraction of structured information from biomedical literature is a complex and challenging problem due to the complexity of biomedical domain and lack of appropriate natural language processing (NLP) techniques. High quality domain ontologies model both data and metadata information at a fine level of granularity, which can be effectively used to accurately extract structured information from biomedical text. Extraction of provenance metadata, which describes the history or source of information, from published articles is an important task to support scientific reproducibility. Reproducibility of results reported by previous research studies is a foundational component of scientific advancement. This is highlighted by the recent initiative by the US National Institutes of Health called "Principles of Rigor and Reproducibility". In this paper, we describe an effective approach to extract provenance metadata from published biomedical research literature using an ontology-enabled NLP platform as part of the Provenance for Clinical and Healthcare Research (ProvCaRe). The ProvCaRe-NLP tool extends the clinical Text Analysis and Knowledge Extraction System (cTAKES) platform using both provenance and biomedical domain ontologies. We demonstrate the effectiveness of ProvCaRe-NLP tool using a corpus of 20 peer-reviewed publications. The results of our evaluation demonstrate that the ProvCaRe-NLP tool has significantly higher recall in extracting provenance metadata as compared to existing NLP pipelines such as MetaMap.

摘要

由于生物医学领域的复杂性以及缺乏合适的自然语言处理(NLP)技术,从生物医学文献中提取结构化信息是一个复杂且具有挑战性的问题。高质量的领域本体以精细的粒度对数据和元数据信息进行建模,可有效用于从生物医学文本中准确提取结构化信息。从已发表文章中提取描述信息历史或来源的出处元数据,是支持科学可重复性的一项重要任务。先前研究报告结果的可重复性是科学进步的一个基础组成部分。美国国立卫生研究院最近发起的“严谨性和可重复性原则”倡议就突出了这一点。在本文中,我们描述了一种有效的方法,即使用一个启用本体的NLP平台,作为临床和医疗保健研究出处(ProvCaRe)的一部分,从已发表的生物医学研究文献中提取出处元数据。ProvCaRe-NLP工具使用出处和生物医学领域本体扩展了临床文本分析和知识提取系统(cTAKES)平台。我们使用一个包含20篇同行评审出版物的语料库来证明ProvCaRe-NLP工具的有效性。我们的评估结果表明,与现有NLP管道(如MetaMap)相比,ProvCaRe-NLP工具在提取出处元数据方面具有显著更高的召回率。