• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于基于本体的科学文献标注的统计算法。

Statistical algorithms for ontology-based annotation of scientific literature.

作者信息

Chakrabarti Chayan, Jones Thomas B, Luger George F, Xu Jiawei F, Turner Matthew D, Laird Angela R, Turner Jessica A

机构信息

Department of Computer Science, University of New Mexico, Albuquerque, New Mexico, USA.

Department of Computer Science, University of New Mexico, Albuquerque, New Mexico, USA ; Mind Research Network, Albuquerque, New Mexico, USA.

出版信息

J Biomed Semantics. 2014 Jun 3;5(Suppl 1 Proceedings of the Bio-Ontologies Spec Interest G):S2. doi: 10.1186/2041-1480-5-S1-S2. eCollection 2014.

DOI:10.1186/2041-1480-5-S1-S2
PMID:25093071
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4108869/
Abstract

BACKGROUND

Ontologies encode relationships within a domain in robust data structures that can be used to annotate data objects, including scientific papers, in ways that ease tasks such as search and meta-analysis. However, the annotation process requires significant time and effort when performed by humans. Text mining algorithms can facilitate this process, but they render an analysis mainly based upon keyword, synonym and semantic matching. They do not leverage information embedded in an ontology's structure.

METHODS

We present a probabilistic framework that facilitates the automatic annotation of literature by indirectly modeling the restrictions among the different classes in the ontology. Our research focuses on annotating human functional neuroimaging literature within the Cognitive Paradigm Ontology (CogPO). We use an approach that combines the stochastic simplicity of naïve Bayes with the formal transparency of decision trees. Our data structure is easily modifiable to reflect changing domain knowledge.

RESULTS

We compare our results across naïve Bayes, Bayesian Decision Trees, and Constrained Decision Tree classifiers that keep a human expert in the loop, in terms of the quality measure of the F1-mirco score.

CONCLUSIONS

Unlike traditional text mining algorithms, our framework can model the knowledge encoded by the dependencies in an ontology, albeit indirectly. We successfully exploit the fact that CogPO has explicitly stated restrictions, and implicit dependencies in the form of patterns in the expert curated annotations.

摘要

背景

本体以强大的数据结构对一个领域内的关系进行编码,这些数据结构可用于以简化搜索和元分析等任务的方式注释包括科学论文在内的数据对象。然而,人工进行注释过程需要大量时间和精力。文本挖掘算法可以促进这一过程,但它们主要基于关键词、同义词和语义匹配进行分析。它们没有利用本体结构中嵌入的信息。

方法

我们提出了一个概率框架,通过间接对本体中不同类之间的限制进行建模,促进文献的自动注释。我们的研究重点是在认知范式本体(CogPO)中注释人类功能性神经影像学文献。我们使用一种将朴素贝叶斯的随机简单性与决策树的形式透明度相结合的方法。我们的数据结构易于修改以反映不断变化的领域知识。

结果

我们根据F1微观分数的质量度量,在朴素贝叶斯、贝叶斯决策树和让人类专家参与其中的约束决策树分类器之间比较了我们的结果。

结论

与传统文本挖掘算法不同,我们的框架可以对本体中依赖关系编码的知识进行建模,尽管是间接的。我们成功利用了CogPO具有明确陈述的限制以及专家策划注释中模式形式的隐式依赖这一事实。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddf/4108869/a8bd2040b717/2041-1480-5-S1-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddf/4108869/7f48c5946fdb/2041-1480-5-S1-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddf/4108869/fb2ff2982096/2041-1480-5-S1-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddf/4108869/73a1476da44e/2041-1480-5-S1-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddf/4108869/a8bd2040b717/2041-1480-5-S1-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddf/4108869/7f48c5946fdb/2041-1480-5-S1-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddf/4108869/fb2ff2982096/2041-1480-5-S1-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddf/4108869/73a1476da44e/2041-1480-5-S1-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ddf/4108869/a8bd2040b717/2041-1480-5-S1-S2-4.jpg

相似文献

1
Statistical algorithms for ontology-based annotation of scientific literature.用于基于本体的科学文献标注的统计算法。
J Biomed Semantics. 2014 Jun 3;5(Suppl 1 Proceedings of the Bio-Ontologies Spec Interest G):S2. doi: 10.1186/2041-1480-5-S1-S2. eCollection 2014.
2
Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain.基于链接生物医学本体的疾病-药物领域自动化本体生成框架。
Comput Methods Programs Biomed. 2018 Oct;165:117-128. doi: 10.1016/j.cmpb.2018.08.010. Epub 2018 Aug 16.
3
Multi-label literature classification based on the Gene Ontology graph.基于基因本体图的多标签文献分类
BMC Bioinformatics. 2008 Dec 8;9:525. doi: 10.1186/1471-2105-9-525.
4
Automated annotation of functional imaging experiments via multi-label classification.通过多标签分类对功能成像实验进行自动注释。
Front Neurosci. 2013 Dec 16;7:240. doi: 10.3389/fnins.2013.00240. eCollection 2013.
5
A relation based measure of semantic similarity for Gene Ontology annotations.一种基于关系的基因本体注释语义相似度度量方法。
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.
6
How to link ontologies and protein-protein interactions to literature: text-mining approaches and the BioCreative experience.如何将本体和蛋白质-蛋白质相互作用与文献联系起来:文本挖掘方法和 BioCreative 的经验。
Database (Oxford). 2012 Mar 21;2012:bas017. doi: 10.1093/database/bas017. Print 2012.
7
Textpresso: an ontology-based information retrieval and extraction system for biological literature.Textpresso:一个基于本体的生物文献信息检索与提取系统。
PLoS Biol. 2004 Nov;2(11):e309. doi: 10.1371/journal.pbio.0020309. Epub 2004 Sep 21.
8
SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes.SIFR 标注器:基于本体论的法语生物医学文本和临床笔记的语义标注。
BMC Bioinformatics. 2018 Nov 6;19(1):405. doi: 10.1186/s12859-018-2429-2.
9
Data mart construction based on semantic annotation of scientific articles: A case study for the prioritization of drug targets.基于科学文献语义标注的数据集市构建:以药物靶点优先级排序为例
Comput Methods Programs Biomed. 2018 Apr;157:225-235. doi: 10.1016/j.cmpb.2018.01.010. Epub 2018 Jan 12.
10
A multi-ontology approach to annotate scientific documents based on a modularization technique.一种基于模块化技术对科学文献进行注释的多本体方法。
J Biomed Inform. 2015 Dec;58:208-219. doi: 10.1016/j.jbi.2015.09.022. Epub 2015 Oct 20.

引用本文的文献

1
NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data.NeuroBridge:一个用于发现长尾神经影像数据的原型平台。
Front Neuroinform. 2023 Aug 31;17:1215261. doi: 10.3389/fninf.2023.1215261. eCollection 2023.
2
Automated, Efficient, and Accelerated Knowledge Modeling of the Cognitive Neuroimaging Literature Using the ATHENA Toolkit.使用雅典娜工具包对认知神经影像学文献进行自动化、高效且加速的知识建模。
Front Neurosci. 2019 May 15;13:494. doi: 10.3389/fnins.2019.00494. eCollection 2019.
3
Knowledge Representation and Management. From Ontology to Annotation. Findings from the Yearbook 2015 Section on Knowledge Representation and Management.

本文引用的文献

1
Automated annotation of functional imaging experiments via multi-label classification.通过多标签分类对功能成像实验进行自动注释。
Front Neurosci. 2013 Dec 16;7:240. doi: 10.3389/fnins.2013.00240. eCollection 2013.
2
The cognitive paradigm ontology: design and application.认知范式本体论:设计与应用。
Neuroinformatics. 2012 Jan;10(1):57-66. doi: 10.1007/s12021-011-9126-x.
3
Ontology-driven indexing of public datasets for translational bioinformatics.用于转化生物信息学的公共数据集的本体驱动索引编制
知识表示与管理。从本体到注释。《2015年年鉴》知识表示与管理部分的研究结果。
Yearb Med Inform. 2015 Aug 13;10(1):134-6. doi: 10.15265/IY-2015-038.
BMC Bioinformatics. 2009 Feb 5;10 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-10-S2-S1.
4
The neuroscience information framework: a data and knowledge environment for neuroscience.神经科学信息框架:一个用于神经科学的数据与知识环境。
Neuroinformatics. 2008 Sep;6(3):149-60. doi: 10.1007/s12021-008-9024-z. Epub 2008 Oct 23.
5
BrainMap: the social evolution of a human brain mapping database.脑图谱:一个人类脑图谱数据库的社会演变
Neuroinformatics. 2005;3(1):65-78. doi: 10.1385/ni:3:1:065.
6
BrainMap taxonomy of experimental design: description and evaluation.实验设计的脑图谱分类法:描述与评估
Hum Brain Mapp. 2005 May;25(1):185-98. doi: 10.1002/hbm.20141.