• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SIENA:使用概念识别对数据集进行半自动语义增强。

SIENA: Semi-automatic semantic enhancement of datasets using concept recognition.

机构信息

Institute of Data Science, Maastricht University, Universiteitsingel 60, Maastricht, 6229 ER, Netherlands.

Department of Data Science and Knowledge Engineering, Maastricht University, Paul-Henri Spaaklaan 1, Maastricht, 6229 EN, Netherlands.

出版信息

J Biomed Semantics. 2021 Mar 24;12(1):5. doi: 10.1186/s13326-021-00239-z.

DOI:10.1186/s13326-021-00239-z
PMID:33761996
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7992819/
Abstract

BACKGROUND

The amount of available data, which can facilitate answering scientific research questions, is growing. However, the different formats of published data are expanding as well, creating a serious challenge when multiple datasets need to be integrated for answering a question.

RESULTS

This paper presents a semi-automated framework that provides semantic enhancement of biomedical data, specifically gene datasets. The framework involved a concept recognition task using machine learning, in combination with the BioPortal annotator. Compared to using methods which require only the BioPortal annotator for semantic enhancement, the proposed framework achieves the highest results.

CONCLUSIONS

Using concept recognition combined with machine learning techniques and annotation with a biomedical ontology, the proposed framework can provide datasets to reach their full potential of providing meaningful information, which can answer scientific research questions.

摘要

背景

可用于回答科学研究问题的可用数据量正在增加。然而,随着已发表数据格式的不断增加,当需要整合多个数据集来回答一个问题时,这就带来了严重的挑战。

结果

本文提出了一个半自动框架,为生物医学数据,特别是基因数据集提供语义增强。该框架涉及使用机器学习的概念识别任务,结合了 BioPortal 标注器。与仅使用 BioPortal 标注器进行语义增强的方法相比,所提出的框架实现了最高的效果。

结论

使用概念识别结合机器学习技术,并结合生物医学本体进行注释,所提出的框架可以提供数据集,以充分发挥其提供有意义信息的潜力,从而回答科学研究问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83b7/7992819/89e5bdf93318/13326_2021_239_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83b7/7992819/5f90400ef18b/13326_2021_239_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83b7/7992819/7f2bd738e019/13326_2021_239_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83b7/7992819/c6a9d75b148a/13326_2021_239_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83b7/7992819/89e5bdf93318/13326_2021_239_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83b7/7992819/5f90400ef18b/13326_2021_239_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83b7/7992819/7f2bd738e019/13326_2021_239_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83b7/7992819/c6a9d75b148a/13326_2021_239_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/83b7/7992819/89e5bdf93318/13326_2021_239_Fig4_HTML.jpg

相似文献

1
SIENA: Semi-automatic semantic enhancement of datasets using concept recognition.SIENA:使用概念识别对数据集进行半自动语义增强。
J Biomed Semantics. 2021 Mar 24;12(1):5. doi: 10.1186/s13326-021-00239-z.
2
SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes.SIFR 标注器:基于本体论的法语生物医学文本和临床笔记的语义标注。
BMC Bioinformatics. 2018 Nov 6;19(1):405. doi: 10.1186/s12859-018-2429-2.
3
Linked open data-based framework for automatic biomedical ontology generation.基于链接开放数据的自动生物医学本体生成框架。
BMC Bioinformatics. 2018 Sep 10;19(1):319. doi: 10.1186/s12859-018-2339-3.
4
A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering.一种基于机器学习的生物医学问答中问题类型分类方法。
Methods Inf Med. 2017 May 18;56(3):209-216. doi: 10.3414/ME16-01-0116. Epub 2017 Mar 31.
5
Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain.基于链接生物医学本体的疾病-药物领域自动化本体生成框架。
Comput Methods Programs Biomed. 2018 Oct;165:117-128. doi: 10.1016/j.cmpb.2018.08.010. Epub 2018 Aug 16.
6
Large scale biomedical texts classification: a kNN and an ESA-based approaches.大规模生物医学文本分类:基于k近邻算法和基于词嵌入语义分析的方法。
J Biomed Semantics. 2016 Jun 16;7:40. doi: 10.1186/s13326-016-0073-1.
7
Generation of open biomedical datasets through ontology-driven transformation and integration processes.通过本体驱动的转换和集成过程生成开放生物医学数据集。
J Biomed Semantics. 2016 Jun 3;7:32. doi: 10.1186/s13326-016-0075-z.
8
Evolving knowledge graph similarity for supervised learning in complex biomedical domains.用于复杂生物医学领域中监督学习的进化知识图相似度。
BMC Bioinformatics. 2020 Jan 3;21(1):6. doi: 10.1186/s12859-019-3296-1.
9
Semantic similarity and machine learning with ontologies.语义相似性和本体论的机器学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa199.
10
Enhanced functionalities for annotating and indexing clinical text with the NCBO Annotator.利用 NCBO Annotator 增强临床文本的标注和索引功能。
Bioinformatics. 2018 Jun 1;34(11):1962-1965. doi: 10.1093/bioinformatics/bty009.

本文引用的文献

1
Aber-OWL: a framework for ontology-based data access in biology.Aber-OWL:一种用于生物学中基于本体的数据访问的框架。
BMC Bioinformatics. 2015 Jan 28;16:26. doi: 10.1186/s12859-015-0456-9.
2
Comparison of concept recognizers for building the Open Biomedical Annotator.比较概念识别器在构建开放生物医学标注器中的应用。
BMC Bioinformatics. 2009 Sep 17;10 Suppl 9(Suppl 9):S14. doi: 10.1186/1471-2105-10-S9-S14.
3
BioPortal: ontologies and integrated data resources at the click of a mouse.生物门户:一键点击即可获取本体和集成数据资源。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W170-3. doi: 10.1093/nar/gkp440. Epub 2009 May 29.
4
Recognition of protein/gene names from text using an ensemble of classifiers.使用分类器集成从文本中识别蛋白质/基因名称。
BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S7. doi: 10.1186/1471-2105-6-S1-S7. Epub 2005 May 24.