• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用词嵌入来学习更好的食品本体。

Using Word Embeddings to Learn a Better Food Ontology.

作者信息

Youn Jason, Naravane Tarini, Tagkopoulos Ilias

机构信息

Department of Computer Science, University of California at Davis, Davis, CA, United States.

Genome Center, University of California at Davis, Davis, CA, United States.

出版信息

Front Artif Intell. 2020 Nov 26;3:584784. doi: 10.3389/frai.2020.584784. eCollection 2020.

DOI:10.3389/frai.2020.584784
PMID:33733222
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7861243/
Abstract

Food ontologies require significant effort to create and maintain as they involve manual and time-consuming tasks, often with limited alignment to the underlying food science knowledge. We propose a semi-supervised framework for the automated ontology population from an existing ontology scaffold by using word embeddings. Having applied this on the domain of food and subsequent evaluation against an expert-curated ontology, FoodOn, we observe that the food word embeddings capture the latent relationships and characteristics of foods. The resulting ontology, which utilizes word embeddings trained from the Wikipedia corpus, has an improvement of 89.7% in precision when compared to the expert-curated ontology FoodOn (0.34 vs. 0.18, respectively, value = 2.6 × 10), and it has a 43.6% shorter path distance (hops) between predicted and actual food instances (2.91 vs. 5.16, respectively, value = 4.7 × 10) when compared to other methods. This work demonstrates how high-dimensional representations of food can be used to populate ontologies and paves the way for learning ontologies that integrate contextual information from a variety of sources and types.

摘要

食品本体需要付出巨大努力来创建和维护,因为它们涉及人工且耗时的任务,并且通常与基础食品科学知识的一致性有限。我们提出了一个半监督框架,用于通过使用词嵌入从现有的本体框架自动填充本体。在食品领域应用此方法并针对专家策划的本体FoodOn进行后续评估后,我们观察到食品词嵌入捕捉到了食品的潜在关系和特征。与专家策划的本体FoodOn相比,利用从维基百科语料库训练的词嵌入生成的本体在精度上提高了89.7%(分别为0.34对0.18, 值 = 2.6 × 10),并且与其他方法相比,在预测食品实例和实际食品实例之间的路径距离(跳数)缩短了43.6%(分别为2.91对5.16, 值 = 4.7 × 10)。这项工作展示了如何使用食品的高维表示来填充本体,并为学习整合来自各种来源和类型的上下文信息的本体铺平了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/decd/7861243/63cbb1b0c129/frai-03-584784-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/decd/7861243/d29e12b1d244/frai-03-584784-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/decd/7861243/4d3e290ed263/frai-03-584784-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/decd/7861243/dede0283e446/frai-03-584784-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/decd/7861243/63cbb1b0c129/frai-03-584784-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/decd/7861243/d29e12b1d244/frai-03-584784-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/decd/7861243/4d3e290ed263/frai-03-584784-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/decd/7861243/dede0283e446/frai-03-584784-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/decd/7861243/63cbb1b0c129/frai-03-584784-g004.jpg

相似文献

1
Using Word Embeddings to Learn a Better Food Ontology.使用词嵌入来学习更好的食品本体。
Front Artif Intell. 2020 Nov 26;3:584784. doi: 10.3389/frai.2020.584784. eCollection 2020.
2
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
3
HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology.HPO2Vec+:利用异构知识资源丰富人类表型本体的节点嵌入。
J Biomed Inform. 2019 Aug;96:103246. doi: 10.1016/j.jbi.2019.103246. Epub 2019 Jun 27.
4
Combining lexical and context features for automatic ontology extension.基于词汇和上下文特征的本体自动扩展。
J Biomed Semantics. 2020 Jan 13;11(1):1. doi: 10.1186/s13326-019-0218-0.
5
Multi-domain knowledge graph embeddings for gene-disease association prediction.多领域知识图谱嵌入在基因-疾病关联预测中的应用。
J Biomed Semantics. 2023 Aug 14;14(1):11. doi: 10.1186/s13326-023-00291-x.
6
Biomedical ontology alignment: an approach based on representation learning.生物医学本体对齐:一种基于表征学习的方法。
J Biomed Semantics. 2018 Aug 15;9(1):21. doi: 10.1186/s13326-018-0187-8.
7
COS: A new MeSH term embedding incorporating corpus, ontology, and semantic predications.COS:一种新的包含语料库、本体和语义谓词的 MeSH 术语嵌入方法。
PLoS One. 2021 May 4;16(5):e0251094. doi: 10.1371/journal.pone.0251094. eCollection 2021.
8
Bootstrapping Adversarial Learning of Biomedical Ontology Alignments.生物医学本体对齐的自训练对抗学习
AMIA Annu Symp Proc. 2020 Mar 4;2019:627-636. eCollection 2019.
9
Linking entities through an ontology using word embeddings and syntactic re-ranking.通过使用词向量和句法重新排序将实体链接到本体中。
BMC Bioinformatics. 2019 Mar 27;20(1):156. doi: 10.1186/s12859-019-2678-8.
10
Contextual Word Embeddings and Topic Modeling in Healthy Dieting and Obesity.健康饮食与肥胖中的上下文词嵌入和主题建模
J Healthc Inform Res. 2019 Jun 10;3(2):159-183. doi: 10.1007/s41666-019-00052-5. eCollection 2019 Jun.

引用本文的文献

1
A systematic review of data and models for predicting food flavor and texture.预测食物风味和质地的数据与模型的系统综述。
Curr Res Food Sci. 2025 Jun 26;11:101127. doi: 10.1016/j.crfs.2025.101127. eCollection 2025.
2
Volatile Organic Compound-Based Predictive Modeling of Smoke Taint in Wine.基于挥发性有机化合物的葡萄酒杂醇气味预测模型。
J Agric Food Chem. 2024 Apr 10;72(14):8060-8071. doi: 10.1021/acs.jafc.3c07019. Epub 2024 Mar 27.
3
Empowering health geography research with location-based social media data: innovative food word expansion and energy density prediction via word embedding and machine learning.

本文引用的文献

1
FoodOn: a harmonized food ontology to increase global food traceability, quality control and data integration.FoodOn:一个用于提高全球食品可追溯性、质量控制和数据整合的统一食品本体。
NPJ Sci Food. 2018 Dec 18;2:23. doi: 10.1038/s41538-018-0032-6. eCollection 2018.
2
ISO-FOOD ontology: A formal representation of the knowledge within the domain of isotopes for food science.ISO-FOOD 本体:食品科学领域同位素知识的形式化表示。
Food Chem. 2019 Mar 30;277:382-390. doi: 10.1016/j.foodchem.2018.10.118. Epub 2018 Oct 28.
3
Losses, inefficiencies and waste in the global food system.
利用基于位置的社交媒体数据增强健康地理学研究:通过词嵌入和机器学习进行创新的食品词汇扩展和能量密度预测。
Int J Health Geogr. 2023 Sep 16;22(1):22. doi: 10.1186/s12942-023-00344-5.
4
Opportunities and Challenges of Integrating Food Practice into Clinical Decision-Making.将食物实践融入临床决策中的机遇与挑战。
Appl Clin Inform. 2022 Jan;13(1):252-262. doi: 10.1055/s-0042-1743237. Epub 2022 Feb 23.
全球粮食系统中的损失、低效和浪费。
Agric Syst. 2017 May;153:190-200. doi: 10.1016/j.agsy.2017.01.014.
4
node2vec: Scalable Feature Learning for Networks.节点2向量:网络的可扩展特征学习
KDD. 2016 Aug;2016:855-864. doi: 10.1145/2939672.2939754.
5
The FAIR Guiding Principles for scientific data management and stewardship.科学数据管理和保存的 FAIR 指导原则。
Sci Data. 2016 Mar 15;3:160018. doi: 10.1038/sdata.2016.18.
6
BioPortal: enhanced functionality via new Web services from the National Center for Biomedical Ontology to access and use ontologies in software applications.生物信息学知识库:通过国家生物医学本体学研究中心提供的新 Web 服务增强功能,以便在软件应用程序中访问和使用本体。
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W541-5. doi: 10.1093/nar/gkr469. Epub 2011 Jun 14.
7
Building the bridges to bioinformatics in nutrition research.搭建营养研究通向生物信息学的桥梁。
Am J Clin Nutr. 2007 Nov;86(5):1261-9. doi: 10.1093/ajcn/86.5.1261.