• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于食谱生成和食物检索的结构表示学习

Learning Structural Representations for Recipe Generation and Food Retrieval.

作者信息

Wang Hao, Lin Guosheng, Hoi Steven C H, Miao Chunyan

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3363-3377. doi: 10.1109/TPAMI.2022.3181294. Epub 2023 Feb 3.

DOI:10.1109/TPAMI.2022.3181294
PMID:35687622
Abstract

Food is significant to human daily life. In this paper, we are interested in learning structural representations for lengthy recipes, that can benefit the recipe generation and food cross-modal retrieval tasks. Different from the common vision-language data, here the food images contain mixed ingredients and target recipes are lengthy paragraphs, where we do not have annotations on structure information. To address the above limitations, we propose a novel method to unsupervisedly learn the sentence-level tree structures for the cooking recipes. Our approach brings together several novel ideas in a systematic framework: (1) exploiting an unsupervised learning approach to obtain the sentence-level tree structure labels before training; (2) generating trees of target recipes from images with the supervision of tree structure labels learned from (1); and (3) integrating the learned tree structures into the recipe generation and food cross-modal retrieval procedure. Our proposed model can produce good-quality sentence-level tree structures and coherent recipes. We achieve the state-of-the-art recipe generation and food cross-modal retrieval performance on the benchmark Recipe1M dataset.

摘要

食物对人类日常生活至关重要。在本文中,我们感兴趣的是学习冗长食谱的结构表示,这可以有益于食谱生成和食物跨模态检索任务。与常见的视觉语言数据不同,这里的食物图像包含混合食材,且目标食谱是冗长的段落,我们没有关于结构信息的注释。为解决上述限制,我们提出一种新颖的方法来无监督地学习烹饪食谱的句子级树结构。我们的方法在一个系统框架中融合了几个新颖的想法:(1)利用无监督学习方法在训练前获得句子级树结构标签;(2)在从(1)中学习到的树结构标签的监督下,从图像生成目标食谱的树;(3)将学习到的树结构整合到食谱生成和食物跨模态检索过程中。我们提出的模型可以生成高质量的句子级树结构和连贯的食谱。我们在基准Recipe1M数据集上实现了当前最优的食谱生成和食物跨模态检索性能。

相似文献

1
Learning Structural Representations for Recipe Generation and Food Retrieval.用于食谱生成和食物检索的结构表示学习
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3363-3377. doi: 10.1109/TPAMI.2022.3181294. Epub 2023 Feb 3.
2
Ki-Cook: clustering multimodal cooking representations through knowledge-infused learning.Ki-Cook:通过知识注入学习对多模态烹饪表示进行聚类
Front Big Data. 2023 Jul 24;6:1200840. doi: 10.3389/fdata.2023.1200840. eCollection 2023.
3
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images.食谱1M+:用于学习烹饪食谱和食物图像跨模态嵌入的数据集。
IEEE Trans Pattern Anal Mach Intell. 2019 Jul 9. doi: 10.1109/TPAMI.2019.2927476.
4
Disambiguity and Alignment: An Effective Multi-Modal Alignment Method for Cross-Modal Recipe Retrieval.消除歧义与对齐:一种用于跨模态食谱检索的有效多模态对齐方法。
Foods. 2024 May 23;13(11):1628. doi: 10.3390/foods13111628.
5
Inclusion of Food Safety Information in Home-delivered U.K. Meal-kit Recipes.将食品安全信息纳入英国家庭送餐食谱。
J Food Prot. 2023 Nov;86(11):100162. doi: 10.1016/j.jfp.2023.100162. Epub 2023 Sep 14.
6
Large Scale Visual Food Recognition.大规模视觉食物识别。
IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):9932-9949. doi: 10.1109/TPAMI.2023.3237871. Epub 2023 Jun 30.
7
Cooking "shrimp à la créole": a pilot study of an ecological rehabilitation in semantic dementia.烹饪“克里奥尔虾”:语义性痴呆的生态康复试点研究。
Neuropsychol Rehabil. 2011 Aug;21(4):455-83. doi: 10.1080/09602011.2011.580614. Epub 2011 Jun 30.
8
Cooking Up a Transparent Model Following a DICE Recipe.按照 DICE 食谱制作透明模型。
Pharmacoeconomics. 2019 Nov;37(11):1341-1347. doi: 10.1007/s40273-019-00840-2.
9
Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation.通过跨模态检索和模型适配实现跨域图像字幕生成
IEEE Trans Image Process. 2021;30:1180-1192. doi: 10.1109/TIP.2020.3042086. Epub 2020 Dec 17.
10
How Low-Income Mothers Select and Adapt Recipes and Implications for Promoting Healthy Recipes Online.低收入母亲如何选择和调整食谱,以及对在线推广健康食谱的启示。
Nutrients. 2019 Feb 5;11(2):339. doi: 10.3390/nu11020339.

引用本文的文献

1
Adaptafood: an intelligent system to adapt recipes to specialised diets and healthy lifestyles.适应饮食:一种将食谱适配特殊饮食和健康生活方式的智能系统。
Multimed Syst. 2025;31(1):87. doi: 10.1007/s00530-025-01667-y. Epub 2025 Feb 1.
2
Disambiguity and Alignment: An Effective Multi-Modal Alignment Method for Cross-Modal Recipe Retrieval.消除歧义与对齐:一种用于跨模态食谱检索的有效多模态对齐方法。
Foods. 2024 May 23;13(11):1628. doi: 10.3390/foods13111628.