Suppr
超能文献

视觉营养分析：利用分割和回归进行食物营养估计。

Visual nutrition analysis: leveraging segmentation and regression for food nutrient estimation.

作者信息

Zhao Yaping, Zhu Ping, Jiang Yizhang, Xia Kaijian

机构信息

School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, Jiangsu, China.

Changshu Key Laboratory of Medical Artificial Intelligence and Big Data, Suzhou, Jiangsu, China.

出版信息

Front Nutr. 2024 Dec 17;11:1469878. doi: 10.3389/fnut.2024.1469878. eCollection 2024.

DOI:10.3389/fnut.2024.1469878

PMID:39742105

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11685081/

Abstract

INTRODUCTION

Nutrition is closely related to body health. A reasonable diet structure not only meets the body's needs for various nutrients but also effectively prevents many chronic diseases. However, due to the general lack of systematic nutritional knowledge, people often find it difficult to accurately assess the nutritional content of food. In this context, image-based nutritional evaluation technology can provide significant assistance. Therefore, we are dedicated to directly predicting the nutritional content of dishes through images. Currently, most related research focuses on estimating the volume or area of food through image segmentation tasks and then calculating its nutritional content based on the food category. However, this method often lacks real nutritional content labels as a reference, making it difficult to ensure the accuracy of the predictions.

METHODS

To address this issue, we combined segmentation and regression tasks and used the Nutrition5k dataset, which contains detailed nutritional content labels but no segmentation labels, for manual segmentation annotation. Based on these annotated data, we developed a nutritional content prediction model that performs segmentation first and regression afterward. Specifically, we first applied the UNet model to segment the food, then used a backbone network to extract features, and enhanced the feature expression capability through the Squeeze-and-Excitation structure. Finally, the extracted features were processed through several fully connected layers to obtain predictions for the weight, calories, fat, carbohydrates, and protein content.

RESULTS AND DISCUSSION

Our model achieved an outstanding average percentage mean absolute error (PMAE) of 17.06% for these components. All manually annotated segmentation labels can be found at https://doi.org/10.6084/m9.figshare.26252048.v1.

摘要

引言

营养与身体健康密切相关。合理的饮食结构不仅能满足身体对各种营养素的需求，还能有效预防多种慢性疾病。然而，由于普遍缺乏系统的营养知识，人们常常难以准确评估食物的营养成分。在此背景下，基于图像的营养评估技术能提供重要帮助。因此，我们致力于通过图像直接预测菜肴的营养成分。目前，大多数相关研究集中于通过图像分割任务估计食物的体积或面积，然后根据食物类别计算其营养成分。然而，这种方法往往缺乏真实的营养成分标签作为参考，难以确保预测的准确性。

方法

为解决此问题，我们将分割和回归任务相结合，并使用了Nutrition5k数据集（该数据集包含详细的营养成分标签，但没有分割标签）进行人工分割标注。基于这些标注数据，我们开发了一个先进行分割后进行回归的营养成分预测模型。具体而言，我们首先应用UNet模型分割食物，然后使用骨干网络提取特征，并通过挤压与激励结构增强特征表达能力。最后，对提取的特征通过几个全连接层进行处理，以获得重量、卡路里、脂肪、碳水化合物和蛋白质含量的预测值。

结果与讨论

我们的模型对这些成分的平均百分比平均绝对误差（PMAE）达到了出色的17.06%。所有人工标注的分割标签可在https://doi.org/10.6084/m9.figshare.26252048.v1上找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c9/11685081/6340e782979f/fnut-11-1469878-g002.jpg

相似文献

Visual nutrition analysis: leveraging segmentation and regression for food nutrient estimation.

Front Nutr. 2024 Dec 17;11:1469878. doi: 10.3389/fnut.2024.1469878. eCollection 2024.

Rapid Non-Destructive Analysis of Food Nutrient Content Using Swin-Nutrition.

Foods. 2022 Oct 29;11(21):3429. doi: 10.3390/foods11213429.

Tailoring the Nutritional Composition of Italian Foods to the US Nutrition5k Dataset for Food Image Recognition: Challenges and a Comparative Analysis.

Nutrients. 2024 Oct 1;16(19):3339. doi: 10.3390/nu16193339.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Automatic segmentation of pericardial adipose tissue from cardiac MR images via semi-supervised method with difference-guided consistency.

Med Phys. 2025 Mar;52(3):1679-1692. doi: 10.1002/mp.17558. Epub 2024 Dec 5.

mid-DeepLabv3+: A Novel Approach for Image Semantic Segmentation Applied to African Food Dietary Assessments.

Sensors (Basel). 2023 Dec 29;24(1):209. doi: 10.3390/s24010209.

Annotation-efficient training of medical image segmentation network based on scribble guidance in difficult areas.

Int J Comput Assist Radiol Surg. 2024 Jan;19(1):87-96. doi: 10.1007/s11548-023-02931-0. Epub 2023 May 26.

ASD-Net: a novel U-Net based asymmetric spatial-channel convolution network for precise kidney and kidney tumor image segmentation.

Med Biol Eng Comput. 2024 Jun;62(6):1673-1687. doi: 10.1007/s11517-024-03025-y. Epub 2024 Feb 8.

A dual-decoder banded convolutional attention network for bone segmentation in ultrasound images.

Med Phys. 2025 Mar;52(3):1556-1572. doi: 10.1002/mp.17545. Epub 2024 Dec 9.

Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures.

PeerJ Comput Sci. 2021 Jun 29;7:e607. doi: 10.7717/peerj-cs.607. eCollection 2021.

引用本文的文献

2D Prediction of the Nutritional Composition of Dishes from Food Images: Deep Learning Algorithm Selection and Data Curation Beyond the Nutrition5k Project.

Nutrients. 2025 Jun 30;17(13):2196. doi: 10.3390/nu17132196.

本文引用的文献

Nutritional composition analysis in food images: an innovative Swin Transformer approach.

Front Nutr. 2024 Oct 14;11:1454466. doi: 10.3389/fnut.2024.1454466. eCollection 2024.

DPF-Nutrition: Food Nutrition Estimation via Depth Prediction and Fusion.

Foods. 2023 Nov 28;12(23):4293. doi: 10.3390/foods12234293.

Vision-based food nutrition estimation via RGB-D fusion network.

Food Chem. 2023 Oct 30;424:136309. doi: 10.1016/j.foodchem.2023.136309. Epub 2023 May 17.

FOODCAM: A Novel Structured Light-Stereo Imaging System for Food Portion Size Estimation.

Sensors (Basel). 2022 Apr 26;22(9):3300. doi: 10.3390/s22093300.

Human-Mimetic Estimation of Food Volume from a Single-View RGB Image Using an AI System.

Electronics (Basel). 2021 Jul;10(13). doi: 10.3390/electronics10131556. Epub 2021 Jun 28.

Impact of Mediterranean Diet on Chronic Non-Communicable Diseases and Longevity.

Nutrients. 2021 Jun 12;13(6):2028. doi: 10.3390/nu13062028.

Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images.

IEEE Trans Pattern Anal Mach Intell. 2019 Jul 9. doi: 10.1109/TPAMI.2019.2927476.

Fully Convolutional Networks for Semantic Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.

Diet, nutrition, and cancer: past, present and future.

Nat Rev Clin Oncol. 2016 Aug;13(8):504-15. doi: 10.1038/nrclinonc.2016.24. Epub 2016 Mar 8.

Comparison of known food weights with image-based portion-size automated estimation and adolescents' self-reported portion size.

J Diabetes Sci Technol. 2012 Mar 1;6(2):428-34. doi: 10.1177/193229681200600231.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

视觉营养分析：利用分割和回归进行食物营养估计。

Visual nutrition analysis: leveraging segmentation and regression for food nutrient estimation.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS AND DISCUSSION

引言

方法

结果与讨论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译