Lee Ki-Seung
Department of Electrical and Electronic Engineering, Konkuk University, 1 Hwayang-dong, Gwangjin-gu, Seoul 05029, Republic of Korea.
Foods. 2023 Aug 25;12(17):3212. doi: 10.3390/foods12173212.
Continuous monitoring and recording of the type and caloric content of ingested foods with a minimum of user intervention is very useful in preventing metabolic diseases and obesity. In this paper, automatic recognition of food type and caloric content was achieved via the use of multi-spectral images. A method of fusing the RGB image and the images captured at ultra violet, visible, and near-infrared regions at center wavelengths of 385, 405, 430, 470, 490, 510, 560, 590, 625, 645, 660, 810, 850, 870, 890, 910, 950, 970, and 1020 nm was adopted to improve the accuracy. A convolutional neural network (CNN) was adopted to classify food items and estimate the caloric amounts. The CNN was trained using 10,909 images acquired from 101 types. The objective functions including classification accuracy and mean absolute percentage error (MAPE) were investigated according to wavelength numbers. The optimal combinations of wavelengths (including/excluding the RGB image) were determined by using a piecewise selection method. Validation tests were carried out on 3636 images of the food types that were used in training the CNN. As a result of the experiments, the accuracy of food classification was increased from 88.9 to 97.1% and MAPEs were decreased from 41.97 to 18.97 even when one kind of NIR image was added to the RGB image. The highest accuracy for food type classification was 99.81% when using 19 images and the lowest MAPE for caloric content was 10.56 when using 14 images. These results demonstrated that the use of the images captured at various wavelengths in the UV and NIR bands was very helpful for improving the accuracy of food classification and caloric estimation.
在用户干预最少的情况下,持续监测和记录摄入食物的类型及热量含量,对于预防代谢疾病和肥胖非常有用。本文通过使用多光谱图像实现了食物类型和热量含量的自动识别。采用了一种融合RGB图像与在中心波长为385、405、430、470、490、510、560、590、625、645、660、810、850、870、890、910、950、970和1020纳米的紫外、可见和近红外区域捕获的图像的方法,以提高准确性。采用卷积神经网络(CNN)对食物进行分类并估计热量。使用从101种类型获取的10909张图像对CNN进行训练。根据波长数量研究了包括分类准确率和平均绝对百分比误差(MAPE)在内的目标函数。通过使用分段选择方法确定了波长的最佳组合(包括/不包括RGB图像)。对用于训练CNN的3636张食物类型图像进行了验证测试。实验结果表明,即使在RGB图像中添加一种近红外图像,食物分类的准确率也从88.9%提高到了97.1%,MAPE从41.97降低到了18.97。使用19张图像时食物类型分类的最高准确率为99.81%,使用14张图像时热量含量的最低MAPE为10.56。这些结果表明,使用紫外和近红外波段不同波长捕获的图像对于提高食物分类和热量估计的准确性非常有帮助。