Department Electrical and Computer Engineering, The University of Alabama, Tuscaloosa, AL 35487, USA.
Department Electrical and Computer Engineering, Clarkson University, Potsdam, NY 13699, USA.
Sensors (Basel). 2023 Jan 4;23(2):560. doi: 10.3390/s23020560.
Sensor-based food intake monitoring has become one of the fastest-growing fields in dietary assessment. Researchers are exploring imaging-sensor-based food detection, food recognition, and food portion size estimation. A major problem that is still being tackled in this field is the segmentation of regions of food when multiple food items are present, mainly when similar-looking foods (similar in color and/or texture) are present. Food image segmentation is a relatively under-explored area compared with other fields. This paper proposes a novel approach to food imaging consisting of two imaging sensors: color (Red-Green-Blue) and thermal. Furthermore, we propose a multi-modal four-Dimensional (RGB-T) image segmentation using a k-means clustering algorithm to segment regions of similar-looking food items in multiple combinations of hot, cold, and warm (at room temperature) foods. Six food combinations of two food items each were used to capture RGB and thermal image data. RGB and thermal data were superimposed to form a combined RGB-T image and three sets of data (RGB, thermal, and RGB-T) were tested. A bootstrapped optimization of within-cluster sum of squares (WSS) was employed to determine the optimal number of clusters for each case. The combined RGB-T data achieved better results compared with RGB and thermal data, used individually. The mean ± standard deviation (std. dev.) of the F1 score for RGB-T data was 0.87 ± 0.1 compared with 0.66 ± 0.13 and 0.64 ± 0.39, for RGB and Thermal data, respectively.
基于传感器的食物摄入量监测已成为饮食评估中发展最快的领域之一。研究人员正在探索基于成像传感器的食物检测、食物识别和食物份量估计。在这个领域,一个仍在解决的主要问题是当存在多种食物时,特别是当存在外观相似的食物(颜色和/或质地相似)时,对食物区域的分割。与其他领域相比,食物图像分割是一个相对探索较少的领域。本文提出了一种新的食物成像方法,由两个成像传感器组成:颜色(红绿蓝)和热。此外,我们提出了一种使用 k-均值聚类算法的多模态四维(RGB-T)图像分割,用于分割多种热、冷和温(室温)食物的外观相似食物的区域。使用两种食物各两个的六种食物组合来捕获 RGB 和热图像数据。将 RGB 和热数据叠加以形成组合的 RGB-T 图像,并测试三组数据(RGB、热和 RGB-T)。使用 within-cluster sum of squares (WSS) 的自举优化来确定每种情况的最佳聚类数。与单独使用 RGB 和热数据相比,组合的 RGB-T 数据的 F1 分数的平均值±标准偏差(std.dev.)为 0.87±0.1,而 RGB 数据和热数据分别为 0.66±0.13 和 0.64±0.39。