Scuccimarra Eric Antoine, Arnaud Alexandre, Tassy Marie, Lê Kim-Anne, Mainardi Fabio
Nestlé Institute of Health Sciences, Nestlé Research, Société des Produits Nestlé, Lausanne, Switzerland.
Division of Human Nutrition and Health, Wageningen University & Research, Wageningen, Netherlands.
Front Nutr. 2025 Mar 12;12:1530846. doi: 10.3389/fnut.2025.1530846. eCollection 2025.
Carbohydrates are the major contributor to the energy intake of worldwide population. There is established evidence of links of carbohydrate quality with human health. Knowledge of specific carbohydrate in packaged food, such as added and free sugars, could help further investigate this link, however this information is generally not available.
To develop an algorithm to predict the content of free sugars in a global database of packaged foods and beverages; and test the applicability of the algorithm to assess carbohydrate quality in packaged food products from different countries and monitor the evolution over time. Carbohydrate quality was defined using a 10:1|1:2 ratio for carbohydrate, fibers and free sugar, i.e., for every 10 g of total carbohydrates in a diet or product, there is at least 1 g of dietary fibers, and less than 2 g of free sugars for every 1 g of dietary fibers.
We used a machine learning approach to predict added and free sugars, which enabled us to predict the carbohydrate quality of products from a global database of packaged food. Our predictions were tested by splitting the dataset into training, validation, and test sets, using US data.
We were able to predict free sugars and carbohydrate quality for 424,543 products in the U.S. and in 14 countries. The overall mean absolute error on the test set was 0.96 g/100 g of product. The predictions generalized with a high accuracy to non-US countries, and we were able to effectively predict the proportion of products meeting the 10:1|1:2 criteria in the food supply of 15 countries.
Our methodology achieved high accuracy and is fully automated; it may be applied to other databases of packaged products and can be easily applied for continuous monitoring of the carbohydrate quality of the global supply of packaged food.
碳水化合物是全球人口能量摄入的主要来源。碳水化合物质量与人类健康之间的联系已有确凿证据。了解包装食品中的特定碳水化合物,如添加糖和游离糖,有助于进一步研究这种联系,但此类信息通常难以获取。
开发一种算法,用于预测全球包装食品和饮料数据库中游离糖的含量;并测试该算法在评估不同国家包装食品碳水化合物质量以及监测随时间变化方面的适用性。碳水化合物质量是根据碳水化合物、纤维和游离糖的10:1|1:2比例来定义的,即饮食或产品中每10克总碳水化合物中,至少有1克膳食纤维,且每1克膳食纤维对应的游离糖少于2克。
我们采用机器学习方法预测添加糖和游离糖,从而能够从全球包装食品数据库中预测产品的碳水化合物质量。我们的预测通过将数据集分为训练集、验证集和测试集,并使用美国数据进行测试。
我们能够预测美国和14个国家中424,543种产品的游离糖含量和碳水化合物质量。测试集上的总体平均绝对误差为0.96克/100克产品。这些预测在非美国国家具有很高的准确性,并且我们能够有效预测15个国家食品供应中符合10:1|1:2标准的产品比例。
我们的方法具有很高的准确性且完全自动化;它可应用于其他包装产品数据库,并可轻松用于持续监测全球包装食品供应的碳水化合物质量。