Salord Tristan, Magrini Marie-Benoît, Cabanac Guillaume
AGIR, INRAE, University Toulouse, Castanet-Tolosan, France.
CNRS, IRIT, University Toulouse, Toulouse, France.
Data Brief. 2022 Apr 14;42:108173. doi: 10.1016/j.dib.2022.108173. eCollection 2022 Jun.
There is a lack of methods and tools to reveal robust information on the ingredients used in packaged foods. To tackle this challenge, we developed an original method to parse ingredient lists of packaged foods. We built a dataset of food product innovations with their parsed ingredient lists. We explain the parser algorithm used to provide this dataset; and a benchmark method assessing the performance of the parsing techniques applied on those food ingredient lists. The primary data we used to test and apply this method were retrieved from MINTEL-GNPD. These data cover new food products containing pulse ingredients launched on European markets over the last decade. This work brings original results informing on the diversity of pulse species used in food products, and on the technological features of these ingredients from whole-grain to ultra-processed uses (such as protein isolates). The parsing techniques we developed can be reused to analyse other ingredient lists. This method also makes it possible to assess marketed crop biodiversity in relation to how species diversity is represented in food products, as well as the level of complexity of food formulations. Hence, this work contributes towards providing more complete information on the characteristics of foodstuffs supplied on markets for both private and public stakeholders.
目前缺乏揭示包装食品所用成分可靠信息的方法和工具。为应对这一挑战,我们开发了一种解析包装食品成分列表的原创方法。我们构建了一个包含解析后的成分列表的食品创新数据集。我们解释了用于提供此数据集的解析算法,以及一种评估应用于这些食品成分列表的解析技术性能的基准方法。我们用于测试和应用此方法的主要数据取自英敏特全球新产品数据库(MINTEL-GNPD)。这些数据涵盖了过去十年在欧洲市场推出的含豆类成分的新食品。这项工作带来了原创性成果,揭示了食品中使用的豆类品种的多样性,以及这些成分从全谷物到超加工用途(如分离蛋白)的技术特性。我们开发的解析技术可重复用于分析其他成分列表。该方法还能够评估与食品中物种多样性呈现方式相关的上市作物生物多样性,以及食品配方的复杂程度。因此,这项工作有助于为私营和公共利益相关者提供关于市场上供应食品特征的更完整信息。