Neto Habib Asseiss, Tavares Wanessa L F, Ribeiro Daniela C S Z, Alves Ronnie C O, Fonseca Leorges M, Campos Sérgio V A
Federal Institute of Mato Grosso do Sul, Rua Ângelo Melão, 790, Três Lagoas, 79641-162 MS Brazil.
2Department of Computer Science, Federal University of Minas Gerais, Av. Antônio Carlos, 6627, Belo Horizonte, 31270-901 MG Brazil.
BioData Min. 2019 Jul 8;12:13. doi: 10.1186/s13040-019-0200-5. eCollection 2019.
Fraudulent milk adulteration is a dangerous practice in the dairy industry that is harmful to consumers since milk is one of the most consumed food products. Milk quality can be assessed by Fourier Transformed Infrared Spectroscopy (FTIR), a simple and fast method for obtaining its compositional information. The spectral data produced by this technique can be explored using machine learning methods, such as neural networks and decision trees, in order to create models that represent the characteristics of pure and adulterated milk samples.
Thousands of milk samples were collected, some of them were manually adulterated with five different substances and subjected to infrared spectroscopy. This technique produced spectral data from the milk samples composition, which were used for training different machine learning algorithms, such as deep and ensemble decision tree learners. The proposed method is used to predict the presence of adulterants in a binary classification problem and also the specific assessment of which of five adulterants was found through multiclass classification. In deep learning, we propose a Convolutional Neural Network architecture that needs no preprocessing on spectral data. Classifiers evaluated show promising results, with classification accuracies up to 98.76%, outperforming commonly used classical learning methods.
The proposed methodology uses machine learning techniques on milk spectral data. It is able to predict common adulterations that occur in the dairy industry. Both deep and ensemble tree learners were evaluated considering binary and multiclass classifications and the results were compared. The proposed neural network architecture is able to outperform the composition recognition made by the FTIR equipment and by commonly used methods in the dairy industry.
在乳制品行业中,掺假牛奶是一种危险行为,由于牛奶是消费最多的食品之一,因此对消费者有害。牛奶质量可以通过傅里叶变换红外光谱法(FTIR)进行评估,这是一种获取其成分信息的简单快速方法。可以使用机器学习方法(如神经网络和决策树)来探索该技术产生的光谱数据,以创建代表纯牛奶和掺假牛奶样品特征的模型。
收集了数千个牛奶样品,其中一些用五种不同物质进行了人工掺假,并进行了红外光谱分析。该技术从牛奶样品成分中产生了光谱数据,这些数据用于训练不同的机器学习算法,如深度和集成决策树学习器。所提出的方法用于在二元分类问题中预测掺假物的存在,并且还通过多类分类对发现的五种掺假物中的哪一种进行具体评估。在深度学习中,我们提出了一种无需对光谱数据进行预处理的卷积神经网络架构。评估的分类器显示出有希望的结果,分类准确率高达98.76%,优于常用的经典学习方法。
所提出的方法在牛奶光谱数据上使用机器学习技术。它能够预测乳制品行业中常见的掺假情况。考虑到二元和多类分类对深度和集成树学习器进行了评估,并对结果进行了比较。所提出的神经网络架构能够优于FTIR设备和乳制品行业常用方法进行的成分识别。