College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot 010018, China.
Inner Mongolia Autonomous Region Key Laboratory of Big Data Research and Application of Agriculture and Animal Husbandry, Hohhot 010018, China.
Math Biosci Eng. 2022 Apr 7;19(6):5850-5866. doi: 10.3934/mbe.2022274.
Traditional laboratory microscopy for identifying bovine milk somatic cells is subjective, time-consuming, and labor-intensive. The accuracy of the recognition directly through a single classifier is low. In this paper, a novel algorithm that combined the feature extraction algorithm and fusion classification model was proposed to identify the somatic cells. First, 392 cell images from four types of bovine milk somatic cells dataset were trained and tested. Secondly, filtering and the K-means method were used to preprocess and segment the images. Thirdly, the color, morphological, and texture features of the four types of cells were extracted, totaling 100 features. Finally, the gradient boosting decision tree (GBDT)-AdaBoost fusion model was proposed. For the GBDT classifier, the light gradient boosting machine (LightGBM) was used as the weak classifier. The decision tree (DT) was used as the weak classifier of the AdaBoost classifier. The results showed that the average recognition accuracy of the GBDT-AdaBoost reached 98.0%. At the same time, that of random forest (RF), extremely randomized tree (ET), DT, and LightGBM was 79.9, 71.1, 67.3 and 77.2%, respectively. The recall rate of the GBDT-AdaBoost model was the best performance on all types of cells. The F1-Score of the GBDT-AdaBoost model was also better than the results of any single classifiers. The proposed algorithm can effectively recognize the image of bovine milk somatic cells. Moreover, it may provide a reference for recognizing bovine milk somatic cells with similar shape size characteristics and is difficult to distinguish.
传统的用于识别牛乳体细胞的实验室显微镜方法具有主观性、耗时且劳动强度大的特点。单一分类器的识别准确率直接较低。在本文中,提出了一种将特征提取算法和融合分类模型相结合的新算法,用于识别体细胞。首先,从四种牛乳体细胞数据集的 392 个细胞图像中进行训练和测试。其次,使用滤波和 K-均值方法对图像进行预处理和分割。然后,提取四种细胞的颜色、形态和纹理特征,共提取 100 个特征。最后,提出了梯度提升决策树(GBDT)-AdaBoost 融合模型。对于 GBDT 分类器,使用轻梯度提升机(LightGBM)作为弱分类器。AdaBoost 分类器的弱分类器使用决策树(DT)。结果表明,GBDT-AdaBoost 的平均识别准确率达到 98.0%。同时,随机森林(RF)、极端随机树(ET)、DT 和 LightGBM 的识别准确率分别为 79.9%、71.1%、67.3%和 77.2%。GBDT-AdaBoost 模型在所有类型的细胞上的召回率表现最佳。GBDT-AdaBoost 模型的 F1-Score 也优于任何单个分类器的结果。所提出的算法可以有效地识别牛乳体细胞的图像。此外,它可能为识别具有相似形状和大小特征且难以区分的牛乳体细胞提供参考。