Jensen Dan B, Hogeveen Henk, De Vries Albert
Department of Large Animal Sciences, University of Copenhagen, DK-1870 Frederiksberg C, Denmark.
Chair Group Business Economics, Wageningen University, 6706 KN Wageningen, the Netherlands.
J Dairy Sci. 2016 Sep;99(9):7344-7361. doi: 10.3168/jds.2015-10060. Epub 2016 Jun 16.
Rapid detection of dairy cow mastitis is important so corrective action can be taken as soon as possible. Automatically collected sensor data used to monitor the performance and the health state of the cow could be useful for rapid detection of mastitis while reducing the labor needs for monitoring. The state of the art in combining sensor data to predict clinical mastitis still does not perform well enough to be applied in practice. Our objective was to combine a multivariate dynamic linear model (DLM) with a naïve Bayesian classifier (NBC) in a novel method using sensor and nonsensor data to detect clinical cases of mastitis. We also evaluated reductions in the number of sensors for detecting mastitis. With the DLM, we co-modeled 7 sources of sensor data (milk yield, fat, protein, lactose, conductivity, blood, body weight) collected at each milking for individual cows to produce one-step-ahead forecasts for each sensor. The observations were subsequently categorized according to the errors of the forecasted values and the estimated forecast variance. The categorized sensor data were combined with other data pertaining to the cow (week in milk, parity, mastitis history, somatic cell count category, and season) using Bayes' theorem, which produced a combined probability of the cow having clinical mastitis. If this probability was above a set threshold, the cow was classified as mastitis positive. To illustrate the performance of our method, we used sensor data from 1,003,207 milkings from the University of Florida Dairy Unit collected from 2008 to 2014. Of these, 2,907 milkings were associated with recorded cases of clinical mastitis. Using the DLM/NBC method, we reached an area under the receiver operating characteristic curve of 0.89, with a specificity of 0.81 when the sensitivity was set at 0.80. Specificities with omissions of sensor data ranged from 0.58 to 0.81. These results are comparable to other studies, but differences in data quality, definitions of clinical mastitis, and time windows make comparisons across studies difficult. We found the DLM/NBC method to be a flexible method for combining multiple sensor and nonsensor data sources to predict clinical mastitis and accommodate missing observations. Further research is needed before practical implementation is possible. In particular, the performance of our method needs to be improved in the first 2 wk of lactation. The DLM method produces forecasts that are based on continuously estimated multivariate normal distributions, which makes forecasts and forecast errors easy to interpret, and new sensors can easily be added.
快速检测奶牛乳腺炎很重要,这样才能尽快采取纠正措施。自动收集的用于监测奶牛生产性能和健康状况的传感器数据,对于乳腺炎的快速检测可能有用,同时还能减少监测所需的劳动力。目前在结合传感器数据来预测临床乳腺炎方面的技术水平仍不足以在实际中应用。我们的目标是将多元动态线性模型(DLM)与朴素贝叶斯分类器(NBC)相结合,采用一种新颖的方法,利用传感器和非传感器数据来检测乳腺炎的临床病例。我们还评估了用于检测乳腺炎的传感器数量的减少情况。利用DLM,我们对每头奶牛每次挤奶时收集的7种传感器数据来源(产奶量、脂肪、蛋白质、乳糖、电导率、血液、体重)进行联合建模,以生成每个传感器的提前一步预测值。随后根据预测值的误差和估计的预测方差对观测值进行分类。使用贝叶斯定理将分类后的传感器数据与其他与奶牛相关的数据(泌乳周数、胎次、乳腺炎病史、体细胞计数类别和季节)相结合,得出奶牛患临床乳腺炎的综合概率。如果这个概率高于设定的阈值,则将该奶牛分类为乳腺炎阳性。为了说明我们方法的性能,我们使用了2008年至2014年从佛罗里达大学奶牛场收集的1,003,207次挤奶的传感器数据。其中,2,907次挤奶与记录的临床乳腺炎病例相关。使用DLM/NBC方法,我们得到的受试者工作特征曲线下面积为0.89,当灵敏度设定为0.80时,特异性为0.81。遗漏传感器数据时的特异性范围为0.58至0.81。这些结果与其他研究相当,但数据质量、临床乳腺炎的定义和时间窗口的差异使得跨研究比较变得困难。我们发现DLM/NBC方法是一种灵活的方法,可用于结合多个传感器和非传感器数据源来预测临床乳腺炎并处理缺失观测值。在实际应用之前还需要进一步研究。特别是,我们方法在泌乳的前两周的性能需要改进。DLM方法产生的预测基于连续估计的多元正态分布,这使得预测和预测误差易于解释,并且可以轻松添加新的传感器。