Gonçalves Lucas Melo, Fontes Pedro Levy Piza, Alves Anderson Antonio Carvalho
Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.
Institute for Integrative Precision Agriculture - University of Georgia, Athens, GA 30602, USA.
J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf166.
This study evaluated the suitability of applying supervised deep learning (DL) algorithms for early and real-time pregnancy diagnosis in beef cattle using luteal color Doppler (CD) ultrasonography recorded on days 20 (D20) and 22 (D22) after fixed-time artificial insemination (FTAI). CD ultrasound videos from 390 females were manually evaluated by trained personnel to perform the human-based pregnancy diagnosis (Human). Images were extracted at a rate of 5 frames per second from each video, resulting in 10,533 (D20) and 10,413 (D22) valid frames after applying a frame-filtering pipeline. Three convolutional neural network (CNN) architectures-VGG19, Xception, and ResNet50-along with their averaged inference (Combined), were evaluated using restricted 5-fold cross-validation, ensuring that images from the same animal did not appear in both training and validation sets. Final inferences for each animal were determined by averaging the network outputs across all video frames. Pregnancy status was confirmed on day 29 using conventional ultrasonography and treated as ground truth for assessing both Human and DL-based predictions. Accuracy levels were similar across methods, ranging from 0.84 (VGG19) to 0.87 (Human) for D20 and from 0.86 (VGG19) to 0.93 (Human) for D22. Based on Matthew's correlation coefficient, the Combined and Xception architectures demonstrated the best overall agreement with true pregnancy status among DL models. These architectures performed comparably to human diagnosis, with the Combined model achieving similar F1 scores (0.89 vs 0.91), higher specificity (0.72 vs 0.65), and slightly lower sensitivity (0.95 vs 1.00) on D20. Xception showed similar performance to human diagnosis on D22, with comparable accuracy (0.91 vs 0.93), specificity (0.79 vs 0.81), sensitivity (0.99 vs 1.00), and F1 score (0.93 vs 0.94). In conclusion, DL algorithms can effectively predict pregnancy status using CD ultrasonography earlier than industry-standard methods, with performance comparable to that of trained personnel.
本研究评估了应用监督深度学习(DL)算法,通过在定时人工授精(FTAI)后第20天(D20)和第22天(D22)记录的黄体彩色多普勒(CD)超声检查,对肉牛进行早期实时妊娠诊断的适用性。390头母牛的CD超声视频由训练有素的人员进行人工评估,以进行基于人工的妊娠诊断(人工诊断)。从每个视频中以每秒5帧的速率提取图像,在应用帧过滤流程后,得到10533帧(D20)和10413帧(D22)有效帧。使用受限的5折交叉验证评估了三种卷积神经网络(CNN)架构——VGG19、Xception和ResNet50,以及它们的平均推理结果(组合模型),确保同一动物的图像不会同时出现在训练集和验证集中。通过对所有视频帧的网络输出求平均来确定每头动物的最终推理结果。在第29天使用传统超声检查确认妊娠状态,并将其作为评估人工诊断和基于DL的预测的真实情况。各方法的准确率水平相似,D20时从0.84(VGG19)到0.87(人工诊断),D22时从0.86(VGG19)到0.93(人工诊断)。基于马修斯相关系数,组合模型和Xception架构在DL模型中与真实妊娠状态的总体一致性最佳。这些架构的表现与人工诊断相当,组合模型在D20时的F1分数相似(0.89对0.91),特异性更高(0.72对0.65),敏感性略低(0.95对1.00)。Xception在D22时的表现与人工诊断相似,准确率(0.91对0.93)、特异性(0.79对0.81)、敏感性(0.99对1.00)和F1分数(0.93对0.94)相当。总之,DL算法可以使用CD超声检查比行业标准方法更早地有效预测妊娠状态,其性能与训练有素的人员相当。