Suppr超能文献

PoulTrans:一种基于变压器的精确家禽状况评估模型。

PoulTrans: a transformer-based model for accurate poultry condition assessment.

作者信息

Li Jun, Yang Bing, Chen Junyang, Liu Jiaxin, Amevor Felix Kwame, Chen Guanyu, Zhang Buyuan, Zhao Xiaoling

机构信息

College of Information Engineering, Sichuan Agricultural University, 46 Xinkang Road, Yucheng District, Ya'an, 625000, Sichuan Province, People's Republic of China.

Agricultural Information Engineering Higher Institution Key Laboratory of Sichuan Province, Ya'an, 625000, Sichuan Province, People's Republic of China.

出版信息

Sci Rep. 2025 Apr 23;15(1):14064. doi: 10.1038/s41598-025-98078-w.

Abstract

Recent advances in deep learning have significantly enhanced the accuracy of poultry image recognition, particularly in assessing poultry conditions. However, developing intuitive decision support tools remain a significant challenge. To address this, we present PoulTrans, an innovative image captioning framework that leverages a Convolutional Neural Network (CNN) integrated with a CSA_Encoder-Transformer architecture to generate detailed poultry status reports. This model incorporates visual features extracted by CNNs into the Channel Spatial Attention Segmentation Encoder (CSA_Encoder), which produces segmented channel and spatial attention outputs. To optimize multi-level attention and improve the semantic precision of the status descriptions, we introduced a Channel Spatial Memory-Guided Transformer (CSMT) and a novel PS-Loss function. The performance of PoulTrans was tested on the PSC-Captions dataset, achieving top scores of 0.501, 0.803, 4.927, 0.608, and 1.882 for the BLEU-4, ROUGE-L, CIDEr, SPICE, and Sm metrics, respectively. Comprehensive analyses and experiments have validated the effectiveness and reliability of our model, providing advanced tools for automated poultry status generation and enhancing the digital experience for poultry farmers. Our code is available at: https://github.com/kong1107800/PoulTrans .

摘要

深度学习的最新进展显著提高了家禽图像识别的准确性,尤其是在评估家禽状况方面。然而,开发直观的决策支持工具仍然是一项重大挑战。为了解决这一问题,我们提出了PoulTrans,这是一个创新的图像字幕框架,它利用了与CSA_Encoder-Transformer架构集成的卷积神经网络(CNN)来生成详细的家禽状态报告。该模型将CNN提取的视觉特征整合到通道空间注意力分割编码器(CSA_Encoder)中,该编码器产生分割后的通道和空间注意力输出。为了优化多级注意力并提高状态描述的语义精度,我们引入了通道空间记忆引导变压器(CSMT)和一种新颖的PS损失函数。在PSC-Captions数据集上对PoulTrans的性能进行了测试,在BLEU-4、ROUGE-L、CIDEr、SPICE和Sm指标上分别取得了0.501、0.803、4.927、0.608和1.882的最高分。综合分析和实验验证了我们模型的有效性和可靠性,为自动生成家禽状态提供了先进工具,并增强了家禽养殖户的数字体验。我们的代码可在以下网址获取:https://github.com/kong1107800/PoulTrans

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验