PoulTrans：一种基于变压器的精确家禽状况评估模型。

PoulTrans: a transformer-based model for accurate poultry condition assessment.

作者信息

Li Jun, Yang Bing, Chen Junyang, Liu Jiaxin, Amevor Felix Kwame, Chen Guanyu, Zhang Buyuan, Zhao Xiaoling

机构信息

College of Information Engineering, Sichuan Agricultural University, 46 Xinkang Road, Yucheng District, Ya'an, 625000, Sichuan Province, People's Republic of China.

Agricultural Information Engineering Higher Institution Key Laboratory of Sichuan Province, Ya'an, 625000, Sichuan Province, People's Republic of China.

出版信息

Sci Rep. 2025 Apr 23;15(1):14064. doi: 10.1038/s41598-025-98078-w.

DOI:10.1038/s41598-025-98078-w

PMID:40269017

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12018970/

Abstract

Recent advances in deep learning have significantly enhanced the accuracy of poultry image recognition, particularly in assessing poultry conditions. However, developing intuitive decision support tools remain a significant challenge. To address this, we present PoulTrans, an innovative image captioning framework that leverages a Convolutional Neural Network (CNN) integrated with a CSA_Encoder-Transformer architecture to generate detailed poultry status reports. This model incorporates visual features extracted by CNNs into the Channel Spatial Attention Segmentation Encoder (CSA_Encoder), which produces segmented channel and spatial attention outputs. To optimize multi-level attention and improve the semantic precision of the status descriptions, we introduced a Channel Spatial Memory-Guided Transformer (CSMT) and a novel PS-Loss function. The performance of PoulTrans was tested on the PSC-Captions dataset, achieving top scores of 0.501, 0.803, 4.927, 0.608, and 1.882 for the BLEU-4, ROUGE-L, CIDEr, SPICE, and Sm metrics, respectively. Comprehensive analyses and experiments have validated the effectiveness and reliability of our model, providing advanced tools for automated poultry status generation and enhancing the digital experience for poultry farmers. Our code is available at: https://github.com/kong1107800/PoulTrans .

摘要

深度学习的最新进展显著提高了家禽图像识别的准确性，尤其是在评估家禽状况方面。然而，开发直观的决策支持工具仍然是一项重大挑战。为了解决这一问题，我们提出了PoulTrans，这是一个创新的图像字幕框架，它利用了与CSA_Encoder-Transformer架构集成的卷积神经网络（CNN）来生成详细的家禽状态报告。该模型将CNN提取的视觉特征整合到通道空间注意力分割编码器（CSA_Encoder）中，该编码器产生分割后的通道和空间注意力输出。为了优化多级注意力并提高状态描述的语义精度，我们引入了通道空间记忆引导变压器（CSMT）和一种新颖的PS损失函数。在PSC-Captions数据集上对PoulTrans的性能进行了测试，在BLEU-4、ROUGE-L、CIDEr、SPICE和Sm指标上分别取得了0.501、0.803、4.927、0.608和1.882的最高分。综合分析和实验验证了我们模型的有效性和可靠性，为自动生成家禽状态提供了先进工具，并增强了家禽养殖户的数字体验。我们的代码可在以下网址获取：https://github.com/kong1107800/PoulTrans 。

相似文献

PoulTrans: a transformer-based model for accurate poultry condition assessment.PoulTrans：一种基于变压器的精确家禽状况评估模型。

Sci Rep. 2025 Apr 23;15(1):14064. doi: 10.1038/s41598-025-98078-w.

Insights into Object Semantics: Leveraging Transformer Networks for Advanced Image Captioning.深入理解对象语义：利用Transformer网络实现高级图像字幕生成

Sensors (Basel). 2024 Mar 11;24(6):1796. doi: 10.3390/s24061796.

Translating medical image to radiological report: Adaptive multilevel multi-attention approach.将医学图像翻译为放射报告：自适应多级多关注方法。

Comput Methods Programs Biomed. 2022 Jun;221:106853. doi: 10.1016/j.cmpb.2022.106853. Epub 2022 May 4.

TAC-UNet: transformer-assisted convolutional neural network for medical image segmentation.TAC-UNet：用于医学图像分割的Transformer辅助卷积神经网络。

Quant Imaging Med Surg. 2024 Dec 5;14(12):8824-8839. doi: 10.21037/qims-24-1229. Epub 2024 Nov 5.

MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation.MS-TCNet：一种基于多尺度特征学习的有效的 Transformer-CNN 组合网络，用于 3D 医学图像分割。

Comput Biol Med. 2024 Mar;170:108057. doi: 10.1016/j.compbiomed.2024.108057. Epub 2024 Jan 28.

Multi-level semantic-aware transformer for image captioning.用于图像字幕的多级语义感知变换器

Neural Netw. 2025 Jul;187:107390. doi: 10.1016/j.neunet.2025.107390. Epub 2025 Mar 17.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像（MRI）中进行脑肿瘤分割与检测

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

A deep learning-based framework (Co-ReTr) for auto-segmentation of non-small cell-lung cancer in computed tomography images.一种基于深度学习的框架（Co-ReTr），用于在计算机断层扫描图像中对非小细胞肺癌进行自动分割。

J Appl Clin Med Phys. 2024 Mar;25(3):e14297. doi: 10.1002/acm2.14297. Epub 2024 Feb 19.

ETUNet:Exploring efficient transformer enhanced UNet for 3D brain tumor segmentation.ETUNet：探索高效的基于Transformer 的增强型 UNet 进行 3D 脑肿瘤分割。

Comput Biol Med. 2024 Mar;171:108005. doi: 10.1016/j.compbiomed.2024.108005. Epub 2024 Jan 23.

ResTransUNet: A hybrid CNN-transformer approach for liver and tumor segmentation in CT images.ResTransUNet：一种用于CT图像中肝脏和肿瘤分割的卷积神经网络与Transformer混合方法。

Comput Biol Med. 2025 May;190:110048. doi: 10.1016/j.compbiomed.2025.110048. Epub 2025 Mar 28.

本文引用的文献

Poultry health constraints in smallholder village poultry systems in Northern Ghana and Central Tanzania.加纳北部和坦桑尼亚中部小农户村庄家禽养殖系统中的家禽健康制约因素。

Front Vet Sci. 2023 Jul 3;10:1159331. doi: 10.3389/fvets.2023.1159331. eCollection 2023.

Global epidemiology of avian influenza A H5N1 virus infection in humans, 1997-2015: a systematic review of individual case data.1997 - 2015年人感染甲型H5N1禽流感病毒的全球流行病学：个体病例数据的系统评价

Lancet Infect Dis. 2016 Jul;16(7):e108-e118. doi: 10.1016/S1473-3099(16)00153-5. Epub 2016 May 17.

Computational Analysis of Behavior.行为的计算分析。

Annu Rev Neurosci. 2016 Jul 8;39:217-36. doi: 10.1146/annurev-neuro-070815-013845. Epub 2016 Apr 18.

Long short-term memory.长短期记忆

Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

PoulTrans：一种基于变压器的精确家禽状况评估模型。

PoulTrans: a transformer-based model for accurate poultry condition assessment.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献