• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图像与文本信息融合的病鸡粪便高精度识别方法

High-Accuracy Recognition Method for Diseased Chicken Feces Based on Image and Text Information Fusion.

作者信息

Yang Duanli, Tian Zishang, Xi Jianzhong, Chen Hui, Sun Erdong, Wang Lianzeng

机构信息

College of Information Science and Technology, Hebei Agricultural University, Baoding 071001, China.

Hebei Key Laboratory of Agricultural Big Data, Baoding 071001, China.

出版信息

Animals (Basel). 2025 Jul 22;15(15):2158. doi: 10.3390/ani15152158.

DOI:10.3390/ani15152158
PMID:40804949
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12345488/
Abstract

Poultry feces, a critical biomarker for health assessment, requires timely and accurate pathological identification for food safety. Conventional visual-only methods face limitations due to environmental sensitivity and high visual similarity among feces from different diseases. To address this, we propose MMCD (Multimodal Chicken-feces Diagnosis), a ResNet50-based multimodal fusion model leveraging semantic complementarity between images and descriptive text to enhance diagnostic precision. Key innovations include the following: (1) Integrating MASA(Manhattan self-attention)and DSconv (Depthwise Separable convolution) into the backbone network to mitigate feature confusion. (2) Utilizing a pre-trained BERT to extract textual semantic features, reducing annotation dependency and cost. (3) Designing a lightweight Gated Cross-Attention (GCA) module for dynamic multimodal fusion, achieving a 41% parameter reduction versus cross-modal transformers. Experiments demonstrate that MMCD significantly outperforms single-modal baselines in Accuracy (+8.69%), Recall (+8.72%), Precision (+8.67%), and F1 score (+8.72%). It surpasses simple feature concatenation by 2.51-2.82% and reduces parameters by 7.5M and computations by 1.62 GFLOPs versus the base ResNet50. This work validates multimodal fusion's efficacy in pathological fecal detection, providing a theoretical and technical foundation for agricultural health monitoring systems.

摘要

家禽粪便作为健康评估的关键生物标志物,为保障食品安全,需要及时且准确的病理识别。传统的仅依靠视觉的方法存在局限性,因为其对环境敏感,且不同疾病的粪便在视觉上相似度高。为解决这一问题,我们提出了MMCD(多模态鸡粪诊断),这是一个基于ResNet50的多模态融合模型,利用图像和描述性文本之间的语义互补性来提高诊断精度。关键创新点如下:(1)将曼哈顿自注意力(MASA)和深度可分离卷积(DSconv)集成到骨干网络中,以减轻特征混淆。(2)利用预训练的BERT提取文本语义特征,减少注释依赖和成本。(3)设计了一个轻量级的门控交叉注意力(GCA)模块用于动态多模态融合,与跨模态变压器相比,参数减少了41%。实验表明,MMCD在准确率(提高8.69%)、召回率(提高8.72%)、精确率(提高8.67%)和F1分数(提高8.72%)方面显著优于单模态基线。与简单的特征拼接相比,它提高了2.51 - 2.82%,与基础ResNet50相比,参数减少了750万个,计算量减少了1.62 GFLOPs。这项工作验证了多模态融合在病理性粪便检测中的有效性,为农业健康监测系统提供了理论和技术基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/4ecc2ec5c63c/animals-15-02158-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/ea587ce2ec0b/animals-15-02158-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/a1a5cfc16e7e/animals-15-02158-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/b5fdab874d79/animals-15-02158-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/64f0a104c804/animals-15-02158-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/0c4b42ee23e1/animals-15-02158-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/076e518523eb/animals-15-02158-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/8cef8ed514a4/animals-15-02158-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/079477a2dc0e/animals-15-02158-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/9703bf21dcb3/animals-15-02158-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/f0a27efb0994/animals-15-02158-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/4ecc2ec5c63c/animals-15-02158-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/ea587ce2ec0b/animals-15-02158-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/a1a5cfc16e7e/animals-15-02158-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/b5fdab874d79/animals-15-02158-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/64f0a104c804/animals-15-02158-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/0c4b42ee23e1/animals-15-02158-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/076e518523eb/animals-15-02158-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/8cef8ed514a4/animals-15-02158-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/079477a2dc0e/animals-15-02158-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/9703bf21dcb3/animals-15-02158-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/f0a27efb0994/animals-15-02158-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2d/12345488/4ecc2ec5c63c/animals-15-02158-g011.jpg

相似文献

1
High-Accuracy Recognition Method for Diseased Chicken Feces Based on Image and Text Information Fusion.基于图像与文本信息融合的病鸡粪便高精度识别方法
Animals (Basel). 2025 Jul 22;15(15):2158. doi: 10.3390/ani15152158.
2
Tomato Leaf Disease Identification Framework FCMNet Based on Multimodal Fusion.基于多模态融合的番茄叶病识别框架FCMNet
Plants (Basel). 2025 Jul 27;14(15):2329. doi: 10.3390/plants14152329.
3
A fake news detection model using the integration of multimodal attention mechanism and residual convolutional network.一种融合多模态注意力机制和残差卷积网络的假新闻检测模型。
Sci Rep. 2025 Jul 1;15(1):20544. doi: 10.1038/s41598-025-05702-w.
4
ModFus-PD: synergizing cross-modal attention and contrastive learning for enhanced multimodal diagnosis of Parkinson's disease.ModFus-PD:协同跨模态注意力与对比学习以增强帕金森病的多模态诊断
Front Comput Neurosci. 2025 Jul 16;19:1604399. doi: 10.3389/fncom.2025.1604399. eCollection 2025.
5
Hierarchical cross-modal attention and dual audio pathways for enhanced multimodal sentiment analysis.用于增强多模态情感分析的分层跨模态注意力和双音频路径
Sci Rep. 2025 Jul 15;15(1):25440. doi: 10.1038/s41598-025-09000-3.
6
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
7
Structural semantic-guided MR synthesis from PET images via a dual cross-attention mechanism.通过双交叉注意力机制从PET图像进行结构语义引导的MR合成。
Med Phys. 2025 Jul;52(7):e17957. doi: 10.1002/mp.17957.
8
Deep learning algorithm for pathological grading of renal cell carcinoma based on multi-phase enhanced CT.基于多期增强CT的肾细胞癌病理分级深度学习算法
Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2025 Apr 28;50(4):651-663. doi: 10.11817/j.issn.1672-7347.2025.250082.
9
Knowledge Graph-Enhanced Deep Learning Model (H-SYSTEM) for Hypertensive Intracerebral Hemorrhage: Model Development and Validation.用于高血压性脑出血的知识图谱增强深度学习模型(H-SYSTEM):模型开发与验证
J Med Internet Res. 2025 Jun 12;27:e66055. doi: 10.2196/66055.
10
Analysis of the fusion of multimodal sentiment perception and physiological signals in Chinese-English cross-cultural communication: Transformer approach incorporating self-attention enhancement.汉英跨文化交际中多模态情感感知与生理信号融合分析:融入自注意力增强的Transformer方法
PeerJ Comput Sci. 2025 May 23;11:e2890. doi: 10.7717/peerj-cs.2890. eCollection 2025.

本文引用的文献

1
Genetic characteristics and selection signatures between Southern Chinese local and commercial chickens.中国南方地方鸡与商业鸡之间的遗传特征和选择信号。
Poult Sci. 2024 Jul;103(7):103863. doi: 10.1016/j.psj.2024.103863. Epub 2024 May 16.
2
Development of Multimodal Fusion Technology for Tomato Maturity Assessment.用于番茄成熟度评估的多模态融合技术的开发
Sensors (Basel). 2024 Apr 11;24(8):2467. doi: 10.3390/s24082467.
3
Machine vision-based automatic lamb identification and drinking activity in a commercial farm.基于机器视觉的商业养殖场羊自动识别与饮水行为监测。
Animal. 2023 Sep;17(9):100923. doi: 10.1016/j.animal.2023.100923. Epub 2023 Jul 27.
4
Effective Techniques for Multimodal Data Fusion: A Comparative Analysis.多模态数据融合的有效技术:比较分析。
Sensors (Basel). 2023 Feb 21;23(5):2381. doi: 10.3390/s23052381.
5
A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets.计算机视觉深度多模态学习综述:进展、趋势、应用及数据集
Vis Comput. 2022;38(8):2939-2970. doi: 10.1007/s00371-021-02166-7. Epub 2021 Jun 10.