• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于作物病虫害多模态识别的大语言模型。

A large language model for multimodal identification of crop diseases and pests.

作者信息

Wang Yiqun, Wang Fahai, Chen Wenbai, Lv Bowen, Liu Mengchen, Kong Xiangyuan, Zhao Chunjiang, Pan Zhaocen

机构信息

School of Automation, Beijing Information Science and Technology University, Beijing, 100192, China.

Beijing Research Center for Information Technology in Agriculture, Beijing, 100097, China.

出版信息

Sci Rep. 2025 Jul 1;15(1):21959. doi: 10.1038/s41598-025-01908-0.

DOI:10.1038/s41598-025-01908-0
PMID:40595692
Abstract

Pests and diseases significantly impact the growth and development of crops. When attempting to precisely identify disease characteristics in crop images through dialogue, existing multimodal models face numerous challenges, often leading to misinterpretation and incorrect feedback regarding disease information. This paper proposed a large language model for multimodal identification of crop diseases and pests, which can be called LLMI-CDP. It builds up on the VisualGLM model and introduces improvements to achieve precise identification of agricultural crop disease and pest images, along with providing professional recommendations for relevant preventive measures. The use of Low-Rank Adaptation (LoRA) technology, which adjusts the weights of pre-trained models, achieves significant performance improvements with a minimal increase in parameters. This ensures the precise capture and efficient identification of crop pest and disease characteristics, greatly enhancing the model's application flexibility and accuracy in the field of pest and disease recognition. Simultaneously, the model incorporates the Q-Former framework for effective modal alignment between language models and image features. Through this approach, the LLMI-CDP model is able to more deeply understand and process the complex relationships between language and visual information, further enhancing its performance in multimodal recognition tasks. Experiments are carried out in the homemade datasets, The results demonstrate that the LLMI-CDP model surpasses five leading multimodal large language models in relevant evaluation metrics, confirming its outstanding performance in Chinese multimodal dialogues related to agriculture.

摘要

病虫害对农作物的生长发育有重大影响。在试图通过对话精确识别作物图像中的病害特征时,现有的多模态模型面临诸多挑战,常常导致对病害信息的误解和错误反馈。本文提出了一种用于农作物病虫害多模态识别的大语言模型,可称为LLMI-CDP。它基于VisualGLM模型构建,并进行了改进,以实现对农作物病虫害图像的精确识别,并为相关预防措施提供专业建议。使用低秩自适应(LoRA)技术来调整预训练模型的权重,在参数增加极少的情况下实现了显著的性能提升。这确保了对作物病虫害特征的精确捕捉和高效识别,极大地提高了模型在病虫害识别领域的应用灵活性和准确性。同时,该模型纳入了Q-Former框架,以实现语言模型与图像特征之间的有效模态对齐。通过这种方法,LLMI-CDP模型能够更深入地理解和处理语言与视觉信息之间的复杂关系,进一步提升其在多模态识别任务中的性能。在自制数据集上进行了实验,结果表明LLMI-CDP模型在相关评估指标上超越了五个领先的多模态大语言模型,证实了其在与农业相关的中文多模态对话中的卓越表现。

相似文献

1
A large language model for multimodal identification of crop diseases and pests.一种用于作物病虫害多模态识别的大语言模型。
Sci Rep. 2025 Jul 1;15(1):21959. doi: 10.1038/s41598-025-01908-0.
2
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.利用扩散模型探索基于脑电图信号的图像生成潜力:结合混合方法和多模态分析的综合框架
JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.
5
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架:方法与验证研究
JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.
6
Iterative segmentation and classification for enhanced crop disease diagnosis using optimized hybrid U-Nets model.使用优化的混合U-Net模型进行迭代分割和分类以增强作物病害诊断
PeerJ Comput Sci. 2025 Jun 11;11:e2543. doi: 10.7717/peerj-cs.2543. eCollection 2025.
7
Theoretical approaches to the engagement with patients in case-management programmes and assertive outreach teams: A systematic review of the literature.以病例管理计划和积极外展团队为背景下与患者互动的理论方法:文献的系统综述。
J Psychiatr Ment Health Nurs. 2022 Oct;29(5):647-658. doi: 10.1111/jpm.12831. Epub 2022 Apr 24.
8
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
9
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
10
Evaluating and Enhancing Japanese Large Language Models for Genetic Counseling Support: Comparative Study of Domain Adaptation and the Development of an Expert-Evaluated Dataset.评估和增强用于遗传咨询支持的日本大语言模型:领域适应的比较研究与专家评估数据集的开发
JMIR Med Inform. 2025 Jan 16;13:e65047. doi: 10.2196/65047.

本文引用的文献

1
Deep learning-based classification of alfalfa varieties: A comparative study using a custom leaf image dataset.基于深度学习的苜蓿品种分类:使用自定义叶片图像数据集的比较研究。
MethodsX. 2024 Nov 16;13:103051. doi: 10.1016/j.mex.2024.103051. eCollection 2024 Dec.
2
ChatGPT for shaping the future of dentistry: the potential of multi-modal large language model.ChatGPT 塑造牙科的未来:多模态大语言模型的潜力。
Int J Oral Sci. 2023 Jul 28;15(1):29. doi: 10.1038/s41368-023-00239-y.
3
Fatigue database of complex metallic alloys.复杂金属合金疲劳数据库。
Sci Data. 2023 Jul 12;10(1):447. doi: 10.1038/s41597-023-02354-1.
4
Holistic Evaluation of Language Models.语言模型的整体评估。
Ann N Y Acad Sci. 2023 Jul;1525(1):140-146. doi: 10.1111/nyas.15007. Epub 2023 May 25.