Suppr超能文献

迈向可持续发展:作物害虫识别中数据质量与数量之间的权衡

Toward Sustainability: Trade-Off Between Data Quality and Quantity in Crop Pest Recognition.

作者信息

Li Yang, Chao Xuewei

机构信息

College of Mechanical and Electrical Engineering, Shihezi University, Shihezi, China.

School of Electrical and Information Engineering, Tianjin University, Tianjin, China.

出版信息

Front Plant Sci. 2021 Dec 24;12:811241. doi: 10.3389/fpls.2021.811241. eCollection 2021.

Abstract

The crop pest recognition based on the convolutional neural networks is meaningful and important for the development of intelligent plant protection. However, the current main implementation method is deep learning, which relies heavily on large amounts of data. As known, current big data-driven deep learning is a non-sustainable learning mode with the high cost of data collection, high cost of high-end hardware, and high consumption of power resources. Thus, toward sustainability, we should seriously consider the trade-off between data quality and quantity. In this study, we proposed an embedding range judgment (ERJ) method in the feature space and carried out many comparative experiments. The results showed that, in some recognition tasks, the selected good data with less quantity can reach the same performance with all training data. Furthermore, the limited good data can beat a lot of bad data, and their contrasts are remarkable. Overall, this study lays a foundation for data information analysis in smart agriculture, inspires the subsequent works in the related areas of pattern recognition, and calls for the community to pay more attention to the essential issue of data quality and quantity.

摘要

基于卷积神经网络的农作物害虫识别对于智能植物保护的发展具有重要意义。然而,当前的主要实现方法是深度学习,它严重依赖大量数据。众所周知,当前大数据驱动的深度学习是一种不可持续的学习模式,数据收集成本高、高端硬件成本高且电力资源消耗大。因此,为了实现可持续性,我们应该认真考虑数据质量和数量之间的权衡。在本研究中,我们在特征空间中提出了一种嵌入范围判断(ERJ)方法,并进行了许多对比实验。结果表明,在一些识别任务中,选择数量较少的优质数据可以达到与所有训练数据相同的性能。此外,有限的优质数据可以胜过大量的劣质数据,它们之间的对比非常显著。总体而言,本研究为智慧农业中的数据信息分析奠定了基础,启发了模式识别相关领域的后续工作,并呼吁业界更多地关注数据质量和数量这一关键问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3a/8739801/35ab7ba5c4c6/fpls-12-811241-g0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验